Asking questions about protein structure determination

Click For Summary

Discussion Overview

The discussion revolves around theoretical approaches to determining protein structure and dynamics, particularly in the context of computational methods and modeling techniques. Participants explore the implications of DNA sequencing and the potential for computational power to aid in predicting protein structures.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested

Main Points Raised

  • Some participants inquire about purely theoretical approaches to protein structure determination, questioning the efficiency of computational methods based on one-dimensional DNA data.
  • Homology modeling is presented as a method that assumes similar sequences lead to similar structures, while physics-based modeling relies on force fields to predict structures ab initio.
  • There is mention of improvements in physics-based methods over the past decade, including the development of supercomputers for protein folding simulations.
  • Additional factors influencing protein configuration beyond the amino acid sequence are discussed, including interactions during ribosome translation and the role of chaperone proteins.
  • One participant notes that protein folding is considered a "hard" problem, with references to crowd-sourced efforts to tackle it.

Areas of Agreement / Disagreement

Participants express varying views on the effectiveness of different modeling approaches, with no consensus on a singular method being superior. The discussion reflects multiple competing perspectives on the challenges and advancements in protein structure prediction.

Contextual Notes

Some limitations are noted, such as the dependence on the accuracy of sequence comparisons in homology modeling and the complexities introduced by external factors like ribosomal interactions and chaperone proteins.

docnet
Messages
796
Reaction score
487
TL;DR
Asking questions about protein structure determination
Hi,

I tend to write down questions/thoughts that come to me as I read articles that are interestsing. I'm reading the intro to solution state NMR in a biophysics textbook.

Any there any purely theoretical approaches to determine protein structure / dynamics ?
The 1 dimensional digital code of DNA is easily obtained. And, wouldn’t it be more efficient to develop a sophisticated computer software that runs on pure computational power to determine the 3d structure, when we input the 1 dimensional data? Would this immense computational power? Does P = NP?

Leonard Susskind said calculations involving the folding of a hypothetical molecule of a hundred atoms results in an energy diagram with over 10^100 discrete “valleys” representing locally stable conformations. Still, with constraints given by the amino acids and their sequence, we would be able to lower that number significantly. Given proteins obey the laws of thermodynamics, will the task of determining protein structure be no longer a difficult task one day?
 
Biology news on Phys.org
docnet said:
Any there any purely theoretical approaches to determine protein structure / dynamics ?
The 1 dimensional digital code of DNA is easily obtained. And, wouldn’t it be more efficient to develop a sophisticated computer software that runs on pure computational power to determine the 3d structure, when we input the 1 dimensional data? Would this immense computational power? Does P = NP?

In general, there are two approaches to protein structure prediction 1) homology modeling and 2) physics-based modeling.

Homology modeling makes the assumption that similar sequences will lead to similar structures. These approaches will compare the amino acid sequence of interest to the many thousands of known protein structures and uses that information to guide structure prediction. In contrast, physics-based models start with just the amino acid sequence and a "force field" describing the various intermolecular forces that govern protein folding and use that information to predict the protein's structure ab initio.

Homology modeling approaches are by far the most accurate and efficient methods available, and their power only grows as more protein structures are determined. Of course, the most effective modeling tools combine elements of both approaches. For example, they would use homology modeling to build an initial model for the protein then refine the model using physics based methods.

There have, however, been major improvement to physics-based methods in the past decade. For example, a former hedge fund manager interested in the protein folding problem built a supercomputer specifically for performing protein folding simulations and has had success modeling the folding of small, fast folding proteins (see for example, https://www.nature.com/news/2010/101014/full/news.2010.541.html).
 
  • Like
Likes   Reactions: epenguin, jim mcnamara, docnet and 1 other person
Along the lines of the physics based approach, there are additional factors, beyond just the amino acid sequence, that can affect the 3D configuration.

Proteins are made from one end to the other.
As the forming amino acid string is extruded from the ribosome, the initial groups of amino acids will form little bits of more stable 3D configuration.
After later parts are added and extruded, new conformations might arise for the already extruded parts.
This might actually make the calculations easily due to the fewer parts involved initially.
Interactions with the parts of ribosomal RNA's and proteins that are exposed to the elongating peptide chain may also influence the conformations that the growing chain of amino acids experience as stable. Presumably the ribosomal influences would be common to most/all proteins, since most ribosomes would be expected to be the same.

However, there are cases where additional proteins (chaperone proteins) can interact the growing peptide chain, adding influences on the protein's stable configuration that are independent of the amino acid sequence's influence on the ultimate shape the protein settles into.

These can add additional wrinkles to making predictions.
 
  • Like
Likes   Reactions: docnet
Ygggdrasil said:
In general, there are two approaches to protein structure prediction 1) homology modeling and 2) physics-based modeling.

Homology modeling makes the assumption that similar sequences will lead to similar structures. These approaches will compare the amino acid sequence of interest to the many thousands of known protein structures and uses that information to guide structure prediction. In contrast, physics-based models start with just the amino acid sequence and a "force field" describing the various intermolecular forces that govern protein folding and use that information to predict the protein's structure ab initio.

Homology modeling approaches are by far the most accurate and efficient methods available, and their power only grows as more protein structures are determined. Of course, the most effective modeling tools combine elements of both approaches. For example, they would use homology modeling to build an initial model for the protein then refine the model using physics based methods.

There have, however, been major improvement to physics-based methods in the past decade. For example, a former hedge fund manager interested in the protein folding problem built a supercomputer specifically for performing protein folding simulations and has had success modeling the folding of small, fast folding proteins (see for example, https://www.nature.com/news/2010/101014/full/news.2010.541.html).

Thank you for the detailed answer and the link. I enjoy reading your intelligent responses to PF on a regular basis.
 
docnet said:
Thank you for the detailed answer and the link. I enjoy reading your intelligent responses to PF on a regular basis.

Thanks! Here are two articles on the subject of protein folding that have more information on predicting protein structure. The first is a shorter, more general and accessible review:
https://science.sciencemag.org/content/338/6110/1042.full

While the second is a much longer and more technical review article that goes into a lot more detail:
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2735084/

Both are somewhat old articles as I have not followed the field so closely in recent years, so unfortunately, I don't know off the top of my head any good newer pieces that might be of interest.
 
  • Like
Likes   Reactions: docnet

Similar threads

  • · Replies 1 ·
Replies
1
Views
14K
Replies
24
Views
8K
  • · Replies 20 ·
Replies
20
Views
8K
  • · Replies 94 ·
4
Replies
94
Views
14K
  • · Replies 5 ·
Replies
5
Views
5K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 7 ·
Replies
7
Views
4K
  • · Replies 9 ·
Replies
9
Views
4K