-
Folding Rate Optimization Promotes Frustrated Interactions in Entangled Protein Structures
Authors:
Federico Norbiato,
Flavio Seno,
Antonio Trovato,
Marco Baiesi
Abstract:
Many native structures of proteins accomodate complex topological motifs such as knots, lassos, and other geometrical entanglements. How proteins can fold quickly even in the presence of such topological obstacles is a debated question in structural biology. Recently, the hypothesis that energetic frustration might be a mechanism to avoid topological frustration has been put forward based on the e…
▽ More
Many native structures of proteins accomodate complex topological motifs such as knots, lassos, and other geometrical entanglements. How proteins can fold quickly even in the presence of such topological obstacles is a debated question in structural biology. Recently, the hypothesis that energetic frustration might be a mechanism to avoid topological frustration has been put forward based on the empirical observation that loops involved in entanglements are stabilized by weak interactions between amino-acids at their extrema. To verify this idea, we use a toy lattice model for the folding of proteins into two almost identical structures, one entangled and one not. As expected, the folding time is longer when random sequences folds into the entangled structure. This holds also under an evolutionary pressure simulated by optimizing the folding time. It turns out that optmized protein sequences in the entangled structure are in fact characterized by frustrated interactions at the closures of entangled loops. This phenomenon is much less enhanced in the control case where the entanglement is not present. Our findings, which are in agreement with experimental observations, corroborate the idea that an evolutionary pressure shapes the folding funnel to avoid topological and kinetic traps.
△ Less
Submitted 19 November, 2019;
originally announced November 2019.
-
Sequence and structural patterns detected in entangled proteins reveal the importance of co-translational folding
Authors:
Marco Baiesi,
Enzo Orlandini,
Flavio Seno,
Antonio Trovato
Abstract:
Proteins must fold quickly to acquire their biologically functional three-dimensional native structures. Hence, these are mainly stabilized by local contacts, while intricate topologies such as knots are rare. Here, we reveal the existence of specific patterns adopted by protein sequences and structures to deal with backbone self-entanglement. A large scale analysis of the Protein Data Bank shows…
▽ More
Proteins must fold quickly to acquire their biologically functional three-dimensional native structures. Hence, these are mainly stabilized by local contacts, while intricate topologies such as knots are rare. Here, we reveal the existence of specific patterns adopted by protein sequences and structures to deal with backbone self-entanglement. A large scale analysis of the Protein Data Bank shows that loops significantly intertwined with another chain portion are typically closed by weakly bound amino acids. Why is this energetic frustration maintained? A possible picture is that entangled loops are formed only toward the end of the folding process to avoid kinetic traps. Consistently, these loops are more frequently found to be wrapped around a portion of the chain on their N-terminal side, the one translated earlier at the ribosome. Finally, these motifs are less abundant in natural native states than in simulated protein-like structures, yet they appear in 32% of proteins, which in some cases display an amazingly complex intertwining.
△ Less
Submitted 31 May, 2019; v1 submitted 6 September, 2018;
originally announced September 2018.
-
Exploring the correlation between the folding rates of proteins and the entanglement of their native states
Authors:
Marco Baiesi,
Enzo Orlandini,
Flavio Seno,
Antonio Trovato
Abstract:
The folding of a protein towards its native state is a rather complicated process. However there are empirical evidences that the folding time correlates with the contact order, a simple measure of the spatial organisation of the native state of the protein. Contact order is related to the average length of the main chain loops formed by amino acids which are in contact. Here we argue that folding…
▽ More
The folding of a protein towards its native state is a rather complicated process. However there are empirical evidences that the folding time correlates with the contact order, a simple measure of the spatial organisation of the native state of the protein. Contact order is related to the average length of the main chain loops formed by amino acids which are in contact. Here we argue that folding kinetics can be influenced also by the entanglement that loops may undergo within the overall three dimensional protein structure. In order to explore such possibility, we introduce a novel descriptor, which we call "maximum intrachain contact entanglement". Specifically, we measure the maximum Gaussian entanglement between any looped portion of a protein and any other non-overlapping subchain of the same protein, which is easily computed by discretized line integrals on the coordinates of the $C_α$ atoms. By analyzing experimental data sets of two-state and multistate folders, we show that also the new index is a good predictor of the folding rate. Moreover, being only partially correlated with previous methods, it can be integrated with them to yield more accurate predictions.
△ Less
Submitted 26 October, 2017; v1 submitted 6 September, 2017;
originally announced September 2017.
-
Linking in domain-swapped protein dimers
Authors:
Marco Baiesi,
Enzo Orlandini,
Antonio Trovato,
Flavio Seno
Abstract:
The presence of knots has been observed in a small fraction of single-domain proteins and related to their thermodynamic and kinetic properties. The exchanging of identical structural elements, typical of domain-swapped proteins, make such dimers suitable candidates to validate the possibility that mutual entanglement between chains may play a similar role for protein complexes. We suggest that su…
▽ More
The presence of knots has been observed in a small fraction of single-domain proteins and related to their thermodynamic and kinetic properties. The exchanging of identical structural elements, typical of domain-swapped proteins, make such dimers suitable candidates to validate the possibility that mutual entanglement between chains may play a similar role for protein complexes. We suggest that such entanglement is captured by the linking number. This represents, for two closed curves, the number of times that each curve winds around the other. We show that closing the curves is not necessary, as a novel parameter $G'$, termed Gaussian entanglement, is strongly correlated with the linking number. Based on $110$ non redundant domain-swapped dimers, our analysis evidences a high fraction of chains with a significant intertwining, that is with $|G'| > 1$. We report that Nature promotes configurations with negative mutual entanglement and surprisingly, it seems to suppress intertwining in long protein dimers. Supported by numerical simulations of dimer dissociation, our results provide a novel topology-based classification of protein-swapped dimers together with some preliminary evidence of its impact on their physical and biological properties.
△ Less
Submitted 30 August, 2016; v1 submitted 5 July, 2016;
originally announced July 2016.
-
Protein sequence and structure: Is one more fundamental than the other?
Authors:
Jayanth R. Banavar,
Trinh X. Hoang,
Flavio Seno,
Antonio Trovato,
Amos Maritan
Abstract:
We argue that protein native state structures reside in a novel "phase" of matter which confers on proteins their many amazing characteristics. This phase arises from the common features of all globular proteins and is characterized by a sequence-independent free energy landscape with relatively few low energy minima with funnel-like character. The choice of a sequence that fits well into one of t…
▽ More
We argue that protein native state structures reside in a novel "phase" of matter which confers on proteins their many amazing characteristics. This phase arises from the common features of all globular proteins and is characterized by a sequence-independent free energy landscape with relatively few low energy minima with funnel-like character. The choice of a sequence that fits well into one of these predetermined structures facilitates rapid and cooperative folding. Our model calculations show that this novel phase facilitates the formation of an efficient route for sequence design starting from random peptides.
△ Less
Submitted 12 April, 2012;
originally announced April 2012.
-
Fibril elongation mechanisms of HET-s prion-forming domain: Topological evidence for growth polarity
Authors:
Marco Baiesi,
Flavio Seno,
Antonio Trovato
Abstract:
The prion-forming C-terminal domain of the fungal prion HET-s forms infectious amyloid fibrils at physiological pH. The conformational switch from the non-prion soluble form to the prion fibrillar form is believed to have a functional role, since HET-s in its prion form participates in a recognition process of different fungal strains. Based on the knowledge of the high-resolution structure of HET…
▽ More
The prion-forming C-terminal domain of the fungal prion HET-s forms infectious amyloid fibrils at physiological pH. The conformational switch from the non-prion soluble form to the prion fibrillar form is believed to have a functional role, since HET-s in its prion form participates in a recognition process of different fungal strains. Based on the knowledge of the high-resolution structure of HET-s(218-289) (the prion forming-domain) in its fibrillar form, we here present a numerical simulation of the fibril growth process which emphasizes the role of the topological properties of the fibrillar structure. An accurate thermodynamic analysis of the way an intervening HET-s chain is recruited to the tip of the growing fibril suggests that elongation proceeds through a dock and lock mechanism. First, the chain docks onto the fibril by forming the longest $β$-strands. Then, the re-arrangement in the fibrillar form of all the rest of molecule takes place. Interestingly, we predict also that one side of the HET-s fibril is more suitable for substaining its growth with respect to the other. The resulting strong polarity of fibril growth is a consequence of the complex topology of HET-s fibrillar structure, since the central loop of the intervening chain plays a crucially different role in favouring or not the attachment of the C-terminus tail to the fibril, depending on the growth side.
△ Less
Submitted 19 March, 2012;
originally announced March 2012.
-
Simple solvation potential for coarse-grained models of proteins
Authors:
A. Bhattacharyay,
A. Trovato,
F. Seno
Abstract:
We formulate a simple solvation potential based on a coarsed-grain representation of amino acids with two spheres modeling the $C_α$ atom and an effective side-chain centroid. The potential relies on a new method for estimating the buried area of residues, based on counting the effective number of burying neighbours in a suitable way. This latter quantity shows a good correlation with the buried…
▽ More
We formulate a simple solvation potential based on a coarsed-grain representation of amino acids with two spheres modeling the $C_α$ atom and an effective side-chain centroid. The potential relies on a new method for estimating the buried area of residues, based on counting the effective number of burying neighbours in a suitable way. This latter quantity shows a good correlation with the buried area of residues computed from all atom crystallographic structures. We check the discriminatory power of the solvation potential alone to identify the native fold of a protein from a set of decoys and show the potential to be considerably selective.
△ Less
Submitted 22 June, 2006;
originally announced June 2006.
-
Geometry of proteins: hydrogen bonding, sterics and marginally compact tubes
Authors:
Jayanth R. Banavar,
Marek Cieplak,
Alessandro Flammini,
Trinh X. Hoang,
Randall D. Kamien,
Timothy Lezon,
Davide Marenduzzo,
Amos Maritan,
Flavio Seno,
Yehuda Snir,
Antonio Trovato
Abstract:
The functionality of proteins is related to their structure in the native state. Protein structures are made up of emergent building blocks of helices and almost planar sheets. A simple coarse-grained geometrical model of a flexible tube barely subject to compaction provides a unified framework for understanding the common character of globular proteins.We argue that a recent critique of the tub…
▽ More
The functionality of proteins is related to their structure in the native state. Protein structures are made up of emergent building blocks of helices and almost planar sheets. A simple coarse-grained geometrical model of a flexible tube barely subject to compaction provides a unified framework for understanding the common character of globular proteins.We argue that a recent critique of the tube idea is not well founded.
△ Less
Submitted 20 March, 2006; v1 submitted 27 May, 2005;
originally announced May 2005.
-
Geometrical model for the native-state folds of proteins
Authors:
Trinh X. Hoang,
Antonio Trovato,
Flavio Seno,
Jayanth R. Banavar,
Amos Maritan
Abstract:
We recently introduced a physical model [Hoang et al., P. Natl. Acad. Sci. USA (2004), Banavar et al., Phys. Rev. E (2004)] for proteins which incorporates, in an approximate manner, several key features such as the inherent anisotropy of a chain molecule, the geometrical and energetic constraints placed by the hydrogen bonds and sterics, and the role played by hydrophobicity. Within this framew…
▽ More
We recently introduced a physical model [Hoang et al., P. Natl. Acad. Sci. USA (2004), Banavar et al., Phys. Rev. E (2004)] for proteins which incorporates, in an approximate manner, several key features such as the inherent anisotropy of a chain molecule, the geometrical and energetic constraints placed by the hydrogen bonds and sterics, and the role played by hydrophobicity. Within this framework, marginally compact conformations resembling the native state folds of proteins emerge as broad competing minima in the free energy landscape even for a homopolymer. Here we show how the introduction of sequence heterogeneity using a simple scheme of just two types of amino acids, hydrophobic (H) and polar (P), and sequence design allows a selected putative native fold to become the free energy minimum at low temperature. The folding transition exhibits thermodynamic cooperativity, if one neglects the degeneracy between two different low energy conformations sharing the same fold topology.
△ Less
Submitted 17 May, 2005;
originally announced May 2005.
-
Unified perspective on proteins: A physics approach
Authors:
Jayanth R. Banavar,
Trinh Xuan Hoang,
Amos Maritan,
Flavio Seno,
Antonio Trovato
Abstract:
We study a physical system which, while devoid of the complexity one usually associates with proteins, nevertheless displays a remarkable array of protein-like properties. The constructive hypothesis that this striking resemblance is not accidental leads not only to a unified framework for understanding protein folding, amyloid formation and protein interactions but also has implications for nat…
▽ More
We study a physical system which, while devoid of the complexity one usually associates with proteins, nevertheless displays a remarkable array of protein-like properties. The constructive hypothesis that this striking resemblance is not accidental leads not only to a unified framework for understanding protein folding, amyloid formation and protein interactions but also has implications for natural selection.
△ Less
Submitted 12 October, 2004;
originally announced October 2004.
-
Geometry and symmetry presculpt the free-energy landscape of proteins
Authors:
Trinh Xuan Hoang,
Antonio Trovato,
Flavio Seno,
Jayanth R. Banavar,
Amos Maritan
Abstract:
We present a simple physical model which demonstrates that the native state folds of proteins can emerge on the basis of considerations of geometry and symmetry. We show that the inherent anisotropy of a chain molecule, the geometrical and energetic constraints placed by the hydrogen bonds and sterics, and hydrophobicity are sufficient to yield a free energy landscape with broad minima even for…
▽ More
We present a simple physical model which demonstrates that the native state folds of proteins can emerge on the basis of considerations of geometry and symmetry. We show that the inherent anisotropy of a chain molecule, the geometrical and energetic constraints placed by the hydrogen bonds and sterics, and hydrophobicity are sufficient to yield a free energy landscape with broad minima even for a homopolymer. These minima correspond to marginally compact structures comprising the menu of folds that proteins choose from to house their native-states in. Our results provide a general framework for understanding the common characteristics of globular proteins.
△ Less
Submitted 25 May, 2004;
originally announced May 2004.
-
Complete Phase Diagram of DNA Unzipping: Eye, Y-fork and triple point
Authors:
Rajeev Kapri,
Somendra M. Bhattacharjee,
Flavio Seno
Abstract:
We study the unzipping of double stranded DNA (dsDNA) by applying a pulling force at a fraction $s$ $(0 \le s \le 1)$ from the anchored end. From exact analytical and numerical results, the complete phase diagram is presented. The phase diagram shows a strong ensemble dependence for various values of $s$. In addition, we show the existence of an ``eye'' phase and a triple point.
We study the unzipping of double stranded DNA (dsDNA) by applying a pulling force at a fraction $s$ $(0 \le s \le 1)$ from the anchored end. From exact analytical and numerical results, the complete phase diagram is presented. The phase diagram shows a strong ensemble dependence for various values of $s$. In addition, we show the existence of an ``eye'' phase and a triple point.
△ Less
Submitted 8 November, 2004; v1 submitted 31 March, 2004;
originally announced March 2004.
-
A new perspective on the analysis of helix-helix packing preferences in globular proteins
Authors:
A. Trovato,
F. Seno
Abstract:
For many years it had been believed that steric compatibility of helix interfaces could be the source of the observed preference for particular angles between neighbouring helices as emerging from statistical analysis of protein databanks. Several elegant models describing how side chains on helices can interdigitate without steric clashes were able to account quite reasonably for the observed d…
▽ More
For many years it had been believed that steric compatibility of helix interfaces could be the source of the observed preference for particular angles between neighbouring helices as emerging from statistical analysis of protein databanks. Several elegant models describing how side chains on helices can interdigitate without steric clashes were able to account quite reasonably for the observed distributions. However, it was later recognized (Bowie, 1997 and Walther, 1998) that the ``bare'' measured angle distribution should be corrected to avoid statistical bias. Disappointingly, the rescaled distributions dramatically lost their similarity with theoretical predictions casting many doubts on the validity of the geometrical assumptions and models. In this report we elucidate a few points concerning the proper choice of the random reference distribution. In particular we show the existence of crucial corrections due to the correct implementation of the approach used to discriminate whether two helices are in contact or not and to measure their relative orientations. By using this new rescaling, the ``true'' packing angle preferences are well described, even more than with the original ``bare'' distribution, by regular packing models.
△ Less
Submitted 18 April, 2003;
originally announced April 2003.
-
Elucidation of the disulfide folding pathway of hirudin by a topology-based approach
Authors:
C. Micheletti,
V. De Filippis,
A. Maritan,
F. Seno
Abstract:
A theoretical model for the folding of proteins containing disulfide bonds is introduced. The model exploits the knowledge of the native state to favour the progressive establishment of native interactions. At variance with traditional approaches based on native topology, not all native bonds are treated in the same way; in particular, a suitable energy term is introduced to account for the spec…
▽ More
A theoretical model for the folding of proteins containing disulfide bonds is introduced. The model exploits the knowledge of the native state to favour the progressive establishment of native interactions. At variance with traditional approaches based on native topology, not all native bonds are treated in the same way; in particular, a suitable energy term is introduced to account for the special strength of disulfide bonds (irrespective of whether they are native or not) as well as their ability to undergo intra-molecular reshuffling. The model thus possesses the minimal ingredients necessary to investigated the much debated issue of whether the re-folding process occurs through partially structured intermediates with native or non-native disulfide bonds. This strategy is applied to a context of particular interest, the re-folding process of Hirudin, a thrombin-specific protease inhibitor, for which conflicting folding pathways have been proposed. We show that the only two parameters in the model (temperature and disulfide strength) can be tuned to reproduce well a set of experimental transitions between species with different number of formed disulfide. This model is then used to provide a characterisation of the folding process and a detailed description of the species involved in the rate-limiting step of Hirudin refolding.
△ Less
Submitted 28 February, 2003;
originally announced February 2003.
-
Helicase on DNA: A Phase coexistence based mechanism
Authors:
Somendra M. Bhattacharjee,
Flavio Seno
Abstract:
We propose a phase coexistence based mechanism for activity of helicases, ubiquitous enzymes that unwind double stranded DNA. The helicase-DNA complex constitutes a fixed-stretch ensemble that entails a coexistence of domains of zipped and unzipped phases of DNA, separated by a domain wall. The motor action of the helicase leads to a change in the position of the fixed constraint thereby shiftin…
▽ More
We propose a phase coexistence based mechanism for activity of helicases, ubiquitous enzymes that unwind double stranded DNA. The helicase-DNA complex constitutes a fixed-stretch ensemble that entails a coexistence of domains of zipped and unzipped phases of DNA, separated by a domain wall. The motor action of the helicase leads to a change in the position of the fixed constraint thereby shifting the domain wall on dsDNA. We associate this off-equilibrium domain wall motion with the unzipping activity of helicase. We show that this proposal gives a clear and consistent explanation of the main observed features of helicases.
△ Less
Submitted 24 March, 2003; v1 submitted 13 May, 2002;
originally announced May 2002.
-
Mechanical denaturation of DNA: existence of a low temperature denaturation
Authors:
Enzo Orlandini,
Somendra M. Bhattacharjee,
Davide Marenduzzo,
Amos Maritan,
Flavio Seno
Abstract:
Recent theoretical predictions on DNA mechanical separation induced by pulling forces are numerically tested within a model in which self-avoidance for DNA strands is fully taken into account. DNA strands are described by interacting pairs of self avoiding walks (SAW) which are pulled apart by a force applied at the two extremities. The whole phase diagram is spanned by extensive Monte Carlo (MC…
▽ More
Recent theoretical predictions on DNA mechanical separation induced by pulling forces are numerically tested within a model in which self-avoidance for DNA strands is fully taken into account. DNA strands are described by interacting pairs of self avoiding walks (SAW) which are pulled apart by a force applied at the two extremities. The whole phase diagram is spanned by extensive Monte Carlo (MC) simulations and the existence of a low temperature denaturation is confirmed. The basic features of the phase diagram and the re-entrant phase boundary are also obtained with a simple heuristic argument based on an energy-entropy estimate.
△ Less
Submitted 27 September, 2001;
originally announced September 2001.
-
Geometrical aspects of protein folding
Authors:
Jay Banavar,
Amos Maritan,
Cristian Micheletti,
Flavio Seno
Abstract:
These lectures will address two questions. Is there a simple variational principle underlying the existence of secondary motifs in the native state of proteins? Is there a general approach which can qualitatively capture the salient features of the folding process and which may be useful for interpreting and guiding experiments? Here, we present three different approaches to the first question,…
▽ More
These lectures will address two questions. Is there a simple variational principle underlying the existence of secondary motifs in the native state of proteins? Is there a general approach which can qualitatively capture the salient features of the folding process and which may be useful for interpreting and guiding experiments? Here, we present three different approaches to the first question, which demonstrate the key role played by the topology of the native state of proteins. The second question pertaining to the folding dynamics of proteins remains a challenging problem -- a detailed description capturing the interactions between amino acids among each other and with the solvent is a daunting task. We address this issue building on the lessons learned in tackling the first question and apply the resulting method to the folding of various proteins including HIV protease and membrane proteins. The results that will be presented open a fascinating perspective: the two questions appear to be intimately related. The variety of results reported here all provide evidence in favour of the special criteria adopted by nature in the selection of viable protein folds, ranging from optimal compactness to maximum dynamical and geometrical accessibility of the native states.
△ Less
Submitted 10 May, 2001;
originally announced May 2001.
-
Deciphering the folding kinetics of transmembrane helical proteins
Authors:
E. Orlandini,
F. Seno,
J. R. Banavar,
A. Laio,
A. Maritan
Abstract:
Nearly a quarter of genomic sequences and almost half of all receptors that are likely to be targets for drug design are integral membrane proteins. Understanding the detailed mechanisms of the folding of membrane proteins is a largely unsolved, key problem in structural biology. Here, we introduce a general model and use computer simulations to study the equilibrium properties and the folding k…
▽ More
Nearly a quarter of genomic sequences and almost half of all receptors that are likely to be targets for drug design are integral membrane proteins. Understanding the detailed mechanisms of the folding of membrane proteins is a largely unsolved, key problem in structural biology. Here, we introduce a general model and use computer simulations to study the equilibrium properties and the folding kinetics of a $C_α$-based two helix bundle fragment (comprised of 66 amino-acids) of Bacteriorhodopsin. Various intermediates are identified and their free energy are calculated toghether with the free energy barrier between them. In 40% of folding trajectories, the folding rate is considerably increased by the presence of non-obligatory intermediates acting as traps. In all cases, a substantial portion of the helices is rapidly formed. This initial stage is followed by a long period of consolidation of the helices accompanied by their correct packing within the membrane. Our results provide the framework for understanding the variety of folding pathways of helical transmembrane proteins.
△ Less
Submitted 30 March, 2001;
originally announced March 2001.
-
Dynamical scaling of the DNA unzipping transition
Authors:
D. Marenduzzo,
S. M. Bhattacharjee,
A. Maritan,
E. Orlandini,
F. Seno
Abstract:
We report studies of the equilibrium and the dynamics of a general set of lattice models which capture the essence of the force-induced or mechanical DNA unzipping transition. Besides yielding the whole equilibrium phase diagram in the force vs temperature plane, which reveals the presence of an interesting re-entrant unzipping transition for low T, these models enable us to characterize the dyn…
▽ More
We report studies of the equilibrium and the dynamics of a general set of lattice models which capture the essence of the force-induced or mechanical DNA unzipping transition. Besides yielding the whole equilibrium phase diagram in the force vs temperature plane, which reveals the presence of an interesting re-entrant unzipping transition for low T, these models enable us to characterize the dynamics of the process starting from a non-equilibrium initial condition. The thermal melting of the DNA strands displays a model dependent time evolution. On the contrary, our results suggest that the dynamical mechanism for the unzipping by force is very robust and the scaling behaviour does not depend on the details of the description we adopt.
△ Less
Submitted 6 March, 2001;
originally announced March 2001.
-
Learning effective amino acid interactions through iterative stochastic techniques
Authors:
Cristian Micheletti,
Flavio Seno,
Jayanth Banavar,
Amos Maritan
Abstract:
The prediction of the three-dimensional structures of the native state of proteins from the sequences of their amino acids is one of the most important challenges in molecular biology. An essential ingredient to solve this problem within coarse-grained models is the task of deducing effective interaction potentials between the amino acids. Over the years several techniques have been developed to…
▽ More
The prediction of the three-dimensional structures of the native state of proteins from the sequences of their amino acids is one of the most important challenges in molecular biology. An essential ingredient to solve this problem within coarse-grained models is the task of deducing effective interaction potentials between the amino acids. Over the years several techniques have been developed to extract potentials that are able to discriminate satisfactorily between the native and non-native folds of a pre-assigned protein sequence. In general, when these potentials are used in actual dynamical folding simulations, they lead to a drift of the native structure outside the quasi-native basin. In this study, we present and validate an approach to overcome this difficulty. By exploiting several numerical and analytical tools we set up a rigorous iterative scheme to extract potentials satisfying a pre-requisite of any viable potential: the stabilization of proteins within their native basin (less than 3-4 Å$ $ cRMS). The scheme is flexible and is demonstrated to be applicable to a variety of parametrizations of the energy function and provides, in each case, the optimal potentials.
△ Less
Submitted 15 December, 2000;
originally announced December 2000.
-
Recurrent oligomers in proteins - an optimal scheme reconciling accurate and concise backbone representations in automated folding and design studies
Authors:
Cristian Micheletti,
Flavio Seno,
Amos Maritan
Abstract:
A novel scheme is introduced to capture the spatial correlations of consecutive amino acids in naturally occurring proteins. This knowledge-based strategy is able to carry out optimally automated subdivisions of protein fragments into classes of similarity. The goal is to provide the minimal set of protein oligomers (termed ``oligons'' for brevity) that is able to represent any other fragment. A…
▽ More
A novel scheme is introduced to capture the spatial correlations of consecutive amino acids in naturally occurring proteins. This knowledge-based strategy is able to carry out optimally automated subdivisions of protein fragments into classes of similarity. The goal is to provide the minimal set of protein oligomers (termed ``oligons'' for brevity) that is able to represent any other fragment. At variance with previous studies where recurrent local motifs were classified, our concern is to provide simplified protein representations that have been optimised for use in automated folding and/or design attempts. In such contexts it is paramount to limit the number of degrees of freedom per amino acid without incurring in loss of accuracy of structural representations. The suggested method finds, by construction, the optimal compromise between these needs. Several possible oligon lengths are considered. It is shown that meaningful classifications cannot be done for lengths greater than 6 or smaller than 4. Different contexts are considered were oligons of length 5 or 6 are recommendable. With only a few dozen of oligons of such length, virtually any protein can be reproduced within typical experimental uncertainties. Structural data for the oligons is made publicly available.
△ Less
Submitted 19 October, 2000;
originally announced October 2000.
-
Protein structures and optimal folding emerging from a geometrical variational principle
Authors:
Cristian Micheletti,
Jayanth R. Banavar,
Amos Maritan,
Flavio Seno
Abstract:
Novel numerical techniques, validated by an analysis of barnase and chymotrypsin inhibitor, are used to elucidate the paramount role played by the geometry of the protein backbone in steering the folding to the correct native state. It is found that, irrespective of the sequence, the native state of a protein has exceedingly large number of conformations with a given amount of structural overlap…
▽ More
Novel numerical techniques, validated by an analysis of barnase and chymotrypsin inhibitor, are used to elucidate the paramount role played by the geometry of the protein backbone in steering the folding to the correct native state. It is found that, irrespective of the sequence, the native state of a protein has exceedingly large number of conformations with a given amount of structural overlap compared to other compact artificial backbones; moreover the conformational entropies of unrelated proteins of the same length are nearly equal at any given stage of folding. These results are suggestive of an extremality principle underlying protein evolution, which, in turn, is shown to be associated with the emergence of secondary structures.
△ Less
Submitted 17 November, 1998;
originally announced November 1998.
-
Variational approach to protein design and extraction of interaction potentials
Authors:
Flavio Seno,
Cristian Micheletti,
Amos Maritan,
Jayanth R. Banavar
Abstract:
We present and discuss a novel approach to the direct and inverse protein folding problem. The proposed strategy is based on a variational approach that allows the simultaneous extraction of amino acid interactions and the low-temperature free energy of sequences of amino acids. The knowledge-based technique is simple and straightforward to implement even for realistic off-lattice proteins becau…
▽ More
We present and discuss a novel approach to the direct and inverse protein folding problem. The proposed strategy is based on a variational approach that allows the simultaneous extraction of amino acid interactions and the low-temperature free energy of sequences of amino acids. The knowledge-based technique is simple and straightforward to implement even for realistic off-lattice proteins because it does not entail threading-like procedures. Its validity is assessed in the context of a lattice model by means of a variety of stringent checks.
△ Less
Submitted 6 April, 1998;
originally announced April 1998.
-
Determination of Interaction Potentials of Amino Acids from Native Protein Structures: Test on Simple Lattice Models
Authors:
Jort van Mourik,
Cecilia Clementi,
Amos Maritan,
Flavio Seno,
J. R. Banavar
Abstract:
We propose a novel method for the determination of the effective interaction potential between the amino acids of a protein. The strategy is based on the combination of a new optimization procedure and a geometrical argument, which also uncovers the shortcomings of any optimization procedure. The strategy can be applied on any data set of native structures such as those available from the Protei…
▽ More
We propose a novel method for the determination of the effective interaction potential between the amino acids of a protein. The strategy is based on the combination of a new optimization procedure and a geometrical argument, which also uncovers the shortcomings of any optimization procedure. The strategy can be applied on any data set of native structures such as those available from the Protein Data Bank (PDB). In this work, however, we explain and test our approach on simple lattice models, where the true interactions are known a priori. Excellent agreement is obtained between the extracted and the true potentials even for modest numbers of protein structures in the PDB. Comparisons with other methods are also discussed.
△ Less
Submitted 15 January, 1998; v1 submitted 14 January, 1998;
originally announced January 1998.
-
Steric constraints in model proteins
Authors:
Cristian Micheletti,
Jayanth R. Banavar,
Amos Maritan,
Flavio Seno
Abstract:
A simple lattice model for proteins that allows for distinct sizes of the amino acids is presented. The model is found to lead to a significant number of conformations that are the unique ground state of one or more sequences or encodable. Furthermore, several of the encodable structures are highly designable and are the non-degenerate ground state of several sequences. Even though the native st…
▽ More
A simple lattice model for proteins that allows for distinct sizes of the amino acids is presented. The model is found to lead to a significant number of conformations that are the unique ground state of one or more sequences or encodable. Furthermore, several of the encodable structures are highly designable and are the non-degenerate ground state of several sequences. Even though the native state conformations are typically compact, not all compact conformations are encodable. The incorporation of the hydrophobic and polar nature of amino acids further enhances the attractive features of the model.
△ Less
Submitted 23 December, 1997;
originally announced December 1997.
-
Inverse design of proteins with hydrophobic and polar amino acids
Authors:
C. Micheletti,
F. Seno,
A. Maritan,
J. R. Banavar
Abstract:
A two amino acid (hydrophobic and polar) scheme is used to perform the design on target conformations corresponding to the native states of twenty single chain proteins. Strikingly, the percentage of successful identification of the nature of the residues benchmarked against naturally occurring proteins and their homologues is around 75 % independent of the complexity of the design procedure. Ty…
▽ More
A two amino acid (hydrophobic and polar) scheme is used to perform the design on target conformations corresponding to the native states of twenty single chain proteins. Strikingly, the percentage of successful identification of the nature of the residues benchmarked against naturally occurring proteins and their homologues is around 75 % independent of the complexity of the design procedure. Typically, the lowest success rate occurs for residues such as alanine that have a high secondary structure functionality. Using a simple lattice model, we argue that one possible shortcoming of the model studied may involve the coarse-graining of the twenty kinds of amino acids into just two effective types.
△ Less
Submitted 11 December, 1997;
originally announced December 1997.
-
Protein design in a lattice model of hydrophobic and polar amino acids
Authors:
C. Micheletti,
F. Seno,
A. Maritan,
J. R. Banavar
Abstract:
A general strategy is described for finding which amino acid sequences have native states in a desired conformation (inverse design). The approach is used to design sequences of 48 hydrophobic and polar aminoacids on three-dimensional lattice structures. Previous studies employing a sequence-space Monte-Carlo technique resulted in the successful design of one sequence in ten attempts. The presen…
▽ More
A general strategy is described for finding which amino acid sequences have native states in a desired conformation (inverse design). The approach is used to design sequences of 48 hydrophobic and polar aminoacids on three-dimensional lattice structures. Previous studies employing a sequence-space Monte-Carlo technique resulted in the successful design of one sequence in ten attempts. The present work also entails the exploration of conformations that compete significantly with the target structure for being its ground state. The design procedure is successful in all the ten cases.
△ Less
Submitted 21 November, 1997;
originally announced November 1997.