Matches in SemOpenAlex for { <https://semopenalex.org/work/W1945828453> ?p ?o ?g. }
- W1945828453 abstract "The evolutionary trajectory of a protein through sequence space is constrained by function and three-dimensional (3D) structure. Residues in spatial proximity tend to co-evolve, yet attempts to invert the evolutionary record to identify these constraints and use them to computationally fold proteins have so far been unsuccessful. Here, we show that co-variation of residue pairs, observed in a large protein family, provides sufficient information to determine 3D protein structure. Using a data-constrained maximum entropy model of the multiple sequence alignment, we identify pairs of statistically coupled residue positions which are expected to be close in the protein fold, termed contacts inferred from evolutionary information (EICs). To assess the amount of information about the protein fold contained in these coupled pairs, we evaluate the accuracy of predicted 3D structures for proteins of 50-260 residues, from 15 diverse protein families, including a G-protein coupled receptor. These structure predictions are de novo, i.e., they do not use homology modeling or sequence-similar fragments from known structures. The resulting low C{alpha}-RMSD error range of 2.7-5.1A, over at least 75% of the protein, indicates the potential for predicting essentially correct 3D structures for the thousands of protein families that have no known structure, provided they include a sufficiently large number of divergent sample sequences. With the current enormous growth in sequence information based on new sequencing technology, this opens the door to a comprehensive survey of protein 3D structures, including many not currently accessible to the experimental methods of structural genomics. This advance has potential applications in many biological contexts, such as synthetic biology, identification of functional sites in proteins and interpretation of the functional impact of genetic variants." @default.
- W1945828453 created "2016-06-24" @default.
- W1945828453 creator A5008166270 @default.
- W1945828453 creator A5010467904 @default.
- W1945828453 creator A5011435835 @default.
- W1945828453 creator A5022191527 @default.
- W1945828453 creator A5041685522 @default.
- W1945828453 creator A5046017741 @default.
- W1945828453 creator A5082475080 @default.
- W1945828453 date "2011-10-25" @default.
- W1945828453 modified "2023-09-27" @default.
- W1945828453 title "3D Protein Structure Predicted from Sequence" @default.
- W1945828453 cites W1514077080 @default.
- W1945828453 cites W1791999417 @default.
- W1945828453 cites W1975622380 @default.
- W1945828453 cites W1976666280 @default.
- W1945828453 cites W1979762151 @default.
- W1945828453 cites W1982420443 @default.
- W1945828453 cites W1985946228 @default.
- W1945828453 cites W1989415425 @default.
- W1945828453 cites W1990700294 @default.
- W1945828453 cites W1995017064 @default.
- W1945828453 cites W2004461664 @default.
- W1945828453 cites W2006333907 @default.
- W1945828453 cites W2011399646 @default.
- W1945828453 cites W2015470526 @default.
- W1945828453 cites W2021769301 @default.
- W1945828453 cites W2025018109 @default.
- W1945828453 cites W2033806446 @default.
- W1945828453 cites W2034557959 @default.
- W1945828453 cites W2037312364 @default.
- W1945828453 cites W2042661033 @default.
- W1945828453 cites W2049617698 @default.
- W1945828453 cites W2056265093 @default.
- W1945828453 cites W2059567258 @default.
- W1945828453 cites W2072843928 @default.
- W1945828453 cites W2073758233 @default.
- W1945828453 cites W2075460992 @default.
- W1945828453 cites W2079871896 @default.
- W1945828453 cites W2088638514 @default.
- W1945828453 cites W2089035513 @default.
- W1945828453 cites W2091194233 @default.
- W1945828453 cites W2091582418 @default.
- W1945828453 cites W2092024874 @default.
- W1945828453 cites W2093769098 @default.
- W1945828453 cites W2096863352 @default.
- W1945828453 cites W2098138211 @default.
- W1945828453 cites W2099667027 @default.
- W1945828453 cites W2101052277 @default.
- W1945828453 cites W2101348065 @default.
- W1945828453 cites W2109091716 @default.
- W1945828453 cites W2109839728 @default.
- W1945828453 cites W2110483430 @default.
- W1945828453 cites W2111010158 @default.
- W1945828453 cites W2113178668 @default.
- W1945828453 cites W2113358325 @default.
- W1945828453 cites W2114115133 @default.
- W1945828453 cites W2117020466 @default.
- W1945828453 cites W2118756701 @default.
- W1945828453 cites W2120549641 @default.
- W1945828453 cites W2126679079 @default.
- W1945828453 cites W2135509505 @default.
- W1945828453 cites W2136624897 @default.
- W1945828453 cites W2137965757 @default.
- W1945828453 cites W2138059406 @default.
- W1945828453 cites W2138551420 @default.
- W1945828453 cites W2141718854 @default.
- W1945828453 cites W2141885858 @default.
- W1945828453 cites W2141920824 @default.
- W1945828453 cites W2147238273 @default.
- W1945828453 cites W2148171419 @default.
- W1945828453 cites W2151457629 @default.
- W1945828453 cites W2154083418 @default.
- W1945828453 cites W2157938277 @default.
- W1945828453 cites W2158580600 @default.
- W1945828453 cites W2161151688 @default.
- W1945828453 cites W2171641243 @default.
- W1945828453 cites W3098888484 @default.
- W1945828453 hasPublicationYear "2011" @default.
- W1945828453 type Work @default.
- W1945828453 sameAs 1945828453 @default.
- W1945828453 citedByCount "1" @default.
- W1945828453 countsByYear W19458284532012 @default.
- W1945828453 crossrefType "posted-content" @default.
- W1945828453 hasAuthorship W1945828453A5008166270 @default.
- W1945828453 hasAuthorship W1945828453A5010467904 @default.
- W1945828453 hasAuthorship W1945828453A5011435835 @default.
- W1945828453 hasAuthorship W1945828453A5022191527 @default.
- W1945828453 hasAuthorship W1945828453A5041685522 @default.
- W1945828453 hasAuthorship W1945828453A5046017741 @default.
- W1945828453 hasAuthorship W1945828453A5082475080 @default.
- W1945828453 hasConcept C10010492 @default.
- W1945828453 hasConcept C104317684 @default.
- W1945828453 hasConcept C167625842 @default.
- W1945828453 hasConcept C169627665 @default.
- W1945828453 hasConcept C171897839 @default.
- W1945828453 hasConcept C18051474 @default.
- W1945828453 hasConcept C181199279 @default.
- W1945828453 hasConcept C2778112365 @default.
- W1945828453 hasConcept C41008148 @default.