Matches in SemOpenAlex for { <https://semopenalex.org/work/W2892487830> ?p ?o ?g. }
- W2892487830 abstract "Abstract The ability to routinely design functional proteins, in a targeted manner, would have enormous implications for biomedical research and therapeutic development. Computational protein design (CPD) offers the potential to fulfill this need, and though recent years have brought considerable progress in the field, major limitations remain. Current state-of-the-art approaches to CPD aim to capture the determinants of structure from physical principles. While this has led to many successful designs, it does have strong limitations associated with inaccuracies in physical modeling, such that a robust general solution to CPD has yet to be found. Here we propose a fundamentally novel design framework—one based on identifying and applying patterns of sequence-structure compatibility found in known proteins, rather than approximating them from models of inter-atomic interactions. Specifically, we systematically decompose the target structure to be designed into structural building blocks we call TERMs (tertiary motifs) and use rapid structure search against the Protein Data Bank (PDB) to identify sequence patterns associated with each TERM from known protein structures that contain it. These results are then combined to produce a sequence-level pseudo-energy model that can score any sequence for compatibility with the target structure. This model can then be used to extract the optimal-scoring sequence via combinatorial optimization or otherwise sample the sequence space predicted to be well compatible with folding to the target. Here we carry out extensive computational analyses, showing that our method, which we dub dTERMen (design with TERM energies): 1) produces native-like sequences given native crystallographic or NMR backbones, 2) produces sequence-structure compatibility scores that correlate with thermodynamic stability, and 3) is able to predict experimental success of designed sequences generated with other methods, and 4) designs sequences that are found to fold to the desired target by structure prediction more frequently than sequences designed with an atomistic method. As an experimental validation of dTERMen, we perform a total surface redesign of Red Fluorescent Protein mCherry, marking a total of 64 residues as variable. The single sequence identified as optimal by dTERMen harbors 48 mutations relative to mCherry, but nevertheless folds, is monomeric in solution, exhibits similar stability to chemical denaturation as mCherry, and even preserves the fluorescence property. Our results strongly argue that the PDB is now sufficiently large to enable proteins to be designed by using only examples of structural motifs from unrelated proteins. This is highly significant, given that the structural database will only continue to grow, and signals the possibility of a whole host of novel data-driven CPD methods. Because such methods are likely to have orthogonal strengths relative to existing techniques, they could represent an important step towards removing remaining barriers to robust CPD." @default.
- W2892487830 created "2018-10-05" @default.
- W2892487830 creator A5000219088 @default.
- W2892487830 creator A5031865026 @default.
- W2892487830 creator A5055760120 @default.
- W2892487830 date "2018-10-01" @default.
- W2892487830 modified "2023-09-26" @default.
- W2892487830 title "A general-purpose protein design framework based on mining sequence-structure relationships in known protein structures" @default.
- W2892487830 cites W1547701085 @default.
- W2892487830 cites W1721447212 @default.
- W2892487830 cites W1756401784 @default.
- W2892487830 cites W1909149576 @default.
- W2892487830 cites W1923888192 @default.
- W2892487830 cites W1968673134 @default.
- W2892487830 cites W1969644422 @default.
- W2892487830 cites W1974312616 @default.
- W2892487830 cites W1976258326 @default.
- W2892487830 cites W1980703922 @default.
- W2892487830 cites W1984148408 @default.
- W2892487830 cites W1987134040 @default.
- W2892487830 cites W1992426410 @default.
- W2892487830 cites W1994802225 @default.
- W2892487830 cites W1998704463 @default.
- W2892487830 cites W2009365528 @default.
- W2892487830 cites W2011658408 @default.
- W2892487830 cites W2012987401 @default.
- W2892487830 cites W2014245863 @default.
- W2892487830 cites W2038139711 @default.
- W2892487830 cites W2053983828 @default.
- W2892487830 cites W2054525948 @default.
- W2892487830 cites W2058001460 @default.
- W2892487830 cites W2060757799 @default.
- W2892487830 cites W2065247480 @default.
- W2892487830 cites W2074655761 @default.
- W2892487830 cites W2085497102 @default.
- W2892487830 cites W2089473150 @default.
- W2892487830 cites W2092087990 @default.
- W2892487830 cites W2092364334 @default.
- W2892487830 cites W2096648003 @default.
- W2892487830 cites W2101220662 @default.
- W2892487830 cites W2102245393 @default.
- W2892487830 cites W2104720751 @default.
- W2892487830 cites W2105417856 @default.
- W2892487830 cites W2106146363 @default.
- W2892487830 cites W2108664112 @default.
- W2892487830 cites W2111686080 @default.
- W2892487830 cites W2113528117 @default.
- W2892487830 cites W2121627241 @default.
- W2892487830 cites W2126570516 @default.
- W2892487830 cites W2134059200 @default.
- W2892487830 cites W2134731229 @default.
- W2892487830 cites W2135815512 @default.
- W2892487830 cites W2139881678 @default.
- W2892487830 cites W2140831051 @default.
- W2892487830 cites W2141920771 @default.
- W2892487830 cites W2142134255 @default.
- W2892487830 cites W2148165483 @default.
- W2892487830 cites W2150806079 @default.
- W2892487830 cites W2161151688 @default.
- W2892487830 cites W2170060741 @default.
- W2892487830 cites W2173280871 @default.
- W2892487830 cites W2201713963 @default.
- W2892487830 cites W2286345835 @default.
- W2892487830 cites W2340987618 @default.
- W2892487830 cites W2466231879 @default.
- W2892487830 cites W2511393753 @default.
- W2892487830 cites W2519539312 @default.
- W2892487830 cites W2537114341 @default.
- W2892487830 cites W2548003600 @default.
- W2892487830 cites W2557252183 @default.
- W2892487830 cites W2557450030 @default.
- W2892487830 cites W2558272290 @default.
- W2892487830 cites W2559539924 @default.
- W2892487830 cites W2607853144 @default.
- W2892487830 cites W2617533773 @default.
- W2892487830 cites W2735621019 @default.
- W2892487830 cites W2763971786 @default.
- W2892487830 cites W2774178591 @default.
- W2892487830 cites W2778365038 @default.
- W2892487830 cites W2785273668 @default.
- W2892487830 cites W2790686372 @default.
- W2892487830 cites W2798528327 @default.
- W2892487830 cites W3047703793 @default.
- W2892487830 cites W4236236547 @default.
- W2892487830 cites W4251787227 @default.
- W2892487830 cites W834977926 @default.
- W2892487830 doi "https://doi.org/10.1101/431635" @default.
- W2892487830 hasPublicationYear "2018" @default.
- W2892487830 type Work @default.
- W2892487830 sameAs 2892487830 @default.
- W2892487830 citedByCount "2" @default.
- W2892487830 countsByYear W28924878302018 @default.
- W2892487830 countsByYear W28924878302023 @default.
- W2892487830 crossrefType "posted-content" @default.
- W2892487830 hasAuthorship W2892487830A5000219088 @default.
- W2892487830 hasAuthorship W2892487830A5031865026 @default.
- W2892487830 hasAuthorship W2892487830A5055760120 @default.
- W2892487830 hasBestOaLocation W28924878301 @default.
- W2892487830 hasConcept C104317684 @default.
- W2892487830 hasConcept C119145174 @default.