Matches in SemOpenAlex for { <https://semopenalex.org/work/W2153193090> ?p ?o ?g. }
- W2153193090 endingPage "802" @default.
- W2153193090 startingPage "793" @default.
- W2153193090 abstract "Structure comparisons of all representative proteins have been done. Employing the relative root mean square deviation (RMSD) from native enables the assessment of the statistical significance of structure alignments of different lengths in terms of a Z-score. Two conclusions emerge: first, proteins with their native fold can be distinguished by their Z-score. Second and somewhat surprising, all small proteins up to 100 residues in length have significant structure alignments to other proteins in a different secondary structure and fold class; i.e. 24.0% of them have 60% coverage by a template protein with a RMSD below 3.5 Å and 6.0% have 70% coverage. If the restriction that we align proteins only having different secondary structure types is removed, then in a representative benchmark set of proteins of 200 residues or smaller, 93% can be aligned to a single template structure (with average sequence identity of 9.8%), with a RMSD less than 4 Å, and 79% average coverage. In this sense, the current Protein Data Bank (PDB) is almost a covering set of small protein structures. The length of the aligned region (relative to the whole protein length) does not differ among the top hit proteins, indicating that protein structure space is highly dense. For larger proteins, non-related proteins can cover a significant portion of the structure. Moreover, these top hit proteins are aligned to different parts of the target protein, so that almost the entire molecule can be covered when combined. The number of proteins required to cover a target protein is very small, e.g. the top ten hit proteins can give 90% coverage below a RMSD of 3.5 Å for proteins up to 320 residues long. These results give a new view of the nature of protein structure space, and its implications for protein structure prediction are discussed." @default.
- W2153193090 created "2016-06-24" @default.
- W2153193090 creator A5010808261 @default.
- W2153193090 creator A5062293219 @default.
- W2153193090 date "2003-12-01" @default.
- W2153193090 modified "2023-09-26" @default.
- W2153193090 title "The PDB is a Covering Set of Small Protein Structures" @default.
- W2153193090 cites W1979147581 @default.
- W2153193090 cites W1991679665 @default.
- W2153193090 cites W1992558926 @default.
- W2153193090 cites W1993273781 @default.
- W2153193090 cites W1993279060 @default.
- W2153193090 cites W1996073320 @default.
- W2153193090 cites W2004646218 @default.
- W2153193090 cites W2008922668 @default.
- W2153193090 cites W2011235285 @default.
- W2153193090 cites W2013507130 @default.
- W2153193090 cites W2014734789 @default.
- W2153193090 cites W2015292449 @default.
- W2153193090 cites W2022015698 @default.
- W2153193090 cites W2022058405 @default.
- W2153193090 cites W2023790301 @default.
- W2153193090 cites W2024755629 @default.
- W2153193090 cites W2034822406 @default.
- W2153193090 cites W2050330741 @default.
- W2153193090 cites W2053281575 @default.
- W2153193090 cites W2055640060 @default.
- W2153193090 cites W2061693113 @default.
- W2153193090 cites W2062327179 @default.
- W2153193090 cites W2065283382 @default.
- W2153193090 cites W2071818582 @default.
- W2153193090 cites W2072375395 @default.
- W2153193090 cites W2097785476 @default.
- W2153193090 cites W2108067237 @default.
- W2153193090 cites W2108107014 @default.
- W2153193090 cites W2112615307 @default.
- W2153193090 cites W2118581335 @default.
- W2153193090 cites W2122520496 @default.
- W2153193090 cites W2124506510 @default.
- W2153193090 cites W2130479394 @default.
- W2153193090 cites W2133307208 @default.
- W2153193090 cites W2137736208 @default.
- W2153193090 cites W2144483796 @default.
- W2153193090 cites W2152326664 @default.
- W2153193090 cites W2153187042 @default.
- W2153193090 cites W2155479906 @default.
- W2153193090 cites W2155480785 @default.
- W2153193090 cites W4246356731 @default.
- W2153193090 doi "https://doi.org/10.1016/j.jmb.2003.10.027" @default.
- W2153193090 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/14636603" @default.
- W2153193090 hasPublicationYear "2003" @default.
- W2153193090 type Work @default.
- W2153193090 sameAs 2153193090 @default.
- W2153193090 citedByCount "108" @default.
- W2153193090 countsByYear W21531930902012 @default.
- W2153193090 countsByYear W21531930902013 @default.
- W2153193090 countsByYear W21531930902014 @default.
- W2153193090 countsByYear W21531930902015 @default.
- W2153193090 countsByYear W21531930902016 @default.
- W2153193090 countsByYear W21531930902017 @default.
- W2153193090 countsByYear W21531930902018 @default.
- W2153193090 countsByYear W21531930902019 @default.
- W2153193090 countsByYear W21531930902020 @default.
- W2153193090 countsByYear W21531930902021 @default.
- W2153193090 countsByYear W21531930902022 @default.
- W2153193090 crossrefType "journal-article" @default.
- W2153193090 hasAuthorship W2153193090A5010808261 @default.
- W2153193090 hasAuthorship W2153193090A5062293219 @default.
- W2153193090 hasBestOaLocation W21531930902 @default.
- W2153193090 hasConcept C104317684 @default.
- W2153193090 hasConcept C119145174 @default.
- W2153193090 hasConcept C167625842 @default.
- W2153193090 hasConcept C18051474 @default.
- W2153193090 hasConcept C185592680 @default.
- W2153193090 hasConcept C200307862 @default.
- W2153193090 hasConcept C33923547 @default.
- W2153193090 hasConcept C45484198 @default.
- W2153193090 hasConcept C4668613 @default.
- W2153193090 hasConcept C47701112 @default.
- W2153193090 hasConcept C55493867 @default.
- W2153193090 hasConcept C62614982 @default.
- W2153193090 hasConcept C65556437 @default.
- W2153193090 hasConcept C70721500 @default.
- W2153193090 hasConcept C8010536 @default.
- W2153193090 hasConcept C86803240 @default.
- W2153193090 hasConceptScore W2153193090C104317684 @default.
- W2153193090 hasConceptScore W2153193090C119145174 @default.
- W2153193090 hasConceptScore W2153193090C167625842 @default.
- W2153193090 hasConceptScore W2153193090C18051474 @default.
- W2153193090 hasConceptScore W2153193090C185592680 @default.
- W2153193090 hasConceptScore W2153193090C200307862 @default.
- W2153193090 hasConceptScore W2153193090C33923547 @default.
- W2153193090 hasConceptScore W2153193090C45484198 @default.
- W2153193090 hasConceptScore W2153193090C4668613 @default.
- W2153193090 hasConceptScore W2153193090C47701112 @default.
- W2153193090 hasConceptScore W2153193090C55493867 @default.
- W2153193090 hasConceptScore W2153193090C62614982 @default.
- W2153193090 hasConceptScore W2153193090C65556437 @default.