Matches in SemOpenAlex for { <https://semopenalex.org/work/W2076165098> ?p ?o ?g. }
- W2076165098 endingPage "979" @default.
- W2076165098 startingPage "962" @default.
- W2076165098 abstract "Protein functional annotation relies on the identification of accurate relationships, sequence divergence being a key factor. This is especially evident when distant protein relationships are demonstrated only with three-dimensional structures. To address this challenge, we describe a computational approach to purposefully bridge gaps between related protein families through directed design of protein-like “linker” sequences. For this, we represented SCOP domain families, integrated with sequence homologues, as multiple profiles and performed HMM-HMM alignments between related domain families. Where convincing alignments were achieved, we applied a roulette wheel-based method to design 3,611,010 protein-like sequences corresponding to 374 SCOP folds. To analyze their ability to link proteins in homology searches, we used 3024 queries to search two databases, one containing only natural sequences and another one additionally containing designed sequences. Our results showed that augmented database searches showed up to 30% improvement in fold coverage for over 74% of the folds, with 52 folds achieving all theoretically possible connections. Although sequences could not be designed between some families, the availability of designed sequences between other families within the fold established the sequence continuum to demonstrate 373 difficult relationships. Ultimately, as a practical and realistic extension, we demonstrate that such protein-like sequences can be “plugged-into” routine and generic sequence database searches to empower not only remote homology detection but also fold recognition. Our richly statistically supported findings show that complementary searches in both databases will increase the effectiveness of sequence-based searches in recognizing all homologues sharing a common fold." @default.
- W2076165098 created "2016-06-24" @default.
- W2076165098 creator A5057235883 @default.
- W2076165098 creator A5076779054 @default.
- W2076165098 creator A5076920718 @default.
- W2076165098 creator A5085789773 @default.
- W2076165098 date "2014-02-01" @default.
- W2076165098 modified "2023-10-18" @default.
- W2076165098 title "Filling-in Void and Sparse Regions in Protein Sequence Space by Protein-Like Artificial Sequences Enables Remarkable Enhancement in Remote Homology Detection Capability" @default.
- W2076165098 cites W1810634920 @default.
- W2076165098 cites W1969051510 @default.
- W2076165098 cites W1988625445 @default.
- W2076165098 cites W1997110152 @default.
- W2076165098 cites W2015507989 @default.
- W2076165098 cites W2017727500 @default.
- W2076165098 cites W2036149515 @default.
- W2076165098 cites W2036792999 @default.
- W2076165098 cites W2038438679 @default.
- W2076165098 cites W2042033548 @default.
- W2076165098 cites W2051210555 @default.
- W2076165098 cites W2053671774 @default.
- W2076165098 cites W2053756634 @default.
- W2076165098 cites W2060491201 @default.
- W2076165098 cites W2067913741 @default.
- W2076165098 cites W2069458148 @default.
- W2076165098 cites W2076048958 @default.
- W2076165098 cites W2077235131 @default.
- W2076165098 cites W2080336769 @default.
- W2076165098 cites W2085277871 @default.
- W2076165098 cites W2085497102 @default.
- W2076165098 cites W2101335101 @default.
- W2076165098 cites W2105118494 @default.
- W2076165098 cites W2110156441 @default.
- W2076165098 cites W2112380236 @default.
- W2076165098 cites W2120251063 @default.
- W2076165098 cites W2122687161 @default.
- W2076165098 cites W2124871329 @default.
- W2076165098 cites W2126377763 @default.
- W2076165098 cites W2134571902 @default.
- W2076165098 cites W2139240863 @default.
- W2076165098 cites W2143705344 @default.
- W2076165098 cites W2151457629 @default.
- W2076165098 cites W2151831732 @default.
- W2076165098 cites W2156125289 @default.
- W2076165098 cites W2158714788 @default.
- W2076165098 cites W2159675211 @default.
- W2076165098 cites W2163940928 @default.
- W2076165098 cites W2164684045 @default.
- W2076165098 cites W3147254695 @default.
- W2076165098 cites W4246643264 @default.
- W2076165098 doi "https://doi.org/10.1016/j.jmb.2013.11.026" @default.
- W2076165098 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/24316367" @default.
- W2076165098 hasPublicationYear "2014" @default.
- W2076165098 type Work @default.
- W2076165098 sameAs 2076165098 @default.
- W2076165098 citedByCount "13" @default.
- W2076165098 countsByYear W20761650982014 @default.
- W2076165098 countsByYear W20761650982015 @default.
- W2076165098 countsByYear W20761650982016 @default.
- W2076165098 countsByYear W20761650982018 @default.
- W2076165098 countsByYear W20761650982020 @default.
- W2076165098 countsByYear W20761650982021 @default.
- W2076165098 countsByYear W20761650982022 @default.
- W2076165098 crossrefType "journal-article" @default.
- W2076165098 hasAuthorship W2076165098A5057235883 @default.
- W2076165098 hasAuthorship W2076165098A5076779054 @default.
- W2076165098 hasAuthorship W2076165098A5076920718 @default.
- W2076165098 hasAuthorship W2076165098A5085789773 @default.
- W2076165098 hasConcept C10010492 @default.
- W2076165098 hasConcept C104317684 @default.
- W2076165098 hasConcept C105082737 @default.
- W2076165098 hasConcept C136475424 @default.
- W2076165098 hasConcept C144292202 @default.
- W2076165098 hasConcept C154945302 @default.
- W2076165098 hasConcept C165525559 @default.
- W2076165098 hasConcept C167625842 @default.
- W2076165098 hasConcept C169627665 @default.
- W2076165098 hasConcept C171897839 @default.
- W2076165098 hasConcept C181199279 @default.
- W2076165098 hasConcept C23224414 @default.
- W2076165098 hasConcept C2776321320 @default.
- W2076165098 hasConcept C2778112365 @default.
- W2076165098 hasConcept C41008148 @default.
- W2076165098 hasConcept C41584329 @default.
- W2076165098 hasConcept C45484198 @default.
- W2076165098 hasConcept C4668613 @default.
- W2076165098 hasConcept C47701112 @default.
- W2076165098 hasConcept C54355233 @default.
- W2076165098 hasConcept C55493867 @default.
- W2076165098 hasConcept C58773245 @default.
- W2076165098 hasConcept C70721500 @default.
- W2076165098 hasConcept C86803240 @default.
- W2076165098 hasConceptScore W2076165098C10010492 @default.
- W2076165098 hasConceptScore W2076165098C104317684 @default.
- W2076165098 hasConceptScore W2076165098C105082737 @default.
- W2076165098 hasConceptScore W2076165098C136475424 @default.
- W2076165098 hasConceptScore W2076165098C144292202 @default.
- W2076165098 hasConceptScore W2076165098C154945302 @default.