Matches in SemOpenAlex for { <https://semopenalex.org/work/W3045820076> ?p ?o ?g. }
- W3045820076 endingPage "1700" @default.
- W3045820076 startingPage "1688" @default.
- W3045820076 abstract "Abstract High divergence in protein sequences makes the detection of distant protein relationships through homology‐based approaches challenging. Grouping protein sequences into families, through similarities in either sequence or 3‐D structure, facilitates in the improved recognition of protein relationships. In addition, strategically designed protein‐like sequences have been shown to bridge distant structural domain families by serving as artificial linkers. In this study, we have augmented a search database of known protein domain families with such designed sequences, with the intention of providing functional clues to domain families of unknown structure. When assessed using representative query sequences from each family, we obtain a success rate of 94% in protein domain families of known structure. Further, we demonstrate that the augmented search space enabled fold recognition for 582 families with no structural information available a priori . Additionally, we were able to provide reliable functional relationships for 610 orphan families. We discuss the application of our method in predicting functional roles through select examples for DUF4922, DUF5131, and DUF5085. Our approach also detects new associations between families that were previously not known to be related, as demonstrated through new sub‐groups of the RNA polymerase domain among three distinct RNA viruses. Taken together, designed sequences‐augmented search databases direct the detection of meaningful relationships between distant protein families. In turn, they enable fold recognition and offer reliable pointers to potential functional sites that may be probed further through direct mutagenesis studies." @default.
- W3045820076 created "2020-08-03" @default.
- W3045820076 creator A5044528301 @default.
- W3045820076 creator A5076920718 @default.
- W3045820076 date "2020-08-31" @default.
- W3045820076 modified "2023-09-26" @default.
- W3045820076 title "Artificial protein sequences enable recognition of vicinal and distant protein functional relationships" @default.
- W3045820076 cites W1711482514 @default.
- W3045820076 cites W1803102843 @default.
- W3045820076 cites W1964219644 @default.
- W3045820076 cites W1965827393 @default.
- W3045820076 cites W1977316565 @default.
- W3045820076 cites W1985127008 @default.
- W3045820076 cites W1986155816 @default.
- W3045820076 cites W1994025789 @default.
- W3045820076 cites W1997110152 @default.
- W3045820076 cites W2015530520 @default.
- W3045820076 cites W2045564656 @default.
- W3045820076 cites W2055043387 @default.
- W3045820076 cites W2055264790 @default.
- W3045820076 cites W2066142272 @default.
- W3045820076 cites W2069458148 @default.
- W3045820076 cites W2076165098 @default.
- W3045820076 cites W2079914556 @default.
- W3045820076 cites W2092018396 @default.
- W3045820076 cites W2096250458 @default.
- W3045820076 cites W2097270746 @default.
- W3045820076 cites W2100033550 @default.
- W3045820076 cites W2102424043 @default.
- W3045820076 cites W2108067237 @default.
- W3045820076 cites W2110156441 @default.
- W3045820076 cites W2110668908 @default.
- W3045820076 cites W2115138247 @default.
- W3045820076 cites W2116423958 @default.
- W3045820076 cites W2124871329 @default.
- W3045820076 cites W2133312664 @default.
- W3045820076 cites W2136280642 @default.
- W3045820076 cites W2138122982 @default.
- W3045820076 cites W2143705344 @default.
- W3045820076 cites W2144813419 @default.
- W3045820076 cites W2145268834 @default.
- W3045820076 cites W2145931647 @default.
- W3045820076 cites W2147840025 @default.
- W3045820076 cites W2157484293 @default.
- W3045820076 cites W2157925921 @default.
- W3045820076 cites W2158714788 @default.
- W3045820076 cites W2159679199 @default.
- W3045820076 cites W2166372707 @default.
- W3045820076 cites W2168974130 @default.
- W3045820076 cites W2224056471 @default.
- W3045820076 cites W2305946779 @default.
- W3045820076 cites W2332848608 @default.
- W3045820076 cites W2426189081 @default.
- W3045820076 cites W2518510166 @default.
- W3045820076 cites W2620385970 @default.
- W3045820076 cites W2800194647 @default.
- W3045820076 cites W2898402099 @default.
- W3045820076 cites W2900701906 @default.
- W3045820076 cites W2902353954 @default.
- W3045820076 cites W2903675822 @default.
- W3045820076 cites W2915932155 @default.
- W3045820076 cites W2964233912 @default.
- W3045820076 doi "https://doi.org/10.1002/prot.25986" @default.
- W3045820076 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/32725917" @default.
- W3045820076 hasPublicationYear "2020" @default.
- W3045820076 type Work @default.
- W3045820076 sameAs 3045820076 @default.
- W3045820076 citedByCount "1" @default.
- W3045820076 countsByYear W30458200762022 @default.
- W3045820076 crossrefType "journal-article" @default.
- W3045820076 hasAuthorship W3045820076A5044528301 @default.
- W3045820076 hasAuthorship W3045820076A5076920718 @default.
- W3045820076 hasConcept C104317684 @default.
- W3045820076 hasConcept C134306372 @default.
- W3045820076 hasConcept C136475424 @default.
- W3045820076 hasConcept C144292202 @default.
- W3045820076 hasConcept C167625842 @default.
- W3045820076 hasConcept C171897839 @default.
- W3045820076 hasConcept C200307862 @default.
- W3045820076 hasConcept C33923547 @default.
- W3045820076 hasConcept C36503486 @default.
- W3045820076 hasConcept C41008148 @default.
- W3045820076 hasConcept C41584329 @default.
- W3045820076 hasConcept C45484198 @default.
- W3045820076 hasConcept C4668613 @default.
- W3045820076 hasConcept C47701112 @default.
- W3045820076 hasConcept C54355233 @default.
- W3045820076 hasConcept C55493867 @default.
- W3045820076 hasConcept C70721500 @default.
- W3045820076 hasConcept C86803240 @default.
- W3045820076 hasConceptScore W3045820076C104317684 @default.
- W3045820076 hasConceptScore W3045820076C134306372 @default.
- W3045820076 hasConceptScore W3045820076C136475424 @default.
- W3045820076 hasConceptScore W3045820076C144292202 @default.
- W3045820076 hasConceptScore W3045820076C167625842 @default.
- W3045820076 hasConceptScore W3045820076C171897839 @default.
- W3045820076 hasConceptScore W3045820076C200307862 @default.
- W3045820076 hasConceptScore W3045820076C33923547 @default.