Matches in SemOpenAlex for { <https://semopenalex.org/work/W2100094101> ?p ?o ?g. }
- W2100094101 endingPage "768" @default.
- W2100094101 startingPage "758" @default.
- W2100094101 abstract "Abstract Protein function shift can be predicted from sequence comparisons, either using positive selection signals or evolutionary rate estimation. None of the methods have been validated on large datasets, however. Here we investigate existing and novel methods for protein function shift prediction, and benchmark the accuracy against a large dataset of proteins with known enzymatic functions. Function change was predicted between subfamilies by identifying two kinds of sites in a multiple sequence alignment: Conservation‐Shifting Sites (CSS), which are conserved in two subfamilies using two different amino acid types, and Rate‐Shifting Sites (RSS), which have different evolutionary rates in two subfamilies. CSS were predicted by a new entropy‐based method, and RSS using the Rate‐Shift program. In principle, the more CSS and RSS between two subfamilies, the more likely a function shift between them. A test dataset was built by extracting subfamilies from Pfam with different EC numbers that belong to the same domain family. Subfamilies were generated automatically using a phylogenetic tree‐based program, BETE. The dataset comprised 997 subfamily pairs with four or more members per subfamily. We observed a significant increase in CSS and RSS for subfamily comparisons with different EC numbers compared to cases with same EC numbers. The discrimination was better using RSS than CSS, and was more pronounced for larger families. Combining RSS and CSS by discriminant analysis improved classification accuracy to 71%. The method was applied to the Pfam database and the results are available at http://FunShift.cgb.ki.se . A closer examination of some superfamily comparisons showed that single EC numbers sometimes embody distinct functional classes. Hence, the measured accuracy of function shift is underestimated. Proteins 2005. © 2005 Wiley‐Liss, Inc." @default.
- W2100094101 created "2016-06-24" @default.
- W2100094101 creator A5026460419 @default.
- W2100094101 creator A5032432650 @default.
- W2100094101 date "2005-07-06" @default.
- W2100094101 modified "2023-10-17" @default.
- W2100094101 title "Large-scale prediction of function shift in protein families with a focus on enzymatic function" @default.
- W2100094101 cites W1539076962 @default.
- W2100094101 cites W1568709468 @default.
- W2100094101 cites W1573258040 @default.
- W2100094101 cites W1620564540 @default.
- W2100094101 cites W1830381718 @default.
- W2100094101 cites W1913751214 @default.
- W2100094101 cites W1927239524 @default.
- W2100094101 cites W1964814810 @default.
- W2100094101 cites W1979824026 @default.
- W2100094101 cites W1995924392 @default.
- W2100094101 cites W2019907599 @default.
- W2100094101 cites W2027536884 @default.
- W2100094101 cites W2041856646 @default.
- W2100094101 cites W2043033724 @default.
- W2100094101 cites W2046575349 @default.
- W2100094101 cites W2049203248 @default.
- W2100094101 cites W2072175110 @default.
- W2100094101 cites W2090678715 @default.
- W2100094101 cites W2092414509 @default.
- W2100094101 cites W2098425296 @default.
- W2100094101 cites W2111973517 @default.
- W2100094101 cites W2112492958 @default.
- W2100094101 cites W2116111392 @default.
- W2100094101 cites W2116990139 @default.
- W2100094101 cites W2120532481 @default.
- W2100094101 cites W2137991504 @default.
- W2100094101 cites W2139919097 @default.
- W2100094101 cites W2141187118 @default.
- W2100094101 cites W2146213764 @default.
- W2100094101 cites W2151831732 @default.
- W2100094101 cites W2152091041 @default.
- W2100094101 cites W2163748594 @default.
- W2100094101 cites W2167689124 @default.
- W2100094101 cites W4213149192 @default.
- W2100094101 doi "https://doi.org/10.1002/prot.20550" @default.
- W2100094101 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/16001403" @default.
- W2100094101 hasPublicationYear "2005" @default.
- W2100094101 type Work @default.
- W2100094101 sameAs 2100094101 @default.
- W2100094101 citedByCount "36" @default.
- W2100094101 countsByYear W21000941012012 @default.
- W2100094101 countsByYear W21000941012013 @default.
- W2100094101 countsByYear W21000941012014 @default.
- W2100094101 countsByYear W21000941012016 @default.
- W2100094101 countsByYear W21000941012021 @default.
- W2100094101 countsByYear W21000941012022 @default.
- W2100094101 crossrefType "journal-article" @default.
- W2100094101 hasAuthorship W2100094101A5026460419 @default.
- W2100094101 hasAuthorship W2100094101A5032432650 @default.
- W2100094101 hasBestOaLocation W21000941012 @default.
- W2100094101 hasConcept C104317684 @default.
- W2100094101 hasConcept C111919701 @default.
- W2100094101 hasConcept C124101348 @default.
- W2100094101 hasConcept C14036430 @default.
- W2100094101 hasConcept C167625842 @default.
- W2100094101 hasConcept C193252679 @default.
- W2100094101 hasConcept C207060522 @default.
- W2100094101 hasConcept C2385561 @default.
- W2100094101 hasConcept C2986374874 @default.
- W2100094101 hasConcept C41008148 @default.
- W2100094101 hasConcept C45484198 @default.
- W2100094101 hasConcept C50929876 @default.
- W2100094101 hasConcept C54355233 @default.
- W2100094101 hasConcept C70721500 @default.
- W2100094101 hasConcept C86803240 @default.
- W2100094101 hasConcept C88031987 @default.
- W2100094101 hasConceptScore W2100094101C104317684 @default.
- W2100094101 hasConceptScore W2100094101C111919701 @default.
- W2100094101 hasConceptScore W2100094101C124101348 @default.
- W2100094101 hasConceptScore W2100094101C14036430 @default.
- W2100094101 hasConceptScore W2100094101C167625842 @default.
- W2100094101 hasConceptScore W2100094101C193252679 @default.
- W2100094101 hasConceptScore W2100094101C207060522 @default.
- W2100094101 hasConceptScore W2100094101C2385561 @default.
- W2100094101 hasConceptScore W2100094101C2986374874 @default.
- W2100094101 hasConceptScore W2100094101C41008148 @default.
- W2100094101 hasConceptScore W2100094101C45484198 @default.
- W2100094101 hasConceptScore W2100094101C50929876 @default.
- W2100094101 hasConceptScore W2100094101C54355233 @default.
- W2100094101 hasConceptScore W2100094101C70721500 @default.
- W2100094101 hasConceptScore W2100094101C86803240 @default.
- W2100094101 hasConceptScore W2100094101C88031987 @default.
- W2100094101 hasIssue "4" @default.
- W2100094101 hasLocation W21000941011 @default.
- W2100094101 hasLocation W21000941012 @default.
- W2100094101 hasLocation W21000941013 @default.
- W2100094101 hasOpenAccess W2100094101 @default.
- W2100094101 hasPrimaryLocation W21000941011 @default.
- W2100094101 hasRelatedWork W1482324242 @default.
- W2100094101 hasRelatedWork W1981307089 @default.
- W2100094101 hasRelatedWork W1985408726 @default.