Matches in SemOpenAlex for { <https://semopenalex.org/work/W1490503023> ?p ?o ?g. }
- W1490503023 abstract "Predicted open reading frames (ORFs) that lack detectable homology to known proteins are termed ORFans. Despite their prevalence in metagenomes, the extent to which ORFans encode real proteins, the degree to which they can be annotated, and their functional contributions, remain unclear. To gain insights into these questions, we applied sensitive remote-homology detection methods to functionally analyze ORFans from soil, marine, and human gut metagenome collections. ORFans were identified, clustered into sequence families, and annotated through profile-profile comparison to proteins of known structure. We found that a considerable number of metagenomic ORFans (73,896 of 484,121, 15.3%) exhibit significant remote homology to structurally characterized proteins, providing a means for ORFan functional profiling. The extent of detected remote homology far exceeds that obtained for artificial protein families (1.4%). As expected for real genes, the predicted functions of ORFans are significantly similar to the functions of their gene neighbors (p < 0.001). Compared to the functional profiles predicted through standard homology searches, ORFans show biologically intriguing differences. Many ORFan-enriched functions are virus-related and tend to reflect biological processes associated with extreme sequence diversity. Each environment also possesses a large number of unique ORFan families and functions, including some known to play important community roles such as gut microbial polysaccharide digestion. Lastly, ORFans are a valuable resource for finding novel enzymes of interest, as we demonstrate through the identification of hundreds of novel ORFan metalloproteases that all possess a signature catalytic motif despite a general lack of similarity to known proteins. Our ORFan functional predictions are a valuable resource for discovering novel protein families and exploring the boundaries of protein sequence space. All remote homology predictions are available at http://doxey.uwaterloo.ca/ORFans." @default.
- W1490503023 created "2016-06-24" @default.
- W1490503023 creator A5003877983 @default.
- W1490503023 creator A5032187526 @default.
- W1490503023 creator A5057232313 @default.
- W1490503023 creator A5069360590 @default.
- W1490503023 date "2015-07-21" @default.
- W1490503023 modified "2023-10-18" @default.
- W1490503023 title "Remote homology and the functions of metagenomic dark matter" @default.
- W1490503023 cites W1488354887 @default.
- W1490503023 cites W1520718236 @default.
- W1490503023 cites W1551442557 @default.
- W1490503023 cites W1791999417 @default.
- W1490503023 cites W1963723331 @default.
- W1490503023 cites W1965186869 @default.
- W1490503023 cites W1969250533 @default.
- W1490503023 cites W1971738871 @default.
- W1490503023 cites W1972314084 @default.
- W1490503023 cites W1983995085 @default.
- W1490503023 cites W1998996716 @default.
- W1490503023 cites W2000471751 @default.
- W1490503023 cites W2001738143 @default.
- W1490503023 cites W2006465310 @default.
- W1490503023 cites W2015426350 @default.
- W1490503023 cites W2018756605 @default.
- W1490503023 cites W2019377841 @default.
- W1490503023 cites W2025853251 @default.
- W1490503023 cites W2028707451 @default.
- W1490503023 cites W2029743552 @default.
- W1490503023 cites W2030818758 @default.
- W1490503023 cites W2037562977 @default.
- W1490503023 cites W2041115444 @default.
- W1490503023 cites W2045564656 @default.
- W1490503023 cites W2051210555 @default.
- W1490503023 cites W2055777573 @default.
- W1490503023 cites W2059223767 @default.
- W1490503023 cites W2061654988 @default.
- W1490503023 cites W2061704677 @default.
- W1490503023 cites W2062491586 @default.
- W1490503023 cites W2065643553 @default.
- W1490503023 cites W2069689025 @default.
- W1490503023 cites W2081219382 @default.
- W1490503023 cites W2081832301 @default.
- W1490503023 cites W2093830129 @default.
- W1490503023 cites W2095854550 @default.
- W1490503023 cites W2097186593 @default.
- W1490503023 cites W2107424557 @default.
- W1490503023 cites W2108035844 @default.
- W1490503023 cites W2108929776 @default.
- W1490503023 cites W2109715166 @default.
- W1490503023 cites W2110650347 @default.
- W1490503023 cites W2114548104 @default.
- W1490503023 cites W2116423958 @default.
- W1490503023 cites W2124560890 @default.
- W1490503023 cites W2125826054 @default.
- W1490503023 cites W2126809954 @default.
- W1490503023 cites W2131186249 @default.
- W1490503023 cites W2132801908 @default.
- W1490503023 cites W2143339264 @default.
- W1490503023 cites W2145268834 @default.
- W1490503023 cites W2145336165 @default.
- W1490503023 cites W2147526198 @default.
- W1490503023 cites W2147783737 @default.
- W1490503023 cites W2153544371 @default.
- W1490503023 cites W2154510716 @default.
- W1490503023 cites W2154654747 @default.
- W1490503023 cites W2158714788 @default.
- W1490503023 cites W2159261956 @default.
- W1490503023 cites W2159591897 @default.
- W1490503023 cites W2161794223 @default.
- W1490503023 cites W2162888916 @default.
- W1490503023 cites W2165979915 @default.
- W1490503023 cites W2168867830 @default.
- W1490503023 cites W2169748335 @default.
- W1490503023 cites W2169762780 @default.
- W1490503023 cites W4211222361 @default.
- W1490503023 cites W4229487902 @default.
- W1490503023 cites W4250908540 @default.
- W1490503023 doi "https://doi.org/10.3389/fgene.2015.00234" @default.
- W1490503023 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/4508852" @default.
- W1490503023 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/26257768" @default.
- W1490503023 hasPublicationYear "2015" @default.
- W1490503023 type Work @default.
- W1490503023 sameAs 1490503023 @default.
- W1490503023 citedByCount "32" @default.
- W1490503023 countsByYear W14905030232015 @default.
- W1490503023 countsByYear W14905030232016 @default.
- W1490503023 countsByYear W14905030232017 @default.
- W1490503023 countsByYear W14905030232018 @default.
- W1490503023 countsByYear W14905030232019 @default.
- W1490503023 countsByYear W14905030232020 @default.
- W1490503023 countsByYear W14905030232021 @default.
- W1490503023 countsByYear W14905030232022 @default.
- W1490503023 countsByYear W14905030232023 @default.
- W1490503023 crossrefType "journal-article" @default.
- W1490503023 hasAuthorship W1490503023A5003877983 @default.
- W1490503023 hasAuthorship W1490503023A5032187526 @default.
- W1490503023 hasAuthorship W1490503023A5057232313 @default.
- W1490503023 hasAuthorship W1490503023A5069360590 @default.
- W1490503023 hasBestOaLocation W14905030231 @default.