Matches in SemOpenAlex for { <https://semopenalex.org/work/W2147442130> ?p ?o ?g. }
- W2147442130 abstract "Protein domains are commonly used to assess the functional roles and evolutionary relationships of proteins and protein families. Here, we use the Pfam protein family database to examine a set of candidate partial domains. Pfam protein domains are often thought of as evolutionarily indivisible, structurally compact, units from which larger functional proteins are assembled; however, almost 4% of Pfam27 PfamA domains are shorter than 50% of their family model length, suggesting that more than half of the domain is missing at those locations. To better understand the structural nature of partial domains in proteins, we examined 30,961 partial domain regions from 136 domain families contained in a representative subset of PfamA domains (RefProtDom2 or RPD2).We characterized three types of apparent partial domains: split domains, bounded partials, and unbounded partials. We find that bounded partial domains are over-represented in eukaryotes and in lower quality protein predictions, suggesting that they often result from inaccurate genome assemblies or gene models. We also find that a large percentage of unbounded partial domains produce long alignments, which suggests that their annotation as a partial is an alignment artifact; yet some can be found as partials in other sequence contexts.Partial domains are largely the result of alignment and annotation artifacts and should be viewed with caution. The presence of partial domain annotations in proteins should raise the concern that the prediction of the protein's gene may be incomplete. In general, protein domains can be considered the structural building blocks of proteins." @default.
- W2147442130 created "2016-06-24" @default.
- W2147442130 creator A5043942506 @default.
- W2147442130 creator A5073334733 @default.
- W2147442130 date "2015-05-15" @default.
- W2147442130 modified "2023-09-29" @default.
- W2147442130 title "Most partial domains in proteins are alignment and annotation artifacts" @default.
- W2147442130 cites W1980092242 @default.
- W2147442130 cites W1996241166 @default.
- W2147442130 cites W2081495471 @default.
- W2147442130 cites W2089499735 @default.
- W2147442130 cites W2091506983 @default.
- W2147442130 cites W2094713937 @default.
- W2147442130 cites W2101335101 @default.
- W2147442130 cites W2107644675 @default.
- W2147442130 cites W2120691184 @default.
- W2147442130 cites W2122576958 @default.
- W2147442130 cites W2130358901 @default.
- W2147442130 cites W2132495279 @default.
- W2147442130 cites W2136280642 @default.
- W2147442130 cites W2141885858 @default.
- W2147442130 cites W2157484293 @default.
- W2147442130 cites W2168708922 @default.
- W2147442130 cites W2169686324 @default.
- W2147442130 cites W2605068739 @default.
- W2147442130 cites W4210623056 @default.
- W2147442130 cites W4210767115 @default.
- W2147442130 cites W4320301318 @default.
- W2147442130 doi "https://doi.org/10.1186/s13059-015-0656-7" @default.
- W2147442130 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/4443539" @default.
- W2147442130 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/25976240" @default.
- W2147442130 hasPublicationYear "2015" @default.
- W2147442130 type Work @default.
- W2147442130 sameAs 2147442130 @default.
- W2147442130 citedByCount "24" @default.
- W2147442130 countsByYear W21474421302015 @default.
- W2147442130 countsByYear W21474421302016 @default.
- W2147442130 countsByYear W21474421302017 @default.
- W2147442130 countsByYear W21474421302018 @default.
- W2147442130 countsByYear W21474421302019 @default.
- W2147442130 countsByYear W21474421302021 @default.
- W2147442130 countsByYear W21474421302022 @default.
- W2147442130 countsByYear W21474421302023 @default.
- W2147442130 crossrefType "journal-article" @default.
- W2147442130 hasAuthorship W2147442130A5043942506 @default.
- W2147442130 hasAuthorship W2147442130A5073334733 @default.
- W2147442130 hasBestOaLocation W21474421301 @default.
- W2147442130 hasConcept C104317684 @default.
- W2147442130 hasConcept C134306372 @default.
- W2147442130 hasConcept C141231307 @default.
- W2147442130 hasConcept C144292202 @default.
- W2147442130 hasConcept C167625842 @default.
- W2147442130 hasConcept C171897839 @default.
- W2147442130 hasConcept C177264268 @default.
- W2147442130 hasConcept C192772702 @default.
- W2147442130 hasConcept C199360897 @default.
- W2147442130 hasConcept C2776321320 @default.
- W2147442130 hasConcept C2908923196 @default.
- W2147442130 hasConcept C33923547 @default.
- W2147442130 hasConcept C36503486 @default.
- W2147442130 hasConcept C41008148 @default.
- W2147442130 hasConcept C45484198 @default.
- W2147442130 hasConcept C47701112 @default.
- W2147442130 hasConcept C54355233 @default.
- W2147442130 hasConcept C55493867 @default.
- W2147442130 hasConcept C70721500 @default.
- W2147442130 hasConcept C78458016 @default.
- W2147442130 hasConcept C86803240 @default.
- W2147442130 hasConceptScore W2147442130C104317684 @default.
- W2147442130 hasConceptScore W2147442130C134306372 @default.
- W2147442130 hasConceptScore W2147442130C141231307 @default.
- W2147442130 hasConceptScore W2147442130C144292202 @default.
- W2147442130 hasConceptScore W2147442130C167625842 @default.
- W2147442130 hasConceptScore W2147442130C171897839 @default.
- W2147442130 hasConceptScore W2147442130C177264268 @default.
- W2147442130 hasConceptScore W2147442130C192772702 @default.
- W2147442130 hasConceptScore W2147442130C199360897 @default.
- W2147442130 hasConceptScore W2147442130C2776321320 @default.
- W2147442130 hasConceptScore W2147442130C2908923196 @default.
- W2147442130 hasConceptScore W2147442130C33923547 @default.
- W2147442130 hasConceptScore W2147442130C36503486 @default.
- W2147442130 hasConceptScore W2147442130C41008148 @default.
- W2147442130 hasConceptScore W2147442130C45484198 @default.
- W2147442130 hasConceptScore W2147442130C47701112 @default.
- W2147442130 hasConceptScore W2147442130C54355233 @default.
- W2147442130 hasConceptScore W2147442130C55493867 @default.
- W2147442130 hasConceptScore W2147442130C70721500 @default.
- W2147442130 hasConceptScore W2147442130C78458016 @default.
- W2147442130 hasConceptScore W2147442130C86803240 @default.
- W2147442130 hasIssue "1" @default.
- W2147442130 hasLocation W21474421301 @default.
- W2147442130 hasLocation W21474421302 @default.
- W2147442130 hasLocation W21474421303 @default.
- W2147442130 hasLocation W21474421304 @default.
- W2147442130 hasOpenAccess W2147442130 @default.
- W2147442130 hasPrimaryLocation W21474421301 @default.
- W2147442130 hasRelatedWork W1966852334 @default.
- W2147442130 hasRelatedWork W2037993016 @default.
- W2147442130 hasRelatedWork W2058721531 @default.
- W2147442130 hasRelatedWork W2118652015 @default.