Matches in SemOpenAlex for { <https://semopenalex.org/work/W2079268119> ?p ?o ?g. }
- W2079268119 abstract "Abstract Background The identification of protein domains plays an important role in protein structure comparison. Domain query size and composition are critical to structure similarity search algorithms such as the Vector Alignment Search Tool (VAST), the method employed for computing related protein structures in NCBI Entrez system. Currently, domains identified on the basis of structural compactness are used for VAST computations. In this study, we have investigated how alternative definitions of domains derived from conserved sequence alignments in the Conserved Domain Database (CDD) would affect the domain comparisons and structure similarity search performance of VAST. Results Alternative domains, which have significantly different secondary structure composition from those based on structurally compact units, were identified based on the alignment footprints of curated protein sequence domain families. Our analysis indicates that domain boundaries disagree on roughly 8% of protein chains in the medium redundancy subset of the Molecular Modeling Database (MMDB). These conflicting sequence based domain boundaries perform slightly better than structure domains in structure similarity searches, and there are interesting cases when structure similarity search performance is markedly improved. Conclusion Structure similarity searches using domain boundaries based on conserved sequence information can provide an additional method for investigators to identify interesting similarities between proteins with known structures. Because of the improvement in performance of structure similarity searches using sequence domain boundaries, we are in the process of implementing their inclusion into the VAST search and MMDB resources in the NCBI Entrez system." @default.
- W2079268119 created "2016-06-24" @default.
- W2079268119 creator A5040349721 @default.
- W2079268119 creator A5059431543 @default.
- W2079268119 creator A5065601142 @default.
- W2079268119 creator A5081728565 @default.
- W2079268119 date "2009-05-19" @default.
- W2079268119 modified "2023-09-25" @default.
- W2079268119 title "Improving protein structure similarity searches using domain boundaries based on conserved sequence information" @default.
- W2079268119 cites W1508885706 @default.
- W2079268119 cites W1965266514 @default.
- W2079268119 cites W1965910641 @default.
- W2079268119 cites W1972718882 @default.
- W2079268119 cites W1977922844 @default.
- W2079268119 cites W1986510280 @default.
- W2079268119 cites W1990453950 @default.
- W2079268119 cites W2005049999 @default.
- W2079268119 cites W2005313213 @default.
- W2079268119 cites W2009528611 @default.
- W2079268119 cites W2013025153 @default.
- W2079268119 cites W2015292449 @default.
- W2079268119 cites W2022058405 @default.
- W2079268119 cites W2049355199 @default.
- W2079268119 cites W2055043387 @default.
- W2079268119 cites W2092638671 @default.
- W2079268119 cites W2094406540 @default.
- W2079268119 cites W2095179527 @default.
- W2079268119 cites W2097471670 @default.
- W2079268119 cites W2108067237 @default.
- W2079268119 cites W2150434889 @default.
- W2079268119 cites W2152326664 @default.
- W2079268119 cites W2152517415 @default.
- W2079268119 cites W2155606054 @default.
- W2079268119 cites W2157242929 @default.
- W2079268119 cites W2169395813 @default.
- W2079268119 cites W4230224839 @default.
- W2079268119 doi "https://doi.org/10.1186/1472-6807-9-33" @default.
- W2079268119 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/2694201" @default.
- W2079268119 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/19454035" @default.
- W2079268119 hasPublicationYear "2009" @default.
- W2079268119 type Work @default.
- W2079268119 sameAs 2079268119 @default.
- W2079268119 citedByCount "6" @default.
- W2079268119 countsByYear W20792681192012 @default.
- W2079268119 countsByYear W20792681192014 @default.
- W2079268119 crossrefType "journal-article" @default.
- W2079268119 hasAuthorship W2079268119A5040349721 @default.
- W2079268119 hasAuthorship W2079268119A5059431543 @default.
- W2079268119 hasAuthorship W2079268119A5065601142 @default.
- W2079268119 hasAuthorship W2079268119A5081728565 @default.
- W2079268119 hasBestOaLocation W20792681191 @default.
- W2079268119 hasConcept C10010492 @default.
- W2079268119 hasConcept C103278499 @default.
- W2079268119 hasConcept C104317684 @default.
- W2079268119 hasConcept C115961682 @default.
- W2079268119 hasConcept C124101348 @default.
- W2079268119 hasConcept C134306372 @default.
- W2079268119 hasConcept C136475424 @default.
- W2079268119 hasConcept C139489369 @default.
- W2079268119 hasConcept C144292202 @default.
- W2079268119 hasConcept C154945302 @default.
- W2079268119 hasConcept C167625842 @default.
- W2079268119 hasConcept C199216141 @default.
- W2079268119 hasConcept C2778112365 @default.
- W2079268119 hasConcept C33923547 @default.
- W2079268119 hasConcept C36503486 @default.
- W2079268119 hasConcept C41008148 @default.
- W2079268119 hasConcept C41584329 @default.
- W2079268119 hasConcept C45484198 @default.
- W2079268119 hasConcept C4668613 @default.
- W2079268119 hasConcept C47701112 @default.
- W2079268119 hasConcept C54355233 @default.
- W2079268119 hasConcept C55493867 @default.
- W2079268119 hasConcept C60644358 @default.
- W2079268119 hasConcept C70721500 @default.
- W2079268119 hasConcept C86803240 @default.
- W2079268119 hasConcept C88031987 @default.
- W2079268119 hasConceptScore W2079268119C10010492 @default.
- W2079268119 hasConceptScore W2079268119C103278499 @default.
- W2079268119 hasConceptScore W2079268119C104317684 @default.
- W2079268119 hasConceptScore W2079268119C115961682 @default.
- W2079268119 hasConceptScore W2079268119C124101348 @default.
- W2079268119 hasConceptScore W2079268119C134306372 @default.
- W2079268119 hasConceptScore W2079268119C136475424 @default.
- W2079268119 hasConceptScore W2079268119C139489369 @default.
- W2079268119 hasConceptScore W2079268119C144292202 @default.
- W2079268119 hasConceptScore W2079268119C154945302 @default.
- W2079268119 hasConceptScore W2079268119C167625842 @default.
- W2079268119 hasConceptScore W2079268119C199216141 @default.
- W2079268119 hasConceptScore W2079268119C2778112365 @default.
- W2079268119 hasConceptScore W2079268119C33923547 @default.
- W2079268119 hasConceptScore W2079268119C36503486 @default.
- W2079268119 hasConceptScore W2079268119C41008148 @default.
- W2079268119 hasConceptScore W2079268119C41584329 @default.
- W2079268119 hasConceptScore W2079268119C45484198 @default.
- W2079268119 hasConceptScore W2079268119C4668613 @default.
- W2079268119 hasConceptScore W2079268119C47701112 @default.
- W2079268119 hasConceptScore W2079268119C54355233 @default.
- W2079268119 hasConceptScore W2079268119C55493867 @default.
- W2079268119 hasConceptScore W2079268119C60644358 @default.