Matches in SemOpenAlex for { <https://semopenalex.org/work/W2004337816> ?p ?o ?g. }
- W2004337816 endingPage "11" @default.
- W2004337816 startingPage "11" @default.
- W2004337816 abstract "The detection of conserved residue clusters on a protein structure is one of the effective strategies for the prediction of functional protein regions. Various methods, such as Evolutionary Trace, have been developed based on this strategy. In such approaches, the conserved residues are identified through comparisons of homologous amino acid sequences. Therefore, the selection of homologous sequences is a critical step. It is empirically known that a certain degree of sequence divergence in the set of homologous sequences is required for the identification of conserved residues. However, the development of a method to select homologous sequences appropriate for the identification of conserved residues has not been sufficiently addressed. An objective and general method to select appropriate homologous sequences is desired for the efficient prediction of functional regions. We have developed a novel index to select the sequences appropriate for the identification of conserved residues, and implemented the index within our method to predict the functional regions of a protein. The implementation of the index improved the performance of the functional region prediction. The index represents the degree of conserved residue clustering on the tertiary structure of the protein. For this purpose, the structure and sequence information were integrated within the index by the application of spatial statistics. Spatial statistics is a field of statistics in which not only the attributes but also the geometrical coordinates of the data are considered simultaneously. Higher degrees of clustering generate larger index scores. We adopted the set of homologous sequences with the highest index score, under the assumption that the best prediction accuracy is obtained when the degree of clustering is the maximum. The set of sequences selected by the index led to higher functional region prediction performance than the sets of sequences selected by other sequence-based methods. Appropriate homologous sequences are selected automatically and objectively by the index. Such sequence selection improved the performance of functional region prediction. As far as we know, this is the first approach in which spatial statistics have been applied to protein analyses. Such integration of structure and sequence information would be useful for other bioinformatics problems." @default.
- W2004337816 created "2016-06-24" @default.
- W2004337816 creator A5045205001 @default.
- W2004337816 creator A5064128993 @default.
- W2004337816 date "2012-01-01" @default.
- W2004337816 modified "2023-09-27" @default.
- W2004337816 title "Functional region prediction with a set of appropriate homologous sequences-an index for sequence selection by integrating structure and sequence information with spatial statistics" @default.
- W2004337816 cites W1561011671 @default.
- W2004337816 cites W1964640688 @default.
- W2004337816 cites W1965582988 @default.
- W2004337816 cites W1966122249 @default.
- W2004337816 cites W1970999032 @default.
- W2004337816 cites W1992358035 @default.
- W2004337816 cites W2003788979 @default.
- W2004337816 cites W2018078151 @default.
- W2004337816 cites W2027036222 @default.
- W2004337816 cites W2028750420 @default.
- W2004337816 cites W2043199972 @default.
- W2004337816 cites W2043886357 @default.
- W2004337816 cites W2059067075 @default.
- W2004337816 cites W2064430199 @default.
- W2004337816 cites W2087918275 @default.
- W2004337816 cites W2090699441 @default.
- W2004337816 cites W2099254366 @default.
- W2004337816 cites W2099628648 @default.
- W2004337816 cites W2102931666 @default.
- W2004337816 cites W2107903949 @default.
- W2004337816 cites W2113024639 @default.
- W2004337816 cites W2115595474 @default.
- W2004337816 cites W2119362018 @default.
- W2004337816 cites W2119498937 @default.
- W2004337816 cites W2127296617 @default.
- W2004337816 cites W2128674962 @default.
- W2004337816 cites W2132395520 @default.
- W2004337816 cites W2137991504 @default.
- W2004337816 cites W2143210482 @default.
- W2004337816 cites W2150444353 @default.
- W2004337816 cites W2150825041 @default.
- W2004337816 cites W2154388132 @default.
- W2004337816 cites W2155469822 @default.
- W2004337816 cites W2162574056 @default.
- W2004337816 cites W2169890434 @default.
- W2004337816 doi "https://doi.org/10.1186/1472-6807-12-11" @default.
- W2004337816 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/3533907" @default.
- W2004337816 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/22643026" @default.
- W2004337816 hasPublicationYear "2012" @default.
- W2004337816 type Work @default.
- W2004337816 sameAs 2004337816 @default.
- W2004337816 citedByCount "5" @default.
- W2004337816 countsByYear W20043378162013 @default.
- W2004337816 countsByYear W20043378162016 @default.
- W2004337816 countsByYear W20043378162019 @default.
- W2004337816 countsByYear W20043378162021 @default.
- W2004337816 crossrefType "journal-article" @default.
- W2004337816 hasAuthorship W2004337816A5045205001 @default.
- W2004337816 hasAuthorship W2004337816A5064128993 @default.
- W2004337816 hasBestOaLocation W20043378161 @default.
- W2004337816 hasConcept C10010492 @default.
- W2004337816 hasConcept C104317684 @default.
- W2004337816 hasConcept C124101348 @default.
- W2004337816 hasConcept C154945302 @default.
- W2004337816 hasConcept C167625842 @default.
- W2004337816 hasConcept C177264268 @default.
- W2004337816 hasConcept C178180057 @default.
- W2004337816 hasConcept C180384323 @default.
- W2004337816 hasConcept C199216141 @default.
- W2004337816 hasConcept C199360897 @default.
- W2004337816 hasConcept C2778112365 @default.
- W2004337816 hasConcept C33923547 @default.
- W2004337816 hasConcept C41008148 @default.
- W2004337816 hasConcept C45484198 @default.
- W2004337816 hasConcept C54355233 @default.
- W2004337816 hasConcept C61053724 @default.
- W2004337816 hasConcept C70721500 @default.
- W2004337816 hasConcept C73555534 @default.
- W2004337816 hasConcept C81917197 @default.
- W2004337816 hasConcept C86803240 @default.
- W2004337816 hasConcept C88031987 @default.
- W2004337816 hasConceptScore W2004337816C10010492 @default.
- W2004337816 hasConceptScore W2004337816C104317684 @default.
- W2004337816 hasConceptScore W2004337816C124101348 @default.
- W2004337816 hasConceptScore W2004337816C154945302 @default.
- W2004337816 hasConceptScore W2004337816C167625842 @default.
- W2004337816 hasConceptScore W2004337816C177264268 @default.
- W2004337816 hasConceptScore W2004337816C178180057 @default.
- W2004337816 hasConceptScore W2004337816C180384323 @default.
- W2004337816 hasConceptScore W2004337816C199216141 @default.
- W2004337816 hasConceptScore W2004337816C199360897 @default.
- W2004337816 hasConceptScore W2004337816C2778112365 @default.
- W2004337816 hasConceptScore W2004337816C33923547 @default.
- W2004337816 hasConceptScore W2004337816C41008148 @default.
- W2004337816 hasConceptScore W2004337816C45484198 @default.
- W2004337816 hasConceptScore W2004337816C54355233 @default.
- W2004337816 hasConceptScore W2004337816C61053724 @default.
- W2004337816 hasConceptScore W2004337816C70721500 @default.
- W2004337816 hasConceptScore W2004337816C73555534 @default.
- W2004337816 hasConceptScore W2004337816C81917197 @default.
- W2004337816 hasConceptScore W2004337816C86803240 @default.