Matches in SemOpenAlex for { <https://semopenalex.org/work/W2143065269> ?p ?o ?g. }
- W2143065269 abstract "Abstract Background Structural variations caused by a wide range of physico-chemical and biological sources directly influence the function of a protein. For enzymatic proteins, the structure and chemistry of the catalytic binding site residues can be loosely defined as a substructure of the protein. Comparative analysis of drug-receptor substructures across and within species has been used for lead evaluation. Substructure-level similarity between the binding sites of functionally similar proteins has also been used to identify instances of convergent evolution among proteins. In functionally homologous protein families, shared chemistry and geometry at catalytic sites provide a common, local point of comparison among proteins that may differ significantly at the sequence, fold, or domain topology levels. Results This paper describes two key results that can be used separately or in combination for protein function analysis. The Family-wise Analysis of SubStructural Templates (FASST) method uses all-against-all substructure comparison to determine Substructural Clusters (SCs). SCs characterize the binding site substructural variation within a protein family. In this paper we focus on examples of automatically determined SCs that can be linked to phylogenetic distance between family members, segregation by conformation, and organization by homology among convergent protein lineages. The Motif Ensemble Statistical Hypothesis (MESH) framework constructs a representative motif for each protein cluster among the SCs determined by FASST to build motif ensembles that are shown through a series of function prediction experiments to improve the function prediction power of existing motifs. Conclusions FASST contributes a critical feedback and assessment step to existing binding site substructure identification methods and can be used for the thorough investigation of structure-function relationships. The application of MESH allows for an automated, statistically rigorous procedure for incorporating structural variation data into protein function prediction pipelines. Our work provides an unbiased, automated assessment of the structural variability of identified binding site substructures among protein structure families and a technique for exploring the relation of substructural variation to protein function. As available proteomic data continues to expand, the techniques proposed will be indispensable for the large-scale analysis and interpretation of structural data." @default.
- W2143065269 created "2016-06-24" @default.
- W2143065269 creator A5035723276 @default.
- W2143065269 creator A5061397940 @default.
- W2143065269 creator A5062182398 @default.
- W2143065269 creator A5067205988 @default.
- W2143065269 creator A5087145328 @default.
- W2143065269 date "2010-05-11" @default.
- W2143065269 modified "2023-10-10" @default.
- W2143065269 title "Analysis of substructural variation in families of enzymatic proteins with applications to protein function prediction" @default.
- W2143065269 cites W1550745757 @default.
- W2143065269 cites W1810417380 @default.
- W2143065269 cites W1971857738 @default.
- W2143065269 cites W1981575795 @default.
- W2143065269 cites W1986154009 @default.
- W2143065269 cites W1986199568 @default.
- W2143065269 cites W1986969007 @default.
- W2143065269 cites W1988681425 @default.
- W2143065269 cites W1991781000 @default.
- W2143065269 cites W1992115216 @default.
- W2143065269 cites W1995413875 @default.
- W2143065269 cites W1998014636 @default.
- W2143065269 cites W2011832962 @default.
- W2143065269 cites W2013628732 @default.
- W2143065269 cites W2013987468 @default.
- W2143065269 cites W2014474863 @default.
- W2143065269 cites W2015147525 @default.
- W2143065269 cites W2017016843 @default.
- W2143065269 cites W2017727500 @default.
- W2143065269 cites W2020276026 @default.
- W2143065269 cites W2021954541 @default.
- W2143065269 cites W2023790301 @default.
- W2143065269 cites W2025157967 @default.
- W2143065269 cites W2042647264 @default.
- W2143065269 cites W2044041242 @default.
- W2143065269 cites W2044128233 @default.
- W2143065269 cites W2044687262 @default.
- W2143065269 cites W2048767410 @default.
- W2143065269 cites W2049905965 @default.
- W2143065269 cites W2051895548 @default.
- W2143065269 cites W2060851522 @default.
- W2143065269 cites W2062352340 @default.
- W2143065269 cites W2063935375 @default.
- W2143065269 cites W2066521937 @default.
- W2143065269 cites W2070318172 @default.
- W2143065269 cites W2074192927 @default.
- W2143065269 cites W2074388759 @default.
- W2143065269 cites W2077412035 @default.
- W2143065269 cites W2079659761 @default.
- W2143065269 cites W2088534163 @default.
- W2143065269 cites W2092053194 @default.
- W2143065269 cites W2100494857 @default.
- W2143065269 cites W2101449937 @default.
- W2143065269 cites W2102333330 @default.
- W2143065269 cites W2103935383 @default.
- W2143065269 cites W2104245738 @default.
- W2143065269 cites W2106882534 @default.
- W2143065269 cites W2108067237 @default.
- W2143065269 cites W2108909656 @default.
- W2143065269 cites W2111108708 @default.
- W2143065269 cites W2115012618 @default.
- W2143065269 cites W2121704271 @default.
- W2143065269 cites W2121926265 @default.
- W2143065269 cites W2130479394 @default.
- W2143065269 cites W2132629607 @default.
- W2143065269 cites W2137502578 @default.
- W2143065269 cites W2137507165 @default.
- W2143065269 cites W2137991504 @default.
- W2143065269 cites W2146940292 @default.
- W2143065269 cites W2150065588 @default.
- W2143065269 cites W2150444353 @default.
- W2143065269 cites W2151884301 @default.
- W2143065269 cites W2152326664 @default.
- W2143065269 cites W2153504900 @default.
- W2143065269 cites W2156575953 @default.
- W2143065269 cites W2168865322 @default.
- W2143065269 cites W2171172158 @default.
- W2143065269 cites W2171602500 @default.
- W2143065269 cites W2171985093 @default.
- W2143065269 cites W2314337911 @default.
- W2143065269 cites W4234330104 @default.
- W2143065269 cites W4292023222 @default.
- W2143065269 doi "https://doi.org/10.1186/1471-2105-11-242" @default.
- W2143065269 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/2885373" @default.
- W2143065269 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/20459833" @default.
- W2143065269 hasPublicationYear "2010" @default.
- W2143065269 type Work @default.
- W2143065269 sameAs 2143065269 @default.
- W2143065269 citedByCount "14" @default.
- W2143065269 countsByYear W21430652692012 @default.
- W2143065269 countsByYear W21430652692013 @default.
- W2143065269 countsByYear W21430652692014 @default.
- W2143065269 countsByYear W21430652692015 @default.
- W2143065269 countsByYear W21430652692017 @default.
- W2143065269 countsByYear W21430652692021 @default.
- W2143065269 countsByYear W21430652692022 @default.
- W2143065269 countsByYear W21430652692023 @default.
- W2143065269 crossrefType "journal-article" @default.
- W2143065269 hasAuthorship W2143065269A5035723276 @default.
- W2143065269 hasAuthorship W2143065269A5061397940 @default.