Matches in SemOpenAlex for { <https://semopenalex.org/work/W73004682> ?p ?o ?g. }
- W73004682 abstract "The number of protein sequences being deposited in databases is currently growing rapidly as a result of large-scale high throughput genome sequencing efforts. A large proportion of these sequences have no experimentally determined structure. Also, relatively few have high quality, specific, experimentally determined functions.Due to the time, cost and technical complexity of experimental procedures for the determination of protein function this situation is unlikely to change in the near future. Therefore, one of the major challenges for bioinformatics is the ability to automatically assign highly accurate, high-specificity functional information to these unknown protein sequences. As yet this problem has not been successfully solved to a level both acceptable in terms of detailed accuracy and reliability for use as a basis for detailed biological analysis on a genome wide, automated, high-throughput scale.This research thesis aims to address this shortfall through the provision and benchmarking of methods that can be used towards improving the accuracy of high-specificity protein function prediction from enzyme sequences. The datasets used in these studies are multiple alignments of evolutionarily related protein sequences, identified through the use of BLAST sequence database searches.Firstly, a number of non-standard amino acid substitution matrices were used to re-score the benchmark multiple sequence alignments. A subset of these matrices were shown to improve the accuracy of specific function annotation, when compared to both the original BLAST sequence similarity ordering and a random sequence selection model.Following this, two established methods for the identification of functional specificity determining amino acid residues (fSDRs) were used to identify regions within the aligned sequences that are functionally and phylogenetically informative. These localised sequence regions were then used to re-score the aligned sequences and provide an assessment of their ability to improve the specific functional annotation of the benchmark sequence sets.Finally, a machine learning approach (support vector machines) was followed to evaluate the possibility of identifying fSDRs, which improve the annotation accuracy, directly from alignments of closely related protein sequences without prior knowledge of their specific functional sub-types. The performance of this SVM based method was then assessed by applying it to the automatic functional assignment of a number of well studied classes of enzymes." @default.
- W73004682 created "2016-06-24" @default.
- W73004682 creator A5050065279 @default.
- W73004682 date "2011-06-28" @default.
- W73004682 modified "2023-09-26" @default.
- W73004682 title "High specificity automatic function assignment for enzyme sequences" @default.
- W73004682 cites W146451012 @default.
- W73004682 cites W1498183065 @default.
- W73004682 cites W1513332069 @default.
- W73004682 cites W1530704185 @default.
- W73004682 cites W1561011671 @default.
- W73004682 cites W1576520375 @default.
- W73004682 cites W1592614241 @default.
- W73004682 cites W1604938182 @default.
- W73004682 cites W1964640688 @default.
- W73004682 cites W1965582988 @default.
- W73004682 cites W1969051510 @default.
- W73004682 cites W1969080446 @default.
- W73004682 cites W1969738839 @default.
- W73004682 cites W1970697508 @default.
- W73004682 cites W1971155547 @default.
- W73004682 cites W1971449260 @default.
- W73004682 cites W1975283401 @default.
- W73004682 cites W1975316279 @default.
- W73004682 cites W1975796937 @default.
- W73004682 cites W1976927254 @default.
- W73004682 cites W1990453950 @default.
- W73004682 cites W1993267991 @default.
- W73004682 cites W1993393468 @default.
- W73004682 cites W1995924392 @default.
- W73004682 cites W1996423252 @default.
- W73004682 cites W1996874226 @default.
- W73004682 cites W1999875603 @default.
- W73004682 cites W2003788979 @default.
- W73004682 cites W2005120335 @default.
- W73004682 cites W2009570821 @default.
- W73004682 cites W2010730039 @default.
- W73004682 cites W2015292449 @default.
- W73004682 cites W2017519756 @default.
- W73004682 cites W2027036222 @default.
- W73004682 cites W2030966943 @default.
- W73004682 cites W2032838501 @default.
- W73004682 cites W2035720976 @default.
- W73004682 cites W2043199972 @default.
- W73004682 cites W2043904638 @default.
- W73004682 cites W2044350743 @default.
- W73004682 cites W2050889354 @default.
- W73004682 cites W2055043387 @default.
- W73004682 cites W2059067075 @default.
- W73004682 cites W2060791849 @default.
- W73004682 cites W2062880866 @default.
- W73004682 cites W2065898587 @default.
- W73004682 cites W2066468520 @default.
- W73004682 cites W2069179738 @default.
- W73004682 cites W2074231493 @default.
- W73004682 cites W2081269341 @default.
- W73004682 cites W2083654996 @default.
- W73004682 cites W2085277871 @default.
- W73004682 cites W2090699441 @default.
- W73004682 cites W2092199526 @default.
- W73004682 cites W2098223336 @default.
- W73004682 cites W2099393019 @default.
- W73004682 cites W2100320834 @default.
- W73004682 cites W2101229087 @default.
- W73004682 cites W2103017472 @default.
- W73004682 cites W2103111028 @default.
- W73004682 cites W2103150692 @default.
- W73004682 cites W2105869636 @default.
- W73004682 cites W2106737886 @default.
- W73004682 cites W2106882534 @default.
- W73004682 cites W2107251251 @default.
- W73004682 cites W2108067237 @default.
- W73004682 cites W2108491719 @default.
- W73004682 cites W2109943925 @default.
- W73004682 cites W2111973517 @default.
- W73004682 cites W2112162865 @default.
- W73004682 cites W2114886480 @default.
- W73004682 cites W2115595474 @default.
- W73004682 cites W2116423958 @default.
- W73004682 cites W2116973522 @default.
- W73004682 cites W2119027485 @default.
- W73004682 cites W2119498937 @default.
- W73004682 cites W2120122292 @default.
- W73004682 cites W2120772351 @default.
- W73004682 cites W2123858481 @default.
- W73004682 cites W2124410686 @default.
- W73004682 cites W2124815061 @default.
- W73004682 cites W2124908487 @default.
- W73004682 cites W2124931735 @default.
- W73004682 cites W2125546020 @default.
- W73004682 cites W2126975650 @default.
- W73004682 cites W2127774996 @default.
- W73004682 cites W2130479394 @default.
- W73004682 cites W2132109794 @default.
- W73004682 cites W2132926880 @default.
- W73004682 cites W2133312664 @default.
- W73004682 cites W2133787379 @default.
- W73004682 cites W2134789671 @default.
- W73004682 cites W2135123683 @default.
- W73004682 cites W2136134567 @default.