Matches in SemOpenAlex for { <https://semopenalex.org/work/W2109084433> ?p ?o ?g. }
- W2109084433 abstract "This thesis describes the development and application of machine learning-basedmethods for the prediction of alpha-helical transmembrane proteinstructure from sequence alone. It is divided into six chapters.Chapter 1 provides an introduction to membrane structure and dynamics,membrane protein classes and families, and membrane protein structure prediction.Chapter 2 describes a topological study of the transmembrane proteinCLN3 using a consensus of bioinformatic approaches constrained by experimentaldata. Mutations in CLN3 can cause juvenile neuronal ceroidlipofuscinosis, or Batten disease, an inherited neurodegenerative lysosomalstorage disease affecting children, therefore such studies are importantfor directing further experimental work into this incurable illness.Chapter 3 explores the possibility of using biologically meaningful signaturesdescribed as regular expressions to influence the assignment of insideand outside loop locations during transmembrane topology prediction. Usingthis approach, it was possilbe to modify a recent topology prediction methodleading to an improvement of 6% prediction accuracy using a standard data set.Chapter 4 describes the development of a novel support vector machine-basedtopology predictor that integrates both signal peptide and re-entrant helix prediction,benchmarked with full cross-validation on a novel data set of sequences withknown crystal structures. The method achieves state-of-the-art performance in predictingtopology and discriminating between globular and transmembrane proteins.We also present the results of applying these tools to a number of complete genomes.Chapter 5 describes a novel approach to predict lipid exposure, residuecontacts, helix-helix interactions and finally the optimal helical packing arrangement of transmembrane proteins. It is based on two support vectormachine classifiers that predict per residue lipid exposure and residue contacts,which are used to determine helix-helix interaction with up to 65%accuracy. The method is also able to discriminate native from decoy helicalpacking arrangements with up to 70% accuracy. Finally, a force-directedalgorithm is employed to construct the optimal helical packing arrangementwhich demonstrates success for proteins containing up to 13 transmembrane helices.The final chapter summarises the major contributions of this thesis to biology,before future perspectives for TM protein structure prediction are discussed." @default.
- W2109084433 created "2016-06-24" @default.
- W2109084433 creator A5002637452 @default.
- W2109084433 date "2010-12-28" @default.
- W2109084433 modified "2023-09-26" @default.
- W2109084433 title "Transmembrane protein structure prediction using machine learning" @default.
- W2109084433 cites W1030488403 @default.
- W2109084433 cites W105780082 @default.
- W2109084433 cites W11515763 @default.
- W2109084433 cites W1488986178 @default.
- W2109084433 cites W1526754730 @default.
- W2109084433 cites W1548178570 @default.
- W2109084433 cites W1563088657 @default.
- W2109084433 cites W1567609013 @default.
- W2109084433 cites W1576520375 @default.
- W2109084433 cites W1578815762 @default.
- W2109084433 cites W1603609979 @default.
- W2109084433 cites W1603859820 @default.
- W2109084433 cites W1604938182 @default.
- W2109084433 cites W1605593725 @default.
- W2109084433 cites W1663973292 @default.
- W2109084433 cites W1869944834 @default.
- W2109084433 cites W1939505221 @default.
- W2109084433 cites W1952814327 @default.
- W2109084433 cites W1966309781 @default.
- W2109084433 cites W1966876001 @default.
- W2109084433 cites W1970605146 @default.
- W2109084433 cites W1971147414 @default.
- W2109084433 cites W1971751969 @default.
- W2109084433 cites W1974682887 @default.
- W2109084433 cites W1975304761 @default.
- W2109084433 cites W1978140423 @default.
- W2109084433 cites W1978354625 @default.
- W2109084433 cites W1978650123 @default.
- W2109084433 cites W1978763597 @default.
- W2109084433 cites W1985015125 @default.
- W2109084433 cites W1988284480 @default.
- W2109084433 cites W1989934196 @default.
- W2109084433 cites W1991655565 @default.
- W2109084433 cites W1994392652 @default.
- W2109084433 cites W1996049812 @default.
- W2109084433 cites W1996518688 @default.
- W2109084433 cites W1997640953 @default.
- W2109084433 cites W1997886400 @default.
- W2109084433 cites W1999455032 @default.
- W2109084433 cites W2002832354 @default.
- W2109084433 cites W2005393972 @default.
- W2109084433 cites W2006977540 @default.
- W2109084433 cites W2009073161 @default.
- W2109084433 cites W2012516949 @default.
- W2109084433 cites W2012653714 @default.
- W2109084433 cites W2015403140 @default.
- W2109084433 cites W2015761041 @default.
- W2109084433 cites W2016343967 @default.
- W2109084433 cites W2016441295 @default.
- W2109084433 cites W2019187626 @default.
- W2109084433 cites W2020652394 @default.
- W2109084433 cites W2023763252 @default.
- W2109084433 cites W2026593599 @default.
- W2109084433 cites W2027991357 @default.
- W2109084433 cites W2028131589 @default.
- W2109084433 cites W2031368570 @default.
- W2109084433 cites W2031644342 @default.
- W2109084433 cites W2032433700 @default.
- W2109084433 cites W2033323125 @default.
- W2109084433 cites W2033354942 @default.
- W2109084433 cites W2033948133 @default.
- W2109084433 cites W2034177352 @default.
- W2109084433 cites W2034410027 @default.
- W2109084433 cites W2034726494 @default.
- W2109084433 cites W2036123216 @default.
- W2109084433 cites W2037661134 @default.
- W2109084433 cites W2038827150 @default.
- W2109084433 cites W2039330662 @default.
- W2109084433 cites W2042096080 @default.
- W2109084433 cites W2042182494 @default.
- W2109084433 cites W2042521088 @default.
- W2109084433 cites W2042761559 @default.
- W2109084433 cites W2046267877 @default.
- W2109084433 cites W2048951759 @default.
- W2109084433 cites W2049330624 @default.
- W2109084433 cites W2051831487 @default.
- W2109084433 cites W2052060433 @default.
- W2109084433 cites W2052935924 @default.
- W2109084433 cites W2055136196 @default.
- W2109084433 cites W2056402029 @default.
- W2109084433 cites W2056825408 @default.
- W2109084433 cites W2058658881 @default.
- W2109084433 cites W2059382484 @default.
- W2109084433 cites W2059571424 @default.
- W2109084433 cites W2059784037 @default.
- W2109084433 cites W2060421713 @default.
- W2109084433 cites W2062355909 @default.
- W2109084433 cites W2062709131 @default.
- W2109084433 cites W2064728365 @default.
- W2109084433 cites W2065335146 @default.
- W2109084433 cites W2065876439 @default.
- W2109084433 cites W2066600235 @default.
- W2109084433 cites W2067148494 @default.
- W2109084433 cites W2067204369 @default.