Matches in SemOpenAlex for { <https://semopenalex.org/work/W2123186393> ?p ?o ?g. }
- W2123186393 endingPage "35" @default.
- W2123186393 startingPage "27" @default.
- W2123186393 abstract "Computational prediction of signal peptides (SPs) and their cleavage sites is of great importance in computational biology; however, currently there is no available method capable of predicting reliably the SPs of archaea, due to the limited amount of experimentally verified proteins with SPs. We performed an extensive literature search in order to identify archaeal proteins having experimentally verified SP and managed to find 69 such proteins, the largest number ever reported. A detailed analysis of these sequences revealed some unique features of the SPs of archaea, such as the unique amino acid composition of the hydrophobic region with a higher than expected occurrence of isoleucine, and a cleavage site resembling more the sequences of gram-positives with almost equal amounts of alanine and valine at the position-3 before the cleavage site and a dominant alanine at position-1, followed in abundance by serine and glycine. Using these proteins as a training set, we trained a hidden Markov model method that predicts the presence of the SPs and their cleavage sites and also discriminates such proteins from cytoplasmic and transmembrane ones. The method performs satisfactorily, yielding a 35-fold cross-validation procedure, a sensitivity of 100% and specificity 98.41% with the Matthews' correlation coefficient being equal to 0.964. This particular method is currently the only available method for the prediction of secretory SPs in archaea, and performs consistently and significantly better compared with other available predictors that were trained on sequences of eukaryotic or bacterial origin. Searching 48 completely sequenced archaeal genomes we identified 9437 putative SPs. The method, PRED-SIGNAL, and the results are freely available for academic users at http://bioinformatics.biol.uoa.gr/PRED-SIGNAL/ and we anticipate that it will be a valuable tool for the computational analysis of archaeal genomes." @default.
- W2123186393 created "2016-06-24" @default.
- W2123186393 creator A5023208593 @default.
- W2123186393 creator A5035334094 @default.
- W2123186393 creator A5073654780 @default.
- W2123186393 creator A5077134817 @default.
- W2123186393 creator A5087868319 @default.
- W2123186393 date "2008-11-06" @default.
- W2123186393 modified "2023-09-27" @default.
- W2123186393 title "Prediction of signal peptides in archaea" @default.
- W2123186393 cites W1486425970 @default.
- W2123186393 cites W1495098782 @default.
- W2123186393 cites W1503396257 @default.
- W2123186393 cites W1507468227 @default.
- W2123186393 cites W1529347870 @default.
- W2123186393 cites W1529576101 @default.
- W2123186393 cites W1540000608 @default.
- W2123186393 cites W1543209415 @default.
- W2123186393 cites W1544679009 @default.
- W2123186393 cites W1549790155 @default.
- W2123186393 cites W1561608454 @default.
- W2123186393 cites W1602915164 @default.
- W2123186393 cites W1602952910 @default.
- W2123186393 cites W1626475480 @default.
- W2123186393 cites W1935158889 @default.
- W2123186393 cites W1939505221 @default.
- W2123186393 cites W1953597614 @default.
- W2123186393 cites W1961322092 @default.
- W2123186393 cites W1963849924 @default.
- W2123186393 cites W1968136381 @default.
- W2123186393 cites W1971147414 @default.
- W2123186393 cites W1975224955 @default.
- W2123186393 cites W1981985298 @default.
- W2123186393 cites W1985092735 @default.
- W2123186393 cites W1986522582 @default.
- W2123186393 cites W1988284480 @default.
- W2123186393 cites W1995759416 @default.
- W2123186393 cites W1998861389 @default.
- W2123186393 cites W1999449468 @default.
- W2123186393 cites W2001586974 @default.
- W2123186393 cites W2003538307 @default.
- W2123186393 cites W2007315854 @default.
- W2123186393 cites W2008683693 @default.
- W2123186393 cites W2009160571 @default.
- W2123186393 cites W2013595954 @default.
- W2123186393 cites W2014199958 @default.
- W2123186393 cites W2016093063 @default.
- W2123186393 cites W2022086113 @default.
- W2123186393 cites W2030207380 @default.
- W2123186393 cites W2035663056 @default.
- W2123186393 cites W2038605297 @default.
- W2123186393 cites W2041877620 @default.
- W2123186393 cites W2043865115 @default.
- W2123186393 cites W2045293454 @default.
- W2123186393 cites W2047041570 @default.
- W2123186393 cites W2051610226 @default.
- W2123186393 cites W2056162167 @default.
- W2123186393 cites W2061320085 @default.
- W2123186393 cites W2061534050 @default.
- W2123186393 cites W2062195385 @default.
- W2123186393 cites W2070581754 @default.
- W2123186393 cites W2070585770 @default.
- W2123186393 cites W2070985854 @default.
- W2123186393 cites W2088922360 @default.
- W2123186393 cites W2094141112 @default.
- W2123186393 cites W2097388832 @default.
- W2123186393 cites W2103188931 @default.
- W2123186393 cites W2105734162 @default.
- W2123186393 cites W2105986552 @default.
- W2123186393 cites W2107432340 @default.
- W2123186393 cites W2107943123 @default.
- W2123186393 cites W2110612352 @default.
- W2123186393 cites W2113150793 @default.
- W2123186393 cites W2119261344 @default.
- W2123186393 cites W2119452210 @default.
- W2123186393 cites W2124088653 @default.
- W2123186393 cites W2126549518 @default.
- W2123186393 cites W2126650464 @default.
- W2123186393 cites W2129336587 @default.
- W2123186393 cites W2130393314 @default.
- W2123186393 cites W2131778683 @default.
- W2123186393 cites W2134229777 @default.
- W2123186393 cites W2134759004 @default.
- W2123186393 cites W2138879032 @default.
- W2123186393 cites W2139026836 @default.
- W2123186393 cites W2141195253 @default.
- W2123186393 cites W2143355106 @default.
- W2123186393 cites W2148945900 @default.
- W2123186393 cites W2149649966 @default.
- W2123186393 cites W2152770371 @default.
- W2123186393 cites W2153536578 @default.
- W2123186393 cites W2153671224 @default.
- W2123186393 cites W2156099263 @default.
- W2123186393 cites W2156444956 @default.
- W2123186393 cites W2156657683 @default.
- W2123186393 cites W2158266834 @default.
- W2123186393 cites W2158623906 @default.
- W2123186393 cites W2161732286 @default.