Matches in SemOpenAlex for { <https://semopenalex.org/work/W2003204068> ?p ?o ?g. }
- W2003204068 abstract "Ubiquitylation plays an important role in regulating protein functions. Recently, experimental methods were developed toward effective identification of ubiquitylation sites. To efficiently explore more undiscovered ubiquitylation sites, this study aims to develop an accurate sequence-based prediction method to identify promising ubiquitylation sites. We established an ubiquitylation dataset consisting of 157 ubiquitylation sites and 3676 putative non-ubiquitylation sites extracted from 105 proteins in the UbiProt database. This study first evaluates promising sequence-based features and classifiers for the prediction of ubiquitylation sites by assessing three kinds of features (amino acid identity, evolutionary information, and physicochemical property) and three classifiers (support vector machine, k-nearest neighbor, and NaïveBayes). Results show that the set of used 531 physicochemical properties and support vector machine (SVM) are the best kind of features and classifier respectively that their combination has a prediction accuracy of 72.19% using leave-one-out cross-validation. Consequently, an informative physicochemical property mining algorithm (IPMA) is proposed to select an informative subset of 531 physicochemical properties. A prediction system UbiPred was implemented by using an SVM with the feature set of 31 informative physicochemical properties selected by IPMA, which can improve the accuracy from 72.19% to 84.44%. To further analyze the informative physicochemical properties, a decision tree method C5.0 was used to acquire if-then rule-based knowledge of predicting ubiquitylation sites. UbiPred can screen promising ubiquitylation sites from putative non-ubiquitylation sites using prediction scores. By applying UbiPred, 23 promising ubiquitylation sites were identified from an independent dataset of 3424 putative non-ubiquitylation sites, which were also validated by using the obtained prediction rules. We have proposed an algorithm IPMA for mining informative physicochemical properties from protein sequences to build an SVM-based prediction system UbiPred. UbiPred can predict ubiquitylation sites accompanied with a prediction score each to help biologists in identifying promising sites for experimental verification. UbiPred has been implemented as a web server and is available at http://iclab.life.nctu.edu.tw/ubipred ." @default.
- W2003204068 created "2016-06-24" @default.
- W2003204068 creator A5007171835 @default.
- W2003204068 creator A5066866717 @default.
- W2003204068 date "2008-07-15" @default.
- W2003204068 modified "2023-10-16" @default.
- W2003204068 title "Computational identification of ubiquitylation sites from protein sequences" @default.
- W2003204068 cites W1538162617 @default.
- W2003204068 cites W1974471014 @default.
- W2003204068 cites W1990018583 @default.
- W2003204068 cites W1991634033 @default.
- W2003204068 cites W1996819669 @default.
- W2003204068 cites W2003154891 @default.
- W2003204068 cites W2007805002 @default.
- W2003204068 cites W2020194549 @default.
- W2003204068 cites W2026575016 @default.
- W2003204068 cites W2029093937 @default.
- W2003204068 cites W2029820720 @default.
- W2003204068 cites W2031084895 @default.
- W2003204068 cites W2031300832 @default.
- W2003204068 cites W2043338013 @default.
- W2003204068 cites W2073454559 @default.
- W2003204068 cites W2076925894 @default.
- W2003204068 cites W2080553628 @default.
- W2003204068 cites W2095900655 @default.
- W2003204068 cites W2101466114 @default.
- W2003204068 cites W2125983734 @default.
- W2003204068 cites W2126389772 @default.
- W2003204068 cites W2140628140 @default.
- W2003204068 cites W2156125289 @default.
- W2003204068 cites W2157759203 @default.
- W2003204068 cites W2158266834 @default.
- W2003204068 cites W2158714788 @default.
- W2003204068 cites W2164492849 @default.
- W2003204068 cites W2167485812 @default.
- W2003204068 doi "https://doi.org/10.1186/1471-2105-9-310" @default.
- W2003204068 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/2488362" @default.
- W2003204068 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/18625080" @default.
- W2003204068 hasPublicationYear "2008" @default.
- W2003204068 type Work @default.
- W2003204068 sameAs 2003204068 @default.
- W2003204068 citedByCount "161" @default.
- W2003204068 countsByYear W20032040682012 @default.
- W2003204068 countsByYear W20032040682013 @default.
- W2003204068 countsByYear W20032040682014 @default.
- W2003204068 countsByYear W20032040682015 @default.
- W2003204068 countsByYear W20032040682016 @default.
- W2003204068 countsByYear W20032040682017 @default.
- W2003204068 countsByYear W20032040682018 @default.
- W2003204068 countsByYear W20032040682019 @default.
- W2003204068 countsByYear W20032040682020 @default.
- W2003204068 countsByYear W20032040682021 @default.
- W2003204068 countsByYear W20032040682022 @default.
- W2003204068 countsByYear W20032040682023 @default.
- W2003204068 crossrefType "journal-article" @default.
- W2003204068 hasAuthorship W2003204068A5007171835 @default.
- W2003204068 hasAuthorship W2003204068A5066866717 @default.
- W2003204068 hasBestOaLocation W20032040681 @default.
- W2003204068 hasConcept C10010492 @default.
- W2003204068 hasConcept C104317684 @default.
- W2003204068 hasConcept C116834253 @default.
- W2003204068 hasConcept C119857082 @default.
- W2003204068 hasConcept C12267149 @default.
- W2003204068 hasConcept C124101348 @default.
- W2003204068 hasConcept C153180895 @default.
- W2003204068 hasConcept C154945302 @default.
- W2003204068 hasConcept C167625842 @default.
- W2003204068 hasConcept C177264268 @default.
- W2003204068 hasConcept C199360897 @default.
- W2003204068 hasConcept C25602115 @default.
- W2003204068 hasConcept C41008148 @default.
- W2003204068 hasConcept C55493867 @default.
- W2003204068 hasConcept C59822182 @default.
- W2003204068 hasConcept C60644358 @default.
- W2003204068 hasConcept C70721500 @default.
- W2003204068 hasConcept C86803240 @default.
- W2003204068 hasConcept C95623464 @default.
- W2003204068 hasConceptScore W2003204068C10010492 @default.
- W2003204068 hasConceptScore W2003204068C104317684 @default.
- W2003204068 hasConceptScore W2003204068C116834253 @default.
- W2003204068 hasConceptScore W2003204068C119857082 @default.
- W2003204068 hasConceptScore W2003204068C12267149 @default.
- W2003204068 hasConceptScore W2003204068C124101348 @default.
- W2003204068 hasConceptScore W2003204068C153180895 @default.
- W2003204068 hasConceptScore W2003204068C154945302 @default.
- W2003204068 hasConceptScore W2003204068C167625842 @default.
- W2003204068 hasConceptScore W2003204068C177264268 @default.
- W2003204068 hasConceptScore W2003204068C199360897 @default.
- W2003204068 hasConceptScore W2003204068C25602115 @default.
- W2003204068 hasConceptScore W2003204068C41008148 @default.
- W2003204068 hasConceptScore W2003204068C55493867 @default.
- W2003204068 hasConceptScore W2003204068C59822182 @default.
- W2003204068 hasConceptScore W2003204068C60644358 @default.
- W2003204068 hasConceptScore W2003204068C70721500 @default.
- W2003204068 hasConceptScore W2003204068C86803240 @default.
- W2003204068 hasConceptScore W2003204068C95623464 @default.
- W2003204068 hasIssue "1" @default.
- W2003204068 hasLocation W20032040681 @default.
- W2003204068 hasLocation W20032040682 @default.
- W2003204068 hasLocation W20032040683 @default.