Matches in SemOpenAlex for { <https://semopenalex.org/work/W2023232162> ?p ?o ?g. }
- W2023232162 endingPage "133" @default.
- W2023232162 startingPage "120" @default.
- W2023232162 abstract "Identifying the subcellular localization of proteins is particularly helpful in the functional annotation of gene products. In this study, we use Machine Learning and Exploratory Data Analysis (EDA) techniques to examine and characterize amino acid sequences of human proteins localized in nine cellular compartments. A dataset of 3,749 protein sequences representing human proteins was extracted from the SWISS-PROT database. Feature vectors were created to capture specific amino acid sequence characteristics. Relative to a Support Vector Machine, a Multi-layer Perceptron, and a Naïve Bayes classifier, the C4.5 Decision Tree algorithm was the most consistent performer across all nine compartments in reliably predicting the subcellular localization of proteins based on their amino acid sequences (average Precision=0.88; average Sensitivity=0.86). Furthermore, EDA graphics characterized essential features of proteins in each compartment. As examples, proteins localized to the plasma membrane had higher proportions of hydrophobic amino acids; cytoplasmic proteins had higher proportions of neutral amino acids; and mitochondrial proteins had higher proportions of neutral amino acids and lower proportions of polar amino acids. These data showed that the C4.5 classifier and EDA tools can be effective for characterizing and predicting the subcellular localization of human proteins based on their amino acid sequences." @default.
- W2023232162 created "2016-06-24" @default.
- W2023232162 creator A5020172912 @default.
- W2023232162 creator A5041931115 @default.
- W2023232162 creator A5050528341 @default.
- W2023232162 date "2006-01-01" @default.
- W2023232162 modified "2023-09-27" @default.
- W2023232162 title "Predicting the Subcellular Localization of Human Proteins Using Machine Learning and Exploratory Data Analysis" @default.
- W2023232162 cites W1816488618 @default.
- W2023232162 cites W1979228662 @default.
- W2023232162 cites W2006552405 @default.
- W2023232162 cites W2007360228 @default.
- W2023232162 cites W2012481064 @default.
- W2023232162 cites W2025131366 @default.
- W2023232162 cites W2029987169 @default.
- W2023232162 cites W2040630248 @default.
- W2023232162 cites W2042084565 @default.
- W2023232162 cites W2044573154 @default.
- W2023232162 cites W2076469978 @default.
- W2023232162 cites W2077307563 @default.
- W2023232162 cites W2078914183 @default.
- W2023232162 cites W2094027833 @default.
- W2023232162 cites W2096049852 @default.
- W2023232162 cites W2105270502 @default.
- W2023232162 cites W2109109045 @default.
- W2023232162 cites W2115998090 @default.
- W2023232162 cites W2121869529 @default.
- W2023232162 cites W2143197682 @default.
- W2023232162 cites W2152458080 @default.
- W2023232162 cites W2158120166 @default.
- W2023232162 cites W2160979370 @default.
- W2023232162 cites W2164703259 @default.
- W2023232162 cites W2169805130 @default.
- W2023232162 cites W2954364924 @default.
- W2023232162 doi "https://doi.org/10.1016/s1672-0229(06)60023-5" @default.
- W2023232162 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/2709537" @default.
- W2023232162 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/16970551" @default.
- W2023232162 hasPublicationYear "2006" @default.
- W2023232162 type Work @default.
- W2023232162 sameAs 2023232162 @default.
- W2023232162 citedByCount "9" @default.
- W2023232162 countsByYear W20232321622014 @default.
- W2023232162 countsByYear W20232321622017 @default.
- W2023232162 countsByYear W20232321622019 @default.
- W2023232162 countsByYear W20232321622020 @default.
- W2023232162 crossrefType "journal-article" @default.
- W2023232162 hasAuthorship W2023232162A5020172912 @default.
- W2023232162 hasAuthorship W2023232162A5041931115 @default.
- W2023232162 hasAuthorship W2023232162A5050528341 @default.
- W2023232162 hasBestOaLocation W20232321621 @default.
- W2023232162 hasConcept C104317684 @default.
- W2023232162 hasConcept C12267149 @default.
- W2023232162 hasConcept C140051345 @default.
- W2023232162 hasConcept C154945302 @default.
- W2023232162 hasConcept C190062978 @default.
- W2023232162 hasConcept C2776879804 @default.
- W2023232162 hasConcept C2780362125 @default.
- W2023232162 hasConcept C41008148 @default.
- W2023232162 hasConcept C515207424 @default.
- W2023232162 hasConcept C52001869 @default.
- W2023232162 hasConcept C55493867 @default.
- W2023232162 hasConcept C70721500 @default.
- W2023232162 hasConcept C86803240 @default.
- W2023232162 hasConcept C95623464 @default.
- W2023232162 hasConceptScore W2023232162C104317684 @default.
- W2023232162 hasConceptScore W2023232162C12267149 @default.
- W2023232162 hasConceptScore W2023232162C140051345 @default.
- W2023232162 hasConceptScore W2023232162C154945302 @default.
- W2023232162 hasConceptScore W2023232162C190062978 @default.
- W2023232162 hasConceptScore W2023232162C2776879804 @default.
- W2023232162 hasConceptScore W2023232162C2780362125 @default.
- W2023232162 hasConceptScore W2023232162C41008148 @default.
- W2023232162 hasConceptScore W2023232162C515207424 @default.
- W2023232162 hasConceptScore W2023232162C52001869 @default.
- W2023232162 hasConceptScore W2023232162C55493867 @default.
- W2023232162 hasConceptScore W2023232162C70721500 @default.
- W2023232162 hasConceptScore W2023232162C86803240 @default.
- W2023232162 hasConceptScore W2023232162C95623464 @default.
- W2023232162 hasIssue "2" @default.
- W2023232162 hasLocation W20232321621 @default.
- W2023232162 hasLocation W20232321622 @default.
- W2023232162 hasLocation W20232321623 @default.
- W2023232162 hasLocation W20232321624 @default.
- W2023232162 hasOpenAccess W2023232162 @default.
- W2023232162 hasPrimaryLocation W20232321621 @default.
- W2023232162 hasRelatedWork W1989603884 @default.
- W2023232162 hasRelatedWork W2042112205 @default.
- W2023232162 hasRelatedWork W2076925894 @default.
- W2023232162 hasRelatedWork W2077919959 @default.
- W2023232162 hasRelatedWork W2117701146 @default.
- W2023232162 hasRelatedWork W2127103767 @default.
- W2023232162 hasRelatedWork W2384565783 @default.
- W2023232162 hasRelatedWork W2620675477 @default.
- W2023232162 hasRelatedWork W3035434756 @default.
- W2023232162 hasRelatedWork W4230059417 @default.
- W2023232162 hasVolume "4" @default.
- W2023232162 isParatext "false" @default.
- W2023232162 isRetracted "false" @default.