Matches in SemOpenAlex for { <https://semopenalex.org/work/W2074406564> ?p ?o ?g. }
- W2074406564 endingPage "e85139" @default.
- W2074406564 startingPage "e85139" @default.
- W2074406564 abstract "Structured Logistic Regression (SLR) is a newly developed machine learning tool first proposed in the context of text categorization. Current availability of extensive protein sequence databases calls for an automated method to reliably classify sequences and SLR seems well-suited for this task. The classification of P-type ATPases, a large family of ATP-driven membrane pumps transporting essential cations, was selected as a test-case that would generate important biological information as well as provide a proof-of-concept for the application of SLR to a large scale bioinformatics problem.Using SLR, we have built classifiers to identify and automatically categorize P-type ATPases into one of 11 pre-defined classes. The SLR-classifiers are compared to a Hidden Markov Model approach and shown to be highly accurate and scalable. Representing the bulk of currently known sequences, we analysed 9.3 million sequences in the UniProtKB and attempted to classify a large number of P-type ATPases. To examine the distribution of pumps on organisms, we also applied SLR to 1,123 complete genomes from the Entrez genome database. Finally, we analysed the predicted membrane topology of the identified P-type ATPases.Using the SLR-based classification tool we are able to run a large scale study of P-type ATPases. This study provides proof-of-concept for the application of SLR to a bioinformatics problem and the analysis of P-type ATPases pinpoints new and interesting targets for further biochemical characterization and structural analysis." @default.
- W2074406564 created "2016-06-24" @default.
- W2074406564 creator A5001251924 @default.
- W2074406564 creator A5024719774 @default.
- W2074406564 creator A5036245919 @default.
- W2074406564 creator A5054845773 @default.
- W2074406564 creator A5057382760 @default.
- W2074406564 creator A5065384282 @default.
- W2074406564 creator A5077863600 @default.
- W2074406564 creator A5086046361 @default.
- W2074406564 date "2014-01-20" @default.
- W2074406564 modified "2023-10-12" @default.
- W2074406564 title "Large Scale Identification and Categorization of Protein Sequences Using Structured Logistic Regression" @default.
- W2074406564 cites W115160791 @default.
- W2074406564 cites W1414945947 @default.
- W2074406564 cites W148367990 @default.
- W2074406564 cites W1542816909 @default.
- W2074406564 cites W1857358525 @default.
- W2074406564 cites W1968740095 @default.
- W2074406564 cites W1971147414 @default.
- W2074406564 cites W1976145314 @default.
- W2074406564 cites W1981519049 @default.
- W2074406564 cites W1991548431 @default.
- W2074406564 cites W2000428723 @default.
- W2074406564 cites W2006035084 @default.
- W2074406564 cites W2008719391 @default.
- W2074406564 cites W2019958839 @default.
- W2074406564 cites W2038119705 @default.
- W2074406564 cites W2040356073 @default.
- W2074406564 cites W2042028629 @default.
- W2074406564 cites W2043551256 @default.
- W2074406564 cites W2048146794 @default.
- W2074406564 cites W2058658881 @default.
- W2074406564 cites W2059496396 @default.
- W2074406564 cites W2067999317 @default.
- W2074406564 cites W2068633356 @default.
- W2074406564 cites W2078151980 @default.
- W2074406564 cites W2079357395 @default.
- W2074406564 cites W2087726424 @default.
- W2074406564 cites W2088782754 @default.
- W2074406564 cites W2111802622 @default.
- W2074406564 cites W2118488195 @default.
- W2074406564 cites W2119362018 @default.
- W2074406564 cites W2122851256 @default.
- W2074406564 cites W2124549482 @default.
- W2074406564 cites W2125121305 @default.
- W2074406564 cites W2132926880 @default.
- W2074406564 cites W2135022976 @default.
- W2074406564 cites W2153331570 @default.
- W2074406564 cites W2157451936 @default.
- W2074406564 cites W2158714788 @default.
- W2074406564 cites W2165555639 @default.
- W2074406564 cites W367800729 @default.
- W2074406564 cites W4210623056 @default.
- W2074406564 cites W4211208250 @default.
- W2074406564 cites W4236711181 @default.
- W2074406564 doi "https://doi.org/10.1371/journal.pone.0085139" @default.
- W2074406564 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/3896382" @default.
- W2074406564 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/24465495" @default.
- W2074406564 hasPublicationYear "2014" @default.
- W2074406564 type Work @default.
- W2074406564 sameAs 2074406564 @default.
- W2074406564 citedByCount "12" @default.
- W2074406564 countsByYear W20744065642015 @default.
- W2074406564 countsByYear W20744065642016 @default.
- W2074406564 countsByYear W20744065642017 @default.
- W2074406564 countsByYear W20744065642018 @default.
- W2074406564 countsByYear W20744065642019 @default.
- W2074406564 countsByYear W20744065642021 @default.
- W2074406564 countsByYear W20744065642023 @default.
- W2074406564 crossrefType "journal-article" @default.
- W2074406564 hasAuthorship W2074406564A5001251924 @default.
- W2074406564 hasAuthorship W2074406564A5024719774 @default.
- W2074406564 hasAuthorship W2074406564A5036245919 @default.
- W2074406564 hasAuthorship W2074406564A5054845773 @default.
- W2074406564 hasAuthorship W2074406564A5057382760 @default.
- W2074406564 hasAuthorship W2074406564A5065384282 @default.
- W2074406564 hasAuthorship W2074406564A5077863600 @default.
- W2074406564 hasAuthorship W2074406564A5086046361 @default.
- W2074406564 hasBestOaLocation W20744065641 @default.
- W2074406564 hasConcept C104317684 @default.
- W2074406564 hasConcept C116834253 @default.
- W2074406564 hasConcept C119857082 @default.
- W2074406564 hasConcept C124101348 @default.
- W2074406564 hasConcept C151730666 @default.
- W2074406564 hasConcept C154945302 @default.
- W2074406564 hasConcept C202264299 @default.
- W2074406564 hasConcept C2779343474 @default.
- W2074406564 hasConcept C41008148 @default.
- W2074406564 hasConcept C48044578 @default.
- W2074406564 hasConcept C54355233 @default.
- W2074406564 hasConcept C59822182 @default.
- W2074406564 hasConcept C60644358 @default.
- W2074406564 hasConcept C70721500 @default.
- W2074406564 hasConcept C77088390 @default.
- W2074406564 hasConcept C86803240 @default.
- W2074406564 hasConcept C94124525 @default.
- W2074406564 hasConceptScore W2074406564C104317684 @default.