Matches in SemOpenAlex for { <https://semopenalex.org/work/W2891585877> ?p ?o ?g. }
- W2891585877 endingPage "1124" @default.
- W2891585877 startingPage "1116" @default.
- W2891585877 abstract "Abstract Motivation Most automatic functional annotation methods assign Gene Ontology (GO) terms to proteins based on annotations of highly similar proteins. We advocate that proteins that are less similar are still informative. Also, despite their simplicity and structure, GO terms seem to be hard for computers to learn, in particular the Biological Process ontology, which has the most terms (>29 000). We propose to use Label-Space Dimensionality Reduction (LSDR) techniques to exploit the redundancy of GO terms and transform them into a more compact latent representation that is easier to predict. Results We compare proteins using a sequence similarity profile (SSP) to a set of annotated training proteins. We introduce two new LSDR methods, one based on the structure of the GO, and one based on semantic similarity of terms. We show that these LSDR methods, as well as three existing ones, improve the Critical Assessment of Functional Annotation performance of several function prediction algorithms. Cross-validation experiments on Arabidopsis thaliana proteins pinpoint the superiority of our GO-aware LSDR over generic LSDR. Our experiments on A.thaliana proteins show that the SSP representation in combination with a kNN classifier outperforms state-of-the-art and baseline methods in terms of cross-validated F-measure. Availability and implementation Source code for the experiments is available at https://github.com/stamakro/SSP-LSDR. Supplementary information Supplementary data are available at Bioinformatics online." @default.
- W2891585877 created "2018-09-27" @default.
- W2891585877 creator A5038211775 @default.
- W2891585877 creator A5050761197 @default.
- W2891585877 creator A5052699801 @default.
- W2891585877 date "2018-08-29" @default.
- W2891585877 modified "2023-10-11" @default.
- W2891585877 title "Improving protein function prediction using protein sequence and GO-term similarities" @default.
- W2891585877 cites W1144107824 @default.
- W2891585877 cites W1663797894 @default.
- W2891585877 cites W1705245392 @default.
- W2891585877 cites W1967542092 @default.
- W2891585877 cites W1973163442 @default.
- W2891585877 cites W1973714307 @default.
- W2891585877 cites W2027869746 @default.
- W2891585877 cites W2035832348 @default.
- W2891585877 cites W2045156863 @default.
- W2891585877 cites W2047606947 @default.
- W2891585877 cites W2054118996 @default.
- W2891585877 cites W2060148724 @default.
- W2891585877 cites W2063291945 @default.
- W2891585877 cites W2063345857 @default.
- W2891585877 cites W2084168100 @default.
- W2891585877 cites W2094124219 @default.
- W2891585877 cites W2098432760 @default.
- W2891585877 cites W2102814386 @default.
- W2891585877 cites W2103017472 @default.
- W2891585877 cites W2108256858 @default.
- W2891585877 cites W2117486996 @default.
- W2891585877 cites W2120930809 @default.
- W2891585877 cites W2124735751 @default.
- W2891585877 cites W2125282516 @default.
- W2891585877 cites W2140735973 @default.
- W2891585877 cites W2150563191 @default.
- W2891585877 cites W2161955763 @default.
- W2891585877 cites W2162813650 @default.
- W2891585877 cites W2168118433 @default.
- W2891585877 cites W2168662600 @default.
- W2891585877 cites W2227395312 @default.
- W2891585877 cites W2520368209 @default.
- W2891585877 cites W2562531153 @default.
- W2891585877 cites W2565684071 @default.
- W2891585877 cites W2612647668 @default.
- W2891585877 cites W2615066396 @default.
- W2891585877 cites W2763133603 @default.
- W2891585877 doi "https://doi.org/10.1093/bioinformatics/bty751" @default.
- W2891585877 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/6449755" @default.
- W2891585877 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/30169569" @default.
- W2891585877 hasPublicationYear "2018" @default.
- W2891585877 type Work @default.
- W2891585877 sameAs 2891585877 @default.
- W2891585877 citedByCount "15" @default.
- W2891585877 countsByYear W28915858772019 @default.
- W2891585877 countsByYear W28915858772020 @default.
- W2891585877 countsByYear W28915858772021 @default.
- W2891585877 countsByYear W28915858772022 @default.
- W2891585877 countsByYear W28915858772023 @default.
- W2891585877 crossrefType "journal-article" @default.
- W2891585877 hasAuthorship W2891585877A5038211775 @default.
- W2891585877 hasAuthorship W2891585877A5050761197 @default.
- W2891585877 hasAuthorship W2891585877A5052699801 @default.
- W2891585877 hasBestOaLocation W28915858771 @default.
- W2891585877 hasConcept C104317684 @default.
- W2891585877 hasConcept C111919701 @default.
- W2891585877 hasConcept C119857082 @default.
- W2891585877 hasConcept C124101348 @default.
- W2891585877 hasConcept C150194340 @default.
- W2891585877 hasConcept C154945302 @default.
- W2891585877 hasConcept C165696696 @default.
- W2891585877 hasConcept C207060522 @default.
- W2891585877 hasConcept C2776321320 @default.
- W2891585877 hasConcept C2778112365 @default.
- W2891585877 hasConcept C2986374874 @default.
- W2891585877 hasConcept C2987395477 @default.
- W2891585877 hasConcept C38652104 @default.
- W2891585877 hasConcept C41008148 @default.
- W2891585877 hasConcept C43126263 @default.
- W2891585877 hasConcept C54355233 @default.
- W2891585877 hasConcept C55493867 @default.
- W2891585877 hasConcept C86803240 @default.
- W2891585877 hasConcept C95623464 @default.
- W2891585877 hasConceptScore W2891585877C104317684 @default.
- W2891585877 hasConceptScore W2891585877C111919701 @default.
- W2891585877 hasConceptScore W2891585877C119857082 @default.
- W2891585877 hasConceptScore W2891585877C124101348 @default.
- W2891585877 hasConceptScore W2891585877C150194340 @default.
- W2891585877 hasConceptScore W2891585877C154945302 @default.
- W2891585877 hasConceptScore W2891585877C165696696 @default.
- W2891585877 hasConceptScore W2891585877C207060522 @default.
- W2891585877 hasConceptScore W2891585877C2776321320 @default.
- W2891585877 hasConceptScore W2891585877C2778112365 @default.
- W2891585877 hasConceptScore W2891585877C2986374874 @default.
- W2891585877 hasConceptScore W2891585877C2987395477 @default.
- W2891585877 hasConceptScore W2891585877C38652104 @default.
- W2891585877 hasConceptScore W2891585877C41008148 @default.
- W2891585877 hasConceptScore W2891585877C43126263 @default.
- W2891585877 hasConceptScore W2891585877C54355233 @default.
- W2891585877 hasConceptScore W2891585877C55493867 @default.