Matches in SemOpenAlex for { <https://semopenalex.org/work/W2040778567> ?p ?o ?g. }
- W2040778567 abstract "Text mining has spurred huge interest in the domain of biology. The goal of the BioCreAtIvE exercise was to evaluate the performance of current text mining systems. We participated in Task 2, which addressed assigning Gene Ontology terms to human proteins and selecting relevant evidence from full-text documents. We approached it as a modified form of the document classification task. We used a supervised machine-learning approach (based on support vector machines) to assign protein function and select passages that support the assignments. As classification features, we used a protein's co-occurring terms that were automatically extracted from documents.The results evaluated by curators were modest, and quite variable for different problems: in many cases we have relatively good assignment of GO terms to proteins, but the selected supporting text was typically non-relevant (precision spanning from 3% to 50%). The method appears to work best when a substantial set of relevant documents is obtained, while it works poorly on single documents and/or short passages. The initial results suggest that our approach can also mine annotations from text even when an explicit statement relating a protein to a GO term is absent.A machine learning approach to mining protein function predictions from text can yield good performance only if sufficient training data is available, and significant amount of supporting data is used for prediction. The most promising results are for combined document retrieval and GO term assignment, which calls for the integration of methods developed in BioCreAtIvE Task 1 and Task 2." @default.
- W2040778567 created "2016-06-24" @default.
- W2040778567 creator A5005912060 @default.
- W2040778567 creator A5073899463 @default.
- W2040778567 creator A5086617519 @default.
- W2040778567 date "2005-05-01" @default.
- W2040778567 modified "2023-10-16" @default.
- W2040778567 title "Mining protein function from text using term-based support vector machines" @default.
- W2040778567 cites W1493036841 @default.
- W2040778567 cites W1987213598 @default.
- W2040778567 cites W1991570251 @default.
- W2040778567 cites W2018874418 @default.
- W2040778567 cites W2020801982 @default.
- W2040778567 cites W2039612385 @default.
- W2040778567 cites W2049107599 @default.
- W2040778567 cites W2056119436 @default.
- W2040778567 cites W2061166439 @default.
- W2040778567 cites W2067096102 @default.
- W2040778567 cites W2104768328 @default.
- W2040778567 cites W2106093878 @default.
- W2040778567 cites W2135937947 @default.
- W2040778567 cites W2138548161 @default.
- W2040778567 cites W2139259976 @default.
- W2040778567 cites W4243819499 @default.
- W2040778567 doi "https://doi.org/10.1186/1471-2105-6-s1-s22" @default.
- W2040778567 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/1869015" @default.
- W2040778567 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/15960835" @default.
- W2040778567 hasPublicationYear "2005" @default.
- W2040778567 type Work @default.
- W2040778567 sameAs 2040778567 @default.
- W2040778567 citedByCount "48" @default.
- W2040778567 countsByYear W20407785672012 @default.
- W2040778567 countsByYear W20407785672013 @default.
- W2040778567 countsByYear W20407785672014 @default.
- W2040778567 countsByYear W20407785672016 @default.
- W2040778567 countsByYear W20407785672017 @default.
- W2040778567 countsByYear W20407785672018 @default.
- W2040778567 countsByYear W20407785672019 @default.
- W2040778567 countsByYear W20407785672020 @default.
- W2040778567 countsByYear W20407785672021 @default.
- W2040778567 countsByYear W20407785672022 @default.
- W2040778567 countsByYear W20407785672023 @default.
- W2040778567 crossrefType "journal-article" @default.
- W2040778567 hasAuthorship W2040778567A5005912060 @default.
- W2040778567 hasAuthorship W2040778567A5073899463 @default.
- W2040778567 hasAuthorship W2040778567A5086617519 @default.
- W2040778567 hasBestOaLocation W20407785671 @default.
- W2040778567 hasConcept C119857082 @default.
- W2040778567 hasConcept C121332964 @default.
- W2040778567 hasConcept C12267149 @default.
- W2040778567 hasConcept C124101348 @default.
- W2040778567 hasConcept C14036430 @default.
- W2040778567 hasConcept C154945302 @default.
- W2040778567 hasConcept C162324750 @default.
- W2040778567 hasConcept C165141518 @default.
- W2040778567 hasConcept C177264268 @default.
- W2040778567 hasConcept C17744445 @default.
- W2040778567 hasConcept C187736073 @default.
- W2040778567 hasConcept C199360897 @default.
- W2040778567 hasConcept C199539241 @default.
- W2040778567 hasConcept C204321447 @default.
- W2040778567 hasConcept C23123220 @default.
- W2040778567 hasConcept C2777026412 @default.
- W2040778567 hasConcept C2780451532 @default.
- W2040778567 hasConcept C41008148 @default.
- W2040778567 hasConcept C61797465 @default.
- W2040778567 hasConcept C62520636 @default.
- W2040778567 hasConcept C71472368 @default.
- W2040778567 hasConcept C78458016 @default.
- W2040778567 hasConcept C86803240 @default.
- W2040778567 hasConceptScore W2040778567C119857082 @default.
- W2040778567 hasConceptScore W2040778567C121332964 @default.
- W2040778567 hasConceptScore W2040778567C12267149 @default.
- W2040778567 hasConceptScore W2040778567C124101348 @default.
- W2040778567 hasConceptScore W2040778567C14036430 @default.
- W2040778567 hasConceptScore W2040778567C154945302 @default.
- W2040778567 hasConceptScore W2040778567C162324750 @default.
- W2040778567 hasConceptScore W2040778567C165141518 @default.
- W2040778567 hasConceptScore W2040778567C177264268 @default.
- W2040778567 hasConceptScore W2040778567C17744445 @default.
- W2040778567 hasConceptScore W2040778567C187736073 @default.
- W2040778567 hasConceptScore W2040778567C199360897 @default.
- W2040778567 hasConceptScore W2040778567C199539241 @default.
- W2040778567 hasConceptScore W2040778567C204321447 @default.
- W2040778567 hasConceptScore W2040778567C23123220 @default.
- W2040778567 hasConceptScore W2040778567C2777026412 @default.
- W2040778567 hasConceptScore W2040778567C2780451532 @default.
- W2040778567 hasConceptScore W2040778567C41008148 @default.
- W2040778567 hasConceptScore W2040778567C61797465 @default.
- W2040778567 hasConceptScore W2040778567C62520636 @default.
- W2040778567 hasConceptScore W2040778567C71472368 @default.
- W2040778567 hasConceptScore W2040778567C78458016 @default.
- W2040778567 hasConceptScore W2040778567C86803240 @default.
- W2040778567 hasIssue "S1" @default.
- W2040778567 hasLocation W20407785671 @default.
- W2040778567 hasLocation W20407785672 @default.
- W2040778567 hasLocation W20407785673 @default.
- W2040778567 hasLocation W20407785674 @default.
- W2040778567 hasOpenAccess W2040778567 @default.
- W2040778567 hasPrimaryLocation W20407785671 @default.