Matches in SemOpenAlex for { <https://semopenalex.org/work/W2053562938> ?p ?o ?g. }
Showing items 1 to 90 of
90
with 100 items per page.
- W2053562938 abstract "Literature mining is expected to help not only with automatically sifting through huge biomedical literature and annotation databases, but also with linking bio-chemical entities to appropriate functional hypotheses. However, there has been very limited success in testing literature mining methods due to the lack of large, objectively validated test sets or gold standards. To improve this situation we created a large-scale test of literature mining methods and resources. We report on a specific implementation of this test: how well can the Pfam protein family classification be replicated from independently mining different literature/annotation resources? We test and compare different keyterm sets as well as different algorithms for issuing protein family predictions. We find that protein families can indeed be automatically predicted from the literature. Using words from PubMed abstracts, of 3663 proteins tested, over 75% were correctly assigned to one of 618 Pfam families. For 90% of proteins the correct Pfam family was among the top 5 ranked families. We found that protein family prediction is far superior with keywords extracted from PubMed abstracts than with GO annotations or MeSH keyterms, suggesting that the text itself (in combination with the vector space model) is superior to GO and MeSH as a literature mining resources, at least for detecting protein family membership. Finally, we show that Shannon's entropy can be exploited to improve prediction by facilitating the integration of the different literature sources tested." @default.
- W2053562938 created "2016-06-24" @default.
- W2053562938 creator A5057955464 @default.
- W2053562938 creator A5065682719 @default.
- W2053562938 creator A5075220413 @default.
- W2053562938 creator A5084069668 @default.
- W2053562938 creator A5089757137 @default.
- W2053562938 date "2005-12-01" @default.
- W2053562938 modified "2023-09-26" @default.
- W2053562938 title "LARGE-SCALE TESTING OF BIBLIOME INFORMATICS USING PFAM PROTEIN FAMILIES" @default.
- W2053562938 cites W1566012138 @default.
- W2053562938 cites W1586623508 @default.
- W2053562938 cites W1600047650 @default.
- W2053562938 cites W1660390307 @default.
- W2053562938 cites W1661470104 @default.
- W2053562938 cites W2025672980 @default.
- W2053562938 cites W2045340002 @default.
- W2053562938 cites W2061327015 @default.
- W2053562938 cites W2081566793 @default.
- W2053562938 cites W2091978351 @default.
- W2053562938 cites W2092795373 @default.
- W2053562938 cites W2110203109 @default.
- W2053562938 cites W2126276057 @default.
- W2053562938 cites W2129448726 @default.
- W2053562938 cites W2149942697 @default.
- W2053562938 cites W2159203162 @default.
- W2053562938 cites W2405987207 @default.
- W2053562938 cites W2413372017 @default.
- W2053562938 cites W24306466 @default.
- W2053562938 cites W2799061466 @default.
- W2053562938 cites W37629172 @default.
- W2053562938 doi "https://doi.org/10.1142/9789812701626_0008" @default.
- W2053562938 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/17094229" @default.
- W2053562938 hasPublicationYear "2005" @default.
- W2053562938 type Work @default.
- W2053562938 sameAs 2053562938 @default.
- W2053562938 citedByCount "6" @default.
- W2053562938 countsByYear W20535629382012 @default.
- W2053562938 crossrefType "proceedings-article" @default.
- W2053562938 hasAuthorship W2053562938A5057955464 @default.
- W2053562938 hasAuthorship W2053562938A5065682719 @default.
- W2053562938 hasAuthorship W2053562938A5075220413 @default.
- W2053562938 hasAuthorship W2053562938A5084069668 @default.
- W2053562938 hasAuthorship W2053562938A5089757137 @default.
- W2053562938 hasConcept C104317684 @default.
- W2053562938 hasConcept C119599485 @default.
- W2053562938 hasConcept C124101348 @default.
- W2053562938 hasConcept C127413603 @default.
- W2053562938 hasConcept C154945302 @default.
- W2053562938 hasConcept C171897839 @default.
- W2053562938 hasConcept C191630685 @default.
- W2053562938 hasConcept C23123220 @default.
- W2053562938 hasConcept C2522767166 @default.
- W2053562938 hasConcept C2776321320 @default.
- W2053562938 hasConcept C41008148 @default.
- W2053562938 hasConcept C54355233 @default.
- W2053562938 hasConcept C70721500 @default.
- W2053562938 hasConcept C86803240 @default.
- W2053562938 hasConceptScore W2053562938C104317684 @default.
- W2053562938 hasConceptScore W2053562938C119599485 @default.
- W2053562938 hasConceptScore W2053562938C124101348 @default.
- W2053562938 hasConceptScore W2053562938C127413603 @default.
- W2053562938 hasConceptScore W2053562938C154945302 @default.
- W2053562938 hasConceptScore W2053562938C171897839 @default.
- W2053562938 hasConceptScore W2053562938C191630685 @default.
- W2053562938 hasConceptScore W2053562938C23123220 @default.
- W2053562938 hasConceptScore W2053562938C2522767166 @default.
- W2053562938 hasConceptScore W2053562938C2776321320 @default.
- W2053562938 hasConceptScore W2053562938C41008148 @default.
- W2053562938 hasConceptScore W2053562938C54355233 @default.
- W2053562938 hasConceptScore W2053562938C70721500 @default.
- W2053562938 hasConceptScore W2053562938C86803240 @default.
- W2053562938 hasLocation W20535629381 @default.
- W2053562938 hasLocation W20535629382 @default.
- W2053562938 hasOpenAccess W2053562938 @default.
- W2053562938 hasPrimaryLocation W20535629381 @default.
- W2053562938 hasRelatedWork W151193258 @default.
- W2053562938 hasRelatedWork W1598723751 @default.
- W2053562938 hasRelatedWork W1607472309 @default.
- W2053562938 hasRelatedWork W1892467659 @default.
- W2053562938 hasRelatedWork W2175120218 @default.
- W2053562938 hasRelatedWork W2384888906 @default.
- W2053562938 hasRelatedWork W2472885054 @default.
- W2053562938 hasRelatedWork W2892559408 @default.
- W2053562938 hasRelatedWork W3083711586 @default.
- W2053562938 hasRelatedWork W77472803 @default.
- W2053562938 isParatext "false" @default.
- W2053562938 isRetracted "false" @default.
- W2053562938 magId "2053562938" @default.
- W2053562938 workType "article" @default.