Matches in SemOpenAlex for { <https://semopenalex.org/work/W2116760667> ?p ?o ?g. }
- W2116760667 abstract "As we are witnessing a great interest in identifying and extracting chemical entities in academic articles, many approaches have been proposed to solve this problem. In this work we describe a probabilistic framework that allows for the output of multiple information extraction systems to be combined in a systematic way. The identified entities are assigned a probability score that reflects the extractors' confidence, without the need for each individual extractor to generate a probability score. We quantitively compared the performance of multiple chemical tokenizers to measure the effect of tokenization on extraction accuracy. Later, a single Conditional Random Fields (CRF) extractor that utilizes the best performing tokenizer is built using a unique collection of features such as word embeddings and Soundex codes, which, to the best of our knowledge, has not been explored in this context before.The ensemble of multiple extractors outperforms each extractor's individual performance during the CHEMDNER challenge. When the runs were optimized to favor recall, the ensemble approach achieved the second highest recall on unseen entities. As for the single CRF model with novel features, the extractor achieves an F1 score of 83.3% on the test set, without any post processing or abbreviation matching.Ensemble information extraction is effective when multiple stand alone extractors are to be used, and produces higher performance than individual off the shelf extractors. The novel features introduced in the single CRF model are sufficient to achieve very competitive F1 score using a simple standalone extractor." @default.
- W2116760667 created "2016-06-24" @default.
- W2116760667 creator A5001294898 @default.
- W2116760667 creator A5054253075 @default.
- W2116760667 date "2015-01-19" @default.
- W2116760667 modified "2023-10-01" @default.
- W2116760667 title "Chemical entity extraction using CRF and an ensemble of extractors" @default.
- W2116760667 cites W1524134026 @default.
- W2116760667 cites W2009556891 @default.
- W2116760667 cites W2056451646 @default.
- W2116760667 cites W2061167720 @default.
- W2116760667 cites W2101553882 @default.
- W2116760667 cites W2107005506 @default.
- W2116760667 cites W2121244856 @default.
- W2116760667 cites W2136796599 @default.
- W2116760667 cites W2145870108 @default.
- W2116760667 cites W2165671627 @default.
- W2116760667 cites W2169986726 @default.
- W2116760667 cites W2172216479 @default.
- W2116760667 cites W28412257 @default.
- W2116760667 doi "https://doi.org/10.1186/1758-2946-7-s1-s12" @default.
- W2116760667 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/4331688" @default.
- W2116760667 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/25810769" @default.
- W2116760667 hasPublicationYear "2015" @default.
- W2116760667 type Work @default.
- W2116760667 sameAs 2116760667 @default.
- W2116760667 citedByCount "15" @default.
- W2116760667 countsByYear W21167606672015 @default.
- W2116760667 countsByYear W21167606672016 @default.
- W2116760667 countsByYear W21167606672018 @default.
- W2116760667 countsByYear W21167606672019 @default.
- W2116760667 countsByYear W21167606672020 @default.
- W2116760667 countsByYear W21167606672021 @default.
- W2116760667 countsByYear W21167606672022 @default.
- W2116760667 crossrefType "journal-article" @default.
- W2116760667 hasAuthorship W2116760667A5001294898 @default.
- W2116760667 hasAuthorship W2116760667A5054253075 @default.
- W2116760667 hasBestOaLocation W21167606671 @default.
- W2116760667 hasConcept C105795698 @default.
- W2116760667 hasConcept C117978034 @default.
- W2116760667 hasConcept C119857082 @default.
- W2116760667 hasConcept C124101348 @default.
- W2116760667 hasConcept C127413603 @default.
- W2116760667 hasConcept C148524875 @default.
- W2116760667 hasConcept C151730666 @default.
- W2116760667 hasConcept C152565575 @default.
- W2116760667 hasConcept C154945302 @default.
- W2116760667 hasConcept C165064840 @default.
- W2116760667 hasConcept C169258074 @default.
- W2116760667 hasConcept C176982825 @default.
- W2116760667 hasConcept C177264268 @default.
- W2116760667 hasConcept C199360897 @default.
- W2116760667 hasConcept C21880701 @default.
- W2116760667 hasConcept C2524010 @default.
- W2116760667 hasConcept C2779343474 @default.
- W2116760667 hasConcept C33923547 @default.
- W2116760667 hasConcept C41008148 @default.
- W2116760667 hasConcept C49937458 @default.
- W2116760667 hasConcept C81669768 @default.
- W2116760667 hasConcept C86803240 @default.
- W2116760667 hasConcept C90805587 @default.
- W2116760667 hasConceptScore W2116760667C105795698 @default.
- W2116760667 hasConceptScore W2116760667C117978034 @default.
- W2116760667 hasConceptScore W2116760667C119857082 @default.
- W2116760667 hasConceptScore W2116760667C124101348 @default.
- W2116760667 hasConceptScore W2116760667C127413603 @default.
- W2116760667 hasConceptScore W2116760667C148524875 @default.
- W2116760667 hasConceptScore W2116760667C151730666 @default.
- W2116760667 hasConceptScore W2116760667C152565575 @default.
- W2116760667 hasConceptScore W2116760667C154945302 @default.
- W2116760667 hasConceptScore W2116760667C165064840 @default.
- W2116760667 hasConceptScore W2116760667C169258074 @default.
- W2116760667 hasConceptScore W2116760667C176982825 @default.
- W2116760667 hasConceptScore W2116760667C177264268 @default.
- W2116760667 hasConceptScore W2116760667C199360897 @default.
- W2116760667 hasConceptScore W2116760667C21880701 @default.
- W2116760667 hasConceptScore W2116760667C2524010 @default.
- W2116760667 hasConceptScore W2116760667C2779343474 @default.
- W2116760667 hasConceptScore W2116760667C33923547 @default.
- W2116760667 hasConceptScore W2116760667C41008148 @default.
- W2116760667 hasConceptScore W2116760667C49937458 @default.
- W2116760667 hasConceptScore W2116760667C81669768 @default.
- W2116760667 hasConceptScore W2116760667C86803240 @default.
- W2116760667 hasConceptScore W2116760667C90805587 @default.
- W2116760667 hasIssue "S1" @default.
- W2116760667 hasLocation W21167606671 @default.
- W2116760667 hasLocation W21167606672 @default.
- W2116760667 hasLocation W21167606673 @default.
- W2116760667 hasLocation W21167606674 @default.
- W2116760667 hasOpenAccess W2116760667 @default.
- W2116760667 hasPrimaryLocation W21167606671 @default.
- W2116760667 hasRelatedWork W2116760667 @default.
- W2116760667 hasRelatedWork W2947903144 @default.
- W2116760667 hasRelatedWork W3109997267 @default.
- W2116760667 hasRelatedWork W3188307501 @default.
- W2116760667 hasRelatedWork W4207072088 @default.
- W2116760667 hasRelatedWork W4297900598 @default.
- W2116760667 hasRelatedWork W4360612004 @default.
- W2116760667 hasRelatedWork W4381956280 @default.
- W2116760667 hasRelatedWork W4382864507 @default.