Matches in SemOpenAlex for { <https://semopenalex.org/work/W4297461106> ?p ?o ?g. }
- W4297461106 abstract "Chemical identification involves finding chemical entities in text (i.e. named entity recognition) and assigning unique identifiers to the entities (i.e. named entity normalization). While current models are developed and evaluated based on article titles and abstracts, their effectiveness has not been thoroughly verified in full text. In this paper, we identify two limitations of models in tagging full-text articles: (1) low generalizability to unseen mentions and (2) tagging inconsistency. We use simple training and post-processing methods to address the limitations such as transfer learning and mention-wise majority voting. We also present a hybrid model for the normalization task that utilizes the high recall of a neural model while maintaining the high precision of a dictionary model. In the BioCreative VII NLM-Chem track challenge, our best model achieves 86.72 and 78.31 F1 scores in named entity recognition and normalization, significantly outperforming the median (83.73 and 77.49 F1 scores) and taking first place in named entity recognition. In a post-challenge evaluation, we re-implement our model and obtain 84.70 F1 score in the normalization task, outperforming the best score in the challenge by 3.34 F1 score. Database URL: https://github.com/dmis-lab/bc7-chem-id." @default.
- W4297461106 created "2022-09-29" @default.
- W4297461106 creator A5012809586 @default.
- W4297461106 creator A5020746366 @default.
- W4297461106 creator A5038983063 @default.
- W4297461106 creator A5076917278 @default.
- W4297461106 creator A5083644268 @default.
- W4297461106 date "2022-01-01" @default.
- W4297461106 modified "2023-09-26" @default.
- W4297461106 title "Full-text chemical identification with improved generalizability and tagging consistency" @default.
- W4297461106 cites W1623072288 @default.
- W4297461106 cites W2145870108 @default.
- W4297461106 cites W2149369282 @default.
- W4297461106 cites W2346452181 @default.
- W4297461106 cites W2533611849 @default.
- W4297461106 cites W2573492843 @default.
- W4297461106 cites W2765742249 @default.
- W4297461106 cites W2769387903 @default.
- W4297461106 cites W2809349863 @default.
- W4297461106 cites W2911489562 @default.
- W4297461106 cites W3046375318 @default.
- W4297461106 cites W3105491236 @default.
- W4297461106 cites W3115908473 @default.
- W4297461106 cites W3137481621 @default.
- W4297461106 cites W3164540570 @default.
- W4297461106 cites W4221142212 @default.
- W4297461106 cites W4226154421 @default.
- W4297461106 doi "https://doi.org/10.1093/database/baac074" @default.
- W4297461106 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/36170114" @default.
- W4297461106 hasPublicationYear "2022" @default.
- W4297461106 type Work @default.
- W4297461106 citedByCount "0" @default.
- W4297461106 crossrefType "journal-article" @default.
- W4297461106 hasAuthorship W4297461106A5012809586 @default.
- W4297461106 hasAuthorship W4297461106A5020746366 @default.
- W4297461106 hasAuthorship W4297461106A5038983063 @default.
- W4297461106 hasAuthorship W4297461106A5076917278 @default.
- W4297461106 hasAuthorship W4297461106A5083644268 @default.
- W4297461106 hasBestOaLocation W42974611061 @default.
- W4297461106 hasConcept C105795698 @default.
- W4297461106 hasConcept C116834253 @default.
- W4297461106 hasConcept C119857082 @default.
- W4297461106 hasConcept C136886441 @default.
- W4297461106 hasConcept C144024400 @default.
- W4297461106 hasConcept C148524875 @default.
- W4297461106 hasConcept C154504017 @default.
- W4297461106 hasConcept C154945302 @default.
- W4297461106 hasConcept C162324750 @default.
- W4297461106 hasConcept C187736073 @default.
- W4297461106 hasConcept C19165224 @default.
- W4297461106 hasConcept C199360897 @default.
- W4297461106 hasConcept C204321447 @default.
- W4297461106 hasConcept C23123220 @default.
- W4297461106 hasConcept C27158222 @default.
- W4297461106 hasConcept C2776436953 @default.
- W4297461106 hasConcept C2779135771 @default.
- W4297461106 hasConcept C2780451532 @default.
- W4297461106 hasConcept C33923547 @default.
- W4297461106 hasConcept C41008148 @default.
- W4297461106 hasConcept C59822182 @default.
- W4297461106 hasConcept C86803240 @default.
- W4297461106 hasConceptScore W4297461106C105795698 @default.
- W4297461106 hasConceptScore W4297461106C116834253 @default.
- W4297461106 hasConceptScore W4297461106C119857082 @default.
- W4297461106 hasConceptScore W4297461106C136886441 @default.
- W4297461106 hasConceptScore W4297461106C144024400 @default.
- W4297461106 hasConceptScore W4297461106C148524875 @default.
- W4297461106 hasConceptScore W4297461106C154504017 @default.
- W4297461106 hasConceptScore W4297461106C154945302 @default.
- W4297461106 hasConceptScore W4297461106C162324750 @default.
- W4297461106 hasConceptScore W4297461106C187736073 @default.
- W4297461106 hasConceptScore W4297461106C19165224 @default.
- W4297461106 hasConceptScore W4297461106C199360897 @default.
- W4297461106 hasConceptScore W4297461106C204321447 @default.
- W4297461106 hasConceptScore W4297461106C23123220 @default.
- W4297461106 hasConceptScore W4297461106C27158222 @default.
- W4297461106 hasConceptScore W4297461106C2776436953 @default.
- W4297461106 hasConceptScore W4297461106C2779135771 @default.
- W4297461106 hasConceptScore W4297461106C2780451532 @default.
- W4297461106 hasConceptScore W4297461106C33923547 @default.
- W4297461106 hasConceptScore W4297461106C41008148 @default.
- W4297461106 hasConceptScore W4297461106C59822182 @default.
- W4297461106 hasConceptScore W4297461106C86803240 @default.
- W4297461106 hasLocation W42974611061 @default.
- W4297461106 hasLocation W42974611062 @default.
- W4297461106 hasLocation W42974611063 @default.
- W4297461106 hasOpenAccess W4297461106 @default.
- W4297461106 hasPrimaryLocation W42974611061 @default.
- W4297461106 hasRelatedWork W2129038679 @default.
- W4297461106 hasRelatedWork W2395078704 @default.
- W4297461106 hasRelatedWork W2405038964 @default.
- W4297461106 hasRelatedWork W2773616286 @default.
- W4297461106 hasRelatedWork W2947903144 @default.
- W4297461106 hasRelatedWork W2952732525 @default.
- W4297461106 hasRelatedWork W3194539120 @default.
- W4297461106 hasRelatedWork W4200511449 @default.
- W4297461106 hasRelatedWork W4254089628 @default.
- W4297461106 hasRelatedWork W4297461106 @default.
- W4297461106 hasVolume "2022" @default.
- W4297461106 isParatext "false" @default.