Matches in SemOpenAlex for { <https://semopenalex.org/work/W4385496630> ?p ?o ?g. }
- W4385496630 endingPage "104461" @default.
- W4385496630 startingPage "104461" @default.
- W4385496630 abstract "Electronic Clinical Narratives (ECNs) store valuable individual’s health information. However, there are few available open-source data. Besides, ECNs can be structurally heterogeneous, ranging from documents with explicit section headings or titles to unstructured notes. This lack of structure complicates building automatic systems and their evaluation. The aim of the present work is to provide the scientific community with a Spanish open-source dataset to build and evaluate automatic section identification systems. Together with this dataset, the purpose is to design and implement a suitable evaluation measure and a fine-tuned language model adapted to the task. A corpus of unstructured clinical records, in this case progress notes written in Spanish, was annotated with seven major section types. Existing metrics for the presented task were thoroughly assessed and, based on the most suitable one, we defined a new B2 metric better tailored given the task. The annotated corpus, as well as the designed new evaluation script and a baseline model are freely available for the community. This model reaches an average B2 score of 71.3 on our open source dataset and an average B2 of 67.0 in data scarcity scenarios where the target corpus and its structure differs from the dataset used for training the LM. Although section identification in unstructured clinical narratives is challenging, this work shows that it is possible to build competitive automatic systems when both data and the right evaluation metrics are available. The annotated data, the implemented evaluation scripts, and the section identification Language Model are open-sourced hoping that this contribution will foster the building of more and better systems." @default.
- W4385496630 created "2023-08-03" @default.
- W4385496630 creator A5006193015 @default.
- W4385496630 creator A5030279705 @default.
- W4385496630 creator A5030328561 @default.
- W4385496630 creator A5076692436 @default.
- W4385496630 creator A5077699885 @default.
- W4385496630 creator A5088401849 @default.
- W4385496630 date "2023-09-01" @default.
- W4385496630 modified "2023-09-27" @default.
- W4385496630 title "An open source corpus and automatic tool for section identification in Spanish health records" @default.
- W4385496630 cites W1748329829 @default.
- W4385496630 cites W2125674401 @default.
- W4385496630 cites W2159083595 @default.
- W4385496630 cites W2190421341 @default.
- W4385496630 cites W2396881363 @default.
- W4385496630 cites W2612374874 @default.
- W4385496630 cites W2768488789 @default.
- W4385496630 cites W2795959543 @default.
- W4385496630 cites W2808897169 @default.
- W4385496630 cites W2884462261 @default.
- W4385496630 cites W2914694065 @default.
- W4385496630 cites W2946575095 @default.
- W4385496630 cites W2958747773 @default.
- W4385496630 cites W2962705025 @default.
- W4385496630 cites W3131698376 @default.
- W4385496630 cites W3183363647 @default.
- W4385496630 cites W3201524630 @default.
- W4385496630 cites W4236059846 @default.
- W4385496630 doi "https://doi.org/10.1016/j.jbi.2023.104461" @default.
- W4385496630 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/37536643" @default.
- W4385496630 hasPublicationYear "2023" @default.
- W4385496630 type Work @default.
- W4385496630 citedByCount "0" @default.
- W4385496630 crossrefType "journal-article" @default.
- W4385496630 hasAuthorship W4385496630A5006193015 @default.
- W4385496630 hasAuthorship W4385496630A5030279705 @default.
- W4385496630 hasAuthorship W4385496630A5030328561 @default.
- W4385496630 hasAuthorship W4385496630A5076692436 @default.
- W4385496630 hasAuthorship W4385496630A5077699885 @default.
- W4385496630 hasAuthorship W4385496630A5088401849 @default.
- W4385496630 hasConcept C111368507 @default.
- W4385496630 hasConcept C111919701 @default.
- W4385496630 hasConcept C116834253 @default.
- W4385496630 hasConcept C12725497 @default.
- W4385496630 hasConcept C127313418 @default.
- W4385496630 hasConcept C154945302 @default.
- W4385496630 hasConcept C162324750 @default.
- W4385496630 hasConcept C176217482 @default.
- W4385496630 hasConcept C187736073 @default.
- W4385496630 hasConcept C199360897 @default.
- W4385496630 hasConcept C204321447 @default.
- W4385496630 hasConcept C21547014 @default.
- W4385496630 hasConcept C23123220 @default.
- W4385496630 hasConcept C2522767166 @default.
- W4385496630 hasConcept C2780129039 @default.
- W4385496630 hasConcept C2780451532 @default.
- W4385496630 hasConcept C41008148 @default.
- W4385496630 hasConcept C59822182 @default.
- W4385496630 hasConcept C61423126 @default.
- W4385496630 hasConcept C86803240 @default.
- W4385496630 hasConceptScore W4385496630C111368507 @default.
- W4385496630 hasConceptScore W4385496630C111919701 @default.
- W4385496630 hasConceptScore W4385496630C116834253 @default.
- W4385496630 hasConceptScore W4385496630C12725497 @default.
- W4385496630 hasConceptScore W4385496630C127313418 @default.
- W4385496630 hasConceptScore W4385496630C154945302 @default.
- W4385496630 hasConceptScore W4385496630C162324750 @default.
- W4385496630 hasConceptScore W4385496630C176217482 @default.
- W4385496630 hasConceptScore W4385496630C187736073 @default.
- W4385496630 hasConceptScore W4385496630C199360897 @default.
- W4385496630 hasConceptScore W4385496630C204321447 @default.
- W4385496630 hasConceptScore W4385496630C21547014 @default.
- W4385496630 hasConceptScore W4385496630C23123220 @default.
- W4385496630 hasConceptScore W4385496630C2522767166 @default.
- W4385496630 hasConceptScore W4385496630C2780129039 @default.
- W4385496630 hasConceptScore W4385496630C2780451532 @default.
- W4385496630 hasConceptScore W4385496630C41008148 @default.
- W4385496630 hasConceptScore W4385496630C59822182 @default.
- W4385496630 hasConceptScore W4385496630C61423126 @default.
- W4385496630 hasConceptScore W4385496630C86803240 @default.
- W4385496630 hasLocation W43854966301 @default.
- W4385496630 hasLocation W43854966302 @default.
- W4385496630 hasOpenAccess W4385496630 @default.
- W4385496630 hasPrimaryLocation W43854966301 @default.
- W4385496630 hasRelatedWork W2086733238 @default.
- W4385496630 hasRelatedWork W2355288082 @default.
- W4385496630 hasRelatedWork W2357241418 @default.
- W4385496630 hasRelatedWork W2368553372 @default.
- W4385496630 hasRelatedWork W2395078704 @default.
- W4385496630 hasRelatedWork W2787190016 @default.
- W4385496630 hasRelatedWork W2809330710 @default.
- W4385496630 hasRelatedWork W3037322406 @default.
- W4385496630 hasRelatedWork W4294661698 @default.
- W4385496630 hasRelatedWork W4319453497 @default.
- W4385496630 hasVolume "145" @default.
- W4385496630 isParatext "false" @default.
- W4385496630 isRetracted "false" @default.