Matches in SemOpenAlex for { <https://semopenalex.org/work/W4287754559> ?p ?o ?g. }
Showing items 1 to 77 of
77
with 100 items per page.
- W4287754559 abstract "Large-scale and high-quality corpora are necessary for evaluating machine reading comprehension models on a low-resource language like Vietnamese. Besides, machine reading comprehension (MRC) for the health domain offers great potential for practical applications; however, there is still very little MRC research in this domain. This paper presents ViNewsQA as a new corpus for the Vietnamese language to evaluate healthcare reading comprehension models. The corpus comprises 22,057 human-generated question-answer pairs. Crowd-workers create the questions and their answers based on a collection of over 4,416 online Vietnamese healthcare news articles, where the answers comprise spans extracted from the corresponding articles. In particular, we develop a process of creating a corpus for the Vietnamese machine reading comprehension. Comprehensive evaluations demonstrate that our corpus requires abilities beyond simple reasoning, such as word matching and demanding difficult reasoning based on single-or-multiple-sentence information. We conduct experiments using different types of machine reading comprehension methods to achieve the first baseline performances, compared with further models' performances. We also measure human performance on the corpus and compared it with several powerful neural network-based and transfer learning-based models. Our experiments show that the best machine model is ALBERT, which achieves an exact match score of 65.26% and an F1-score of 84.89% on our corpus. The significant differences between humans and the best-performance model (14.53% of EM and 10.90% of F1-score) on the test set of our corpus indicate that improvements in ViNewsQA could be explored in the future study. Our corpus is publicly available on our website for the research purpose to encourage the research community to make these improvements." @default.
- W4287754559 created "2022-07-26" @default.
- W4287754559 creator A5009894498 @default.
- W4287754559 creator A5033137339 @default.
- W4287754559 creator A5045127260 @default.
- W4287754559 creator A5046262541 @default.
- W4287754559 creator A5055531030 @default.
- W4287754559 date "2020-06-19" @default.
- W4287754559 modified "2023-10-17" @default.
- W4287754559 title "New Vietnamese Corpus for Machine Reading Comprehension of Health News Articles" @default.
- W4287754559 doi "https://doi.org/10.48550/arxiv.2006.11138" @default.
- W4287754559 hasPublicationYear "2020" @default.
- W4287754559 type Work @default.
- W4287754559 citedByCount "0" @default.
- W4287754559 crossrefType "posted-content" @default.
- W4287754559 hasAuthorship W4287754559A5009894498 @default.
- W4287754559 hasAuthorship W4287754559A5033137339 @default.
- W4287754559 hasAuthorship W4287754559A5045127260 @default.
- W4287754559 hasAuthorship W4287754559A5046262541 @default.
- W4287754559 hasAuthorship W4287754559A5055531030 @default.
- W4287754559 hasBestOaLocation W42877545591 @default.
- W4287754559 hasConcept C103621254 @default.
- W4287754559 hasConcept C105795698 @default.
- W4287754559 hasConcept C111472728 @default.
- W4287754559 hasConcept C134306372 @default.
- W4287754559 hasConcept C137293760 @default.
- W4287754559 hasConcept C138885662 @default.
- W4287754559 hasConcept C154945302 @default.
- W4287754559 hasConcept C165064840 @default.
- W4287754559 hasConcept C199360897 @default.
- W4287754559 hasConcept C203005215 @default.
- W4287754559 hasConcept C204321447 @default.
- W4287754559 hasConcept C2777530160 @default.
- W4287754559 hasConcept C2778780117 @default.
- W4287754559 hasConcept C2779530757 @default.
- W4287754559 hasConcept C33923547 @default.
- W4287754559 hasConcept C36503486 @default.
- W4287754559 hasConcept C41008148 @default.
- W4287754559 hasConcept C41895202 @default.
- W4287754559 hasConcept C511192102 @default.
- W4287754559 hasConcept C554936623 @default.
- W4287754559 hasConceptScore W4287754559C103621254 @default.
- W4287754559 hasConceptScore W4287754559C105795698 @default.
- W4287754559 hasConceptScore W4287754559C111472728 @default.
- W4287754559 hasConceptScore W4287754559C134306372 @default.
- W4287754559 hasConceptScore W4287754559C137293760 @default.
- W4287754559 hasConceptScore W4287754559C138885662 @default.
- W4287754559 hasConceptScore W4287754559C154945302 @default.
- W4287754559 hasConceptScore W4287754559C165064840 @default.
- W4287754559 hasConceptScore W4287754559C199360897 @default.
- W4287754559 hasConceptScore W4287754559C203005215 @default.
- W4287754559 hasConceptScore W4287754559C204321447 @default.
- W4287754559 hasConceptScore W4287754559C2777530160 @default.
- W4287754559 hasConceptScore W4287754559C2778780117 @default.
- W4287754559 hasConceptScore W4287754559C2779530757 @default.
- W4287754559 hasConceptScore W4287754559C33923547 @default.
- W4287754559 hasConceptScore W4287754559C36503486 @default.
- W4287754559 hasConceptScore W4287754559C41008148 @default.
- W4287754559 hasConceptScore W4287754559C41895202 @default.
- W4287754559 hasConceptScore W4287754559C511192102 @default.
- W4287754559 hasConceptScore W4287754559C554936623 @default.
- W4287754559 hasLocation W42877545591 @default.
- W4287754559 hasOpenAccess W4287754559 @default.
- W4287754559 hasPrimaryLocation W42877545591 @default.
- W4287754559 hasRelatedWork W1978971213 @default.
- W4287754559 hasRelatedWork W3000271051 @default.
- W4287754559 hasRelatedWork W3093843097 @default.
- W4287754559 hasRelatedWork W3107474891 @default.
- W4287754559 hasRelatedWork W3108520605 @default.
- W4287754559 hasRelatedWork W3111156164 @default.
- W4287754559 hasRelatedWork W3156577902 @default.
- W4287754559 hasRelatedWork W3162043819 @default.
- W4287754559 hasRelatedWork W4287179757 @default.
- W4287754559 hasRelatedWork W4287754559 @default.
- W4287754559 isParatext "false" @default.
- W4287754559 isRetracted "false" @default.
- W4287754559 workType "article" @default.