Matches in SemOpenAlex for { <https://semopenalex.org/work/W2985964562> ?p ?o ?g. }
Showing items 1 to 89 of
89
with 100 items per page.
- W2985964562 abstract "The field of question answering (QA) has seen rapid growth in new tasks and modeling approaches in recent years. Large scale datasets and focus on challenging linguistic phenomena have driven development in neural models, some of which have achieved parity with human performance in limited cases. However, an examination of state-of-the-art model output reveals that a gap remains in reasoning ability compared to a human, and performance tends to degrade when models are exposed to less-constrained tasks. We are interested in more clearly defining the strengths and limitations of leading models across diverse QA challenges, intending to help future researchers with identifying pathways to generalizable performance. We conduct extensive qualitative and quantitative analyses on the results of four models across four datasets and relate common errors to model capabilities. We also illustrate limitations in the datasets we examine and discuss a way forward for achieving generalizable models and datasets that broadly test QA capabilities." @default.
- W2985964562 created "2019-11-22" @default.
- W2985964562 creator A5002450533 @default.
- W2985964562 creator A5038055180 @default.
- W2985964562 creator A5055340301 @default.
- W2985964562 creator A5063241234 @default.
- W2985964562 creator A5068836463 @default.
- W2985964562 date "2019-01-01" @default.
- W2985964562 modified "2023-09-26" @default.
- W2985964562 title "Bend but Don’t Break? Multi-Challenge Stress Test for QA Models" @default.
- W2985964562 cites W1801866228 @default.
- W2985964562 cites W2267186426 @default.
- W2985964562 cites W2551396370 @default.
- W2985964562 cites W2606974598 @default.
- W2985964562 cites W2609826708 @default.
- W2985964562 cites W2734823783 @default.
- W2985964562 cites W2742122443 @default.
- W2985964562 cites W2798858969 @default.
- W2985964562 cites W2799081691 @default.
- W2985964562 cites W2886441967 @default.
- W2985964562 cites W2889787757 @default.
- W2985964562 cites W2892280852 @default.
- W2985964562 cites W2893268956 @default.
- W2985964562 cites W2951873305 @default.
- W2985964562 cites W2962727366 @default.
- W2985964562 cites W2962816513 @default.
- W2985964562 cites W2963323070 @default.
- W2985964562 cites W2963339397 @default.
- W2985964562 cites W2963341493 @default.
- W2985964562 cites W2963341956 @default.
- W2985964562 cites W2963343509 @default.
- W2985964562 cites W2963403868 @default.
- W2985964562 cites W2963748441 @default.
- W2985964562 cites W2963866616 @default.
- W2985964562 cites W2963963993 @default.
- W2985964562 cites W2964207259 @default.
- W2985964562 doi "https://doi.org/10.18653/v1/d19-5818" @default.
- W2985964562 hasPublicationYear "2019" @default.
- W2985964562 type Work @default.
- W2985964562 sameAs 2985964562 @default.
- W2985964562 citedByCount "7" @default.
- W2985964562 countsByYear W29859645622020 @default.
- W2985964562 countsByYear W29859645622021 @default.
- W2985964562 countsByYear W29859645622022 @default.
- W2985964562 crossrefType "proceedings-article" @default.
- W2985964562 hasAuthorship W2985964562A5002450533 @default.
- W2985964562 hasAuthorship W2985964562A5038055180 @default.
- W2985964562 hasAuthorship W2985964562A5055340301 @default.
- W2985964562 hasAuthorship W2985964562A5063241234 @default.
- W2985964562 hasAuthorship W2985964562A5068836463 @default.
- W2985964562 hasConcept C119857082 @default.
- W2985964562 hasConcept C120665830 @default.
- W2985964562 hasConcept C121332964 @default.
- W2985964562 hasConcept C154945302 @default.
- W2985964562 hasConcept C192209626 @default.
- W2985964562 hasConcept C202444582 @default.
- W2985964562 hasConcept C2522767166 @default.
- W2985964562 hasConcept C33923547 @default.
- W2985964562 hasConcept C41008148 @default.
- W2985964562 hasConcept C44291984 @default.
- W2985964562 hasConcept C9652623 @default.
- W2985964562 hasConceptScore W2985964562C119857082 @default.
- W2985964562 hasConceptScore W2985964562C120665830 @default.
- W2985964562 hasConceptScore W2985964562C121332964 @default.
- W2985964562 hasConceptScore W2985964562C154945302 @default.
- W2985964562 hasConceptScore W2985964562C192209626 @default.
- W2985964562 hasConceptScore W2985964562C202444582 @default.
- W2985964562 hasConceptScore W2985964562C2522767166 @default.
- W2985964562 hasConceptScore W2985964562C33923547 @default.
- W2985964562 hasConceptScore W2985964562C41008148 @default.
- W2985964562 hasConceptScore W2985964562C44291984 @default.
- W2985964562 hasConceptScore W2985964562C9652623 @default.
- W2985964562 hasLocation W29859645621 @default.
- W2985964562 hasOpenAccess W2985964562 @default.
- W2985964562 hasPrimaryLocation W29859645621 @default.
- W2985964562 hasRelatedWork W1517909231 @default.
- W2985964562 hasRelatedWork W1571404427 @default.
- W2985964562 hasRelatedWork W2130575083 @default.
- W2985964562 hasRelatedWork W2789244308 @default.
- W2985964562 hasRelatedWork W2916492174 @default.
- W2985964562 hasRelatedWork W3107474891 @default.
- W2985964562 hasRelatedWork W3208943668 @default.
- W2985964562 hasRelatedWork W4212887618 @default.
- W2985964562 hasRelatedWork W4288754364 @default.
- W2985964562 hasRelatedWork W2613333037 @default.
- W2985964562 isParatext "false" @default.
- W2985964562 isRetracted "false" @default.
- W2985964562 magId "2985964562" @default.
- W2985964562 workType "article" @default.