Matches in SemOpenAlex for { <https://semopenalex.org/work/W4368408045> ?p ?o ?g. }
- W4368408045 abstract "In this paper, we examine several methods of acquiring Czech data for automated fact-checking, which is a task commonly modeled as a classification of textual claim veracity w.r.t. a corpus of trusted ground truths. We attempt to collect sets of data in form of a factual claim, evidence within the ground truth corpus, and its veracity label (supported, refuted or not enough info). As a first attempt, we generate a Czech version of the large-scale FEVER dataset built on top of Wikipedia corpus. We take a hybrid approach of machine translation and document alignment; the approach and the tools we provide can be easily applied to other languages. We discuss its weaknesses and inaccuracies, propose a future approach for their cleaning and publish the 127k resulting translations, as well as a version of such dataset reliably applicable for the Natural Language Inference task - the CsFEVER-NLI. Furthermore, we collect a novel dataset of 3,097 claims, which is annotated using the corpus of 2.2M articles of Czech News Agency. We present its extended annotation methodology based on the FEVER approach, and, as the underlying corpus is kept a trade secret, we also publish a standalone version of the dataset for the task of Natural Language Inference we call CTKFactsNLI. We analyze both acquired datasets for spurious cues - annotation patterns leading to model overfitting. CTKFacts is further examined for inter-annotator agreement, thoroughly cleaned, and a typology of common annotator errors is extracted. Finally, we provide baseline models for all stages of the fact-checking pipeline and publish the NLI datasets, as well as our annotation platform and other experimental data." @default.
- W4368408045 created "2023-05-05" @default.
- W4368408045 creator A5041989043 @default.
- W4368408045 creator A5049289726 @default.
- W4368408045 creator A5075686754 @default.
- W4368408045 creator A5083321457 @default.
- W4368408045 creator A5089326901 @default.
- W4368408045 date "2023-05-03" @default.
- W4368408045 modified "2023-09-30" @default.
- W4368408045 title "CsFEVER and CTKFacts: acquiring Czech data for fact verification" @default.
- W4368408045 cites W1975879668 @default.
- W4368408045 cites W2019029324 @default.
- W4368408045 cites W2058634939 @default.
- W4368408045 cites W2061504941 @default.
- W4368408045 cites W2097645005 @default.
- W4368408045 cites W2103664125 @default.
- W4368408045 cites W2251648400 @default.
- W4368408045 cites W2337875011 @default.
- W4368408045 cites W2413794162 @default.
- W4368408045 cites W2751368487 @default.
- W4368408045 cites W2803728898 @default.
- W4368408045 cites W2891555348 @default.
- W4368408045 cites W2952594430 @default.
- W4368408045 cites W2952638691 @default.
- W4368408045 cites W2952984539 @default.
- W4368408045 cites W2962985038 @default.
- W4368408045 cites W2963341956 @default.
- W4368408045 cites W2963416784 @default.
- W4368408045 cites W2963748441 @default.
- W4368408045 cites W2963961878 @default.
- W4368408045 cites W2964060837 @default.
- W4368408045 cites W2964068236 @default.
- W4368408045 cites W2970641574 @default.
- W4368408045 cites W2970716846 @default.
- W4368408045 cites W2988092105 @default.
- W4368408045 cites W2991654209 @default.
- W4368408045 cites W2998702515 @default.
- W4368408045 cites W3021397474 @default.
- W4368408045 cites W3082760180 @default.
- W4368408045 cites W3090350559 @default.
- W4368408045 cites W3170180819 @default.
- W4368408045 cites W3173365702 @default.
- W4368408045 cites W3175864309 @default.
- W4368408045 cites W3177233335 @default.
- W4368408045 cites W3180230246 @default.
- W4368408045 cites W3191286626 @default.
- W4368408045 cites W3203765809 @default.
- W4368408045 cites W3206435361 @default.
- W4368408045 cites W3207988762 @default.
- W4368408045 cites W3212606841 @default.
- W4368408045 cites W4205866829 @default.
- W4368408045 cites W4288280763 @default.
- W4368408045 doi "https://doi.org/10.1007/s10579-023-09654-3" @default.
- W4368408045 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/37360264" @default.
- W4368408045 hasPublicationYear "2023" @default.
- W4368408045 type Work @default.
- W4368408045 citedByCount "0" @default.
- W4368408045 crossrefType "journal-article" @default.
- W4368408045 hasAuthorship W4368408045A5041989043 @default.
- W4368408045 hasAuthorship W4368408045A5049289726 @default.
- W4368408045 hasAuthorship W4368408045A5075686754 @default.
- W4368408045 hasAuthorship W4368408045A5083321457 @default.
- W4368408045 hasAuthorship W4368408045A5089326901 @default.
- W4368408045 hasBestOaLocation W43684080451 @default.
- W4368408045 hasConcept C111368507 @default.
- W4368408045 hasConcept C112698675 @default.
- W4368408045 hasConcept C119857082 @default.
- W4368408045 hasConcept C12725497 @default.
- W4368408045 hasConcept C127313418 @default.
- W4368408045 hasConcept C138885662 @default.
- W4368408045 hasConcept C144133560 @default.
- W4368408045 hasConcept C146849305 @default.
- W4368408045 hasConcept C154945302 @default.
- W4368408045 hasConcept C162324750 @default.
- W4368408045 hasConcept C187736073 @default.
- W4368408045 hasConcept C203005215 @default.
- W4368408045 hasConcept C204321447 @default.
- W4368408045 hasConcept C22019652 @default.
- W4368408045 hasConcept C23123220 @default.
- W4368408045 hasConcept C2776214188 @default.
- W4368408045 hasConcept C2776321320 @default.
- W4368408045 hasConcept C2777842544 @default.
- W4368408045 hasConcept C2780451532 @default.
- W4368408045 hasConcept C41008148 @default.
- W4368408045 hasConcept C41458344 @default.
- W4368408045 hasConcept C41895202 @default.
- W4368408045 hasConcept C50644808 @default.
- W4368408045 hasConcept C97256817 @default.
- W4368408045 hasConceptScore W4368408045C111368507 @default.
- W4368408045 hasConceptScore W4368408045C112698675 @default.
- W4368408045 hasConceptScore W4368408045C119857082 @default.
- W4368408045 hasConceptScore W4368408045C12725497 @default.
- W4368408045 hasConceptScore W4368408045C127313418 @default.
- W4368408045 hasConceptScore W4368408045C138885662 @default.
- W4368408045 hasConceptScore W4368408045C144133560 @default.
- W4368408045 hasConceptScore W4368408045C146849305 @default.
- W4368408045 hasConceptScore W4368408045C154945302 @default.
- W4368408045 hasConceptScore W4368408045C162324750 @default.
- W4368408045 hasConceptScore W4368408045C187736073 @default.
- W4368408045 hasConceptScore W4368408045C203005215 @default.