Matches in SemOpenAlex for { <https://semopenalex.org/work/W2000332471> ?p ?o ?g. }
- W2000332471 endingPage "325" @default.
- W2000332471 startingPage "293" @default.
- W2000332471 abstract "Abstract While human annotation is crucial for many natural language processing tasks, it is often very expensive and time-consuming. Inspired by previous work on crowdsourcing, we investigate the viability of using non-expert labels instead of gold standard annotations from experts for a machine learning approach to automatic readability prediction. In order to do so, we evaluate two different methodologies to assess the readability of a wide variety of text material: A more traditional setup in which expert readers make readability judgments and a crowdsourcing setup for users who are not necessarily experts. To this purpose two assessment tools were implemented: a tool where expert readers can rank a batch of texts based on readability, and a lightweight crowdsourcing tool, which invites users to provide pairwise comparisons. To validate this approach, readability assessments for a corpus of written Dutch generic texts were gathered. By collecting multiple assessments per text, we explicitly wanted to level out readers' background knowledge and attitude. Our findings show that the assessments collected through both methodologies are highly consistent and that crowdsourcing is a viable alternative to expert labeling. This is a good news as crowdsourcing is more lightweight to use and can have access to a much wider audience of potential annotators. By performing a set of basic machine learning experiments using a feature set that mainly encodes basic lexical and morpho-syntactic information, we further illustrate how the collected data can be used to perform text comparisons or to assign an absolute readability score to an individual text. We do not focus on optimising the algorithms to achieve the best possible results for the learning tasks, but carry them out to illustrate the various possibilities of our data sets. The results on different data sets, however, show that our system outperforms the readability formulas and a baseline language modelling approach. We conclude that readability assessment by comparing texts is a polyvalent methodology, which can be adapted to specific domains and target audiences if required." @default.
- W2000332471 created "2016-06-24" @default.
- W2000332471 creator A5016968290 @default.
- W2000332471 creator A5019867041 @default.
- W2000332471 creator A5049195161 @default.
- W2000332471 creator A5056749857 @default.
- W2000332471 creator A5075631545 @default.
- W2000332471 creator A5081335221 @default.
- W2000332471 date "2012-12-14" @default.
- W2000332471 modified "2023-10-01" @default.
- W2000332471 title "Using the crowd for readability prediction" @default.
- W2000332471 cites W1507711477 @default.
- W2000332471 cites W1967390364 @default.
- W2000332471 cites W1982643343 @default.
- W2000332471 cites W1988230003 @default.
- W2000332471 cites W2019416425 @default.
- W2000332471 cites W2062585132 @default.
- W2000332471 cites W2065240770 @default.
- W2000332471 cites W2080515068 @default.
- W2000332471 cites W2094394862 @default.
- W2000332471 cites W2106695994 @default.
- W2000332471 cites W2109115148 @default.
- W2000332471 cites W2115968643 @default.
- W2000332471 cites W2164866017 @default.
- W2000332471 cites W2323266328 @default.
- W2000332471 cites W31070363 @default.
- W2000332471 cites W4252080790 @default.
- W2000332471 doi "https://doi.org/10.1017/s1351324912000344" @default.
- W2000332471 hasPublicationYear "2012" @default.
- W2000332471 type Work @default.
- W2000332471 sameAs 2000332471 @default.
- W2000332471 citedByCount "40" @default.
- W2000332471 countsByYear W20003324712013 @default.
- W2000332471 countsByYear W20003324712014 @default.
- W2000332471 countsByYear W20003324712015 @default.
- W2000332471 countsByYear W20003324712016 @default.
- W2000332471 countsByYear W20003324712017 @default.
- W2000332471 countsByYear W20003324712018 @default.
- W2000332471 countsByYear W20003324712019 @default.
- W2000332471 countsByYear W20003324712020 @default.
- W2000332471 countsByYear W20003324712021 @default.
- W2000332471 countsByYear W20003324712022 @default.
- W2000332471 countsByYear W20003324712023 @default.
- W2000332471 crossrefType "journal-article" @default.
- W2000332471 hasAuthorship W2000332471A5016968290 @default.
- W2000332471 hasAuthorship W2000332471A5019867041 @default.
- W2000332471 hasAuthorship W2000332471A5049195161 @default.
- W2000332471 hasAuthorship W2000332471A5056749857 @default.
- W2000332471 hasAuthorship W2000332471A5075631545 @default.
- W2000332471 hasAuthorship W2000332471A5081335221 @default.
- W2000332471 hasBestOaLocation W20003324712 @default.
- W2000332471 hasConcept C119857082 @default.
- W2000332471 hasConcept C136197465 @default.
- W2000332471 hasConcept C136764020 @default.
- W2000332471 hasConcept C138885662 @default.
- W2000332471 hasConcept C154945302 @default.
- W2000332471 hasConcept C177264268 @default.
- W2000332471 hasConcept C184898388 @default.
- W2000332471 hasConcept C199360897 @default.
- W2000332471 hasConcept C204321447 @default.
- W2000332471 hasConcept C23123220 @default.
- W2000332471 hasConcept C2776401178 @default.
- W2000332471 hasConcept C2778143727 @default.
- W2000332471 hasConcept C41008148 @default.
- W2000332471 hasConcept C41895202 @default.
- W2000332471 hasConcept C62230096 @default.
- W2000332471 hasConceptScore W2000332471C119857082 @default.
- W2000332471 hasConceptScore W2000332471C136197465 @default.
- W2000332471 hasConceptScore W2000332471C136764020 @default.
- W2000332471 hasConceptScore W2000332471C138885662 @default.
- W2000332471 hasConceptScore W2000332471C154945302 @default.
- W2000332471 hasConceptScore W2000332471C177264268 @default.
- W2000332471 hasConceptScore W2000332471C184898388 @default.
- W2000332471 hasConceptScore W2000332471C199360897 @default.
- W2000332471 hasConceptScore W2000332471C204321447 @default.
- W2000332471 hasConceptScore W2000332471C23123220 @default.
- W2000332471 hasConceptScore W2000332471C2776401178 @default.
- W2000332471 hasConceptScore W2000332471C2778143727 @default.
- W2000332471 hasConceptScore W2000332471C41008148 @default.
- W2000332471 hasConceptScore W2000332471C41895202 @default.
- W2000332471 hasConceptScore W2000332471C62230096 @default.
- W2000332471 hasIssue "3" @default.
- W2000332471 hasLocation W20003324711 @default.
- W2000332471 hasLocation W20003324712 @default.
- W2000332471 hasOpenAccess W2000332471 @default.
- W2000332471 hasPrimaryLocation W20003324711 @default.
- W2000332471 hasRelatedWork W2000332471 @default.
- W2000332471 hasRelatedWork W2051200425 @default.
- W2000332471 hasRelatedWork W2081830265 @default.
- W2000332471 hasRelatedWork W2507137711 @default.
- W2000332471 hasRelatedWork W2583490418 @default.
- W2000332471 hasRelatedWork W2604943478 @default.
- W2000332471 hasRelatedWork W2952262154 @default.
- W2000332471 hasRelatedWork W3208689818 @default.
- W2000332471 hasRelatedWork W4286891255 @default.
- W2000332471 hasRelatedWork W4300110141 @default.
- W2000332471 hasVolume "20" @default.
- W2000332471 isParatext "false" @default.