Matches in SemOpenAlex for { <https://semopenalex.org/work/W3182155247> ?p ?o ?g. }
Showing items 1 to 75 of
75
with 100 items per page.
- W3182155247 abstract "Phrase Detectives Corpus Version 2 was developed by the School of Computer Science and Electronic Engineering at the University of Essex and consists of approximately 407,000 tokens across 537 documents anaphorically-annotated by the Phrase Detectives Game, an online interactive game-with-a-purpose (GWAP) designed to collect data about English anaphoric coreference. This release constitutes a new version of the Phrase Detectives Corpus (LDC2017T08) that adds significantly more annotated tokens to the data set and supplies for each markable a substantial number of judgments expressed by the players and a silver label annotation based on the probabilistic aggregation method for anaphoric information. GWAPs for creating language resources are growing. In general, they employ non-monetary incentives, such as entertainment, to motivate participation and can be successful for large-scale persistent annotation efforts. Two projects that collect linguistic resources via Phrase Detectives and other similar language-oriented GWAPs are DALI (Disagreements and Language Interpretation), led by Queen Mary University of London and the University of Essex, and the LDC NIEUW (Novel Incentives and Workflows in Linguistic Data Annotation) project through its game site Lingo Boingo, in collaboration with Queen Mary University, the University of Essex and other partners. Data The documents in the corpus are taken from Wikipedia articles and from narrative text in Project Gutenberg. The annotation is a simplified form of the coding scheme used in The ARRAU Corpus of Anaphoric Information (LDC2013T22). Players were asked to classify markables as referring or non-referring. Referring noun phrases could be classified either as discourse-new or discourse-old (referring to the same entity as a previous mention). Two types of non-referring expressions are identified: expletives and predicative NPs (called 'properties'). Discourse-old markables include so-called split antecedent plurals, as in Mary met John. They had dinner together. All player judgments are stored in MAS-XML format; they average 20 judgments per markable, up to 90 judgments in one case. A silver label extracted from those judgments using the MPA probabilistic annotation method (Paun et. al, 2018) is also provided. Wikipedia articles are presented as html, and all other source files are presented as plain text. All text is encoded as UTF-8. Annotations are released in three formats: (1) MAS-XML (the format in the first release), (2) a CONLL-style format based on the CoNLL 2011 and 2012 shared tasks on coreference and (3) CRAC 2018 format." @default.
- W3182155247 created "2021-07-19" @default.
- W3182155247 creator A5003328793 @default.
- W3182155247 creator A5014534985 @default.
- W3182155247 creator A5019552564 @default.
- W3182155247 creator A5047065550 @default.
- W3182155247 creator A5067137759 @default.
- W3182155247 date "2019-07-15" @default.
- W3182155247 modified "2023-09-23" @default.
- W3182155247 title "Phrase Detectives Corpus Version 2" @default.
- W3182155247 hasPublicationYear "2019" @default.
- W3182155247 type Work @default.
- W3182155247 sameAs 3182155247 @default.
- W3182155247 citedByCount "0" @default.
- W3182155247 crossrefType "dataset" @default.
- W3182155247 hasAuthorship W3182155247A5003328793 @default.
- W3182155247 hasAuthorship W3182155247A5014534985 @default.
- W3182155247 hasAuthorship W3182155247A5019552564 @default.
- W3182155247 hasAuthorship W3182155247A5047065550 @default.
- W3182155247 hasAuthorship W3182155247A5067137759 @default.
- W3182155247 hasConcept C121934690 @default.
- W3182155247 hasConcept C138268822 @default.
- W3182155247 hasConcept C138885662 @default.
- W3182155247 hasConcept C153962237 @default.
- W3182155247 hasConcept C154945302 @default.
- W3182155247 hasConcept C199033989 @default.
- W3182155247 hasConcept C204321447 @default.
- W3182155247 hasConcept C26022165 @default.
- W3182155247 hasConcept C2776224158 @default.
- W3182155247 hasConcept C2776321320 @default.
- W3182155247 hasConcept C28076734 @default.
- W3182155247 hasConcept C41008148 @default.
- W3182155247 hasConcept C41895202 @default.
- W3182155247 hasConcept C532629269 @default.
- W3182155247 hasConceptScore W3182155247C121934690 @default.
- W3182155247 hasConceptScore W3182155247C138268822 @default.
- W3182155247 hasConceptScore W3182155247C138885662 @default.
- W3182155247 hasConceptScore W3182155247C153962237 @default.
- W3182155247 hasConceptScore W3182155247C154945302 @default.
- W3182155247 hasConceptScore W3182155247C199033989 @default.
- W3182155247 hasConceptScore W3182155247C204321447 @default.
- W3182155247 hasConceptScore W3182155247C26022165 @default.
- W3182155247 hasConceptScore W3182155247C2776224158 @default.
- W3182155247 hasConceptScore W3182155247C2776321320 @default.
- W3182155247 hasConceptScore W3182155247C28076734 @default.
- W3182155247 hasConceptScore W3182155247C41008148 @default.
- W3182155247 hasConceptScore W3182155247C41895202 @default.
- W3182155247 hasConceptScore W3182155247C532629269 @default.
- W3182155247 hasLocation W31821552471 @default.
- W3182155247 hasOpenAccess W3182155247 @default.
- W3182155247 hasPrimaryLocation W31821552471 @default.
- W3182155247 hasRelatedWork W116192430 @default.
- W3182155247 hasRelatedWork W116700124 @default.
- W3182155247 hasRelatedWork W185240168 @default.
- W3182155247 hasRelatedWork W1981379484 @default.
- W3182155247 hasRelatedWork W2250203420 @default.
- W3182155247 hasRelatedWork W2544589406 @default.
- W3182155247 hasRelatedWork W2567922298 @default.
- W3182155247 hasRelatedWork W2573772233 @default.
- W3182155247 hasRelatedWork W2575233155 @default.
- W3182155247 hasRelatedWork W2626574723 @default.
- W3182155247 hasRelatedWork W2759600630 @default.
- W3182155247 hasRelatedWork W2806360462 @default.
- W3182155247 hasRelatedWork W2888824653 @default.
- W3182155247 hasRelatedWork W2911699369 @default.
- W3182155247 hasRelatedWork W3029801984 @default.
- W3182155247 hasRelatedWork W3179541312 @default.
- W3182155247 hasRelatedWork W3186788573 @default.
- W3182155247 hasRelatedWork W576550507 @default.
- W3182155247 hasRelatedWork W62273090 @default.
- W3182155247 hasRelatedWork W3196552872 @default.
- W3182155247 isParatext "false" @default.
- W3182155247 isRetracted "false" @default.
- W3182155247 magId "3182155247" @default.
- W3182155247 workType "dataset" @default.