Matches in SemOpenAlex for { <https://semopenalex.org/work/W4288021280> ?p ?o ?g. }
Showing items 1 to 59 of
59
with 100 items per page.
- W4288021280 abstract "Idiomatic expressions like `out of the woods' and `up the ante' present a range of difficulties for natural language processing applications. We present work on the annotation and extraction of what we term potentially idiomatic expressions (PIEs), a subclass of multiword expressions covering both literal and non-literal uses of idiomatic expressions. Existing corpora of PIEs are small and have limited coverage of different PIE types, which hampers research. To further progress on the extraction and disambiguation of potentially idiomatic expressions, larger corpora of PIEs are required. In addition, larger corpora are a potential source for valuable linguistic insights into idiomatic expressions and their variability. We propose automatic tools to facilitate the building of larger PIE corpora, by investigating the feasibility of using dictionary-based extraction of PIEs as a pre-extraction tool for English. We do this by assessing the reliability and coverage of idiom dictionaries, the annotation of a PIE corpus, and the automatic extraction of PIEs from a large corpus. Results show that combinations of dictionaries are a reliable source of idiomatic expressions, that PIEs can be annotated with a high reliability (0.74-0.91 Fleiss' Kappa), and that parse-based PIE extraction yields highly accurate performance (88% F1-score). Combining complementary PIE extraction methods increases reliability further, to over 92% F1-score. Moreover, the extraction method presented here could be extended to other types of multiword expressions and to other languages, given that sufficient NLP tools are available." @default.
- W4288021280 created "2022-07-26" @default.
- W4288021280 creator A5034237344 @default.
- W4288021280 creator A5040564747 @default.
- W4288021280 creator A5057798904 @default.
- W4288021280 date "2019-11-20" @default.
- W4288021280 modified "2023-09-24" @default.
- W4288021280 title "Casting a Wide Net: Robust Extraction of Potentially Idiomatic Expressions" @default.
- W4288021280 doi "https://doi.org/10.48550/arxiv.1911.08829" @default.
- W4288021280 hasPublicationYear "2019" @default.
- W4288021280 type Work @default.
- W4288021280 citedByCount "0" @default.
- W4288021280 crossrefType "posted-content" @default.
- W4288021280 hasAuthorship W4288021280A5034237344 @default.
- W4288021280 hasAuthorship W4288021280A5040564747 @default.
- W4288021280 hasAuthorship W4288021280A5057798904 @default.
- W4288021280 hasBestOaLocation W42880212801 @default.
- W4288021280 hasConcept C11413529 @default.
- W4288021280 hasConcept C121332964 @default.
- W4288021280 hasConcept C154945302 @default.
- W4288021280 hasConcept C163258240 @default.
- W4288021280 hasConcept C186644900 @default.
- W4288021280 hasConcept C199360897 @default.
- W4288021280 hasConcept C204321447 @default.
- W4288021280 hasConcept C2776321320 @default.
- W4288021280 hasConcept C2780882242 @default.
- W4288021280 hasConcept C41008148 @default.
- W4288021280 hasConcept C43214815 @default.
- W4288021280 hasConcept C62520636 @default.
- W4288021280 hasConcept C90559484 @default.
- W4288021280 hasConceptScore W4288021280C11413529 @default.
- W4288021280 hasConceptScore W4288021280C121332964 @default.
- W4288021280 hasConceptScore W4288021280C154945302 @default.
- W4288021280 hasConceptScore W4288021280C163258240 @default.
- W4288021280 hasConceptScore W4288021280C186644900 @default.
- W4288021280 hasConceptScore W4288021280C199360897 @default.
- W4288021280 hasConceptScore W4288021280C204321447 @default.
- W4288021280 hasConceptScore W4288021280C2776321320 @default.
- W4288021280 hasConceptScore W4288021280C2780882242 @default.
- W4288021280 hasConceptScore W4288021280C41008148 @default.
- W4288021280 hasConceptScore W4288021280C43214815 @default.
- W4288021280 hasConceptScore W4288021280C62520636 @default.
- W4288021280 hasConceptScore W4288021280C90559484 @default.
- W4288021280 hasLocation W42880212801 @default.
- W4288021280 hasOpenAccess W4288021280 @default.
- W4288021280 hasPrimaryLocation W42880212801 @default.
- W4288021280 hasRelatedWork W11531451 @default.
- W4288021280 hasRelatedWork W1503459176 @default.
- W4288021280 hasRelatedWork W1552159754 @default.
- W4288021280 hasRelatedWork W1575587935 @default.
- W4288021280 hasRelatedWork W1892467659 @default.
- W4288021280 hasRelatedWork W1930331324 @default.
- W4288021280 hasRelatedWork W2389552174 @default.
- W4288021280 hasRelatedWork W2502722637 @default.
- W4288021280 hasRelatedWork W2754381625 @default.
- W4288021280 hasRelatedWork W2903680434 @default.
- W4288021280 isParatext "false" @default.
- W4288021280 isRetracted "false" @default.
- W4288021280 workType "article" @default.