Matches in SemOpenAlex for { <https://semopenalex.org/work/W2337329403> ?p ?o ?g. }
- W2337329403 abstract "Despite enjoying the status of an official EU language, Irish is considered a minority language. As with most minority languages, it is a `low-density' language, which means it lacks important linguistic and Natural Language Processing (NLP) resources. Relative to better-resourced languages such as English or French, for example, little research has been carried out on computational analysis or processing of Irish. Parsing is the method of analysing the linguistic structure of text, and it is an invaluable processing step that is required for many different types of language technology applications. As a verb-initial language, Irish has several features that are uncharacteristic of many languages previously studied in parsing research. Our work broadens the application of NLP methods to less studied language structures and provides a basis on which future work in Irish NLP is possible. We report on the development of a dependency treebank that serves as training data for the first full Irish dependency parser. We discuss the linguistic structures of Irish, and the motivation behind the design of our annotation scheme. Our work also examines various methods of employing semi-automated approaches to treebank development. We overcome the relatively small pool of linguistic and technological resources available for the Irish language with these approaches, and show that even in early stages of development, parsing results for Irish are promising. What counts as a sufficient number of trees for training a parser varies according to languages. Through empirical methods, we explore the impact our treebank's size and content has on parsing accuracy for Irish. We also discuss our work in crosslingual studies through converting our treebank to a universal annotation scheme. Finally we extend our Irish NLP work to the unstructured user-generated text of Irish tweets. We report on the creation of a POS-tagged corpus of Irish tweets and the training of statistical POS-tagging models. We show how existing resources can be leveraged for this domain-adapted resource development." @default.
- W2337329403 created "2016-06-24" @default.
- W2337329403 creator A5026085889 @default.
- W2337329403 date "2016-03-01" @default.
- W2337329403 modified "2023-09-24" @default.
- W2337329403 title "Irish dependency treebanking and parsing" @default.
- W2337329403 cites W1014376541 @default.
- W2337329403 cites W103758133 @default.
- W2337329403 cites W106326161 @default.
- W2337329403 cites W113196894 @default.
- W2337329403 cites W1185336703 @default.
- W2337329403 cites W127819914 @default.
- W2337329403 cites W13173216 @default.
- W2337329403 cites W145835004 @default.
- W2337329403 cites W1488763748 @default.
- W2337329403 cites W1491975949 @default.
- W2337329403 cites W1493820313 @default.
- W2337329403 cites W149972807 @default.
- W2337329403 cites W1500431650 @default.
- W2337329403 cites W150422849 @default.
- W2337329403 cites W1510819241 @default.
- W2337329403 cites W1511887618 @default.
- W2337329403 cites W1527181724 @default.
- W2337329403 cites W153277182 @default.
- W2337329403 cites W1533480149 @default.
- W2337329403 cites W1535015163 @default.
- W2337329403 cites W1539312853 @default.
- W2337329403 cites W1567570606 @default.
- W2337329403 cites W1569915133 @default.
- W2337329403 cites W1578503079 @default.
- W2337329403 cites W1580375566 @default.
- W2337329403 cites W1582588624 @default.
- W2337329403 cites W1586073462 @default.
- W2337329403 cites W1592072150 @default.
- W2337329403 cites W1600844763 @default.
- W2337329403 cites W1632114991 @default.
- W2337329403 cites W1671799483 @default.
- W2337329403 cites W1722351164 @default.
- W2337329403 cites W1745817670 @default.
- W2337329403 cites W179033058 @default.
- W2337329403 cites W183066880 @default.
- W2337329403 cites W1847370584 @default.
- W2337329403 cites W1865928303 @default.
- W2337329403 cites W1909180194 @default.
- W2337329403 cites W1911662473 @default.
- W2337329403 cites W1962622193 @default.
- W2337329403 cites W1963684808 @default.
- W2337329403 cites W1968499222 @default.
- W2337329403 cites W1973775139 @default.
- W2337329403 cites W1993202648 @default.
- W2337329403 cites W2000196122 @default.
- W2337329403 cites W2002586403 @default.
- W2337329403 cites W2002664886 @default.
- W2337329403 cites W2026976290 @default.
- W2337329403 cites W2027979924 @default.
- W2337329403 cites W2035955017 @default.
- W2337329403 cites W2052449326 @default.
- W2337329403 cites W2053154970 @default.
- W2337329403 cites W2060097798 @default.
- W2337329403 cites W2062478322 @default.
- W2337329403 cites W2065157922 @default.
- W2337329403 cites W2066832356 @default.
- W2337329403 cites W2069196424 @default.
- W2337329403 cites W2080021732 @default.
- W2337329403 cites W2085989833 @default.
- W2337329403 cites W2088198454 @default.
- W2337329403 cites W2089505529 @default.
- W2337329403 cites W2093647425 @default.
- W2337329403 cites W2094061585 @default.
- W2337329403 cites W2095981462 @default.
- W2337329403 cites W2097173846 @default.
- W2337329403 cites W2098301261 @default.
- W2337329403 cites W2101761627 @default.
- W2337329403 cites W2102331280 @default.
- W2337329403 cites W2105103433 @default.
- W2337329403 cites W2108460050 @default.
- W2337329403 cites W2113691817 @default.
- W2337329403 cites W2113788796 @default.
- W2337329403 cites W2114663556 @default.
- W2337329403 cites W2118875438 @default.
- W2337329403 cites W2122922578 @default.
- W2337329403 cites W2123325748 @default.
- W2337329403 cites W2124772738 @default.
- W2337329403 cites W2130867674 @default.
- W2337329403 cites W2133853925 @default.
- W2337329403 cites W2134354099 @default.
- W2337329403 cites W2135547275 @default.
- W2337329403 cites W2138517257 @default.
- W2337329403 cites W2138909885 @default.
- W2337329403 cites W2139885235 @default.
- W2337329403 cites W2139907445 @default.
- W2337329403 cites W2141766660 @default.
- W2337329403 cites W2142708806 @default.
- W2337329403 cites W2143995218 @default.
- W2337329403 cites W2145837098 @default.
- W2337329403 cites W2149467188 @default.
- W2337329403 cites W2151023586 @default.
- W2337329403 cites W2152691628 @default.
- W2337329403 cites W2153072998 @default.
- W2337329403 cites W2153800732 @default.