Matches in SemOpenAlex for { <https://semopenalex.org/work/W3118374080> ?p ?o ?g. }
Showing items 1 to 100 of
100
with 100 items per page.
- W3118374080 endingPage "149" @default.
- W3118374080 startingPage "140" @default.
- W3118374080 abstract "Occitan is a Romance language spoken mainly in the south of France. It has no official status in the country, it is not standardized and displays important diatopic variation resulting in a rich system of dialects. Recently, a first treebank for this language was created. However, this corpus is based exclusively on texts in the Lengadocian dialect. Our paper describes the work aimed at extending the existing corpus with content in three new dialects, namely Gascon, Provencau and Lemosin. We describe both the annotation of initial content in these new varieties of Occitan and experiments allowing us to identify the most efficient method for further enrichment of the corpus. We observe that parsing models trained on Occitan dialects achieve better results than a delexicalized model trained on other Romance languages despite the latter training corpus being much larger (20K vs 900K tokens). The results of the native Occitan models show an important impact of cross-dialectal lexical variation, whereas syntactic variation seems to affect the systems less. We hope that the resulting corpus, incorporating several Occitan varieties, will facilitate the training of robust NLP tools, capable of processing all kinds of Occitan texts." @default.
- W3118374080 created "2021-01-18" @default.
- W3118374080 creator A5032752012 @default.
- W3118374080 creator A5045668765 @default.
- W3118374080 creator A5047160739 @default.
- W3118374080 creator A5070434269 @default.
- W3118374080 creator A5072763617 @default.
- W3118374080 creator A5076578554 @default.
- W3118374080 date "2020-12-01" @default.
- W3118374080 modified "2023-10-04" @default.
- W3118374080 title "A Four-Dialect Treebank for Occitan: Building Process and Parsing Experiments" @default.
- W3118374080 cites W1565169561 @default.
- W3118374080 cites W1569915133 @default.
- W3118374080 cites W2027979924 @default.
- W3118374080 cites W2134354099 @default.
- W3118374080 cites W2250173480 @default.
- W3118374080 cites W2251324968 @default.
- W3118374080 cites W2251443950 @default.
- W3118374080 cites W2265796885 @default.
- W3118374080 cites W2569308312 @default.
- W3118374080 cites W2578468121 @default.
- W3118374080 cites W2805082254 @default.
- W3118374080 cites W2887636719 @default.
- W3118374080 cites W2994621060 @default.
- W3118374080 cites W3032771801 @default.
- W3118374080 cites W3117254193 @default.
- W3118374080 cites W3127568958 @default.
- W3118374080 cites W331019419 @default.
- W3118374080 cites W38718431 @default.
- W3118374080 cites W647189206 @default.
- W3118374080 hasPublicationYear "2020" @default.
- W3118374080 type Work @default.
- W3118374080 sameAs 3118374080 @default.
- W3118374080 citedByCount "0" @default.
- W3118374080 crossrefType "proceedings-article" @default.
- W3118374080 hasAuthorship W3118374080A5032752012 @default.
- W3118374080 hasAuthorship W3118374080A5045668765 @default.
- W3118374080 hasAuthorship W3118374080A5047160739 @default.
- W3118374080 hasAuthorship W3118374080A5070434269 @default.
- W3118374080 hasAuthorship W3118374080A5072763617 @default.
- W3118374080 hasAuthorship W3118374080A5076578554 @default.
- W3118374080 hasConcept C121332964 @default.
- W3118374080 hasConcept C138885662 @default.
- W3118374080 hasConcept C154945302 @default.
- W3118374080 hasConcept C161831844 @default.
- W3118374080 hasConcept C186644900 @default.
- W3118374080 hasConcept C199360897 @default.
- W3118374080 hasConcept C204321447 @default.
- W3118374080 hasConcept C206134035 @default.
- W3118374080 hasConcept C2776321320 @default.
- W3118374080 hasConcept C2778334786 @default.
- W3118374080 hasConcept C41008148 @default.
- W3118374080 hasConcept C41132520 @default.
- W3118374080 hasConcept C41895202 @default.
- W3118374080 hasConcept C44870925 @default.
- W3118374080 hasConcept C98045186 @default.
- W3118374080 hasConceptScore W3118374080C121332964 @default.
- W3118374080 hasConceptScore W3118374080C138885662 @default.
- W3118374080 hasConceptScore W3118374080C154945302 @default.
- W3118374080 hasConceptScore W3118374080C161831844 @default.
- W3118374080 hasConceptScore W3118374080C186644900 @default.
- W3118374080 hasConceptScore W3118374080C199360897 @default.
- W3118374080 hasConceptScore W3118374080C204321447 @default.
- W3118374080 hasConceptScore W3118374080C206134035 @default.
- W3118374080 hasConceptScore W3118374080C2776321320 @default.
- W3118374080 hasConceptScore W3118374080C2778334786 @default.
- W3118374080 hasConceptScore W3118374080C41008148 @default.
- W3118374080 hasConceptScore W3118374080C41132520 @default.
- W3118374080 hasConceptScore W3118374080C41895202 @default.
- W3118374080 hasConceptScore W3118374080C44870925 @default.
- W3118374080 hasConceptScore W3118374080C98045186 @default.
- W3118374080 hasLocation W31183740801 @default.
- W3118374080 hasOpenAccess W3118374080 @default.
- W3118374080 hasPrimaryLocation W31183740801 @default.
- W3118374080 hasRelatedWork W181876033 @default.
- W3118374080 hasRelatedWork W1998988916 @default.
- W3118374080 hasRelatedWork W2041283357 @default.
- W3118374080 hasRelatedWork W2144283235 @default.
- W3118374080 hasRelatedWork W2245468790 @default.
- W3118374080 hasRelatedWork W2251899375 @default.
- W3118374080 hasRelatedWork W2251992530 @default.
- W3118374080 hasRelatedWork W2276492818 @default.
- W3118374080 hasRelatedWork W2328729146 @default.
- W3118374080 hasRelatedWork W2575828199 @default.
- W3118374080 hasRelatedWork W2576362958 @default.
- W3118374080 hasRelatedWork W2576860871 @default.
- W3118374080 hasRelatedWork W2740647784 @default.
- W3118374080 hasRelatedWork W2757758134 @default.
- W3118374080 hasRelatedWork W2994621060 @default.
- W3118374080 hasRelatedWork W3032432775 @default.
- W3118374080 hasRelatedWork W3043431263 @default.
- W3118374080 hasRelatedWork W3088331576 @default.
- W3118374080 hasRelatedWork W41492599 @default.
- W3118374080 hasRelatedWork W46355541 @default.
- W3118374080 isParatext "false" @default.
- W3118374080 isRetracted "false" @default.
- W3118374080 magId "3118374080" @default.
- W3118374080 workType "article" @default.