Matches in SemOpenAlex for { <https://semopenalex.org/work/W4240560945> ?p ?o ?g. }
Showing items 1 to 67 of
67
with 100 items per page.
- W4240560945 abstract "Setswana, a Bantu language in the Sotho group, is one of the eleven official languages of South Africa. The language is characterised by a disjunctive orthography, mainly affecting the important word category of verbs. In particular, verbal prefixal morphemes are usually written disjunctively, while suffixal morphemes follow a conjunctive writing style. Therefore, Setswana tokenisation cannot be based solely on whitespace, as is the case in many alphabetic, segmented languages, including the conjunctively written Nguni group of South African Bantu languages. This paper shows how a combination of two tokeniser transducers and a finite-state (rule-based) morphological analyser may be combined to effectively solve the Setswana tokenisation problem. The approach has the important advantage of bringing the processing of Setswana beyond the morphological analysis level in line with what is appropriate for the Nguni languages. This means that the challenge of the disjunctive orthography is met at the tokenisation/morphological analysis level and does not in principle propagate to subsequent levels of analysis such as POS tagging and shallow parsing, etc. Indeed, the approach ensures that an aspect such as orthography does not obfuscate sound linguistics and, ultimately, proper semantic analysis, which remains the ultimate aim of linguistic analysis and therefore also computational linguistic analysis." @default.
- W4240560945 created "2022-05-12" @default.
- W4240560945 creator A5015337389 @default.
- W4240560945 creator A5018993029 @default.
- W4240560945 creator A5036798968 @default.
- W4240560945 creator A5041725366 @default.
- W4240560945 date "2009-01-01" @default.
- W4240560945 modified "2023-10-16" @default.
- W4240560945 title "Setswana tokenisation and computational verb morphology" @default.
- W4240560945 doi "https://doi.org/10.3115/1564508.1564522" @default.
- W4240560945 hasPublicationYear "2009" @default.
- W4240560945 type Work @default.
- W4240560945 citedByCount "5" @default.
- W4240560945 countsByYear W42405609452014 @default.
- W4240560945 countsByYear W42405609452017 @default.
- W4240560945 countsByYear W42405609452019 @default.
- W4240560945 countsByYear W42405609452022 @default.
- W4240560945 crossrefType "proceedings-article" @default.
- W4240560945 hasAuthorship W4240560945A5015337389 @default.
- W4240560945 hasAuthorship W4240560945A5018993029 @default.
- W4240560945 hasAuthorship W4240560945A5036798968 @default.
- W4240560945 hasAuthorship W4240560945A5041725366 @default.
- W4240560945 hasBestOaLocation W42405609451 @default.
- W4240560945 hasConcept C108494575 @default.
- W4240560945 hasConcept C138885662 @default.
- W4240560945 hasConcept C150670947 @default.
- W4240560945 hasConcept C154945302 @default.
- W4240560945 hasConcept C161831844 @default.
- W4240560945 hasConcept C165297611 @default.
- W4240560945 hasConcept C186644900 @default.
- W4240560945 hasConcept C204321447 @default.
- W4240560945 hasConcept C2776397901 @default.
- W4240560945 hasConcept C2780566098 @default.
- W4240560945 hasConcept C41008148 @default.
- W4240560945 hasConcept C41895202 @default.
- W4240560945 hasConcept C554936623 @default.
- W4240560945 hasConcept C99878080 @default.
- W4240560945 hasConceptScore W4240560945C108494575 @default.
- W4240560945 hasConceptScore W4240560945C138885662 @default.
- W4240560945 hasConceptScore W4240560945C150670947 @default.
- W4240560945 hasConceptScore W4240560945C154945302 @default.
- W4240560945 hasConceptScore W4240560945C161831844 @default.
- W4240560945 hasConceptScore W4240560945C165297611 @default.
- W4240560945 hasConceptScore W4240560945C186644900 @default.
- W4240560945 hasConceptScore W4240560945C204321447 @default.
- W4240560945 hasConceptScore W4240560945C2776397901 @default.
- W4240560945 hasConceptScore W4240560945C2780566098 @default.
- W4240560945 hasConceptScore W4240560945C41008148 @default.
- W4240560945 hasConceptScore W4240560945C41895202 @default.
- W4240560945 hasConceptScore W4240560945C554936623 @default.
- W4240560945 hasConceptScore W4240560945C99878080 @default.
- W4240560945 hasLocation W42405609451 @default.
- W4240560945 hasOpenAccess W4240560945 @default.
- W4240560945 hasPrimaryLocation W42405609451 @default.
- W4240560945 hasRelatedWork W1538801958 @default.
- W4240560945 hasRelatedWork W1585034923 @default.
- W4240560945 hasRelatedWork W2046859240 @default.
- W4240560945 hasRelatedWork W2067255402 @default.
- W4240560945 hasRelatedWork W2100618589 @default.
- W4240560945 hasRelatedWork W2978485045 @default.
- W4240560945 hasRelatedWork W3194008465 @default.
- W4240560945 hasRelatedWork W3201127864 @default.
- W4240560945 hasRelatedWork W1551406738 @default.
- W4240560945 hasRelatedWork W2580451037 @default.
- W4240560945 isParatext "false" @default.
- W4240560945 isRetracted "false" @default.
- W4240560945 workType "article" @default.