Matches in SemOpenAlex for { <https://semopenalex.org/work/W3012964921> ?p ?o ?g. }
Showing items 1 to 91 of
91
with 100 items per page.
- W3012964921 abstract "Yoruba is a widely spoken West African language with a writing system rich in orthographic and tonal diacritics. They provide morphological information, are crucial for lexical disambiguation, pronunciation and are vital for any computational Speech or Natural Language Processing tasks. However diacritic marks are commonly excluded from electronic texts due to limited device and application support as well as general education on proper usage. We report on recent efforts at dataset cultivation. By aggregating and improving disparate texts from the web and various personal libraries, we were able to significantly grow our clean Yoruba dataset from a majority Bibilical text corpora with three sources to millions of tokens from over a dozen sources. We evaluate updated diacritic restoration models on a new, general purpose, public-domain Yoruba evaluation dataset of modern journalistic news text, selected to be multi-purpose and reflecting contemporary usage. All pre-trained models, datasets and source-code have been released as an open-source project to advance efforts on Yoruba language technology." @default.
- W3012964921 created "2020-04-03" @default.
- W3012964921 creator A5014289850 @default.
- W3012964921 creator A5016069022 @default.
- W3012964921 creator A5036857292 @default.
- W3012964921 creator A5048759633 @default.
- W3012964921 creator A5051594349 @default.
- W3012964921 creator A5059533703 @default.
- W3012964921 creator A5088658365 @default.
- W3012964921 date "2020-03-23" @default.
- W3012964921 modified "2023-09-27" @default.
- W3012964921 title "Improving Yor`ub'a Diacritic Restoration" @default.
- W3012964921 cites W2067861964 @default.
- W3012964921 cites W2149995043 @default.
- W3012964921 cites W2396775433 @default.
- W3012964921 cites W2493916176 @default.
- W3012964921 cites W2742700688 @default.
- W3012964921 cites W2963212250 @default.
- W3012964921 cites W3031473890 @default.
- W3012964921 hasPublicationYear "2020" @default.
- W3012964921 type Work @default.
- W3012964921 sameAs 3012964921 @default.
- W3012964921 citedByCount "2" @default.
- W3012964921 countsByYear W30129649212020 @default.
- W3012964921 crossrefType "posted-content" @default.
- W3012964921 hasAuthorship W3012964921A5014289850 @default.
- W3012964921 hasAuthorship W3012964921A5016069022 @default.
- W3012964921 hasAuthorship W3012964921A5036857292 @default.
- W3012964921 hasAuthorship W3012964921A5048759633 @default.
- W3012964921 hasAuthorship W3012964921A5051594349 @default.
- W3012964921 hasAuthorship W3012964921A5059533703 @default.
- W3012964921 hasAuthorship W3012964921A5088658365 @default.
- W3012964921 hasConcept C134306372 @default.
- W3012964921 hasConcept C136764020 @default.
- W3012964921 hasConcept C138885662 @default.
- W3012964921 hasConcept C154945302 @default.
- W3012964921 hasConcept C177264268 @default.
- W3012964921 hasConcept C185181809 @default.
- W3012964921 hasConcept C199360897 @default.
- W3012964921 hasConcept C204321447 @default.
- W3012964921 hasConcept C2776760102 @default.
- W3012964921 hasConcept C2777568999 @default.
- W3012964921 hasConcept C2780844864 @default.
- W3012964921 hasConcept C33923547 @default.
- W3012964921 hasConcept C36503486 @default.
- W3012964921 hasConcept C41008148 @default.
- W3012964921 hasConcept C41895202 @default.
- W3012964921 hasConcept C94375191 @default.
- W3012964921 hasConceptScore W3012964921C134306372 @default.
- W3012964921 hasConceptScore W3012964921C136764020 @default.
- W3012964921 hasConceptScore W3012964921C138885662 @default.
- W3012964921 hasConceptScore W3012964921C154945302 @default.
- W3012964921 hasConceptScore W3012964921C177264268 @default.
- W3012964921 hasConceptScore W3012964921C185181809 @default.
- W3012964921 hasConceptScore W3012964921C199360897 @default.
- W3012964921 hasConceptScore W3012964921C204321447 @default.
- W3012964921 hasConceptScore W3012964921C2776760102 @default.
- W3012964921 hasConceptScore W3012964921C2777568999 @default.
- W3012964921 hasConceptScore W3012964921C2780844864 @default.
- W3012964921 hasConceptScore W3012964921C33923547 @default.
- W3012964921 hasConceptScore W3012964921C36503486 @default.
- W3012964921 hasConceptScore W3012964921C41008148 @default.
- W3012964921 hasConceptScore W3012964921C41895202 @default.
- W3012964921 hasConceptScore W3012964921C94375191 @default.
- W3012964921 hasLocation W30129649211 @default.
- W3012964921 hasOpenAccess W3012964921 @default.
- W3012964921 hasPrimaryLocation W30129649211 @default.
- W3012964921 hasRelatedWork W2100777930 @default.
- W3012964921 hasRelatedWork W2123693498 @default.
- W3012964921 hasRelatedWork W2239919291 @default.
- W3012964921 hasRelatedWork W2336575964 @default.
- W3012964921 hasRelatedWork W2409439155 @default.
- W3012964921 hasRelatedWork W2597551331 @default.
- W3012964921 hasRelatedWork W2740489261 @default.
- W3012964921 hasRelatedWork W2886567646 @default.
- W3012964921 hasRelatedWork W2949663109 @default.
- W3012964921 hasRelatedWork W2963678298 @default.
- W3012964921 hasRelatedWork W2981526238 @default.
- W3012964921 hasRelatedWork W2981821110 @default.
- W3012964921 hasRelatedWork W3004753145 @default.
- W3012964921 hasRelatedWork W3013173896 @default.
- W3012964921 hasRelatedWork W3023933836 @default.
- W3012964921 hasRelatedWork W3030901678 @default.
- W3012964921 hasRelatedWork W3031473890 @default.
- W3012964921 hasRelatedWork W3088649398 @default.
- W3012964921 hasRelatedWork W3118580427 @default.
- W3012964921 hasRelatedWork W3178190498 @default.
- W3012964921 isParatext "false" @default.
- W3012964921 isRetracted "false" @default.
- W3012964921 magId "3012964921" @default.
- W3012964921 workType "article" @default.