Matches in SemOpenAlex for { <https://semopenalex.org/work/W4309739688> ?p ?o ?g. }
Showing items 1 to 93 of
93
with 100 items per page.
- W4309739688 endingPage "11727" @default.
- W4309739688 startingPage "11727" @default.
- W4309739688 abstract "More than 7000 languages are spoken in the world today. Amharic is one of the languages spoken in the East African country Ethiopia. A lot of speech data is being made every day in different languages as machines are getting better at processing and have improved storing capacity. However, searching for a particular word with its respective time frame inside a given audio file is a challenge. Since Amharic has its own distinguishing characteristics, such as glottal, palatal, and labialized consonants, it is not possible to directly use models that are developed for other languages. A popular approach in developing systems for searching particular information in speech involves using an automatic speech recognition (ASR) module that generates the text version of the speech where the word or phrase is searched based on text query. However, it is not possible to transcribe a long audio file without segmentation, which in turn affects the performance of the ASR module. In this paper, we are reporting our investigation on the effects of manual and automatic speech segmentation of Amharic audio files in a spiritual domain. We have used manual segmentation as a baseline for our investigation and found out that sentence-like automatic segmentation resulted in a word error rate (WER) closer to the WER achieved on the manually segmented test speech. Based on the experimental results, we propose Amharic speech search using text word query (ASSTWQ) based on automatic sentence-like segmentation. Since we have achieved lower WER using the previously developed speech corpus, which is in a broadcast news domain, together with the in-domain speech corpus, we recommend using both in- and out-domain speech corpora to develop the Amharic ASR module. The performance of the proposed ASR is a WER of 53% that needs further improvement. Combining two language models (LMs) developed using training text from the two different domains (spiritual and broadcast news) allowed a WER reduction from 53% to 46%. Therefore, we have developed two ASSTWQ systems using the two ASR modules with WERs of 53% and 46%." @default.
- W4309739688 created "2022-11-29" @default.
- W4309739688 creator A5004912976 @default.
- W4309739688 creator A5007418167 @default.
- W4309739688 creator A5076128522 @default.
- W4309739688 creator A5085350970 @default.
- W4309739688 date "2022-11-18" @default.
- W4309739688 modified "2023-09-25" @default.
- W4309739688 title "Amharic Speech Search Using Text Word Query Based on Automatic Sentence-like Segmentation" @default.
- W4309739688 cites W114193738 @default.
- W4309739688 cites W134911624 @default.
- W4309739688 cites W1486939005 @default.
- W4309739688 cites W2012026885 @default.
- W4309739688 cites W2097051852 @default.
- W4309739688 cites W2152753389 @default.
- W4309739688 cites W2195354 @default.
- W4309739688 cites W2689696018 @default.
- W4309739688 cites W2770362844 @default.
- W4309739688 cites W2782146591 @default.
- W4309739688 cites W2809793882 @default.
- W4309739688 cites W2942574155 @default.
- W4309739688 cites W3100732527 @default.
- W4309739688 cites W3111398107 @default.
- W4309739688 cites W3119953805 @default.
- W4309739688 cites W3175595715 @default.
- W4309739688 cites W3199491502 @default.
- W4309739688 cites W4205107597 @default.
- W4309739688 cites W4238136833 @default.
- W4309739688 doi "https://doi.org/10.3390/app122211727" @default.
- W4309739688 hasPublicationYear "2022" @default.
- W4309739688 type Work @default.
- W4309739688 citedByCount "0" @default.
- W4309739688 crossrefType "journal-article" @default.
- W4309739688 hasAuthorship W4309739688A5004912976 @default.
- W4309739688 hasAuthorship W4309739688A5007418167 @default.
- W4309739688 hasAuthorship W4309739688A5076128522 @default.
- W4309739688 hasAuthorship W4309739688A5085350970 @default.
- W4309739688 hasBestOaLocation W43097396881 @default.
- W4309739688 hasConcept C14999030 @default.
- W4309739688 hasConcept C154945302 @default.
- W4309739688 hasConcept C155635449 @default.
- W4309739688 hasConcept C157968479 @default.
- W4309739688 hasConcept C204321447 @default.
- W4309739688 hasConcept C207030507 @default.
- W4309739688 hasConcept C2776224158 @default.
- W4309739688 hasConcept C2777530160 @default.
- W4309739688 hasConcept C2780900699 @default.
- W4309739688 hasConcept C28490314 @default.
- W4309739688 hasConcept C40969351 @default.
- W4309739688 hasConcept C41008148 @default.
- W4309739688 hasConcept C54953205 @default.
- W4309739688 hasConcept C61328038 @default.
- W4309739688 hasConcept C89600930 @default.
- W4309739688 hasConcept C91863865 @default.
- W4309739688 hasConcept C98501671 @default.
- W4309739688 hasConceptScore W4309739688C14999030 @default.
- W4309739688 hasConceptScore W4309739688C154945302 @default.
- W4309739688 hasConceptScore W4309739688C155635449 @default.
- W4309739688 hasConceptScore W4309739688C157968479 @default.
- W4309739688 hasConceptScore W4309739688C204321447 @default.
- W4309739688 hasConceptScore W4309739688C207030507 @default.
- W4309739688 hasConceptScore W4309739688C2776224158 @default.
- W4309739688 hasConceptScore W4309739688C2777530160 @default.
- W4309739688 hasConceptScore W4309739688C2780900699 @default.
- W4309739688 hasConceptScore W4309739688C28490314 @default.
- W4309739688 hasConceptScore W4309739688C40969351 @default.
- W4309739688 hasConceptScore W4309739688C41008148 @default.
- W4309739688 hasConceptScore W4309739688C54953205 @default.
- W4309739688 hasConceptScore W4309739688C61328038 @default.
- W4309739688 hasConceptScore W4309739688C89600930 @default.
- W4309739688 hasConceptScore W4309739688C91863865 @default.
- W4309739688 hasConceptScore W4309739688C98501671 @default.
- W4309739688 hasIssue "22" @default.
- W4309739688 hasLocation W43097396881 @default.
- W4309739688 hasLocation W43097396882 @default.
- W4309739688 hasOpenAccess W4309739688 @default.
- W4309739688 hasPrimaryLocation W43097396881 @default.
- W4309739688 hasRelatedWork W1799027130 @default.
- W4309739688 hasRelatedWork W2122924390 @default.
- W4309739688 hasRelatedWork W2157598242 @default.
- W4309739688 hasRelatedWork W2171351754 @default.
- W4309739688 hasRelatedWork W2376203252 @default.
- W4309739688 hasRelatedWork W2784059283 @default.
- W4309739688 hasRelatedWork W3089379469 @default.
- W4309739688 hasRelatedWork W3112480982 @default.
- W4309739688 hasRelatedWork W3151376046 @default.
- W4309739688 hasRelatedWork W4309739688 @default.
- W4309739688 hasVolume "12" @default.
- W4309739688 isParatext "false" @default.
- W4309739688 isRetracted "false" @default.
- W4309739688 workType "article" @default.