Matches in SemOpenAlex for { <https://semopenalex.org/work/W4310343134> ?p ?o ?g. }
Showing items 1 to 60 of
60
with 100 items per page.
- W4310343134 abstract "Abstract One of the main concerns of researchers in writing scientific texts such as articles and theses is their correctness in terms of spelling since the presence of spelling errors in these texts is unacceptable. This problem, like many natural language processing problems, is highly dependent on the structure and grammar of the language. Persian language is a challenging language in this area due to the presence of homophonic and dotted letters. In addition, many Arabic terms have entered this language. These words and terms have introduced the challenges of correcting Arabic spelling errors into Persian and created a complex combination. Moreover, due to the fact that many Persian speakers are Muslim, the Arabic content of the holy Qur'an has also found its way into Persian texts in such a way that today there are many Islamic texts with mixed Persian and Arabic content, and there is a great need for a tool that can correct bilingual Arabic and Persian spelling errors. In this work, an approach based on machine learning and an unsupervised algorithm is proposed which is designed based on N-gram language models. The data used here consists of about 220,000 sentences with mixed Arabic and Persian content, from which N-grams are made. The language model benefits from a statistical model derived from the probability of N-grams frequencies to score the possible candidates for the erroneous word and choose the best one. In order to evaluate the proposed method, test data has been prepared for Persian-Islamic content, in which spelling errors have been generated manually. The results of the evaluations show a significant improvement compared to similar tools in the Persian language." @default.
- W4310343134 created "2022-12-09" @default.
- W4310343134 creator A5017101746 @default.
- W4310343134 creator A5032412334 @default.
- W4310343134 creator A5052685401 @default.
- W4310343134 creator A5053976236 @default.
- W4310343134 date "2022-11-29" @default.
- W4310343134 modified "2023-10-14" @default.
- W4310343134 title "An unsupervised approach for bilingual Arabic and Persian spell correction using N-gram based Language models" @default.
- W4310343134 cites W2108458913 @default.
- W4310343134 doi "https://doi.org/10.21203/rs.3.rs-2308869/v1" @default.
- W4310343134 hasPublicationYear "2022" @default.
- W4310343134 type Work @default.
- W4310343134 citedByCount "0" @default.
- W4310343134 crossrefType "posted-content" @default.
- W4310343134 hasAuthorship W4310343134A5017101746 @default.
- W4310343134 hasAuthorship W4310343134A5032412334 @default.
- W4310343134 hasAuthorship W4310343134A5052685401 @default.
- W4310343134 hasAuthorship W4310343134A5053976236 @default.
- W4310343134 hasBestOaLocation W43103431341 @default.
- W4310343134 hasConcept C11413529 @default.
- W4310343134 hasConcept C117884012 @default.
- W4310343134 hasConcept C137293760 @default.
- W4310343134 hasConcept C138885662 @default.
- W4310343134 hasConcept C154945302 @default.
- W4310343134 hasConcept C204321447 @default.
- W4310343134 hasConcept C26022165 @default.
- W4310343134 hasConcept C2776527531 @default.
- W4310343134 hasConcept C2777801307 @default.
- W4310343134 hasConcept C41008148 @default.
- W4310343134 hasConcept C41895202 @default.
- W4310343134 hasConcept C55439883 @default.
- W4310343134 hasConceptScore W4310343134C11413529 @default.
- W4310343134 hasConceptScore W4310343134C117884012 @default.
- W4310343134 hasConceptScore W4310343134C137293760 @default.
- W4310343134 hasConceptScore W4310343134C138885662 @default.
- W4310343134 hasConceptScore W4310343134C154945302 @default.
- W4310343134 hasConceptScore W4310343134C204321447 @default.
- W4310343134 hasConceptScore W4310343134C26022165 @default.
- W4310343134 hasConceptScore W4310343134C2776527531 @default.
- W4310343134 hasConceptScore W4310343134C2777801307 @default.
- W4310343134 hasConceptScore W4310343134C41008148 @default.
- W4310343134 hasConceptScore W4310343134C41895202 @default.
- W4310343134 hasConceptScore W4310343134C55439883 @default.
- W4310343134 hasLocation W43103431341 @default.
- W4310343134 hasOpenAccess W4310343134 @default.
- W4310343134 hasPrimaryLocation W43103431341 @default.
- W4310343134 hasRelatedWork W2008468404 @default.
- W4310343134 hasRelatedWork W2057384730 @default.
- W4310343134 hasRelatedWork W2081295016 @default.
- W4310343134 hasRelatedWork W2132221452 @default.
- W4310343134 hasRelatedWork W2147879411 @default.
- W4310343134 hasRelatedWork W2250909759 @default.
- W4310343134 hasRelatedWork W2532616038 @default.
- W4310343134 hasRelatedWork W2624072012 @default.
- W4310343134 hasRelatedWork W2787311093 @default.
- W4310343134 hasRelatedWork W4307474317 @default.
- W4310343134 isParatext "false" @default.
- W4310343134 isRetracted "false" @default.
- W4310343134 workType "article" @default.