Matches in SemOpenAlex for { <https://semopenalex.org/work/W4387095890> ?p ?o ?g. }
Showing items 1 to 98 of
98
with 100 items per page.
- W4387095890 endingPage "121767" @default.
- W4387095890 startingPage "121767" @default.
- W4387095890 abstract "Automatic aggregation of similar words into semantically related groups (or clusters) is of interest to many natural language processing (NLP) applications. Extracting semantically related words and quasi-synonyms from text is a relatively new research area for the under-resourced Arabic language. Previous attempts addressed the problem of single-word term extraction. However, the absence of multiword terms (MWTs) dictionary, ontology, and semantic network makes extracting and identifying high-quality MWTs for the Arabic language a challenging problem. The main goal of this study is to extract corpus-based and coherent MWTs in the form of bigram and trigram sequences as an adequate representation of syntactic and semantic clusters. Therefore, this study contributes to this problem by implementing an algorithm named SEWAR, which uses the FastText algorithm to extract high-quality MWTs from a medical health corpus in Arabic. If put into practice, SEWAR can provide a suitable and helpful solution to classify and direct medical health questions to proper medical practitioners without human intervention. In addition, SEWAR can be applied to plenty of NLP tasks, such as information retrieval, question answering, and text summarization. Three metrics were used to assess the extracted MWTs; the pointwise mutual information (PMI), the cosine similarity, and the clustering purity measure. The results were promising and encouraging to generalize and apply SEWAR to extract MWTs from any Arabic corpus." @default.
- W4387095890 created "2023-09-28" @default.
- W4387095890 creator A5015245485 @default.
- W4387095890 creator A5033278658 @default.
- W4387095890 date "2024-03-01" @default.
- W4387095890 modified "2023-10-09" @default.
- W4387095890 title "SEWAR: A corpus-based N-gram approach for extracting semantically-related words from arabic medical corpus" @default.
- W4387095890 cites W1270133036 @default.
- W4387095890 cites W1577842164 @default.
- W4387095890 cites W1908119927 @default.
- W4387095890 cites W1960685374 @default.
- W4387095890 cites W1977766834 @default.
- W4387095890 cites W1979325497 @default.
- W4387095890 cites W2020362627 @default.
- W4387095890 cites W2026487812 @default.
- W4387095890 cites W2033937535 @default.
- W4387095890 cites W2034184318 @default.
- W4387095890 cites W2041717583 @default.
- W4387095890 cites W2049107599 @default.
- W4387095890 cites W2050641287 @default.
- W4387095890 cites W2058990119 @default.
- W4387095890 cites W2078693345 @default.
- W4387095890 cites W2132885068 @default.
- W4387095890 cites W2134141008 @default.
- W4387095890 cites W2148867178 @default.
- W4387095890 cites W2171313960 @default.
- W4387095890 cites W2250539671 @default.
- W4387095890 cites W2493916176 @default.
- W4387095890 cites W2772528510 @default.
- W4387095890 cites W2804450002 @default.
- W4387095890 cites W2888039742 @default.
- W4387095890 cites W2889492109 @default.
- W4387095890 cites W2903875997 @default.
- W4387095890 cites W2911338114 @default.
- W4387095890 cites W3010889635 @default.
- W4387095890 cites W3036520662 @default.
- W4387095890 cites W3084495208 @default.
- W4387095890 cites W3112113220 @default.
- W4387095890 cites W3120375251 @default.
- W4387095890 cites W3122116274 @default.
- W4387095890 cites W3198812502 @default.
- W4387095890 cites W3203949288 @default.
- W4387095890 cites W4281630578 @default.
- W4387095890 cites W4283458907 @default.
- W4387095890 doi "https://doi.org/10.1016/j.eswa.2023.121767" @default.
- W4387095890 hasPublicationYear "2024" @default.
- W4387095890 type Work @default.
- W4387095890 citedByCount "0" @default.
- W4387095890 crossrefType "journal-article" @default.
- W4387095890 hasAuthorship W4387095890A5015245485 @default.
- W4387095890 hasAuthorship W4387095890A5033278658 @default.
- W4387095890 hasConcept C108757681 @default.
- W4387095890 hasConcept C117884012 @default.
- W4387095890 hasConcept C137293760 @default.
- W4387095890 hasConcept C137546455 @default.
- W4387095890 hasConcept C138885662 @default.
- W4387095890 hasConcept C154945302 @default.
- W4387095890 hasConcept C170858558 @default.
- W4387095890 hasConcept C204321447 @default.
- W4387095890 hasConcept C23123220 @default.
- W4387095890 hasConcept C2780762811 @default.
- W4387095890 hasConcept C41008148 @default.
- W4387095890 hasConcept C41895202 @default.
- W4387095890 hasConcept C73555534 @default.
- W4387095890 hasConcept C96455323 @default.
- W4387095890 hasConceptScore W4387095890C108757681 @default.
- W4387095890 hasConceptScore W4387095890C117884012 @default.
- W4387095890 hasConceptScore W4387095890C137293760 @default.
- W4387095890 hasConceptScore W4387095890C137546455 @default.
- W4387095890 hasConceptScore W4387095890C138885662 @default.
- W4387095890 hasConceptScore W4387095890C154945302 @default.
- W4387095890 hasConceptScore W4387095890C170858558 @default.
- W4387095890 hasConceptScore W4387095890C204321447 @default.
- W4387095890 hasConceptScore W4387095890C23123220 @default.
- W4387095890 hasConceptScore W4387095890C2780762811 @default.
- W4387095890 hasConceptScore W4387095890C41008148 @default.
- W4387095890 hasConceptScore W4387095890C41895202 @default.
- W4387095890 hasConceptScore W4387095890C73555534 @default.
- W4387095890 hasConceptScore W4387095890C96455323 @default.
- W4387095890 hasLocation W43870958901 @default.
- W4387095890 hasOpenAccess W4387095890 @default.
- W4387095890 hasPrimaryLocation W43870958901 @default.
- W4387095890 hasRelatedWork W2164394510 @default.
- W4387095890 hasRelatedWork W2250909759 @default.
- W4387095890 hasRelatedWork W2463816369 @default.
- W4387095890 hasRelatedWork W2921680427 @default.
- W4387095890 hasRelatedWork W2940857995 @default.
- W4387095890 hasRelatedWork W2950765678 @default.
- W4387095890 hasRelatedWork W4288374102 @default.
- W4387095890 hasRelatedWork W4327499987 @default.
- W4387095890 hasRelatedWork W7593531 @default.
- W4387095890 hasRelatedWork W2917105722 @default.
- W4387095890 hasVolume "238" @default.
- W4387095890 isParatext "false" @default.
- W4387095890 isRetracted "false" @default.
- W4387095890 workType "article" @default.