Matches in SemOpenAlex for { <https://semopenalex.org/work/W3003039250> ?p ?o ?g. }
Showing items 1 to 100 of
100
with 100 items per page.
- W3003039250 endingPage "59" @default.
- W3003039250 startingPage "59" @default.
- W3003039250 abstract "Arabic is one of the most semantically and syntactically complex languages in the world. A key challenging issue in text mining is text summarization, so we propose an unsupervised score-based method which combines the vector space model, continuous bag of words (CBOW), clustering, and a statistically-based method. The problems with multidocument text summarization are the noisy data, redundancy, diminished readability, and sentence incoherency. In this study, we adopt a preprocessing strategy to solve the noise problem and use the word2vec model for two purposes, first, to map the words to fixed-length vectors and, second, to obtain the semantic relationship between each vector based on the dimensions. Similarly, we use a k-means algorithm for two purposes: (1) Selecting the distinctive documents and tokenizing these documents to sentences, and (2) using another iteration of the k-means algorithm to select the key sentences based on the similarity metric to overcome the redundancy problem and generate the initial summary. Lastly, we use weighted principal component analysis (W-PCA) to map the sentences’ encoded weights based on a list of features. This selects the highest set of weights, which relates to important sentences for solving incoherency and readability problems. We adopted Recall-Oriented Understudy for Gisting Evaluation (ROUGE) as an evaluation measure to examine our proposed technique and compare it with state-of-the-art methods. Finally, an experiment on the Essex Arabic Summaries Corpus (EASC) using the ROUGE-1 and ROUGE-2 metrics showed promising results in comparison with existing methods." @default.
- W3003039250 created "2020-01-30" @default.
- W3003039250 creator A5009944817 @default.
- W3003039250 creator A5026248036 @default.
- W3003039250 creator A5053926147 @default.
- W3003039250 creator A5073043891 @default.
- W3003039250 date "2020-01-23" @default.
- W3003039250 modified "2023-10-07" @default.
- W3003039250 title "Multidocument Arabic Text Summarization Based on Clustering and Word2Vec to Reduce Redundancy" @default.
- W3003039250 cites W1955311981 @default.
- W3003039250 cites W1984364982 @default.
- W3003039250 cites W2470507356 @default.
- W3003039250 cites W2606838650 @default.
- W3003039250 cites W2724394450 @default.
- W3003039250 cites W2767277178 @default.
- W3003039250 cites W2767888789 @default.
- W3003039250 cites W2792089754 @default.
- W3003039250 cites W2800318991 @default.
- W3003039250 cites W2803026002 @default.
- W3003039250 cites W2890305344 @default.
- W3003039250 cites W2896070522 @default.
- W3003039250 cites W2901513145 @default.
- W3003039250 cites W2901772388 @default.
- W3003039250 cites W2905505354 @default.
- W3003039250 cites W2909602489 @default.
- W3003039250 cites W2922083656 @default.
- W3003039250 cites W2936565617 @default.
- W3003039250 cites W2947047992 @default.
- W3003039250 cites W2994289934 @default.
- W3003039250 cites W3209302286 @default.
- W3003039250 doi "https://doi.org/10.3390/info11020059" @default.
- W3003039250 hasPublicationYear "2020" @default.
- W3003039250 type Work @default.
- W3003039250 sameAs 3003039250 @default.
- W3003039250 citedByCount "24" @default.
- W3003039250 countsByYear W30030392502020 @default.
- W3003039250 countsByYear W30030392502021 @default.
- W3003039250 countsByYear W30030392502022 @default.
- W3003039250 countsByYear W30030392502023 @default.
- W3003039250 crossrefType "journal-article" @default.
- W3003039250 hasAuthorship W3003039250A5009944817 @default.
- W3003039250 hasAuthorship W3003039250A5026248036 @default.
- W3003039250 hasAuthorship W3003039250A5053926147 @default.
- W3003039250 hasAuthorship W3003039250A5073043891 @default.
- W3003039250 hasBestOaLocation W30030392501 @default.
- W3003039250 hasConcept C111919701 @default.
- W3003039250 hasConcept C124101348 @default.
- W3003039250 hasConcept C152124472 @default.
- W3003039250 hasConcept C153180895 @default.
- W3003039250 hasConcept C154945302 @default.
- W3003039250 hasConcept C170858558 @default.
- W3003039250 hasConcept C199360897 @default.
- W3003039250 hasConcept C204321447 @default.
- W3003039250 hasConcept C2776461190 @default.
- W3003039250 hasConcept C2777530160 @default.
- W3003039250 hasConcept C2778121359 @default.
- W3003039250 hasConcept C2778143727 @default.
- W3003039250 hasConcept C41008148 @default.
- W3003039250 hasConcept C41608201 @default.
- W3003039250 hasConcept C73555534 @default.
- W3003039250 hasConcept C89686163 @default.
- W3003039250 hasConceptScore W3003039250C111919701 @default.
- W3003039250 hasConceptScore W3003039250C124101348 @default.
- W3003039250 hasConceptScore W3003039250C152124472 @default.
- W3003039250 hasConceptScore W3003039250C153180895 @default.
- W3003039250 hasConceptScore W3003039250C154945302 @default.
- W3003039250 hasConceptScore W3003039250C170858558 @default.
- W3003039250 hasConceptScore W3003039250C199360897 @default.
- W3003039250 hasConceptScore W3003039250C204321447 @default.
- W3003039250 hasConceptScore W3003039250C2776461190 @default.
- W3003039250 hasConceptScore W3003039250C2777530160 @default.
- W3003039250 hasConceptScore W3003039250C2778121359 @default.
- W3003039250 hasConceptScore W3003039250C2778143727 @default.
- W3003039250 hasConceptScore W3003039250C41008148 @default.
- W3003039250 hasConceptScore W3003039250C41608201 @default.
- W3003039250 hasConceptScore W3003039250C73555534 @default.
- W3003039250 hasConceptScore W3003039250C89686163 @default.
- W3003039250 hasFunder F4320335595 @default.
- W3003039250 hasIssue "2" @default.
- W3003039250 hasLocation W30030392501 @default.
- W3003039250 hasLocation W30030392502 @default.
- W3003039250 hasOpenAccess W3003039250 @default.
- W3003039250 hasPrimaryLocation W30030392501 @default.
- W3003039250 hasRelatedWork W135583706 @default.
- W3003039250 hasRelatedWork W2081830265 @default.
- W3003039250 hasRelatedWork W2185451889 @default.
- W3003039250 hasRelatedWork W2347941600 @default.
- W3003039250 hasRelatedWork W2401226416 @default.
- W3003039250 hasRelatedWork W2773738819 @default.
- W3003039250 hasRelatedWork W3003039250 @default.
- W3003039250 hasRelatedWork W3015015255 @default.
- W3003039250 hasRelatedWork W4288796577 @default.
- W3003039250 hasRelatedWork W4294839263 @default.
- W3003039250 hasVolume "11" @default.
- W3003039250 isParatext "false" @default.
- W3003039250 isRetracted "false" @default.
- W3003039250 magId "3003039250" @default.
- W3003039250 workType "article" @default.