Matches in SemOpenAlex for { <https://semopenalex.org/work/W2597485909> ?p ?o ?g. }
Showing items 1 to 93 of
93
with 100 items per page.
- W2597485909 endingPage "804" @default.
- W2597485909 startingPage "794" @default.
- W2597485909 abstract "One key issue in text mining and natural language processing is how to effectively represent documents using numerical vectors. One classical model is the Bag-of-Words (BoW). In a BoW-based vector representation of a document, each element denotes the normalized number of occurrence of a basis term in the document. To count the number of occurrence of a basis term, BoW conducts exact word matching, which can be regarded as a hard mapping from words to the basis term. BoW representation suffers from its intrinsic extreme sparsity, high dimensionality, and inability to capture high-level semantic meanings behind text data. To address the aforementioned issues, we propose a new document representation method named fuzzy Bag-of-Words (FBoW) in this paper. FBoW adopts a fuzzy mapping based on semantic correlation among words quantified by cosine similarity measures between word embeddings. Since word semantic matching instead of exact word string matching is used, the FBoW could encode more semantics into the numerical representation. In addition, we propose to use word clusters instead of individual words as basis terms and develop fuzzy Bag-of-WordClusters (FBoWC) models. Three variants under the framework of FBoWC are proposed based on three different similarity measures between word clusters and words, which are named as $text{FBoWC}_{rm mean}$ , $text{FBoWC}_{rm max}$ , and $text{FBoWC}_{rm min}$ , respectively. Document representations learned by the proposed FBoW and FBoWC are dense and able to encode high-level semantics. The task of document categorization is used to evaluate the performance of learned representation by the proposed FBoW and FBoWC methods. The results on seven real-word document classification datasets in comparison with six document representation learning methods have shown that our methods FBoW and FBoWC achieve the highest classification accuracies." @default.
- W2597485909 created "2017-04-07" @default.
- W2597485909 creator A5066203581 @default.
- W2597485909 creator A5086031494 @default.
- W2597485909 date "2018-04-01" @default.
- W2597485909 modified "2023-10-16" @default.
- W2597485909 title "Fuzzy Bag-of-Words Model for Document Representation" @default.
- W2597485909 cites W1010415138 @default.
- W2597485909 cites W1530780135 @default.
- W2597485909 cites W1832693441 @default.
- W2597485909 cites W1965667542 @default.
- W2597485909 cites W1978394996 @default.
- W2597485909 cites W1984047604 @default.
- W2597485909 cites W1988920861 @default.
- W2597485909 cites W2019207508 @default.
- W2597485909 cites W2025168209 @default.
- W2597485909 cites W2054759324 @default.
- W2597485909 cites W2065691025 @default.
- W2597485909 cites W2076028064 @default.
- W2597485909 cites W2097089247 @default.
- W2597485909 cites W2105499314 @default.
- W2597485909 cites W2120615054 @default.
- W2597485909 cites W2129250947 @default.
- W2597485909 cites W2129777964 @default.
- W2597485909 cites W2134731454 @default.
- W2597485909 cites W2140321362 @default.
- W2597485909 cites W2141738323 @default.
- W2597485909 cites W2144211451 @default.
- W2597485909 cites W2146434190 @default.
- W2597485909 cites W2153635508 @default.
- W2597485909 cites W2157331557 @default.
- W2597485909 cites W2158323699 @default.
- W2597485909 cites W2158997610 @default.
- W2597485909 cites W2163922914 @default.
- W2597485909 cites W2167428023 @default.
- W2597485909 cites W2342693253 @default.
- W2597485909 cites W2550132532 @default.
- W2597485909 cites W4205184193 @default.
- W2597485909 cites W4318422521 @default.
- W2597485909 doi "https://doi.org/10.1109/tfuzz.2017.2690222" @default.
- W2597485909 hasPublicationYear "2018" @default.
- W2597485909 type Work @default.
- W2597485909 sameAs 2597485909 @default.
- W2597485909 citedByCount "143" @default.
- W2597485909 countsByYear W25974859092017 @default.
- W2597485909 countsByYear W25974859092018 @default.
- W2597485909 countsByYear W25974859092019 @default.
- W2597485909 countsByYear W25974859092020 @default.
- W2597485909 countsByYear W25974859092021 @default.
- W2597485909 countsByYear W25974859092022 @default.
- W2597485909 countsByYear W25974859092023 @default.
- W2597485909 crossrefType "journal-article" @default.
- W2597485909 hasAuthorship W2597485909A5066203581 @default.
- W2597485909 hasAuthorship W2597485909A5086031494 @default.
- W2597485909 hasConcept C153180895 @default.
- W2597485909 hasConcept C154945302 @default.
- W2597485909 hasConcept C17744445 @default.
- W2597485909 hasConcept C199539241 @default.
- W2597485909 hasConcept C204321447 @default.
- W2597485909 hasConcept C2776359362 @default.
- W2597485909 hasConcept C41008148 @default.
- W2597485909 hasConcept C58166 @default.
- W2597485909 hasConcept C94625758 @default.
- W2597485909 hasConceptScore W2597485909C153180895 @default.
- W2597485909 hasConceptScore W2597485909C154945302 @default.
- W2597485909 hasConceptScore W2597485909C17744445 @default.
- W2597485909 hasConceptScore W2597485909C199539241 @default.
- W2597485909 hasConceptScore W2597485909C204321447 @default.
- W2597485909 hasConceptScore W2597485909C2776359362 @default.
- W2597485909 hasConceptScore W2597485909C41008148 @default.
- W2597485909 hasConceptScore W2597485909C58166 @default.
- W2597485909 hasConceptScore W2597485909C94625758 @default.
- W2597485909 hasIssue "2" @default.
- W2597485909 hasLocation W25974859091 @default.
- W2597485909 hasOpenAccess W2597485909 @default.
- W2597485909 hasPrimaryLocation W25974859091 @default.
- W2597485909 hasRelatedWork W1552159754 @default.
- W2597485909 hasRelatedWork W2033914206 @default.
- W2597485909 hasRelatedWork W2131420137 @default.
- W2597485909 hasRelatedWork W2148757832 @default.
- W2597485909 hasRelatedWork W2293457016 @default.
- W2597485909 hasRelatedWork W2368651715 @default.
- W2597485909 hasRelatedWork W2611614995 @default.
- W2597485909 hasRelatedWork W2789919619 @default.
- W2597485909 hasRelatedWork W3107474891 @default.
- W2597485909 hasRelatedWork W3169305685 @default.
- W2597485909 hasVolume "26" @default.
- W2597485909 isParatext "false" @default.
- W2597485909 isRetracted "false" @default.
- W2597485909 magId "2597485909" @default.
- W2597485909 workType "article" @default.