Matches in SemOpenAlex for { <https://semopenalex.org/work/W204086241> ?p ?o ?g. }
- W204086241 abstract "A great challenge of text mining arises from the increasingly large text datasets and the high dimensionality associated with natural language. In this research, a systematic study is conducted of six Dimension Reduction Techniques (DRT) in the context of the text clustering problem using three standard benchmark datasets. The methods considered include three feature transformation techiques, Independent Component Analysis (ICA), Latent Semantic Indexing (LSI), Random Projection (RP), and three feature selection techniques based on Document Frequency (DF ), mean TfIdf (TI) and Term Frequency Variance (TfV ). Experiments with the k-means clustering algorithm show that ICA and LSI are clearly superior to RP on all three datasets. Furthermore,it is shown that TI and TfV outperform DF for text clustering. Finally, experiments where a selection technique is followed by a transformation technique show that this combination can help substantially reduce the computational cost associated with the best transformation methods (ICA and LSI) while preserving clustering performance." @default.
- W204086241 created "2016-06-24" @default.
- W204086241 creator A5003699164 @default.
- W204086241 creator A5013989913 @default.
- W204086241 creator A5023699897 @default.
- W204086241 creator A5086618048 @default.
- W204086241 date "2005-01-01" @default.
- W204086241 modified "2023-10-02" @default.
- W204086241 title "Comparing and Combining Dimension Reduction Techniques for Efficient Text Clustering" @default.
- W204086241 cites W1488854477 @default.
- W204086241 cites W1557925196 @default.
- W204086241 cites W1599022437 @default.
- W204086241 cites W2006533296 @default.
- W204086241 cites W2053171205 @default.
- W204086241 cites W2063392856 @default.
- W204086241 cites W2072773380 @default.
- W204086241 cites W2089497633 @default.
- W204086241 cites W2107743791 @default.
- W204086241 cites W2123649031 @default.
- W204086241 cites W2147152072 @default.
- W204086241 cites W2435251607 @default.
- W204086241 cites W3139328003 @default.
- W204086241 hasPublicationYear "2005" @default.
- W204086241 type Work @default.
- W204086241 sameAs 204086241 @default.
- W204086241 citedByCount "34" @default.
- W204086241 countsByYear W2040862412012 @default.
- W204086241 countsByYear W2040862412014 @default.
- W204086241 countsByYear W2040862412015 @default.
- W204086241 countsByYear W2040862412016 @default.
- W204086241 countsByYear W2040862412017 @default.
- W204086241 countsByYear W2040862412018 @default.
- W204086241 countsByYear W2040862412021 @default.
- W204086241 countsByYear W2040862412022 @default.
- W204086241 crossrefType "journal-article" @default.
- W204086241 hasAuthorship W204086241A5003699164 @default.
- W204086241 hasAuthorship W204086241A5013989913 @default.
- W204086241 hasAuthorship W204086241A5023699897 @default.
- W204086241 hasAuthorship W204086241A5086618048 @default.
- W204086241 hasConcept C104317684 @default.
- W204086241 hasConcept C11413529 @default.
- W204086241 hasConcept C121332964 @default.
- W204086241 hasConcept C124101348 @default.
- W204086241 hasConcept C13280743 @default.
- W204086241 hasConcept C148483581 @default.
- W204086241 hasConcept C151730666 @default.
- W204086241 hasConcept C153180895 @default.
- W204086241 hasConcept C154945302 @default.
- W204086241 hasConcept C177937566 @default.
- W204086241 hasConcept C184509293 @default.
- W204086241 hasConcept C185592680 @default.
- W204086241 hasConcept C185798385 @default.
- W204086241 hasConcept C202444582 @default.
- W204086241 hasConcept C204241405 @default.
- W204086241 hasConcept C205649164 @default.
- W204086241 hasConcept C27438332 @default.
- W204086241 hasConcept C2777036070 @default.
- W204086241 hasConcept C2779343474 @default.
- W204086241 hasConcept C33676613 @default.
- W204086241 hasConcept C33923547 @default.
- W204086241 hasConcept C41008148 @default.
- W204086241 hasConcept C55493867 @default.
- W204086241 hasConcept C57493831 @default.
- W204086241 hasConcept C61797465 @default.
- W204086241 hasConcept C62520636 @default.
- W204086241 hasConcept C70518039 @default.
- W204086241 hasConcept C73555534 @default.
- W204086241 hasConcept C81758059 @default.
- W204086241 hasConcept C81917197 @default.
- W204086241 hasConcept C86803240 @default.
- W204086241 hasConceptScore W204086241C104317684 @default.
- W204086241 hasConceptScore W204086241C11413529 @default.
- W204086241 hasConceptScore W204086241C121332964 @default.
- W204086241 hasConceptScore W204086241C124101348 @default.
- W204086241 hasConceptScore W204086241C13280743 @default.
- W204086241 hasConceptScore W204086241C148483581 @default.
- W204086241 hasConceptScore W204086241C151730666 @default.
- W204086241 hasConceptScore W204086241C153180895 @default.
- W204086241 hasConceptScore W204086241C154945302 @default.
- W204086241 hasConceptScore W204086241C177937566 @default.
- W204086241 hasConceptScore W204086241C184509293 @default.
- W204086241 hasConceptScore W204086241C185592680 @default.
- W204086241 hasConceptScore W204086241C185798385 @default.
- W204086241 hasConceptScore W204086241C202444582 @default.
- W204086241 hasConceptScore W204086241C204241405 @default.
- W204086241 hasConceptScore W204086241C205649164 @default.
- W204086241 hasConceptScore W204086241C27438332 @default.
- W204086241 hasConceptScore W204086241C2777036070 @default.
- W204086241 hasConceptScore W204086241C2779343474 @default.
- W204086241 hasConceptScore W204086241C33676613 @default.
- W204086241 hasConceptScore W204086241C33923547 @default.
- W204086241 hasConceptScore W204086241C41008148 @default.
- W204086241 hasConceptScore W204086241C55493867 @default.
- W204086241 hasConceptScore W204086241C57493831 @default.
- W204086241 hasConceptScore W204086241C61797465 @default.
- W204086241 hasConceptScore W204086241C62520636 @default.
- W204086241 hasConceptScore W204086241C70518039 @default.
- W204086241 hasConceptScore W204086241C73555534 @default.
- W204086241 hasConceptScore W204086241C81758059 @default.
- W204086241 hasConceptScore W204086241C81917197 @default.