Matches in SemOpenAlex for { <https://semopenalex.org/work/W4214728210> ?p ?o ?g. }
- W4214728210 abstract "The continuing success of the Internet has led to an enormous rise in the volume of electronic text records. The strategies for grouping these records into coherent groups are increasingly important. Traditional text clustering methods are focused on statistical characteristics, with a syntactic rather than semantical concept used to do clustering. A new approach for collecting documentation based on textual similarities is presented in this paper. The method is accomplished by defining, tokenizing, and stopping text synopses from Wikipedia and IMDB datasets using the NLTK dictionary. Then, a vector space is created using TFIDF with the K-mean algorithm to carry out clustering. The results were shown as an interactive website." @default.
- W4214728210 created "2022-03-02" @default.
- W4214728210 creator A5009237217 @default.
- W4214728210 creator A5034896032 @default.
- W4214728210 creator A5058552366 @default.
- W4214728210 creator A5077340508 @default.
- W4214728210 creator A5081282313 @default.
- W4214728210 creator A5087996799 @default.
- W4214728210 date "2021-09-21" @default.
- W4214728210 modified "2023-09-23" @default.
- W4214728210 title "Design a Clustering Document based Semantic Similarity System using TFIDF and K-Mean" @default.
- W4214728210 cites W1972389051 @default.
- W4214728210 cites W1987971958 @default.
- W4214728210 cites W2019755661 @default.
- W4214728210 cites W2029219570 @default.
- W4214728210 cites W2058990119 @default.
- W4214728210 cites W2114077504 @default.
- W4214728210 cites W2121197635 @default.
- W4214728210 cites W2133517430 @default.
- W4214728210 cites W2145252566 @default.
- W4214728210 cites W2544075565 @default.
- W4214728210 cites W2557871122 @default.
- W4214728210 cites W2562566938 @default.
- W4214728210 cites W2603222250 @default.
- W4214728210 cites W2786906105 @default.
- W4214728210 cites W2902227688 @default.
- W4214728210 cites W2947573069 @default.
- W4214728210 cites W2947784524 @default.
- W4214728210 cites W2962678499 @default.
- W4214728210 cites W2976289207 @default.
- W4214728210 cites W2977294114 @default.
- W4214728210 cites W3009196982 @default.
- W4214728210 cites W3010077835 @default.
- W4214728210 cites W3010085308 @default.
- W4214728210 cites W3017398344 @default.
- W4214728210 cites W3019409765 @default.
- W4214728210 cites W3019913914 @default.
- W4214728210 cites W3104846106 @default.
- W4214728210 cites W3121947205 @default.
- W4214728210 cites W3133097824 @default.
- W4214728210 cites W3156576211 @default.
- W4214728210 cites W3167771724 @default.
- W4214728210 cites W3172412678 @default.
- W4214728210 cites W3184035605 @default.
- W4214728210 cites W3193488995 @default.
- W4214728210 cites W3195790488 @default.
- W4214728210 cites W3195993566 @default.
- W4214728210 cites W4232826983 @default.
- W4214728210 cites W4251745552 @default.
- W4214728210 cites W1550485927 @default.
- W4214728210 doi "https://doi.org/10.1109/iiceta51758.2021.9717942" @default.
- W4214728210 hasPublicationYear "2021" @default.
- W4214728210 type Work @default.
- W4214728210 citedByCount "2" @default.
- W4214728210 countsByYear W42147282102022 @default.
- W4214728210 countsByYear W42147282102023 @default.
- W4214728210 crossrefType "proceedings-article" @default.
- W4214728210 hasAuthorship W4214728210A5009237217 @default.
- W4214728210 hasAuthorship W4214728210A5034896032 @default.
- W4214728210 hasAuthorship W4214728210A5058552366 @default.
- W4214728210 hasAuthorship W4214728210A5077340508 @default.
- W4214728210 hasAuthorship W4214728210A5081282313 @default.
- W4214728210 hasAuthorship W4214728210A5087996799 @default.
- W4214728210 hasConcept C103278499 @default.
- W4214728210 hasConcept C110875604 @default.
- W4214728210 hasConcept C115961682 @default.
- W4214728210 hasConcept C121332964 @default.
- W4214728210 hasConcept C124101348 @default.
- W4214728210 hasConcept C136764020 @default.
- W4214728210 hasConcept C154945302 @default.
- W4214728210 hasConcept C177937566 @default.
- W4214728210 hasConcept C199360897 @default.
- W4214728210 hasConcept C204321447 @default.
- W4214728210 hasConcept C23123220 @default.
- W4214728210 hasConcept C41008148 @default.
- W4214728210 hasConcept C56666940 @default.
- W4214728210 hasConcept C61797465 @default.
- W4214728210 hasConcept C62520636 @default.
- W4214728210 hasConcept C73555534 @default.
- W4214728210 hasConcept C81758059 @default.
- W4214728210 hasConcept C89686163 @default.
- W4214728210 hasConceptScore W4214728210C103278499 @default.
- W4214728210 hasConceptScore W4214728210C110875604 @default.
- W4214728210 hasConceptScore W4214728210C115961682 @default.
- W4214728210 hasConceptScore W4214728210C121332964 @default.
- W4214728210 hasConceptScore W4214728210C124101348 @default.
- W4214728210 hasConceptScore W4214728210C136764020 @default.
- W4214728210 hasConceptScore W4214728210C154945302 @default.
- W4214728210 hasConceptScore W4214728210C177937566 @default.
- W4214728210 hasConceptScore W4214728210C199360897 @default.
- W4214728210 hasConceptScore W4214728210C204321447 @default.
- W4214728210 hasConceptScore W4214728210C23123220 @default.
- W4214728210 hasConceptScore W4214728210C41008148 @default.
- W4214728210 hasConceptScore W4214728210C56666940 @default.
- W4214728210 hasConceptScore W4214728210C61797465 @default.
- W4214728210 hasConceptScore W4214728210C62520636 @default.
- W4214728210 hasConceptScore W4214728210C73555534 @default.
- W4214728210 hasConceptScore W4214728210C81758059 @default.
- W4214728210 hasConceptScore W4214728210C89686163 @default.
- W4214728210 hasLocation W42147282101 @default.