Matches in SemOpenAlex for { <https://semopenalex.org/work/W2038021268> ?p ?o ?g. }
- W2038021268 endingPage "219" @default.
- W2038021268 startingPage "205" @default.
- W2038021268 abstract "This paper proposes a Wikipedia-based semantic similarity measurement method that is intended for real-world noisy short texts. Our method is a kind of explicit semantic analysis (ESA), which adds a bag of Wikipedia entities (Wikipedia pages) to a text as its semantic representation and uses the vector of entities for computing the semantic similarity. Adding related entities to a text, not a single word or phrase, is a challenging practical problem because it usually consists of several subproblems, e.g., key term extraction from texts, related entity finding for each key term, and weight aggregation of related entities. Our proposed method solves this aggregation problem using extended naive Bayes, a probabilistic weighting mechanism based on the Bayes' theorem. Our method is effective especially when the short text is semantically noisy, i.e., they contain some meaningless or misleading terms for estimating their main topic. Experimental results on Twitter message and Web snippet clustering revealed that our method outperformed ESA for noisy short texts. We also found that reducing the dimension of the vector to representative Wikipedia entities scarcely affected the performance while decreasing the vector size and hence the storage space and the processing time of computing the cosine similarity." @default.
- W2038021268 created "2016-06-24" @default.
- W2038021268 creator A5004881219 @default.
- W2038021268 creator A5010966769 @default.
- W2038021268 creator A5012025870 @default.
- W2038021268 creator A5063345953 @default.
- W2038021268 date "2015-06-01" @default.
- W2038021268 modified "2023-10-16" @default.
- W2038021268 title "Wikipedia-Based Semantic Similarity Measurements for Noisy Short Texts Using Extended Naive Bayes" @default.
- W2038021268 cites W1491314197 @default.
- W2038021268 cites W1646006088 @default.
- W2038021268 cites W1964209958 @default.
- W2038021268 cites W1971350260 @default.
- W2038021268 cites W1978394996 @default.
- W2038021268 cites W1992914835 @default.
- W2038021268 cites W2013579020 @default.
- W2038021268 cites W2080100102 @default.
- W2038021268 cites W2088314245 @default.
- W2038021268 cites W2095122172 @default.
- W2038021268 cites W2100341149 @default.
- W2038021268 cites W2103318667 @default.
- W2038021268 cites W2121184547 @default.
- W2038021268 cites W2123142779 @default.
- W2038021268 cites W2160382551 @default.
- W2038021268 cites W2161186120 @default.
- W2038021268 cites W2170682101 @default.
- W2038021268 cites W2171836785 @default.
- W2038021268 cites W2337480916 @default.
- W2038021268 cites W3216404684 @default.
- W2038021268 cites W4235505822 @default.
- W2038021268 doi "https://doi.org/10.1109/tetc.2015.2418716" @default.
- W2038021268 hasPublicationYear "2015" @default.
- W2038021268 type Work @default.
- W2038021268 sameAs 2038021268 @default.
- W2038021268 citedByCount "34" @default.
- W2038021268 countsByYear W20380212682016 @default.
- W2038021268 countsByYear W20380212682017 @default.
- W2038021268 countsByYear W20380212682018 @default.
- W2038021268 countsByYear W20380212682019 @default.
- W2038021268 countsByYear W20380212682020 @default.
- W2038021268 countsByYear W20380212682021 @default.
- W2038021268 countsByYear W20380212682022 @default.
- W2038021268 countsByYear W20380212682023 @default.
- W2038021268 crossrefType "journal-article" @default.
- W2038021268 hasAuthorship W2038021268A5004881219 @default.
- W2038021268 hasAuthorship W2038021268A5010966769 @default.
- W2038021268 hasAuthorship W2038021268A5012025870 @default.
- W2038021268 hasAuthorship W2038021268A5063345953 @default.
- W2038021268 hasConcept C103278499 @default.
- W2038021268 hasConcept C115961682 @default.
- W2038021268 hasConcept C12267149 @default.
- W2038021268 hasConcept C130318100 @default.
- W2038021268 hasConcept C154945302 @default.
- W2038021268 hasConcept C204321447 @default.
- W2038021268 hasConcept C23123220 @default.
- W2038021268 hasConcept C2524010 @default.
- W2038021268 hasConcept C26517878 @default.
- W2038021268 hasConcept C2777822670 @default.
- W2038021268 hasConcept C2780762811 @default.
- W2038021268 hasConcept C33923547 @default.
- W2038021268 hasConcept C38652104 @default.
- W2038021268 hasConcept C41008148 @default.
- W2038021268 hasConcept C52001869 @default.
- W2038021268 hasConcept C73555534 @default.
- W2038021268 hasConcept C89686163 @default.
- W2038021268 hasConcept C90805587 @default.
- W2038021268 hasConceptScore W2038021268C103278499 @default.
- W2038021268 hasConceptScore W2038021268C115961682 @default.
- W2038021268 hasConceptScore W2038021268C12267149 @default.
- W2038021268 hasConceptScore W2038021268C130318100 @default.
- W2038021268 hasConceptScore W2038021268C154945302 @default.
- W2038021268 hasConceptScore W2038021268C204321447 @default.
- W2038021268 hasConceptScore W2038021268C23123220 @default.
- W2038021268 hasConceptScore W2038021268C2524010 @default.
- W2038021268 hasConceptScore W2038021268C26517878 @default.
- W2038021268 hasConceptScore W2038021268C2777822670 @default.
- W2038021268 hasConceptScore W2038021268C2780762811 @default.
- W2038021268 hasConceptScore W2038021268C33923547 @default.
- W2038021268 hasConceptScore W2038021268C38652104 @default.
- W2038021268 hasConceptScore W2038021268C41008148 @default.
- W2038021268 hasConceptScore W2038021268C52001869 @default.
- W2038021268 hasConceptScore W2038021268C73555534 @default.
- W2038021268 hasConceptScore W2038021268C89686163 @default.
- W2038021268 hasConceptScore W2038021268C90805587 @default.
- W2038021268 hasFunder F4320320912 @default.
- W2038021268 hasIssue "2" @default.
- W2038021268 hasLocation W20380212681 @default.
- W2038021268 hasOpenAccess W2038021268 @default.
- W2038021268 hasPrimaryLocation W20380212681 @default.
- W2038021268 hasRelatedWork W1980104548 @default.
- W2038021268 hasRelatedWork W2038246283 @default.
- W2038021268 hasRelatedWork W2104631007 @default.
- W2038021268 hasRelatedWork W2151108588 @default.
- W2038021268 hasRelatedWork W2172323827 @default.
- W2038021268 hasRelatedWork W2349125667 @default.
- W2038021268 hasRelatedWork W2374872392 @default.
- W2038021268 hasRelatedWork W3113012686 @default.
- W2038021268 hasRelatedWork W3182591145 @default.