Matches in SemOpenAlex for { <https://semopenalex.org/work/W2169153112> ?p ?o ?g. }
Showing items 1 to 96 of
96
with 100 items per page.
- W2169153112 abstract "The vector space model (VSM) is a popular and widely applied model in information retrieval (IR). VSM creates vector spaces whose dimensionality is usually high (e.g., tens of thousands of terms). This may cause various problems, such as susceptibility to noise and difficulty in capturing the underlying semantic structure, which are commonly recognized as different aspects of the curse of dimensionality. In this paper, we investigate a novel aspect of the dimensionality curse, which is referred to as hubness and manifested by the tendency of some documents (called hubs) to be included in unexpectedly many search result lists. Hubness may impact VSM considerably since hubs can become obstinate results, irrelevant to a large number of queries, thus harming the performance of an IR system and the experience of its users. We analyze the origins of hubness, showing it is primarily a consequence of high (intrinsic) dimensionality of data, and not a result of other factors such as sparsity and skewness of the distribution of term frequencies. We describe the mechanisms through which hubness emerges by exploring the behavior of similarity measures in high-dimensional vector spaces. Our consideration begins with the classical VSM (tf-idf term weighting and cosine similarity), but the conclusions generalize to more advanced variations, such as Okapi BM25. Moreover, we explain why hubness may not be easily mitigated by dimensionality reduction, and propose a similarity adjustment scheme that takes into account the existence of hubs. Experimental results over real data indicate that significant improvement can be obtained through consideration of hubness." @default.
- W2169153112 created "2016-06-24" @default.
- W2169153112 creator A5000856508 @default.
- W2169153112 creator A5025192310 @default.
- W2169153112 creator A5047268960 @default.
- W2169153112 date "2010-07-19" @default.
- W2169153112 modified "2023-10-16" @default.
- W2169153112 title "On the existence of obstinate results in vector space models" @default.
- W2169153112 cites W1524345994 @default.
- W2169153112 cites W1979073566 @default.
- W2169153112 cites W2098006457 @default.
- W2169153112 cites W2136846911 @default.
- W2169153112 cites W2143204722 @default.
- W2169153112 cites W2155529673 @default.
- W2169153112 cites W2165612380 @default.
- W2169153112 cites W2168532736 @default.
- W2169153112 cites W4312512934 @default.
- W2169153112 doi "https://doi.org/10.1145/1835449.1835482" @default.
- W2169153112 hasPublicationYear "2010" @default.
- W2169153112 type Work @default.
- W2169153112 sameAs 2169153112 @default.
- W2169153112 citedByCount "73" @default.
- W2169153112 countsByYear W21691531122012 @default.
- W2169153112 countsByYear W21691531122013 @default.
- W2169153112 countsByYear W21691531122014 @default.
- W2169153112 countsByYear W21691531122015 @default.
- W2169153112 countsByYear W21691531122016 @default.
- W2169153112 countsByYear W21691531122017 @default.
- W2169153112 countsByYear W21691531122018 @default.
- W2169153112 countsByYear W21691531122019 @default.
- W2169153112 countsByYear W21691531122020 @default.
- W2169153112 countsByYear W21691531122021 @default.
- W2169153112 countsByYear W21691531122022 @default.
- W2169153112 crossrefType "proceedings-article" @default.
- W2169153112 hasAuthorship W2169153112A5000856508 @default.
- W2169153112 hasAuthorship W2169153112A5025192310 @default.
- W2169153112 hasAuthorship W2169153112A5047268960 @default.
- W2169153112 hasConcept C103278499 @default.
- W2169153112 hasConcept C111030470 @default.
- W2169153112 hasConcept C111919701 @default.
- W2169153112 hasConcept C11413529 @default.
- W2169153112 hasConcept C115961682 @default.
- W2169153112 hasConcept C121332964 @default.
- W2169153112 hasConcept C122342681 @default.
- W2169153112 hasConcept C124101348 @default.
- W2169153112 hasConcept C13336665 @default.
- W2169153112 hasConcept C149782125 @default.
- W2169153112 hasConcept C153180895 @default.
- W2169153112 hasConcept C154945302 @default.
- W2169153112 hasConcept C183115368 @default.
- W2169153112 hasConcept C24890656 @default.
- W2169153112 hasConcept C2524010 @default.
- W2169153112 hasConcept C2778572836 @default.
- W2169153112 hasConcept C2780762811 @default.
- W2169153112 hasConcept C33923547 @default.
- W2169153112 hasConcept C41008148 @default.
- W2169153112 hasConcept C70518039 @default.
- W2169153112 hasConcept C89686163 @default.
- W2169153112 hasConceptScore W2169153112C103278499 @default.
- W2169153112 hasConceptScore W2169153112C111030470 @default.
- W2169153112 hasConceptScore W2169153112C111919701 @default.
- W2169153112 hasConceptScore W2169153112C11413529 @default.
- W2169153112 hasConceptScore W2169153112C115961682 @default.
- W2169153112 hasConceptScore W2169153112C121332964 @default.
- W2169153112 hasConceptScore W2169153112C122342681 @default.
- W2169153112 hasConceptScore W2169153112C124101348 @default.
- W2169153112 hasConceptScore W2169153112C13336665 @default.
- W2169153112 hasConceptScore W2169153112C149782125 @default.
- W2169153112 hasConceptScore W2169153112C153180895 @default.
- W2169153112 hasConceptScore W2169153112C154945302 @default.
- W2169153112 hasConceptScore W2169153112C183115368 @default.
- W2169153112 hasConceptScore W2169153112C24890656 @default.
- W2169153112 hasConceptScore W2169153112C2524010 @default.
- W2169153112 hasConceptScore W2169153112C2778572836 @default.
- W2169153112 hasConceptScore W2169153112C2780762811 @default.
- W2169153112 hasConceptScore W2169153112C33923547 @default.
- W2169153112 hasConceptScore W2169153112C41008148 @default.
- W2169153112 hasConceptScore W2169153112C70518039 @default.
- W2169153112 hasConceptScore W2169153112C89686163 @default.
- W2169153112 hasLocation W21691531121 @default.
- W2169153112 hasOpenAccess W2169153112 @default.
- W2169153112 hasPrimaryLocation W21691531121 @default.
- W2169153112 hasRelatedWork W1587648452 @default.
- W2169153112 hasRelatedWork W2015538044 @default.
- W2169153112 hasRelatedWork W2294367205 @default.
- W2169153112 hasRelatedWork W2324974544 @default.
- W2169153112 hasRelatedWork W2605889996 @default.
- W2169153112 hasRelatedWork W2909011325 @default.
- W2169153112 hasRelatedWork W3011505626 @default.
- W2169153112 hasRelatedWork W3211035526 @default.
- W2169153112 hasRelatedWork W4205786072 @default.
- W2169153112 hasRelatedWork W4312339788 @default.
- W2169153112 isParatext "false" @default.
- W2169153112 isRetracted "false" @default.
- W2169153112 magId "2169153112" @default.
- W2169153112 workType "article" @default.