Matches in SemOpenAlex for { <https://semopenalex.org/work/W2000482994> ?p ?o ?g. }
- W2000482994 endingPage "349" @default.
- W2000482994 startingPage "338" @default.
- W2000482994 abstract "Similarity joins are important operations with a broad range of applications. In this paper, we study the problem of vector similarity join size estimation (VSJ). It is a generalization of the previously studied set similarity join size estimation (SSJ) problem and can handle more interesting cases such as TF-IDF vectors. One of the key challenges in similarity join size estimation is that the join size can change dramatically depending on the input similarity threshold. We propose a sampling based algorithm that uses Locality-Sensitive-Hashing (LSH). The proposed algorithm LSH-SS uses an LSH index to enable effective sampling even at high thresholds. We compare the proposed technique with random sampling and the state-of-the-art technique for SSJ (adapted to VSJ) and demonstrate LSH-SS offers more accurate estimates throughout the similarity threshold range and small variance using real-world data sets." @default.
- W2000482994 created "2016-06-24" @default.
- W2000482994 creator A5022928428 @default.
- W2000482994 creator A5049064713 @default.
- W2000482994 creator A5081157085 @default.
- W2000482994 date "2011-03-01" @default.
- W2000482994 modified "2023-09-23" @default.
- W2000482994 title "Similarity join size estimation using locality sensitive hashing" @default.
- W2000482994 cites W1968829657 @default.
- W2000482994 cites W2007069074 @default.
- W2000482994 cites W2012833704 @default.
- W2000482994 cites W2013092187 @default.
- W2000482994 cites W2090403603 @default.
- W2000482994 cites W2097776316 @default.
- W2000482994 cites W2113380734 @default.
- W2000482994 cites W2115215982 @default.
- W2000482994 cites W2121516976 @default.
- W2000482994 cites W2127675794 @default.
- W2000482994 cites W2147033904 @default.
- W2000482994 cites W2147717514 @default.
- W2000482994 cites W4237172715 @default.
- W2000482994 doi "https://doi.org/10.14778/1978665.1978666" @default.
- W2000482994 hasPublicationYear "2011" @default.
- W2000482994 type Work @default.
- W2000482994 sameAs 2000482994 @default.
- W2000482994 citedByCount "33" @default.
- W2000482994 countsByYear W20004829942012 @default.
- W2000482994 countsByYear W20004829942013 @default.
- W2000482994 countsByYear W20004829942014 @default.
- W2000482994 countsByYear W20004829942015 @default.
- W2000482994 countsByYear W20004829942016 @default.
- W2000482994 countsByYear W20004829942017 @default.
- W2000482994 countsByYear W20004829942018 @default.
- W2000482994 countsByYear W20004829942020 @default.
- W2000482994 countsByYear W20004829942021 @default.
- W2000482994 crossrefType "journal-article" @default.
- W2000482994 hasAuthorship W2000482994A5022928428 @default.
- W2000482994 hasAuthorship W2000482994A5049064713 @default.
- W2000482994 hasAuthorship W2000482994A5081157085 @default.
- W2000482994 hasBestOaLocation W20004829942 @default.
- W2000482994 hasConcept C103278499 @default.
- W2000482994 hasConcept C106131492 @default.
- W2000482994 hasConcept C11413529 @default.
- W2000482994 hasConcept C114614502 @default.
- W2000482994 hasConcept C115961682 @default.
- W2000482994 hasConcept C116738811 @default.
- W2000482994 hasConcept C124101348 @default.
- W2000482994 hasConcept C134306372 @default.
- W2000482994 hasConcept C140779682 @default.
- W2000482994 hasConcept C153180895 @default.
- W2000482994 hasConcept C154945302 @default.
- W2000482994 hasConcept C159985019 @default.
- W2000482994 hasConcept C177148314 @default.
- W2000482994 hasConcept C177264268 @default.
- W2000482994 hasConcept C192562407 @default.
- W2000482994 hasConcept C199360897 @default.
- W2000482994 hasConcept C204323151 @default.
- W2000482994 hasConcept C2776124973 @default.
- W2000482994 hasConcept C2778692605 @default.
- W2000482994 hasConcept C31972630 @default.
- W2000482994 hasConcept C33923547 @default.
- W2000482994 hasConcept C38652104 @default.
- W2000482994 hasConcept C41008148 @default.
- W2000482994 hasConcept C67388219 @default.
- W2000482994 hasConcept C74270461 @default.
- W2000482994 hasConcept C99138194 @default.
- W2000482994 hasConceptScore W2000482994C103278499 @default.
- W2000482994 hasConceptScore W2000482994C106131492 @default.
- W2000482994 hasConceptScore W2000482994C11413529 @default.
- W2000482994 hasConceptScore W2000482994C114614502 @default.
- W2000482994 hasConceptScore W2000482994C115961682 @default.
- W2000482994 hasConceptScore W2000482994C116738811 @default.
- W2000482994 hasConceptScore W2000482994C124101348 @default.
- W2000482994 hasConceptScore W2000482994C134306372 @default.
- W2000482994 hasConceptScore W2000482994C140779682 @default.
- W2000482994 hasConceptScore W2000482994C153180895 @default.
- W2000482994 hasConceptScore W2000482994C154945302 @default.
- W2000482994 hasConceptScore W2000482994C159985019 @default.
- W2000482994 hasConceptScore W2000482994C177148314 @default.
- W2000482994 hasConceptScore W2000482994C177264268 @default.
- W2000482994 hasConceptScore W2000482994C192562407 @default.
- W2000482994 hasConceptScore W2000482994C199360897 @default.
- W2000482994 hasConceptScore W2000482994C204323151 @default.
- W2000482994 hasConceptScore W2000482994C2776124973 @default.
- W2000482994 hasConceptScore W2000482994C2778692605 @default.
- W2000482994 hasConceptScore W2000482994C31972630 @default.
- W2000482994 hasConceptScore W2000482994C33923547 @default.
- W2000482994 hasConceptScore W2000482994C38652104 @default.
- W2000482994 hasConceptScore W2000482994C41008148 @default.
- W2000482994 hasConceptScore W2000482994C67388219 @default.
- W2000482994 hasConceptScore W2000482994C74270461 @default.
- W2000482994 hasConceptScore W2000482994C99138194 @default.
- W2000482994 hasIssue "6" @default.
- W2000482994 hasLocation W20004829941 @default.
- W2000482994 hasLocation W20004829942 @default.
- W2000482994 hasLocation W20004829943 @default.
- W2000482994 hasOpenAccess W2000482994 @default.
- W2000482994 hasPrimaryLocation W20004829941 @default.