Matches in SemOpenAlex for { <https://semopenalex.org/work/W2143996849> ?p ?o ?g. }
- W2143996849 endingPage "894" @default.
- W2143996849 startingPage "886" @default.
- W2143996849 abstract "MinHash and SimHash are the two widely adopted Locality Sensitive Hashing (LSH) algorithms for large-scale data processing applications. Deciding which LSH to use for a particular problem at hand is an important question, which has no clear answer in the existing literature. In this study, we provide a theoretical answer (validated by experiments) that MinHash virtually always outperforms SimHash when the data are binary, as common in practice such as search. The collision probability of MinHash is a function of resemblance similarity (R), while the collision probability of SimHash is a function of cosine similarity (S). To provide a common basis for comparison, we evaluate retrieval results in terms of S for both MinHash and SimHash. This evaluation is valid as we can prove that MinHash is a valid LSH with respect to S, by using a general inequality S 2" @default.
- W2143996849 created "2016-06-24" @default.
- W2143996849 creator A5024993683 @default.
- W2143996849 creator A5085993607 @default.
- W2143996849 date "2014-01-01" @default.
- W2143996849 modified "2023-09-25" @default.
- W2143996849 title "In Defense of MinHash Over SimHash" @default.
- W2143996849 cites W1480714498 @default.
- W2143996849 cites W1537946535 @default.
- W2143996849 cites W1978024959 @default.
- W2143996849 cites W1983645263 @default.
- W2143996849 cites W1985123706 @default.
- W2143996849 cites W1986482242 @default.
- W2143996849 cites W2012833704 @default.
- W2143996849 cites W2029852131 @default.
- W2143996849 cites W2047756776 @default.
- W2143996849 cites W2053377618 @default.
- W2143996849 cites W2081193615 @default.
- W2143996849 cites W2085922539 @default.
- W2143996849 cites W2100141119 @default.
- W2143996849 cites W2120031510 @default.
- W2143996849 cites W2126326837 @default.
- W2143996849 cites W2130484710 @default.
- W2143996849 cites W2132069633 @default.
- W2143996849 cites W2140431670 @default.
- W2143996849 cites W2142256417 @default.
- W2143996849 cites W2147017814 @default.
- W2143996849 cites W2147717514 @default.
- W2143996849 cites W2152228468 @default.
- W2143996849 cites W2157462866 @default.
- W2143996849 cites W2168467811 @default.
- W2143996849 cites W2169557227 @default.
- W2143996849 cites W2293597654 @default.
- W2143996849 cites W2397770138 @default.
- W2143996849 hasPublicationYear "2014" @default.
- W2143996849 type Work @default.
- W2143996849 sameAs 2143996849 @default.
- W2143996849 citedByCount "27" @default.
- W2143996849 countsByYear W21439968492014 @default.
- W2143996849 countsByYear W21439968492015 @default.
- W2143996849 countsByYear W21439968492016 @default.
- W2143996849 countsByYear W21439968492017 @default.
- W2143996849 countsByYear W21439968492018 @default.
- W2143996849 countsByYear W21439968492019 @default.
- W2143996849 countsByYear W21439968492020 @default.
- W2143996849 countsByYear W21439968492021 @default.
- W2143996849 crossrefType "proceedings-article" @default.
- W2143996849 hasAuthorship W2143996849A5024993683 @default.
- W2143996849 hasAuthorship W2143996849A5085993607 @default.
- W2143996849 hasConcept C103278499 @default.
- W2143996849 hasConcept C115961682 @default.
- W2143996849 hasConcept C14036430 @default.
- W2143996849 hasConcept C154945302 @default.
- W2143996849 hasConcept C23123220 @default.
- W2143996849 hasConcept C38652104 @default.
- W2143996849 hasConcept C41008148 @default.
- W2143996849 hasConcept C67388219 @default.
- W2143996849 hasConcept C74270461 @default.
- W2143996849 hasConcept C78458016 @default.
- W2143996849 hasConcept C86803240 @default.
- W2143996849 hasConcept C99138194 @default.
- W2143996849 hasConceptScore W2143996849C103278499 @default.
- W2143996849 hasConceptScore W2143996849C115961682 @default.
- W2143996849 hasConceptScore W2143996849C14036430 @default.
- W2143996849 hasConceptScore W2143996849C154945302 @default.
- W2143996849 hasConceptScore W2143996849C23123220 @default.
- W2143996849 hasConceptScore W2143996849C38652104 @default.
- W2143996849 hasConceptScore W2143996849C41008148 @default.
- W2143996849 hasConceptScore W2143996849C67388219 @default.
- W2143996849 hasConceptScore W2143996849C74270461 @default.
- W2143996849 hasConceptScore W2143996849C78458016 @default.
- W2143996849 hasConceptScore W2143996849C86803240 @default.
- W2143996849 hasConceptScore W2143996849C99138194 @default.
- W2143996849 hasLocation W21439968491 @default.
- W2143996849 hasOpenAccess W2143996849 @default.
- W2143996849 hasPrimaryLocation W21439968491 @default.
- W2143996849 hasRelatedWork W107173025 @default.
- W2143996849 hasRelatedWork W1502916507 @default.
- W2143996849 hasRelatedWork W1541459201 @default.
- W2143996849 hasRelatedWork W1583707981 @default.
- W2143996849 hasRelatedWork W1736726159 @default.
- W2143996849 hasRelatedWork W1991800036 @default.
- W2143996849 hasRelatedWork W1999092742 @default.
- W2143996849 hasRelatedWork W2012833704 @default.
- W2143996849 hasRelatedWork W2029852131 @default.
- W2143996849 hasRelatedWork W2081193615 @default.
- W2143996849 hasRelatedWork W2085922539 @default.
- W2143996849 hasRelatedWork W2097776316 @default.
- W2143996849 hasRelatedWork W2120031510 @default.
- W2143996849 hasRelatedWork W2126907894 @default.
- W2143996849 hasRelatedWork W2132069633 @default.
- W2143996849 hasRelatedWork W2145065594 @default.
- W2143996849 hasRelatedWork W2145349611 @default.
- W2143996849 hasRelatedWork W2147017814 @default.
- W2143996849 hasRelatedWork W2147717514 @default.
- W2143996849 hasRelatedWork W2162006472 @default.
- W2143996849 hasVolume "33" @default.
- W2143996849 isParatext "false" @default.