Matches in SemOpenAlex for { <https://semopenalex.org/work/W2123241698> ?p ?o ?g. }
- W2123241698 endingPage "859" @default.
- W2123241698 startingPage "851" @default.
- W2123241698 abstract "This paper presents a simple and efficient algorithm for approximate dictionary matching designed for similarity measures such as cosine, Dice, Jaccard, and overlap coefficients. We propose this algorithm, called CPMerge, for the τ-overlap join of inverted lists. First we show that this task is solvable exactly by a τ-overlap join. Given inverted lists retrieved for a query, the algorithm collects fewer candidate strings and prunes unlikely candidates to efficiently find strings that satisfy the constraint of the τ-overlap join. We conducted experiments of approximate dictionary matching on three large-scale datasets that include person names, biomedical names, and general English words. The algorithm exhibited scalable performance on the datasets. For example, it retrieved strings in 1.1 ms from the string collection of Google Web1T unigrams (with cosine similarity and threshold 0.7)." @default.
- W2123241698 created "2016-06-24" @default.
- W2123241698 creator A5040725471 @default.
- W2123241698 creator A5066940046 @default.
- W2123241698 date "2010-08-23" @default.
- W2123241698 modified "2023-10-07" @default.
- W2123241698 title "Simple and Efficient Algorithm for Approximate Dictionary Matching" @default.
- W2123241698 cites W125979907 @default.
- W2123241698 cites W1578664299 @default.
- W2123241698 cites W1646278814 @default.
- W2123241698 cites W2012833704 @default.
- W2123241698 cites W2038276547 @default.
- W2123241698 cites W2085922539 @default.
- W2123241698 cites W2096565906 @default.
- W2123241698 cites W2096598900 @default.
- W2123241698 cites W2099370490 @default.
- W2123241698 cites W2107293766 @default.
- W2123241698 cites W2119057313 @default.
- W2123241698 cites W2121516976 @default.
- W2123241698 cites W2122056984 @default.
- W2123241698 cites W2127675794 @default.
- W2123241698 cites W2133331675 @default.
- W2123241698 cites W2145349611 @default.
- W2123241698 cites W2150916025 @default.
- W2123241698 cites W2153083979 @default.
- W2123241698 cites W2159491434 @default.
- W2123241698 cites W2161936973 @default.
- W2123241698 cites W2167847032 @default.
- W2123241698 cites W2169495281 @default.
- W2123241698 cites W22160234 @default.
- W2123241698 cites W46452414 @default.
- W2123241698 hasPublicationYear "2010" @default.
- W2123241698 type Work @default.
- W2123241698 sameAs 2123241698 @default.
- W2123241698 citedByCount "26" @default.
- W2123241698 countsByYear W21232416982012 @default.
- W2123241698 countsByYear W21232416982013 @default.
- W2123241698 countsByYear W21232416982014 @default.
- W2123241698 countsByYear W21232416982015 @default.
- W2123241698 countsByYear W21232416982016 @default.
- W2123241698 countsByYear W21232416982017 @default.
- W2123241698 countsByYear W21232416982019 @default.
- W2123241698 countsByYear W21232416982020 @default.
- W2123241698 countsByYear W21232416982021 @default.
- W2123241698 crossrefType "proceedings-article" @default.
- W2123241698 hasAuthorship W2123241698A5040725471 @default.
- W2123241698 hasAuthorship W2123241698A5066940046 @default.
- W2123241698 hasConcept C103278499 @default.
- W2123241698 hasConcept C105795698 @default.
- W2123241698 hasConcept C111472728 @default.
- W2123241698 hasConcept C11413529 @default.
- W2123241698 hasConcept C114614502 @default.
- W2123241698 hasConcept C115961682 @default.
- W2123241698 hasConcept C124101348 @default.
- W2123241698 hasConcept C138885662 @default.
- W2123241698 hasConcept C153180895 @default.
- W2123241698 hasConcept C154945302 @default.
- W2123241698 hasConcept C157486923 @default.
- W2123241698 hasConcept C165064840 @default.
- W2123241698 hasConcept C203519979 @default.
- W2123241698 hasConcept C22820288 @default.
- W2123241698 hasConcept C2524010 @default.
- W2123241698 hasConcept C2776036281 @default.
- W2123241698 hasConcept C2776124973 @default.
- W2123241698 hasConcept C2780586882 @default.
- W2123241698 hasConcept C2780762811 @default.
- W2123241698 hasConcept C32610155 @default.
- W2123241698 hasConcept C33923547 @default.
- W2123241698 hasConcept C37914503 @default.
- W2123241698 hasConcept C41008148 @default.
- W2123241698 hasConcept C44359876 @default.
- W2123241698 hasConcept C48044578 @default.
- W2123241698 hasConcept C68859911 @default.
- W2123241698 hasConcept C77088390 @default.
- W2123241698 hasConcept C7757238 @default.
- W2123241698 hasConcept C80444323 @default.
- W2123241698 hasConcept C87117476 @default.
- W2123241698 hasConceptScore W2123241698C103278499 @default.
- W2123241698 hasConceptScore W2123241698C105795698 @default.
- W2123241698 hasConceptScore W2123241698C111472728 @default.
- W2123241698 hasConceptScore W2123241698C11413529 @default.
- W2123241698 hasConceptScore W2123241698C114614502 @default.
- W2123241698 hasConceptScore W2123241698C115961682 @default.
- W2123241698 hasConceptScore W2123241698C124101348 @default.
- W2123241698 hasConceptScore W2123241698C138885662 @default.
- W2123241698 hasConceptScore W2123241698C153180895 @default.
- W2123241698 hasConceptScore W2123241698C154945302 @default.
- W2123241698 hasConceptScore W2123241698C157486923 @default.
- W2123241698 hasConceptScore W2123241698C165064840 @default.
- W2123241698 hasConceptScore W2123241698C203519979 @default.
- W2123241698 hasConceptScore W2123241698C22820288 @default.
- W2123241698 hasConceptScore W2123241698C2524010 @default.
- W2123241698 hasConceptScore W2123241698C2776036281 @default.
- W2123241698 hasConceptScore W2123241698C2776124973 @default.
- W2123241698 hasConceptScore W2123241698C2780586882 @default.
- W2123241698 hasConceptScore W2123241698C2780762811 @default.
- W2123241698 hasConceptScore W2123241698C32610155 @default.
- W2123241698 hasConceptScore W2123241698C33923547 @default.