Matches in SemOpenAlex for { <https://semopenalex.org/work/W2125980212> ?p ?o ?g. }
- W2125980212 abstract "String similarity join that finds similar string pairs between two string sets is an essential operation in many applications, and has attracted significant attention recently in the database community. A significant challenge in similarity join is to implement an effective fuzzy match operation to find all similar string pairs which may not match exactly. In this paper, we propose a new similarity metrics, called “fuzzy token matching based similarity”, which extends token-based similarity functions (e.g., Jaccard similarity and Cosine similarity) by allowing fuzzy match between two tokens. We study the problem of similarity join using this new similarity metrics and present a signature-based method to address this problem. We propose new signature schemes and develop effective pruning techniques to improve the performance. Experimental results show that our approach achieves high efficiency and result quality, and significantly outperforms state-of-the-art methods." @default.
- W2125980212 created "2016-06-24" @default.
- W2125980212 creator A5066141913 @default.
- W2125980212 creator A5066934565 @default.
- W2125980212 creator A5083848380 @default.
- W2125980212 date "2011-04-01" @default.
- W2125980212 modified "2023-10-18" @default.
- W2125980212 title "Fast-join: An efficient method for fuzzy token matching based string similarity join" @default.
- W2125980212 cites W1974995373 @default.
- W2125980212 cites W2054693333 @default.
- W2125980212 cites W2072173758 @default.
- W2125980212 cites W2097184821 @default.
- W2125980212 cites W2097776316 @default.
- W2125980212 cites W2099370490 @default.
- W2125980212 cites W2105423800 @default.
- W2125980212 cites W2105436061 @default.
- W2125980212 cites W2115214414 @default.
- W2125980212 cites W2115500858 @default.
- W2125980212 cites W2121516976 @default.
- W2125980212 cites W2127675794 @default.
- W2125980212 cites W2131815873 @default.
- W2125980212 cites W2148148676 @default.
- W2125980212 cites W2150916025 @default.
- W2125980212 cites W2151930506 @default.
- W2125980212 cites W2166739719 @default.
- W2125980212 cites W2167847032 @default.
- W2125980212 doi "https://doi.org/10.1109/icde.2011.5767865" @default.
- W2125980212 hasPublicationYear "2011" @default.
- W2125980212 type Work @default.
- W2125980212 sameAs 2125980212 @default.
- W2125980212 citedByCount "114" @default.
- W2125980212 countsByYear W21259802122012 @default.
- W2125980212 countsByYear W21259802122013 @default.
- W2125980212 countsByYear W21259802122014 @default.
- W2125980212 countsByYear W21259802122015 @default.
- W2125980212 countsByYear W21259802122016 @default.
- W2125980212 countsByYear W21259802122017 @default.
- W2125980212 countsByYear W21259802122018 @default.
- W2125980212 countsByYear W21259802122019 @default.
- W2125980212 countsByYear W21259802122020 @default.
- W2125980212 countsByYear W21259802122021 @default.
- W2125980212 countsByYear W21259802122022 @default.
- W2125980212 countsByYear W21259802122023 @default.
- W2125980212 crossrefType "proceedings-article" @default.
- W2125980212 hasAuthorship W2125980212A5066141913 @default.
- W2125980212 hasAuthorship W2125980212A5066934565 @default.
- W2125980212 hasAuthorship W2125980212A5083848380 @default.
- W2125980212 hasBestOaLocation W21259802122 @default.
- W2125980212 hasConcept C103278499 @default.
- W2125980212 hasConcept C105795698 @default.
- W2125980212 hasConcept C108010975 @default.
- W2125980212 hasConcept C114614502 @default.
- W2125980212 hasConcept C115961682 @default.
- W2125980212 hasConcept C116738811 @default.
- W2125980212 hasConcept C124101348 @default.
- W2125980212 hasConcept C153180895 @default.
- W2125980212 hasConcept C154945302 @default.
- W2125980212 hasConcept C157486923 @default.
- W2125980212 hasConcept C165064840 @default.
- W2125980212 hasConcept C188805328 @default.
- W2125980212 hasConcept C203519979 @default.
- W2125980212 hasConcept C22820288 @default.
- W2125980212 hasConcept C2776124973 @default.
- W2125980212 hasConcept C2780762811 @default.
- W2125980212 hasConcept C32610155 @default.
- W2125980212 hasConcept C33923547 @default.
- W2125980212 hasConcept C37914503 @default.
- W2125980212 hasConcept C38652104 @default.
- W2125980212 hasConcept C41008148 @default.
- W2125980212 hasConcept C44359876 @default.
- W2125980212 hasConcept C48145219 @default.
- W2125980212 hasConcept C6557445 @default.
- W2125980212 hasConcept C68859911 @default.
- W2125980212 hasConcept C7757238 @default.
- W2125980212 hasConcept C80444323 @default.
- W2125980212 hasConcept C86803240 @default.
- W2125980212 hasConcept C99138194 @default.
- W2125980212 hasConceptScore W2125980212C103278499 @default.
- W2125980212 hasConceptScore W2125980212C105795698 @default.
- W2125980212 hasConceptScore W2125980212C108010975 @default.
- W2125980212 hasConceptScore W2125980212C114614502 @default.
- W2125980212 hasConceptScore W2125980212C115961682 @default.
- W2125980212 hasConceptScore W2125980212C116738811 @default.
- W2125980212 hasConceptScore W2125980212C124101348 @default.
- W2125980212 hasConceptScore W2125980212C153180895 @default.
- W2125980212 hasConceptScore W2125980212C154945302 @default.
- W2125980212 hasConceptScore W2125980212C157486923 @default.
- W2125980212 hasConceptScore W2125980212C165064840 @default.
- W2125980212 hasConceptScore W2125980212C188805328 @default.
- W2125980212 hasConceptScore W2125980212C203519979 @default.
- W2125980212 hasConceptScore W2125980212C22820288 @default.
- W2125980212 hasConceptScore W2125980212C2776124973 @default.
- W2125980212 hasConceptScore W2125980212C2780762811 @default.
- W2125980212 hasConceptScore W2125980212C32610155 @default.
- W2125980212 hasConceptScore W2125980212C33923547 @default.
- W2125980212 hasConceptScore W2125980212C37914503 @default.
- W2125980212 hasConceptScore W2125980212C38652104 @default.
- W2125980212 hasConceptScore W2125980212C41008148 @default.
- W2125980212 hasConceptScore W2125980212C44359876 @default.
- W2125980212 hasConceptScore W2125980212C48145219 @default.