Matches in SemOpenAlex for { <https://semopenalex.org/work/W2133331675> ?p ?o ?g. }
- W2133331675 abstract "Data collections often have inconsistencies that arise due to a variety of reasons, and it is desirable to be able to identify and resolve them efficiently. Similarity queries are commonly used in data cleaning for matching similar data. In this work we concentrate on the following problem of approximate string matching based on edit distance: from a collection of strings, how to find those strings similar to a given string, or the strings in another collection of strings with similarity greater than some threshold? We propose an NFA-based (nondeterministic finite-state automation) method for effective approximate string search. We model strings as a trie and construct an NFA on top of the trie. We identify the similar strings by running the NFA based on the tree automata theory. Moreover, we propose grouped trie to further improve the performance of similarity search by incorporating some effective pruning techniques. We have implemented our method and the experimental results show that our approach achieves high performance and out performs the existing state-of-the-art methods by orders of magnitude." @default.
- W2133331675 created "2016-06-24" @default.
- W2133331675 creator A5022852316 @default.
- W2133331675 creator A5032973972 @default.
- W2133331675 creator A5034776912 @default.
- W2133331675 creator A5041232333 @default.
- W2133331675 date "2008-07-01" @default.
- W2133331675 modified "2023-09-23" @default.
- W2133331675 title "Effective Indices for Efficient Approximate String Search and Similarity Join" @default.
- W2133331675 cites W125979907 @default.
- W2133331675 cites W1974995373 @default.
- W2133331675 cites W1986404066 @default.
- W2133331675 cites W2001496424 @default.
- W2133331675 cites W2043481183 @default.
- W2133331675 cites W2055858190 @default.
- W2133331675 cites W2096598900 @default.
- W2133331675 cites W2097776316 @default.
- W2133331675 cites W2103014446 @default.
- W2133331675 cites W2105423800 @default.
- W2133331675 cites W2107412086 @default.
- W2133331675 cites W2116690618 @default.
- W2133331675 cites W2119057313 @default.
- W2133331675 cites W2121516976 @default.
- W2133331675 cites W2127675794 @default.
- W2133331675 cites W2130130225 @default.
- W2133331675 cites W2147909264 @default.
- W2133331675 cites W2161936973 @default.
- W2133331675 cites W2162102353 @default.
- W2133331675 cites W2164501930 @default.
- W2133331675 cites W2167439683 @default.
- W2133331675 cites W2167847032 @default.
- W2133331675 cites W2169844574 @default.
- W2133331675 cites W2337480916 @default.
- W2133331675 cites W2998852864 @default.
- W2133331675 doi "https://doi.org/10.1109/waim.2008.17" @default.
- W2133331675 hasPublicationYear "2008" @default.
- W2133331675 type Work @default.
- W2133331675 sameAs 2133331675 @default.
- W2133331675 citedByCount "4" @default.
- W2133331675 countsByYear W21333316752012 @default.
- W2133331675 crossrefType "proceedings-article" @default.
- W2133331675 hasAuthorship W2133331675A5022852316 @default.
- W2133331675 hasAuthorship W2133331675A5032973972 @default.
- W2133331675 hasAuthorship W2133331675A5034776912 @default.
- W2133331675 hasAuthorship W2133331675A5041232333 @default.
- W2133331675 hasConcept C103278499 @default.
- W2133331675 hasConcept C108010975 @default.
- W2133331675 hasConcept C112505250 @default.
- W2133331675 hasConcept C11413529 @default.
- W2133331675 hasConcept C115961682 @default.
- W2133331675 hasConcept C116248031 @default.
- W2133331675 hasConcept C125583679 @default.
- W2133331675 hasConcept C154945302 @default.
- W2133331675 hasConcept C157486923 @default.
- W2133331675 hasConcept C158008952 @default.
- W2133331675 hasConcept C162319229 @default.
- W2133331675 hasConcept C167822520 @default.
- W2133331675 hasConcept C176181172 @default.
- W2133331675 hasConcept C190290938 @default.
- W2133331675 hasConcept C199360897 @default.
- W2133331675 hasConcept C207024777 @default.
- W2133331675 hasConcept C22820288 @default.
- W2133331675 hasConcept C32610155 @default.
- W2133331675 hasConcept C33923547 @default.
- W2133331675 hasConcept C37914503 @default.
- W2133331675 hasConcept C41008148 @default.
- W2133331675 hasConcept C44359876 @default.
- W2133331675 hasConcept C6557445 @default.
- W2133331675 hasConcept C68859911 @default.
- W2133331675 hasConcept C7757238 @default.
- W2133331675 hasConcept C80444323 @default.
- W2133331675 hasConcept C86803240 @default.
- W2133331675 hasConceptScore W2133331675C103278499 @default.
- W2133331675 hasConceptScore W2133331675C108010975 @default.
- W2133331675 hasConceptScore W2133331675C112505250 @default.
- W2133331675 hasConceptScore W2133331675C11413529 @default.
- W2133331675 hasConceptScore W2133331675C115961682 @default.
- W2133331675 hasConceptScore W2133331675C116248031 @default.
- W2133331675 hasConceptScore W2133331675C125583679 @default.
- W2133331675 hasConceptScore W2133331675C154945302 @default.
- W2133331675 hasConceptScore W2133331675C157486923 @default.
- W2133331675 hasConceptScore W2133331675C158008952 @default.
- W2133331675 hasConceptScore W2133331675C162319229 @default.
- W2133331675 hasConceptScore W2133331675C167822520 @default.
- W2133331675 hasConceptScore W2133331675C176181172 @default.
- W2133331675 hasConceptScore W2133331675C190290938 @default.
- W2133331675 hasConceptScore W2133331675C199360897 @default.
- W2133331675 hasConceptScore W2133331675C207024777 @default.
- W2133331675 hasConceptScore W2133331675C22820288 @default.
- W2133331675 hasConceptScore W2133331675C32610155 @default.
- W2133331675 hasConceptScore W2133331675C33923547 @default.
- W2133331675 hasConceptScore W2133331675C37914503 @default.
- W2133331675 hasConceptScore W2133331675C41008148 @default.
- W2133331675 hasConceptScore W2133331675C44359876 @default.
- W2133331675 hasConceptScore W2133331675C6557445 @default.
- W2133331675 hasConceptScore W2133331675C68859911 @default.
- W2133331675 hasConceptScore W2133331675C7757238 @default.
- W2133331675 hasConceptScore W2133331675C80444323 @default.
- W2133331675 hasConceptScore W2133331675C86803240 @default.
- W2133331675 hasLocation W21333316751 @default.