Matches in SemOpenAlex for { <https://semopenalex.org/work/W2024651132> ?p ?o ?g. }
- W2024651132 endingPage "42" @default.
- W2024651132 startingPage "1" @default.
- W2024651132 abstract "A string-similarity measure quantifies the similarity between two text strings for approximate string matching or comparison. For example, the strings “Sam” and “Samuel” can be considered to be similar. Most existing work that computes the similarity of two strings only considers syntactic similarities, for example, number of common words or q -grams. While this is indeed an indicator of similarity, there are many important cases where syntactically-different strings can represent the same real-world object. For example, “Bill” is a short form of “William,” and “Database Management Systems” can be abbreviated as “DBMS.” Given a collection of predefined synonyms, the purpose of this article is to explore such existing knowledge to effectively evaluate the similarity between two strings and efficiently perform similarity searches and joins, thereby boosting the quality of approximate string matching. In particular, we first present an expansion-based framework to measure string similarities efficiently while considering synonyms. We then study efficient algorithms for similarity searches and joins by proposing two novel indexes, called SI-trees and QP-trees, which combine signature-filtering and length-filtering strategies. In order to improve the efficiency of our algorithms, we develop an estimator to estimate the size of candidates to enable an online selection of signature filters. This estimator provides strong low-error, high-confidence guarantees while requiring only logarithmic space and time costs, thus making our method attractive both in theory and in practice. Finally, the experimental results from a comprehensive study of the algorithms with three real datasets verify the effectiveness and efficiency of our approaches." @default.
- W2024651132 created "2016-06-24" @default.
- W2024651132 creator A5010903591 @default.
- W2024651132 creator A5018627557 @default.
- W2024651132 creator A5032447166 @default.
- W2024651132 creator A5036384670 @default.
- W2024651132 creator A5046597133 @default.
- W2024651132 date "2015-10-23" @default.
- W2024651132 modified "2023-09-24" @default.
- W2024651132 title "Boosting the Quality of Approximate String Matching by Synonyms" @default.
- W2024651132 cites W1527957260 @default.
- W2024651132 cites W1821184416 @default.
- W2024651132 cites W1973001156 @default.
- W2024651132 cites W1978394996 @default.
- W2024651132 cites W1995437102 @default.
- W2024651132 cites W2000482994 @default.
- W2024651132 cites W2025051251 @default.
- W2024651132 cites W2031250218 @default.
- W2024651132 cites W2037594241 @default.
- W2024651132 cites W2044163187 @default.
- W2024651132 cites W2064379477 @default.
- W2024651132 cites W2065259291 @default.
- W2024651132 cites W2072173758 @default.
- W2024651132 cites W2095368471 @default.
- W2024651132 cites W2097776316 @default.
- W2024651132 cites W2100548092 @default.
- W2024651132 cites W2105423800 @default.
- W2024651132 cites W2115215982 @default.
- W2024651132 cites W2121269638 @default.
- W2024651132 cites W2121516976 @default.
- W2024651132 cites W2127675794 @default.
- W2024651132 cites W2133627190 @default.
- W2024651132 cites W2147033904 @default.
- W2024651132 cites W2150916025 @default.
- W2024651132 cites W2152565070 @default.
- W2024651132 cites W2162592052 @default.
- W2024651132 cites W2163550466 @default.
- W2024651132 cites W2163993443 @default.
- W2024651132 cites W2164456230 @default.
- W2024651132 cites W2167847032 @default.
- W2024651132 doi "https://doi.org/10.1145/2818177" @default.
- W2024651132 hasPublicationYear "2015" @default.
- W2024651132 type Work @default.
- W2024651132 sameAs 2024651132 @default.
- W2024651132 citedByCount "8" @default.
- W2024651132 countsByYear W20246511322016 @default.
- W2024651132 countsByYear W20246511322017 @default.
- W2024651132 countsByYear W20246511322018 @default.
- W2024651132 countsByYear W20246511322019 @default.
- W2024651132 countsByYear W20246511322020 @default.
- W2024651132 crossrefType "journal-article" @default.
- W2024651132 hasAuthorship W2024651132A5010903591 @default.
- W2024651132 hasAuthorship W2024651132A5018627557 @default.
- W2024651132 hasAuthorship W2024651132A5032447166 @default.
- W2024651132 hasAuthorship W2024651132A5036384670 @default.
- W2024651132 hasAuthorship W2024651132A5046597133 @default.
- W2024651132 hasBestOaLocation W20246511322 @default.
- W2024651132 hasConcept C103278499 @default.
- W2024651132 hasConcept C105795698 @default.
- W2024651132 hasConcept C115961682 @default.
- W2024651132 hasConcept C122280245 @default.
- W2024651132 hasConcept C12267149 @default.
- W2024651132 hasConcept C124101348 @default.
- W2024651132 hasConcept C154945302 @default.
- W2024651132 hasConcept C157486923 @default.
- W2024651132 hasConcept C160446489 @default.
- W2024651132 hasConcept C165064840 @default.
- W2024651132 hasConcept C185429906 @default.
- W2024651132 hasConcept C199360897 @default.
- W2024651132 hasConcept C22820288 @default.
- W2024651132 hasConcept C2778692605 @default.
- W2024651132 hasConcept C32610155 @default.
- W2024651132 hasConcept C33923547 @default.
- W2024651132 hasConcept C37914503 @default.
- W2024651132 hasConcept C41008148 @default.
- W2024651132 hasConcept C46686674 @default.
- W2024651132 hasConcept C55851704 @default.
- W2024651132 hasConcept C68859911 @default.
- W2024651132 hasConcept C7757238 @default.
- W2024651132 hasConcept C80444323 @default.
- W2024651132 hasConceptScore W2024651132C103278499 @default.
- W2024651132 hasConceptScore W2024651132C105795698 @default.
- W2024651132 hasConceptScore W2024651132C115961682 @default.
- W2024651132 hasConceptScore W2024651132C122280245 @default.
- W2024651132 hasConceptScore W2024651132C12267149 @default.
- W2024651132 hasConceptScore W2024651132C124101348 @default.
- W2024651132 hasConceptScore W2024651132C154945302 @default.
- W2024651132 hasConceptScore W2024651132C157486923 @default.
- W2024651132 hasConceptScore W2024651132C160446489 @default.
- W2024651132 hasConceptScore W2024651132C165064840 @default.
- W2024651132 hasConceptScore W2024651132C185429906 @default.
- W2024651132 hasConceptScore W2024651132C199360897 @default.
- W2024651132 hasConceptScore W2024651132C22820288 @default.
- W2024651132 hasConceptScore W2024651132C2778692605 @default.
- W2024651132 hasConceptScore W2024651132C32610155 @default.
- W2024651132 hasConceptScore W2024651132C33923547 @default.
- W2024651132 hasConceptScore W2024651132C37914503 @default.
- W2024651132 hasConceptScore W2024651132C41008148 @default.