Matches in SemOpenAlex for { <https://semopenalex.org/work/W2095520363> ?p ?o ?g. }
Showing items 1 to 78 of
78
with 100 items per page.
- W2095520363 abstract "Consider a universe of items, each of which is associated with a weight, and a database consisting of subsets of these items. Given a query set, a weighted set similarity query identifies either (i) all sets in the database whose normalized similarity to the query set is above a pre-specified threshold, or (ii) the sets in the database with the k highest similarity values to the query set. Weighted set similarity queries are useful in applications like data cleaning and integration for finding approximate matches in the presence of typographical mistakes, multiple formatting conventions, transformation errors, etc. We show that this problem has semantic properties that can be exploited to design index structures that support efficient algorithms for answering queries; these algorithms can achieve arbitrarily stronger pruning than the family of Threshold Algorithms. We describe how these index structures can beefficiently updated using lazy propagation in a way that gives strict guarantees on the quality of subsequent query answers. Finally, we illustrate that our proposed ideas work well in practice for real datasets." @default.
- W2095520363 created "2016-06-24" @default.
- W2095520363 creator A5088315797 @default.
- W2095520363 date "2009-03-01" @default.
- W2095520363 modified "2023-09-24" @default.
- W2095520363 title "Weighted Set Similarity: Queries and Updates" @default.
- W2095520363 doi "https://doi.org/10.1109/icde.2009.179" @default.
- W2095520363 hasPublicationYear "2009" @default.
- W2095520363 type Work @default.
- W2095520363 sameAs 2095520363 @default.
- W2095520363 citedByCount "0" @default.
- W2095520363 crossrefType "proceedings-article" @default.
- W2095520363 hasAuthorship W2095520363A5088315797 @default.
- W2095520363 hasConcept C103278499 @default.
- W2095520363 hasConcept C108010975 @default.
- W2095520363 hasConcept C111919701 @default.
- W2095520363 hasConcept C115961682 @default.
- W2095520363 hasConcept C124101348 @default.
- W2095520363 hasConcept C136764020 @default.
- W2095520363 hasConcept C154945302 @default.
- W2095520363 hasConcept C157692150 @default.
- W2095520363 hasConcept C177264268 @default.
- W2095520363 hasConcept C199360897 @default.
- W2095520363 hasConcept C23123220 @default.
- W2095520363 hasConcept C2777382242 @default.
- W2095520363 hasConcept C41008148 @default.
- W2095520363 hasConcept C4969071 @default.
- W2095520363 hasConcept C6557445 @default.
- W2095520363 hasConcept C75165309 @default.
- W2095520363 hasConcept C80444323 @default.
- W2095520363 hasConcept C86803240 @default.
- W2095520363 hasConcept C88006597 @default.
- W2095520363 hasConceptScore W2095520363C103278499 @default.
- W2095520363 hasConceptScore W2095520363C108010975 @default.
- W2095520363 hasConceptScore W2095520363C111919701 @default.
- W2095520363 hasConceptScore W2095520363C115961682 @default.
- W2095520363 hasConceptScore W2095520363C124101348 @default.
- W2095520363 hasConceptScore W2095520363C136764020 @default.
- W2095520363 hasConceptScore W2095520363C154945302 @default.
- W2095520363 hasConceptScore W2095520363C157692150 @default.
- W2095520363 hasConceptScore W2095520363C177264268 @default.
- W2095520363 hasConceptScore W2095520363C199360897 @default.
- W2095520363 hasConceptScore W2095520363C23123220 @default.
- W2095520363 hasConceptScore W2095520363C2777382242 @default.
- W2095520363 hasConceptScore W2095520363C41008148 @default.
- W2095520363 hasConceptScore W2095520363C4969071 @default.
- W2095520363 hasConceptScore W2095520363C6557445 @default.
- W2095520363 hasConceptScore W2095520363C75165309 @default.
- W2095520363 hasConceptScore W2095520363C80444323 @default.
- W2095520363 hasConceptScore W2095520363C86803240 @default.
- W2095520363 hasConceptScore W2095520363C88006597 @default.
- W2095520363 hasLocation W20955203631 @default.
- W2095520363 hasOpenAccess W2095520363 @default.
- W2095520363 hasPrimaryLocation W20955203631 @default.
- W2095520363 hasRelatedWork W1541513621 @default.
- W2095520363 hasRelatedWork W1604177764 @default.
- W2095520363 hasRelatedWork W1980659138 @default.
- W2095520363 hasRelatedWork W1982840277 @default.
- W2095520363 hasRelatedWork W2005730467 @default.
- W2095520363 hasRelatedWork W2020919487 @default.
- W2095520363 hasRelatedWork W2028105135 @default.
- W2095520363 hasRelatedWork W2111971349 @default.
- W2095520363 hasRelatedWork W2112257487 @default.
- W2095520363 hasRelatedWork W2113875810 @default.
- W2095520363 hasRelatedWork W2118271908 @default.
- W2095520363 hasRelatedWork W2124604144 @default.
- W2095520363 hasRelatedWork W2136999141 @default.
- W2095520363 hasRelatedWork W2154892898 @default.
- W2095520363 hasRelatedWork W2157340125 @default.
- W2095520363 hasRelatedWork W2169344281 @default.
- W2095520363 hasRelatedWork W2183317059 @default.
- W2095520363 hasRelatedWork W2804989710 @default.
- W2095520363 hasRelatedWork W2893228100 @default.
- W2095520363 hasRelatedWork W2981409090 @default.
- W2095520363 isParatext "false" @default.
- W2095520363 isRetracted "false" @default.
- W2095520363 magId "2095520363" @default.
- W2095520363 workType "article" @default.