Matches in SemOpenAlex for { <https://semopenalex.org/work/W2024605621> ?p ?o ?g. }
- W2024605621 abstract "Declarative data quality has been an active research topic. The fundamental principle behind a declarative approach to data quality is the use of declarative statements to realize data quality primitives on top of any relational data source. A primary advantage of such an approach is the ease of use and integration with existing applications. Over the last few years several similarity predicates have been proposed for common quality primitives (approximate selections, joins, etc) and have been fully expressed using declarative SQL statements. In this paper we propose new similarity predicates along with their declarative realization, based on notions of probabilistic information retrieval. In particular we show how language models and hidden Markov models can be utilized as similarity predicates for data quality and present their full declarative instantiation. We also show how other scoring methods from information retrieval, can be utilized in a similar setting. We then present full declarative specifications of previously proposed similarity predicates in the literature, grouping them into classes according to their primary characteristics. Finally, we present a thorough performance and accuracy study comparing a large number of similarity predicates for data cleaning operations. We quantify both their runtime performance as well as their accuracy for several types of common quality problems encountered in operational databases." @default.
- W2024605621 created "2016-06-24" @default.
- W2024605621 creator A5035257754 @default.
- W2024605621 creator A5068065546 @default.
- W2024605621 creator A5074420391 @default.
- W2024605621 creator A5077370881 @default.
- W2024605621 creator A5088315797 @default.
- W2024605621 date "2007-06-11" @default.
- W2024605621 modified "2023-09-24" @default.
- W2024605621 title "Benchmarking declarative approximate selection predicates" @default.
- W2024605621 cites W1612155886 @default.
- W2024605621 cites W2007682403 @default.
- W2024605621 cites W2024932032 @default.
- W2024605621 cites W2081193615 @default.
- W2024605621 cites W2095368471 @default.
- W2024605621 cites W2105423800 @default.
- W2024605621 cites W2116544254 @default.
- W2024605621 cites W2121516976 @default.
- W2024605621 cites W2125838338 @default.
- W2024605621 cites W2127675794 @default.
- W2024605621 cites W4206765718 @default.
- W2024605621 cites W4251369550 @default.
- W2024605621 doi "https://doi.org/10.1145/1247480.1247521" @default.
- W2024605621 hasPublicationYear "2007" @default.
- W2024605621 type Work @default.
- W2024605621 sameAs 2024605621 @default.
- W2024605621 citedByCount "58" @default.
- W2024605621 countsByYear W20246056212012 @default.
- W2024605621 countsByYear W20246056212013 @default.
- W2024605621 countsByYear W20246056212014 @default.
- W2024605621 countsByYear W20246056212015 @default.
- W2024605621 countsByYear W20246056212016 @default.
- W2024605621 countsByYear W20246056212017 @default.
- W2024605621 countsByYear W20246056212018 @default.
- W2024605621 countsByYear W20246056212019 @default.
- W2024605621 countsByYear W20246056212020 @default.
- W2024605621 countsByYear W20246056212023 @default.
- W2024605621 crossrefType "proceedings-article" @default.
- W2024605621 hasAuthorship W2024605621A5035257754 @default.
- W2024605621 hasAuthorship W2024605621A5068065546 @default.
- W2024605621 hasAuthorship W2024605621A5074420391 @default.
- W2024605621 hasAuthorship W2024605621A5077370881 @default.
- W2024605621 hasAuthorship W2024605621A5088315797 @default.
- W2024605621 hasBestOaLocation W20246056212 @default.
- W2024605621 hasConcept C103278499 @default.
- W2024605621 hasConcept C111472728 @default.
- W2024605621 hasConcept C115961682 @default.
- W2024605621 hasConcept C124101348 @default.
- W2024605621 hasConcept C138885662 @default.
- W2024605621 hasConcept C144133560 @default.
- W2024605621 hasConcept C146206909 @default.
- W2024605621 hasConcept C148230440 @default.
- W2024605621 hasConcept C154945302 @default.
- W2024605621 hasConcept C162853370 @default.
- W2024605621 hasConcept C199360897 @default.
- W2024605621 hasConcept C204321447 @default.
- W2024605621 hasConcept C23123220 @default.
- W2024605621 hasConcept C2778692605 @default.
- W2024605621 hasConcept C2779530757 @default.
- W2024605621 hasConcept C34165917 @default.
- W2024605621 hasConcept C41008148 @default.
- W2024605621 hasConcept C50033165 @default.
- W2024605621 hasConcept C510870499 @default.
- W2024605621 hasConcept C80444323 @default.
- W2024605621 hasConcept C81917197 @default.
- W2024605621 hasConcept C86251818 @default.
- W2024605621 hasConceptScore W2024605621C103278499 @default.
- W2024605621 hasConceptScore W2024605621C111472728 @default.
- W2024605621 hasConceptScore W2024605621C115961682 @default.
- W2024605621 hasConceptScore W2024605621C124101348 @default.
- W2024605621 hasConceptScore W2024605621C138885662 @default.
- W2024605621 hasConceptScore W2024605621C144133560 @default.
- W2024605621 hasConceptScore W2024605621C146206909 @default.
- W2024605621 hasConceptScore W2024605621C148230440 @default.
- W2024605621 hasConceptScore W2024605621C154945302 @default.
- W2024605621 hasConceptScore W2024605621C162853370 @default.
- W2024605621 hasConceptScore W2024605621C199360897 @default.
- W2024605621 hasConceptScore W2024605621C204321447 @default.
- W2024605621 hasConceptScore W2024605621C23123220 @default.
- W2024605621 hasConceptScore W2024605621C2778692605 @default.
- W2024605621 hasConceptScore W2024605621C2779530757 @default.
- W2024605621 hasConceptScore W2024605621C34165917 @default.
- W2024605621 hasConceptScore W2024605621C41008148 @default.
- W2024605621 hasConceptScore W2024605621C50033165 @default.
- W2024605621 hasConceptScore W2024605621C510870499 @default.
- W2024605621 hasConceptScore W2024605621C80444323 @default.
- W2024605621 hasConceptScore W2024605621C81917197 @default.
- W2024605621 hasConceptScore W2024605621C86251818 @default.
- W2024605621 hasLocation W20246056211 @default.
- W2024605621 hasLocation W20246056212 @default.
- W2024605621 hasOpenAccess W2024605621 @default.
- W2024605621 hasPrimaryLocation W20246056211 @default.
- W2024605621 hasRelatedWork W1980794066 @default.
- W2024605621 hasRelatedWork W1998043846 @default.
- W2024605621 hasRelatedWork W2024605621 @default.
- W2024605621 hasRelatedWork W2103608058 @default.
- W2024605621 hasRelatedWork W2131653381 @default.
- W2024605621 hasRelatedWork W2740990710 @default.
- W2024605621 hasRelatedWork W2902695556 @default.
- W2024605621 hasRelatedWork W2951208224 @default.