Matches in SemOpenAlex for { <https://semopenalex.org/work/W2945244600> ?p ?o ?g. }
- W2945244600 abstract "The need for increased privacy protection in data linkage has driven the development of privacy-preserving record linkage (PPRL) techniques. A popular technique using Bloom filters with cryptographic analyses, modifications, and hashing variations to optimise privacy has been the focus of much research in this area. With few applications of Bloom filters within a probabilistic framework, there is limited information on whether approximate matches between Bloom filtered fields can improve linkage quality.In this study, we evaluate the effectiveness of three approximate comparison methods for Bloom filters within the context of the Fellegi-Sunter model of recording linkage: Sørensen-Dice coefficient, Jaccard similarity and Hamming distance.Using synthetic datasets with introduced errors to simulate datasets with a range of data quality and a large real-world administrative health dataset, the research estimated partial weight curves for converting similarity scores (for each approximate comparison method) to partial weights at both field and dataset level. Deduplication linkages were run on each dataset using these partial weight curves. This was to compare the resulting quality of the approximate comparison techniques with linkages using simple cut-off similarity values and only exact matching.Linkages using approximate comparisons produced significantly better quality results than those using exact comparisons only. Field level partial weight curves for a specific dataset produced the best quality results. The Sørensen-Dice coefficient and Jaccard similarity produced the most consistent results across a spectrum of synthetic and real-world datasets.The use of Bloom filter similarity comparisons for probabilistic record linkage can produce linkage quality results which are comparable to Jaro-Winkler string similarities with unencrypted linkages. Probabilistic linkages using Bloom filters benefit significantly from the use of similarity comparisons, with partial weight curves producing the best results, even when not optimised for that particular dataset." @default.
- W2945244600 created "2019-05-29" @default.
- W2945244600 creator A5026306258 @default.
- W2945244600 creator A5055284401 @default.
- W2945244600 creator A5068893829 @default.
- W2945244600 creator A5076521685 @default.
- W2945244600 date "2019-05-23" @default.
- W2945244600 modified "2023-09-23" @default.
- W2945244600 title "Evaluation of approximate comparison methods on Bloom filters for probabilistic linkage" @default.
- W2945244600 cites W132217605 @default.
- W2945244600 cites W1490908269 @default.
- W2945244600 cites W1570596343 @default.
- W2945244600 cites W1901061060 @default.
- W2945244600 cites W1946416073 @default.
- W2945244600 cites W1964584862 @default.
- W2945244600 cites W2041936211 @default.
- W2945244600 cites W2080291321 @default.
- W2945244600 cites W2141965543 @default.
- W2945244600 cites W2143477326 @default.
- W2945244600 cites W2267992110 @default.
- W2945244600 cites W2560610202 @default.
- W2945244600 cites W2601957977 @default.
- W2945244600 cites W2622721620 @default.
- W2945244600 cites W2736113892 @default.
- W2945244600 doi "https://doi.org/10.23889/ijpds.v4i1.1095" @default.
- W2945244600 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/7482522" @default.
- W2945244600 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/32935029" @default.
- W2945244600 hasPublicationYear "2019" @default.
- W2945244600 type Work @default.
- W2945244600 sameAs 2945244600 @default.
- W2945244600 citedByCount "5" @default.
- W2945244600 countsByYear W29452446002019 @default.
- W2945244600 countsByYear W29452446002020 @default.
- W2945244600 countsByYear W29452446002023 @default.
- W2945244600 crossrefType "journal-article" @default.
- W2945244600 hasAuthorship W2945244600A5026306258 @default.
- W2945244600 hasAuthorship W2945244600A5055284401 @default.
- W2945244600 hasAuthorship W2945244600A5068893829 @default.
- W2945244600 hasAuthorship W2945244600A5076521685 @default.
- W2945244600 hasBestOaLocation W29452446001 @default.
- W2945244600 hasConcept C103278499 @default.
- W2945244600 hasConcept C104317684 @default.
- W2945244600 hasConcept C106131492 @default.
- W2945244600 hasConcept C11413529 @default.
- W2945244600 hasConcept C115961682 @default.
- W2945244600 hasConcept C124101348 @default.
- W2945244600 hasConcept C147224247 @default.
- W2945244600 hasConcept C151730666 @default.
- W2945244600 hasConcept C153180895 @default.
- W2945244600 hasConcept C154945302 @default.
- W2945244600 hasConcept C185592680 @default.
- W2945244600 hasConcept C202444582 @default.
- W2945244600 hasConcept C203519979 @default.
- W2945244600 hasConcept C2779343474 @default.
- W2945244600 hasConcept C31266012 @default.
- W2945244600 hasConcept C31972630 @default.
- W2945244600 hasConcept C33923547 @default.
- W2945244600 hasConcept C38652104 @default.
- W2945244600 hasConcept C41008148 @default.
- W2945244600 hasConcept C49937458 @default.
- W2945244600 hasConcept C55493867 @default.
- W2945244600 hasConcept C67388219 @default.
- W2945244600 hasConcept C74270461 @default.
- W2945244600 hasConcept C86803240 @default.
- W2945244600 hasConcept C9652623 @default.
- W2945244600 hasConcept C99138194 @default.
- W2945244600 hasConceptScore W2945244600C103278499 @default.
- W2945244600 hasConceptScore W2945244600C104317684 @default.
- W2945244600 hasConceptScore W2945244600C106131492 @default.
- W2945244600 hasConceptScore W2945244600C11413529 @default.
- W2945244600 hasConceptScore W2945244600C115961682 @default.
- W2945244600 hasConceptScore W2945244600C124101348 @default.
- W2945244600 hasConceptScore W2945244600C147224247 @default.
- W2945244600 hasConceptScore W2945244600C151730666 @default.
- W2945244600 hasConceptScore W2945244600C153180895 @default.
- W2945244600 hasConceptScore W2945244600C154945302 @default.
- W2945244600 hasConceptScore W2945244600C185592680 @default.
- W2945244600 hasConceptScore W2945244600C202444582 @default.
- W2945244600 hasConceptScore W2945244600C203519979 @default.
- W2945244600 hasConceptScore W2945244600C2779343474 @default.
- W2945244600 hasConceptScore W2945244600C31266012 @default.
- W2945244600 hasConceptScore W2945244600C31972630 @default.
- W2945244600 hasConceptScore W2945244600C33923547 @default.
- W2945244600 hasConceptScore W2945244600C38652104 @default.
- W2945244600 hasConceptScore W2945244600C41008148 @default.
- W2945244600 hasConceptScore W2945244600C49937458 @default.
- W2945244600 hasConceptScore W2945244600C55493867 @default.
- W2945244600 hasConceptScore W2945244600C67388219 @default.
- W2945244600 hasConceptScore W2945244600C74270461 @default.
- W2945244600 hasConceptScore W2945244600C86803240 @default.
- W2945244600 hasConceptScore W2945244600C9652623 @default.
- W2945244600 hasConceptScore W2945244600C99138194 @default.
- W2945244600 hasLocation W29452446001 @default.
- W2945244600 hasLocation W29452446002 @default.
- W2945244600 hasLocation W29452446003 @default.
- W2945244600 hasLocation W29452446004 @default.
- W2945244600 hasOpenAccess W2945244600 @default.
- W2945244600 hasPrimaryLocation W29452446001 @default.
- W2945244600 hasRelatedWork W105850746 @default.
- W2945244600 hasRelatedWork W11084441 @default.