Matches in SemOpenAlex for { <https://semopenalex.org/work/W2963313478> ?p ?o ?g. }
- W2963313478 abstract "Databases often contain corrupted, degraded, and noisy data with duplicate entries across and within each database. Such problems arise in citations, medical databases, genetics, human rights databases, and a variety of other applied settings. The target of statistical inference can be viewed as an unsupervised problem of determining the edges of a bipartite graph that links the observed records to unobserved latent entities. Bayesian approaches provide attractive benefits, naturally providing uncertainty quantification via posterior probabilities. We propose a novel record linkage approach based on empirical Bayesian principles. Specifically, the empirical Bayesian-type step consists of taking the empirical distribution function of the data as the prior for the latent entities. This approach improves on the earlier HB approach not only by avoiding the prior specification problem but also by allowing both categorical and string-valued variables. Our extension to string-valued variables also involves the proposal of a new probabilistic mechanism by which observed record values for string fields can deviate from the values of their associated latent entities. Categorical fields that deviate from their corresponding true value are simply drawn from the empirical distribution function. We apply our proposed methodology to a simulated data set of German names and an Italian household survey on income and wealth, showing our method performs favorably compared to several standard methods in the literature. We also consider the robustness of our methods to changes in the hyper-parameters." @default.
- W2963313478 created "2019-07-30" @default.
- W2963313478 creator A5048743195 @default.
- W2963313478 date "2015-12-01" @default.
- W2963313478 modified "2023-09-26" @default.
- W2963313478 title "Entity Resolution with Empirically Motivated Priors" @default.
- W2963313478 cites W1518784700 @default.
- W2963313478 cites W1536860849 @default.
- W2963313478 cites W1724849505 @default.
- W2963313478 cites W1854015338 @default.
- W2963313478 cites W1964879903 @default.
- W2963313478 cites W1975723136 @default.
- W2963313478 cites W2013909137 @default.
- W2963313478 cites W2017353792 @default.
- W2963313478 cites W2047464260 @default.
- W2963313478 cites W2049855118 @default.
- W2963313478 cites W2053870252 @default.
- W2963313478 cites W2055341013 @default.
- W2963313478 cites W2073471108 @default.
- W2963313478 cites W2080099271 @default.
- W2963313478 cites W2102763740 @default.
- W2963313478 cites W2162337786 @default.
- W2963313478 cites W2479390720 @default.
- W2963313478 cites W2911964244 @default.
- W2963313478 cites W3099006712 @default.
- W2963313478 cites W3104667342 @default.
- W2963313478 cites W4249214388 @default.
- W2963313478 cites W4254148532 @default.
- W2963313478 cites W4293052541 @default.
- W2963313478 doi "https://doi.org/10.1214/15-ba965si" @default.
- W2963313478 hasPublicationYear "2015" @default.
- W2963313478 type Work @default.
- W2963313478 sameAs 2963313478 @default.
- W2963313478 citedByCount "50" @default.
- W2963313478 countsByYear W29633134782014 @default.
- W2963313478 countsByYear W29633134782015 @default.
- W2963313478 countsByYear W29633134782016 @default.
- W2963313478 countsByYear W29633134782017 @default.
- W2963313478 countsByYear W29633134782018 @default.
- W2963313478 countsByYear W29633134782019 @default.
- W2963313478 countsByYear W29633134782020 @default.
- W2963313478 countsByYear W29633134782021 @default.
- W2963313478 countsByYear W29633134782022 @default.
- W2963313478 countsByYear W29633134782023 @default.
- W2963313478 crossrefType "journal-article" @default.
- W2963313478 hasAuthorship W2963313478A5048743195 @default.
- W2963313478 hasBestOaLocation W29633134781 @default.
- W2963313478 hasConcept C104317684 @default.
- W2963313478 hasConcept C105795698 @default.
- W2963313478 hasConcept C107673813 @default.
- W2963313478 hasConcept C119857082 @default.
- W2963313478 hasConcept C124101348 @default.
- W2963313478 hasConcept C149782125 @default.
- W2963313478 hasConcept C154945302 @default.
- W2963313478 hasConcept C160234255 @default.
- W2963313478 hasConcept C177769412 @default.
- W2963313478 hasConcept C185592680 @default.
- W2963313478 hasConcept C2776214188 @default.
- W2963313478 hasConcept C33923547 @default.
- W2963313478 hasConcept C41008148 @default.
- W2963313478 hasConcept C5274069 @default.
- W2963313478 hasConcept C55493867 @default.
- W2963313478 hasConcept C63479239 @default.
- W2963313478 hasConcept C98385598 @default.
- W2963313478 hasConceptScore W2963313478C104317684 @default.
- W2963313478 hasConceptScore W2963313478C105795698 @default.
- W2963313478 hasConceptScore W2963313478C107673813 @default.
- W2963313478 hasConceptScore W2963313478C119857082 @default.
- W2963313478 hasConceptScore W2963313478C124101348 @default.
- W2963313478 hasConceptScore W2963313478C149782125 @default.
- W2963313478 hasConceptScore W2963313478C154945302 @default.
- W2963313478 hasConceptScore W2963313478C160234255 @default.
- W2963313478 hasConceptScore W2963313478C177769412 @default.
- W2963313478 hasConceptScore W2963313478C185592680 @default.
- W2963313478 hasConceptScore W2963313478C2776214188 @default.
- W2963313478 hasConceptScore W2963313478C33923547 @default.
- W2963313478 hasConceptScore W2963313478C41008148 @default.
- W2963313478 hasConceptScore W2963313478C5274069 @default.
- W2963313478 hasConceptScore W2963313478C55493867 @default.
- W2963313478 hasConceptScore W2963313478C63479239 @default.
- W2963313478 hasConceptScore W2963313478C98385598 @default.
- W2963313478 hasIssue "4" @default.
- W2963313478 hasLocation W29633134781 @default.
- W2963313478 hasLocation W29633134782 @default.
- W2963313478 hasLocation W29633134783 @default.
- W2963313478 hasOpenAccess W2963313478 @default.
- W2963313478 hasPrimaryLocation W29633134781 @default.
- W2963313478 hasRelatedWork W1498794143 @default.
- W2963313478 hasRelatedWork W1581738021 @default.
- W2963313478 hasRelatedWork W1860485833 @default.
- W2963313478 hasRelatedWork W2081397010 @default.
- W2963313478 hasRelatedWork W2511279186 @default.
- W2963313478 hasRelatedWork W2899026863 @default.
- W2963313478 hasRelatedWork W2963058055 @default.
- W2963313478 hasRelatedWork W3143447564 @default.
- W2963313478 hasRelatedWork W4280548097 @default.
- W2963313478 hasRelatedWork W4297744796 @default.
- W2963313478 hasVolume "10" @default.
- W2963313478 isParatext "false" @default.
- W2963313478 isRetracted "false" @default.