Matches in SemOpenAlex for { <https://semopenalex.org/work/W3089905937> ?p ?o ?g. }
Showing items 1 to 79 of
79
with 100 items per page.
- W3089905937 endingPage "239" @default.
- W3089905937 startingPage "224" @default.
- W3089905937 abstract "Entity resolution (ER) is becoming an increasingly important task across many domains (e.g., official statistics, human rights, medicine, etc.), where databases contain duplications of entities that need to be removed for later inferential and prediction tasks. Motivated by scaling to large data sets and providing uncertainty propagation, we propose a generalized approach to the blocking and ER pipeline which consists of two steps. First, a probabilistic blocking step, where we consider that of [], which is ER record in its own right. Its usage for blocking allows one to reduce the comparison space greatly, providing overlapping blocks for any ER method in the literature. Second, the probabilistic blocking step is passed to any ER method, where one can evaluate uncertainty propagation depending on the ER task. We consider that of [], which is a joint Bayesian method of both blocking and ER, that provides a joint posterior distribution regarding both the blocking and ER, and scales to large datasets, however, it does it a slower rate than when used in tandem with []. Through simulation and empirical studies, we show that our proposed methodology outperforms [, ] when used in isolation of each other. It produces reliable estimates of the underlying linkage structure and the number of true entities in each dataset. Furthermore, it produces an approximate posterior distribution and preserves transitive closures of the linkages." @default.
- W3089905937 created "2020-10-08" @default.
- W3089905937 creator A5048743195 @default.
- W3089905937 creator A5083198627 @default.
- W3089905937 date "2020-01-01" @default.
- W3089905937 modified "2023-10-09" @default.
- W3089905937 title "Probabilistic Blocking and Distributed Bayesian Entity Resolution" @default.
- W3089905937 cites W1981519674 @default.
- W3089905937 cites W1993637408 @default.
- W3089905937 cites W2017353792 @default.
- W3089905937 cites W2047464260 @default.
- W3089905937 cites W2073471108 @default.
- W3089905937 cites W2088008685 @default.
- W3089905937 cites W2105929850 @default.
- W3089905937 cites W2272596129 @default.
- W3089905937 cites W2920239148 @default.
- W3089905937 cites W2963313478 @default.
- W3089905937 cites W2984730910 @default.
- W3089905937 cites W3104667342 @default.
- W3089905937 cites W3121361745 @default.
- W3089905937 cites W4242744113 @default.
- W3089905937 doi "https://doi.org/10.1007/978-3-030-57521-2_16" @default.
- W3089905937 hasPublicationYear "2020" @default.
- W3089905937 type Work @default.
- W3089905937 sameAs 3089905937 @default.
- W3089905937 citedByCount "3" @default.
- W3089905937 countsByYear W30899059372022 @default.
- W3089905937 crossrefType "book-chapter" @default.
- W3089905937 hasAuthorship W3089905937A5048743195 @default.
- W3089905937 hasAuthorship W3089905937A5083198627 @default.
- W3089905937 hasConcept C105795698 @default.
- W3089905937 hasConcept C107673813 @default.
- W3089905937 hasConcept C11413529 @default.
- W3089905937 hasConcept C124101348 @default.
- W3089905937 hasConcept C144745244 @default.
- W3089905937 hasConcept C154945302 @default.
- W3089905937 hasConcept C162324750 @default.
- W3089905937 hasConcept C18653775 @default.
- W3089905937 hasConcept C187736073 @default.
- W3089905937 hasConcept C2780451532 @default.
- W3089905937 hasConcept C31258907 @default.
- W3089905937 hasConcept C33923547 @default.
- W3089905937 hasConcept C41008148 @default.
- W3089905937 hasConcept C49937458 @default.
- W3089905937 hasConcept C80444323 @default.
- W3089905937 hasConceptScore W3089905937C105795698 @default.
- W3089905937 hasConceptScore W3089905937C107673813 @default.
- W3089905937 hasConceptScore W3089905937C11413529 @default.
- W3089905937 hasConceptScore W3089905937C124101348 @default.
- W3089905937 hasConceptScore W3089905937C144745244 @default.
- W3089905937 hasConceptScore W3089905937C154945302 @default.
- W3089905937 hasConceptScore W3089905937C162324750 @default.
- W3089905937 hasConceptScore W3089905937C18653775 @default.
- W3089905937 hasConceptScore W3089905937C187736073 @default.
- W3089905937 hasConceptScore W3089905937C2780451532 @default.
- W3089905937 hasConceptScore W3089905937C31258907 @default.
- W3089905937 hasConceptScore W3089905937C33923547 @default.
- W3089905937 hasConceptScore W3089905937C41008148 @default.
- W3089905937 hasConceptScore W3089905937C49937458 @default.
- W3089905937 hasConceptScore W3089905937C80444323 @default.
- W3089905937 hasLocation W30899059371 @default.
- W3089905937 hasOpenAccess W3089905937 @default.
- W3089905937 hasPrimaryLocation W30899059371 @default.
- W3089905937 hasRelatedWork W1497573972 @default.
- W3089905937 hasRelatedWork W1846253165 @default.
- W3089905937 hasRelatedWork W1965371215 @default.
- W3089905937 hasRelatedWork W2071659383 @default.
- W3089905937 hasRelatedWork W2108990487 @default.
- W3089905937 hasRelatedWork W2124122503 @default.
- W3089905937 hasRelatedWork W2126435977 @default.
- W3089905937 hasRelatedWork W2126932387 @default.
- W3089905937 hasRelatedWork W2353762239 @default.
- W3089905937 hasRelatedWork W2392835431 @default.
- W3089905937 isParatext "false" @default.
- W3089905937 isRetracted "false" @default.
- W3089905937 magId "3089905937" @default.
- W3089905937 workType "book-chapter" @default.