Matches in SemOpenAlex for { <https://semopenalex.org/work/W2541780901> ?p ?o ?g. }
Showing items 1 to 95 of
95
with 100 items per page.
- W2541780901 endingPage "1425" @default.
- W2541780901 startingPage "1417" @default.
- W2541780901 abstract "Most generative models for clustering implicitly assume that the number of data points in each cluster grows linearly with the total number of data points. Finite mixture models, Dirichlet process mixture models, and Pitman--Yor process mixture models make this assumption, as do all other infinitely exchangeable clustering models. However, for some applications, this assumption is inappropriate. For example, when performing entity resolution, the size of each cluster should be unrelated to the size of the data set, and each cluster should contain a negligible fraction of the total number of data points. These applications require models that yield clusters whose sizes grow sublinearly with the size of the data set. We address this requirement by defining the microclustering property and introducing a new class of models that can exhibit this property. We compare models within this class to two commonly used clustering models using four entity-resolution data sets." @default.
- W2541780901 created "2016-11-04" @default.
- W2541780901 creator A5010112180 @default.
- W2541780901 creator A5044818494 @default.
- W2541780901 creator A5046348432 @default.
- W2541780901 creator A5048743195 @default.
- W2541780901 creator A5062491445 @default.
- W2541780901 creator A5067934767 @default.
- W2541780901 date "2016-10-31" @default.
- W2541780901 modified "2023-10-18" @default.
- W2541780901 title "Flexible Models for Microclustering with Application to Entity Resolution" @default.
- W2541780901 hasPublicationYear "2016" @default.
- W2541780901 type Work @default.
- W2541780901 sameAs 2541780901 @default.
- W2541780901 citedByCount "12" @default.
- W2541780901 countsByYear W25417809012017 @default.
- W2541780901 countsByYear W25417809012018 @default.
- W2541780901 countsByYear W25417809012019 @default.
- W2541780901 countsByYear W25417809012020 @default.
- W2541780901 countsByYear W25417809012021 @default.
- W2541780901 crossrefType "proceedings-article" @default.
- W2541780901 hasAuthorship W2541780901A5010112180 @default.
- W2541780901 hasAuthorship W2541780901A5044818494 @default.
- W2541780901 hasAuthorship W2541780901A5046348432 @default.
- W2541780901 hasAuthorship W2541780901A5048743195 @default.
- W2541780901 hasAuthorship W2541780901A5062491445 @default.
- W2541780901 hasAuthorship W2541780901A5067934767 @default.
- W2541780901 hasConcept C111472728 @default.
- W2541780901 hasConcept C124101348 @default.
- W2541780901 hasConcept C138268822 @default.
- W2541780901 hasConcept C138885662 @default.
- W2541780901 hasConcept C154945302 @default.
- W2541780901 hasConcept C164866538 @default.
- W2541780901 hasConcept C177264268 @default.
- W2541780901 hasConcept C189950617 @default.
- W2541780901 hasConcept C199360897 @default.
- W2541780901 hasConcept C2776214188 @default.
- W2541780901 hasConcept C2777212361 @default.
- W2541780901 hasConcept C2781280628 @default.
- W2541780901 hasConcept C33923547 @default.
- W2541780901 hasConcept C41008148 @default.
- W2541780901 hasConcept C58489278 @default.
- W2541780901 hasConcept C61224824 @default.
- W2541780901 hasConcept C67186912 @default.
- W2541780901 hasConcept C73555534 @default.
- W2541780901 hasConcept C77088390 @default.
- W2541780901 hasConceptScore W2541780901C111472728 @default.
- W2541780901 hasConceptScore W2541780901C124101348 @default.
- W2541780901 hasConceptScore W2541780901C138268822 @default.
- W2541780901 hasConceptScore W2541780901C138885662 @default.
- W2541780901 hasConceptScore W2541780901C154945302 @default.
- W2541780901 hasConceptScore W2541780901C164866538 @default.
- W2541780901 hasConceptScore W2541780901C177264268 @default.
- W2541780901 hasConceptScore W2541780901C189950617 @default.
- W2541780901 hasConceptScore W2541780901C199360897 @default.
- W2541780901 hasConceptScore W2541780901C2776214188 @default.
- W2541780901 hasConceptScore W2541780901C2777212361 @default.
- W2541780901 hasConceptScore W2541780901C2781280628 @default.
- W2541780901 hasConceptScore W2541780901C33923547 @default.
- W2541780901 hasConceptScore W2541780901C41008148 @default.
- W2541780901 hasConceptScore W2541780901C58489278 @default.
- W2541780901 hasConceptScore W2541780901C61224824 @default.
- W2541780901 hasConceptScore W2541780901C67186912 @default.
- W2541780901 hasConceptScore W2541780901C73555534 @default.
- W2541780901 hasConceptScore W2541780901C77088390 @default.
- W2541780901 hasLocation W25417809011 @default.
- W2541780901 hasOpenAccess W2541780901 @default.
- W2541780901 hasPrimaryLocation W25417809011 @default.
- W2541780901 hasRelatedWork W108299917 @default.
- W2541780901 hasRelatedWork W1540572177 @default.
- W2541780901 hasRelatedWork W1547612978 @default.
- W2541780901 hasRelatedWork W1603563029 @default.
- W2541780901 hasRelatedWork W1981519674 @default.
- W2541780901 hasRelatedWork W2017353792 @default.
- W2541780901 hasRelatedWork W2047464260 @default.
- W2541780901 hasRelatedWork W2049855118 @default.
- W2541780901 hasRelatedWork W2053870252 @default.
- W2541780901 hasRelatedWork W2073471108 @default.
- W2541780901 hasRelatedWork W2094173157 @default.
- W2541780901 hasRelatedWork W2105929850 @default.
- W2541780901 hasRelatedWork W2182691810 @default.
- W2541780901 hasRelatedWork W2272596129 @default.
- W2541780901 hasRelatedWork W2610877833 @default.
- W2541780901 hasRelatedWork W2963313478 @default.
- W2541780901 hasRelatedWork W3104667342 @default.
- W2541780901 hasRelatedWork W312286408 @default.
- W2541780901 hasRelatedWork W98050876 @default.
- W2541780901 hasRelatedWork W2908287046 @default.
- W2541780901 hasVolume "29" @default.
- W2541780901 isParatext "false" @default.
- W2541780901 isRetracted "false" @default.
- W2541780901 magId "2541780901" @default.
- W2541780901 workType "article" @default.