Matches in SemOpenAlex for { <https://semopenalex.org/work/W4294651292> ?p ?o ?g. }
Showing items 1 to 67 of
67
with 100 items per page.
- W4294651292 abstract "Some of the greatest advances in web search have come from leveraging socio-economic properties of online user behavior. Past advances include PageRank, anchor text, hubs-authorities, and TF-IDF. In this paper, we investigate another socio-economic property that, to our knowledge, has not yet been exploited: sites that create lists of entities, such as IMDB and Netflix, have an incentive to avoid gratuitous duplicates. We leverage this property to resolve entities across the different web sites, and find that we can obtain substantial improvements in resolution accuracy. This improvement in accuracy also translates into robustness, which often reduces the amount of training data that must be labeled for comparing entities across many sites. Furthermore, the technique provides robustness when resolving sites that have some duplicates, even without first removing these duplicates. We present algorithms with very strong precision and recall, and show that max weight matching, while appearing to be a natural choice turns out to have poor performance in some situations. The presented techniques are now being used in the back-end entity resolution system at a major Internet search engine." @default.
- W4294651292 created "2022-09-05" @default.
- W4294651292 creator A5025344952 @default.
- W4294651292 creator A5034065279 @default.
- W4294651292 creator A5078824132 @default.
- W4294651292 date "2011-08-30" @default.
- W4294651292 modified "2023-10-18" @default.
- W4294651292 title "Improving Entity Resolution with Global Constraints" @default.
- W4294651292 doi "https://doi.org/10.48550/arxiv.1108.6016" @default.
- W4294651292 hasPublicationYear "2011" @default.
- W4294651292 type Work @default.
- W4294651292 citedByCount "0" @default.
- W4294651292 crossrefType "posted-content" @default.
- W4294651292 hasAuthorship W4294651292A5025344952 @default.
- W4294651292 hasAuthorship W4294651292A5034065279 @default.
- W4294651292 hasAuthorship W4294651292A5078824132 @default.
- W4294651292 hasBestOaLocation W42946512921 @default.
- W4294651292 hasConcept C104317684 @default.
- W4294651292 hasConcept C110875604 @default.
- W4294651292 hasConcept C124101348 @default.
- W4294651292 hasConcept C136764020 @default.
- W4294651292 hasConcept C153083717 @default.
- W4294651292 hasConcept C154945302 @default.
- W4294651292 hasConcept C162324750 @default.
- W4294651292 hasConcept C175444787 @default.
- W4294651292 hasConcept C185592680 @default.
- W4294651292 hasConcept C23123220 @default.
- W4294651292 hasConcept C2779172887 @default.
- W4294651292 hasConcept C29122968 @default.
- W4294651292 hasConcept C41008148 @default.
- W4294651292 hasConcept C55493867 @default.
- W4294651292 hasConcept C63479239 @default.
- W4294651292 hasConcept C81669768 @default.
- W4294651292 hasConcept C97854310 @default.
- W4294651292 hasConceptScore W4294651292C104317684 @default.
- W4294651292 hasConceptScore W4294651292C110875604 @default.
- W4294651292 hasConceptScore W4294651292C124101348 @default.
- W4294651292 hasConceptScore W4294651292C136764020 @default.
- W4294651292 hasConceptScore W4294651292C153083717 @default.
- W4294651292 hasConceptScore W4294651292C154945302 @default.
- W4294651292 hasConceptScore W4294651292C162324750 @default.
- W4294651292 hasConceptScore W4294651292C175444787 @default.
- W4294651292 hasConceptScore W4294651292C185592680 @default.
- W4294651292 hasConceptScore W4294651292C23123220 @default.
- W4294651292 hasConceptScore W4294651292C2779172887 @default.
- W4294651292 hasConceptScore W4294651292C29122968 @default.
- W4294651292 hasConceptScore W4294651292C41008148 @default.
- W4294651292 hasConceptScore W4294651292C55493867 @default.
- W4294651292 hasConceptScore W4294651292C63479239 @default.
- W4294651292 hasConceptScore W4294651292C81669768 @default.
- W4294651292 hasConceptScore W4294651292C97854310 @default.
- W4294651292 hasLocation W42946512921 @default.
- W4294651292 hasOpenAccess W4294651292 @default.
- W4294651292 hasPrimaryLocation W42946512921 @default.
- W4294651292 hasRelatedWork W1571112163 @default.
- W4294651292 hasRelatedWork W1989785885 @default.
- W4294651292 hasRelatedWork W2001121861 @default.
- W4294651292 hasRelatedWork W2102585996 @default.
- W4294651292 hasRelatedWork W2350178533 @default.
- W4294651292 hasRelatedWork W2350230178 @default.
- W4294651292 hasRelatedWork W2477087992 @default.
- W4294651292 hasRelatedWork W2805117467 @default.
- W4294651292 hasRelatedWork W83344948 @default.
- W4294651292 hasRelatedWork W2593674131 @default.
- W4294651292 isParatext "false" @default.
- W4294651292 isRetracted "false" @default.
- W4294651292 workType "article" @default.