Matches in SemOpenAlex for { <https://semopenalex.org/work/W2811187344> ?p ?o ?g. }
- W2811187344 abstract "Consider the following problem: given a database of records indexed by names (e.g., name of companies, restaurants, businesses, or universities) and a new name, determine whether the new name is in the database, and if so, which record it refers to. This problem is an instance of record linkage problem and is a challenging problem because people do not consistently use the official name, but use abbreviations, synonyms, different order of terms, different spelling of terms, short form of terms, and the name can contain typos or spacing issues. We provide a probabilistic model using relational logistic regression to find the probability of each record in the database being the desired record for a given query and find the best record(s) with respect to the probabilities. Building on term-matching and translational approaches for search, our model addresses many of the aforementioned challenges and provides good results when existing baselines fail. Using the probabilities outputted by the model, we can automate the search process for a portion of queries whose desired documents get a probability higher than a trust threshold. We evaluate our model on a large real-world dataset from a telecommunications company and compare it to several state-of-the-art baselines. The obtained results show that our model is a promising probabilistic model for record linkage for names. We also test if the knowledge learned by our model on one domain can be effectively transferred to a new domain. For this purpose, we test our model on an unseen test set from the business names of the secondString dataset. Promising results show that our model can be effectively applied to unseen datasets. Finally, we study the sensitivity of our model to the statistics of datasets." @default.
- W2811187344 created "2018-07-10" @default.
- W2811187344 creator A5044532113 @default.
- W2811187344 creator A5063661444 @default.
- W2811187344 creator A5065918431 @default.
- W2811187344 date "2018-06-26" @default.
- W2811187344 modified "2023-09-27" @default.
- W2811187344 title "Record Linkage to Match Customer Names: A Probabilistic Approach." @default.
- W2811187344 cites W1547612978 @default.
- W2811187344 cites W1569123402 @default.
- W2811187344 cites W1880262756 @default.
- W2811187344 cites W1966443646 @default.
- W2811187344 cites W1978394996 @default.
- W2811187344 cites W2010392031 @default.
- W2811187344 cites W2024770506 @default.
- W2811187344 cites W2024932032 @default.
- W2811187344 cites W2028742638 @default.
- W2811187344 cites W2073471108 @default.
- W2811187344 cites W2082718666 @default.
- W2811187344 cites W2085099553 @default.
- W2811187344 cites W2102350406 @default.
- W2811187344 cites W2107743791 @default.
- W2811187344 cites W2115924763 @default.
- W2811187344 cites W2123561513 @default.
- W2811187344 cites W2136189984 @default.
- W2811187344 cites W2139688392 @default.
- W2811187344 cites W2147152072 @default.
- W2811187344 cites W2155482025 @default.
- W2811187344 cites W2161936973 @default.
- W2811187344 cites W2168190036 @default.
- W2811187344 cites W2464810161 @default.
- W2811187344 cites W2611099133 @default.
- W2811187344 cites W2737905854 @default.
- W2811187344 cites W3048357673 @default.
- W2811187344 hasPublicationYear "2018" @default.
- W2811187344 type Work @default.
- W2811187344 sameAs 2811187344 @default.
- W2811187344 citedByCount "0" @default.
- W2811187344 crossrefType "posted-content" @default.
- W2811187344 hasAuthorship W2811187344A5044532113 @default.
- W2811187344 hasAuthorship W2811187344A5063661444 @default.
- W2811187344 hasAuthorship W2811187344A5065918431 @default.
- W2811187344 hasConcept C104317684 @default.
- W2811187344 hasConcept C105795698 @default.
- W2811187344 hasConcept C114289077 @default.
- W2811187344 hasConcept C124101348 @default.
- W2811187344 hasConcept C134306372 @default.
- W2811187344 hasConcept C142210648 @default.
- W2811187344 hasConcept C144024400 @default.
- W2811187344 hasConcept C149923435 @default.
- W2811187344 hasConcept C154945302 @default.
- W2811187344 hasConcept C165064840 @default.
- W2811187344 hasConcept C177264268 @default.
- W2811187344 hasConcept C185592680 @default.
- W2811187344 hasConcept C199360897 @default.
- W2811187344 hasConcept C23123220 @default.
- W2811187344 hasConcept C2908647359 @default.
- W2811187344 hasConcept C31266012 @default.
- W2811187344 hasConcept C33923547 @default.
- W2811187344 hasConcept C36503486 @default.
- W2811187344 hasConcept C41008148 @default.
- W2811187344 hasConcept C49937458 @default.
- W2811187344 hasConcept C55493867 @default.
- W2811187344 hasConcept C77088390 @default.
- W2811187344 hasConceptScore W2811187344C104317684 @default.
- W2811187344 hasConceptScore W2811187344C105795698 @default.
- W2811187344 hasConceptScore W2811187344C114289077 @default.
- W2811187344 hasConceptScore W2811187344C124101348 @default.
- W2811187344 hasConceptScore W2811187344C134306372 @default.
- W2811187344 hasConceptScore W2811187344C142210648 @default.
- W2811187344 hasConceptScore W2811187344C144024400 @default.
- W2811187344 hasConceptScore W2811187344C149923435 @default.
- W2811187344 hasConceptScore W2811187344C154945302 @default.
- W2811187344 hasConceptScore W2811187344C165064840 @default.
- W2811187344 hasConceptScore W2811187344C177264268 @default.
- W2811187344 hasConceptScore W2811187344C185592680 @default.
- W2811187344 hasConceptScore W2811187344C199360897 @default.
- W2811187344 hasConceptScore W2811187344C23123220 @default.
- W2811187344 hasConceptScore W2811187344C2908647359 @default.
- W2811187344 hasConceptScore W2811187344C31266012 @default.
- W2811187344 hasConceptScore W2811187344C33923547 @default.
- W2811187344 hasConceptScore W2811187344C36503486 @default.
- W2811187344 hasConceptScore W2811187344C41008148 @default.
- W2811187344 hasConceptScore W2811187344C49937458 @default.
- W2811187344 hasConceptScore W2811187344C55493867 @default.
- W2811187344 hasConceptScore W2811187344C77088390 @default.
- W2811187344 hasLocation W28111873441 @default.
- W2811187344 hasOpenAccess W2811187344 @default.
- W2811187344 hasPrimaryLocation W28111873441 @default.
- W2811187344 hasRelatedWork W1540269031 @default.
- W2811187344 hasRelatedWork W198982522 @default.
- W2811187344 hasRelatedWork W2004305018 @default.
- W2811187344 hasRelatedWork W2015895266 @default.
- W2811187344 hasRelatedWork W2039188080 @default.
- W2811187344 hasRelatedWork W2041706320 @default.
- W2811187344 hasRelatedWork W2060417289 @default.
- W2811187344 hasRelatedWork W2106675345 @default.
- W2811187344 hasRelatedWork W2171450492 @default.
- W2811187344 hasRelatedWork W2793798730 @default.
- W2811187344 hasRelatedWork W2795099423 @default.