Matches in SemOpenAlex for { <https://semopenalex.org/work/W3115123517> ?p ?o ?g. }
- W3115123517 endingPage "3743" @default.
- W3115123517 startingPage "3735" @default.
- W3115123517 abstract "In the context of big data, data sharing between different institutions can not only reduce the cost of information collection greatly but also benefit for obtaining analysis results effectively and efficiently. Record linkage is the task of locating records that refer to the same entity from heterogeneous data sources. In the last decades, extensive researches on alphabet-based record linkages have been carried out, among which the Fellegi-Sunter model extended by Winkler has outperformed others. However, it is still a challenge to perform record linkage on Chinese-character-based datasets. In this article, two set-based methods (Cosine similarity and Dice similarity) were introduced firstly, and then the similarity of Chinese characters was quantified based on an adapted encoding technique which exploits the information of both the shape and the pronunciation of Chinese character. A new method entitled Hybrid similarity was proposed in the next part, which is the combination of the character transformation technique (SoundShape Code) and Dice similarity. Finally, we performed the aforementioned methods on the simulated datasets, and each method was evaluated by counting the number of misclassified record pairs and the computational time. The results demonstrated that our Hybrid similarity method outperformed others in reducing the number of misclassified pairs with a relatively low computational cost." @default.
- W3115123517 created "2021-01-05" @default.
- W3115123517 creator A5039266973 @default.
- W3115123517 creator A5040821202 @default.
- W3115123517 creator A5056549452 @default.
- W3115123517 date "2021-01-01" @default.
- W3115123517 modified "2023-10-18" @default.
- W3115123517 title "String Comparators for Chinese-Characters-Based Record Linkages" @default.
- W3115123517 cites W1514828683 @default.
- W3115123517 cites W1964584862 @default.
- W3115123517 cites W2004918398 @default.
- W3115123517 cites W2024018837 @default.
- W3115123517 cites W2028251458 @default.
- W3115123517 cites W2087483562 @default.
- W3115123517 cites W2101508813 @default.
- W3115123517 cites W2102249999 @default.
- W3115123517 cites W2128600649 @default.
- W3115123517 cites W2561506904 @default.
- W3115123517 cites W2759366113 @default.
- W3115123517 cites W2998278577 @default.
- W3115123517 cites W4230502578 @default.
- W3115123517 cites W4242744113 @default.
- W3115123517 doi "https://doi.org/10.1109/access.2020.3047927" @default.
- W3115123517 hasPublicationYear "2021" @default.
- W3115123517 type Work @default.
- W3115123517 sameAs 3115123517 @default.
- W3115123517 citedByCount "3" @default.
- W3115123517 countsByYear W31151235172021 @default.
- W3115123517 countsByYear W31151235172022 @default.
- W3115123517 crossrefType "journal-article" @default.
- W3115123517 hasAuthorship W3115123517A5039266973 @default.
- W3115123517 hasAuthorship W3115123517A5040821202 @default.
- W3115123517 hasAuthorship W3115123517A5056549452 @default.
- W3115123517 hasBestOaLocation W31151235171 @default.
- W3115123517 hasConcept C103278499 @default.
- W3115123517 hasConcept C115961682 @default.
- W3115123517 hasConcept C124101348 @default.
- W3115123517 hasConcept C142210648 @default.
- W3115123517 hasConcept C144024400 @default.
- W3115123517 hasConcept C149923435 @default.
- W3115123517 hasConcept C151730666 @default.
- W3115123517 hasConcept C153180895 @default.
- W3115123517 hasConcept C154945302 @default.
- W3115123517 hasConcept C157486923 @default.
- W3115123517 hasConcept C162324750 @default.
- W3115123517 hasConcept C177264268 @default.
- W3115123517 hasConcept C187736073 @default.
- W3115123517 hasConcept C199360897 @default.
- W3115123517 hasConcept C204321447 @default.
- W3115123517 hasConcept C22029948 @default.
- W3115123517 hasConcept C2524010 @default.
- W3115123517 hasConcept C2779343474 @default.
- W3115123517 hasConcept C2780451532 @default.
- W3115123517 hasConcept C2780762811 @default.
- W3115123517 hasConcept C2780861071 @default.
- W3115123517 hasConcept C2908647359 @default.
- W3115123517 hasConcept C33923547 @default.
- W3115123517 hasConcept C37914503 @default.
- W3115123517 hasConcept C41008148 @default.
- W3115123517 hasConcept C44359876 @default.
- W3115123517 hasConcept C86803240 @default.
- W3115123517 hasConceptScore W3115123517C103278499 @default.
- W3115123517 hasConceptScore W3115123517C115961682 @default.
- W3115123517 hasConceptScore W3115123517C124101348 @default.
- W3115123517 hasConceptScore W3115123517C142210648 @default.
- W3115123517 hasConceptScore W3115123517C144024400 @default.
- W3115123517 hasConceptScore W3115123517C149923435 @default.
- W3115123517 hasConceptScore W3115123517C151730666 @default.
- W3115123517 hasConceptScore W3115123517C153180895 @default.
- W3115123517 hasConceptScore W3115123517C154945302 @default.
- W3115123517 hasConceptScore W3115123517C157486923 @default.
- W3115123517 hasConceptScore W3115123517C162324750 @default.
- W3115123517 hasConceptScore W3115123517C177264268 @default.
- W3115123517 hasConceptScore W3115123517C187736073 @default.
- W3115123517 hasConceptScore W3115123517C199360897 @default.
- W3115123517 hasConceptScore W3115123517C204321447 @default.
- W3115123517 hasConceptScore W3115123517C22029948 @default.
- W3115123517 hasConceptScore W3115123517C2524010 @default.
- W3115123517 hasConceptScore W3115123517C2779343474 @default.
- W3115123517 hasConceptScore W3115123517C2780451532 @default.
- W3115123517 hasConceptScore W3115123517C2780762811 @default.
- W3115123517 hasConceptScore W3115123517C2780861071 @default.
- W3115123517 hasConceptScore W3115123517C2908647359 @default.
- W3115123517 hasConceptScore W3115123517C33923547 @default.
- W3115123517 hasConceptScore W3115123517C37914503 @default.
- W3115123517 hasConceptScore W3115123517C41008148 @default.
- W3115123517 hasConceptScore W3115123517C44359876 @default.
- W3115123517 hasConceptScore W3115123517C86803240 @default.
- W3115123517 hasFunder F4320321001 @default.
- W3115123517 hasFunder F4320335787 @default.
- W3115123517 hasLocation W31151235171 @default.
- W3115123517 hasLocation W31151235172 @default.
- W3115123517 hasOpenAccess W3115123517 @default.
- W3115123517 hasPrimaryLocation W31151235171 @default.
- W3115123517 hasRelatedWork W2007540612 @default.
- W3115123517 hasRelatedWork W2009559548 @default.
- W3115123517 hasRelatedWork W2016385589 @default.
- W3115123517 hasRelatedWork W2054882906 @default.