Matches in SemOpenAlex for { <https://semopenalex.org/work/W2017167478> ?p ?o ?g. }
- W2017167478 endingPage "29" @default.
- W2017167478 startingPage "1" @default.
- W2017167478 abstract "Real-time Entity Resolution (ER) is the process of matching query records in subsecond time with records in a database that represent the same real-world entity. Indexing techniques are generally used to efficiently extract a set of candidate records from the database that are similar to a query record, and that are to be compared with the query record in more detail. The sorted neighborhood indexing method, which sorts a database and compares records within a sliding window, has been successfully used for ER of large static databases. However, because it is based on static sorted arrays and is designed for batch ER that resolves all records in a database rather than resolving those relating to a single query record, this technique is not suitable for real-time ER on dynamic databases that are constantly updated. We propose a tree-based technique that facilitates dynamic indexing based on the sorted neighborhood method, which can be used for real-time ER, and investigate both static and adaptive window approaches. We propose an approach to reduce query matching times by precalculating the similarities between attribute values stored in neighboring tree nodes. We also propose a multitree solution where different sorting keys are used to reduce the effects of errors and variations in attribute values on matching quality by building several distinct index trees. We experimentally evaluate our proposed techniques on large real datasets, as well as on synthetic data with different data quality characteristics. Our results show that as the index grows, no appreciable increase occurs in both record insertion and query times, and that using multiple trees gives noticeable improvements on matching quality with only a small increase in query time. Compared to earlier indexing techniques for real-time ER, our approach achieves significantly reduced indexing and query matching times while maintaining high matching accuracy." @default.
- W2017167478 created "2016-06-24" @default.
- W2017167478 creator A5019675124 @default.
- W2017167478 creator A5022945960 @default.
- W2017167478 creator A5075528828 @default.
- W2017167478 creator A5080206968 @default.
- W2017167478 date "2015-10-23" @default.
- W2017167478 modified "2023-09-26" @default.
- W2017167478 title "Dynamic Sorted Neighborhood Indexing for Real-Time Entity Resolution" @default.
- W2017167478 cites W1155226818 @default.
- W2017167478 cites W1612155886 @default.
- W2017167478 cites W2024386211 @default.
- W2017167478 cites W2024770506 @default.
- W2017167478 cites W2031250218 @default.
- W2017167478 cites W2036216970 @default.
- W2017167478 cites W2039789840 @default.
- W2017167478 cites W2044280769 @default.
- W2017167478 cites W2077044633 @default.
- W2017167478 cites W2087966340 @default.
- W2017167478 cites W2102763740 @default.
- W2017167478 cites W2108991785 @default.
- W2017167478 cites W2109834209 @default.
- W2017167478 cites W2131967083 @default.
- W2017167478 cites W2139646386 @default.
- W2017167478 cites W2140789797 @default.
- W2017167478 cites W2148524305 @default.
- W2017167478 cites W2150228342 @default.
- W2017167478 cites W2155631901 @default.
- W2017167478 cites W2161694911 @default.
- W2017167478 cites W2166988329 @default.
- W2017167478 cites W2169024178 @default.
- W2017167478 cites W2261544779 @default.
- W2017167478 cites W3045341009 @default.
- W2017167478 cites W4242744113 @default.
- W2017167478 cites W69311973 @default.
- W2017167478 doi "https://doi.org/10.1145/2816821" @default.
- W2017167478 hasPublicationYear "2015" @default.
- W2017167478 type Work @default.
- W2017167478 sameAs 2017167478 @default.
- W2017167478 citedByCount "15" @default.
- W2017167478 countsByYear W20171674782016 @default.
- W2017167478 countsByYear W20171674782018 @default.
- W2017167478 countsByYear W20171674782019 @default.
- W2017167478 countsByYear W20171674782020 @default.
- W2017167478 countsByYear W20171674782021 @default.
- W2017167478 countsByYear W20171674782022 @default.
- W2017167478 countsByYear W20171674782023 @default.
- W2017167478 crossrefType "journal-article" @default.
- W2017167478 hasAuthorship W2017167478A5019675124 @default.
- W2017167478 hasAuthorship W2017167478A5022945960 @default.
- W2017167478 hasAuthorship W2017167478A5075528828 @default.
- W2017167478 hasAuthorship W2017167478A5080206968 @default.
- W2017167478 hasBestOaLocation W20171674782 @default.
- W2017167478 hasConcept C102392041 @default.
- W2017167478 hasConcept C105795698 @default.
- W2017167478 hasConcept C111696304 @default.
- W2017167478 hasConcept C111919701 @default.
- W2017167478 hasConcept C113174947 @default.
- W2017167478 hasConcept C11413529 @default.
- W2017167478 hasConcept C124101348 @default.
- W2017167478 hasConcept C134306372 @default.
- W2017167478 hasConcept C165064840 @default.
- W2017167478 hasConcept C177264268 @default.
- W2017167478 hasConcept C199360897 @default.
- W2017167478 hasConcept C23123220 @default.
- W2017167478 hasConcept C2778751112 @default.
- W2017167478 hasConcept C33923547 @default.
- W2017167478 hasConcept C41008148 @default.
- W2017167478 hasConcept C59276292 @default.
- W2017167478 hasConcept C75165309 @default.
- W2017167478 hasConcept C77088390 @default.
- W2017167478 hasConceptScore W2017167478C102392041 @default.
- W2017167478 hasConceptScore W2017167478C105795698 @default.
- W2017167478 hasConceptScore W2017167478C111696304 @default.
- W2017167478 hasConceptScore W2017167478C111919701 @default.
- W2017167478 hasConceptScore W2017167478C113174947 @default.
- W2017167478 hasConceptScore W2017167478C11413529 @default.
- W2017167478 hasConceptScore W2017167478C124101348 @default.
- W2017167478 hasConceptScore W2017167478C134306372 @default.
- W2017167478 hasConceptScore W2017167478C165064840 @default.
- W2017167478 hasConceptScore W2017167478C177264268 @default.
- W2017167478 hasConceptScore W2017167478C199360897 @default.
- W2017167478 hasConceptScore W2017167478C23123220 @default.
- W2017167478 hasConceptScore W2017167478C2778751112 @default.
- W2017167478 hasConceptScore W2017167478C33923547 @default.
- W2017167478 hasConceptScore W2017167478C41008148 @default.
- W2017167478 hasConceptScore W2017167478C59276292 @default.
- W2017167478 hasConceptScore W2017167478C75165309 @default.
- W2017167478 hasConceptScore W2017167478C77088390 @default.
- W2017167478 hasFunder F4320334704 @default.
- W2017167478 hasIssue "4" @default.
- W2017167478 hasLocation W20171674781 @default.
- W2017167478 hasLocation W20171674782 @default.
- W2017167478 hasLocation W20171674783 @default.
- W2017167478 hasOpenAccess W2017167478 @default.
- W2017167478 hasPrimaryLocation W20171674781 @default.
- W2017167478 hasRelatedWork W1487822255 @default.
- W2017167478 hasRelatedWork W1605381316 @default.