Matches in SemOpenAlex for { <https://semopenalex.org/work/W2023608004> ?p ?o ?g. }
Showing items 1 to 70 of
70
with 100 items per page.
- W2023608004 abstract "State-of-the-art in duplicate detection in semi-structured data obtains significant improvement by exploiting the schema-related knowledge. Such schema-bound duplicate detection approaches, however, have severe limitations when dealing with multi-sourced, heterogeneous, high-velocity data streams. In this paper, we propose a novel context-aware duplicate detection system which is workload- and complexity-aware, and is adaptable to the underlying computing platform. The system operates in schema-oblivious manner, and relies upon information theory based heuristic and data shaping technique for efficient, and scalable duplicate detection in multi-sourced, heterogeneous data sets. Experiments with real-world data sets show speed up of up to 8X over state of-the-art schemes, while maintaining upto 92 percent accuracy. In addition, our data shaping technique for GPGPU processing speeds up the duplicate detection throughput by up to two orders of magnitude." @default.
- W2023608004 created "2016-06-24" @default.
- W2023608004 creator A5052170781 @default.
- W2023608004 creator A5087267932 @default.
- W2023608004 date "2014-06-01" @default.
- W2023608004 modified "2023-09-26" @default.
- W2023608004 title "Context-Aware Duplicate Detection in Semi-structured Data Streams" @default.
- W2023608004 cites W1610496399 @default.
- W2023608004 cites W1845040079 @default.
- W2023608004 cites W1976541661 @default.
- W2023608004 cites W1978478796 @default.
- W2023608004 cites W1978510388 @default.
- W2023608004 cites W1979954747 @default.
- W2023608004 cites W2001496424 @default.
- W2023608004 cites W2012546939 @default.
- W2023608004 cites W2047709159 @default.
- W2023608004 cites W2065290081 @default.
- W2023608004 cites W2084043637 @default.
- W2023608004 cites W2157468491 @default.
- W2023608004 cites W2164501930 @default.
- W2023608004 cites W54366487 @default.
- W2023608004 cites W92980254 @default.
- W2023608004 doi "https://doi.org/10.1109/services.2014.46" @default.
- W2023608004 hasPublicationYear "2014" @default.
- W2023608004 type Work @default.
- W2023608004 sameAs 2023608004 @default.
- W2023608004 citedByCount "1" @default.
- W2023608004 countsByYear W20236080042018 @default.
- W2023608004 crossrefType "proceedings-article" @default.
- W2023608004 hasAuthorship W2023608004A5052170781 @default.
- W2023608004 hasAuthorship W2023608004A5087267932 @default.
- W2023608004 hasConcept C111919701 @default.
- W2023608004 hasConcept C124101348 @default.
- W2023608004 hasConcept C154945302 @default.
- W2023608004 hasConcept C173801870 @default.
- W2023608004 hasConcept C23123220 @default.
- W2023608004 hasConcept C2778476105 @default.
- W2023608004 hasConcept C41008148 @default.
- W2023608004 hasConcept C48044578 @default.
- W2023608004 hasConcept C52146309 @default.
- W2023608004 hasConcept C77088390 @default.
- W2023608004 hasConcept C89198739 @default.
- W2023608004 hasConceptScore W2023608004C111919701 @default.
- W2023608004 hasConceptScore W2023608004C124101348 @default.
- W2023608004 hasConceptScore W2023608004C154945302 @default.
- W2023608004 hasConceptScore W2023608004C173801870 @default.
- W2023608004 hasConceptScore W2023608004C23123220 @default.
- W2023608004 hasConceptScore W2023608004C2778476105 @default.
- W2023608004 hasConceptScore W2023608004C41008148 @default.
- W2023608004 hasConceptScore W2023608004C48044578 @default.
- W2023608004 hasConceptScore W2023608004C52146309 @default.
- W2023608004 hasConceptScore W2023608004C77088390 @default.
- W2023608004 hasConceptScore W2023608004C89198739 @default.
- W2023608004 hasLocation W20236080041 @default.
- W2023608004 hasOpenAccess W2023608004 @default.
- W2023608004 hasPrimaryLocation W20236080041 @default.
- W2023608004 hasRelatedWork W2059591939 @default.
- W2023608004 hasRelatedWork W2074496324 @default.
- W2023608004 hasRelatedWork W2136701769 @default.
- W2023608004 hasRelatedWork W2138675498 @default.
- W2023608004 hasRelatedWork W2383597676 @default.
- W2023608004 hasRelatedWork W2383698455 @default.
- W2023608004 hasRelatedWork W2786293878 @default.
- W2023608004 hasRelatedWork W568669090 @default.
- W2023608004 hasRelatedWork W2183743873 @default.
- W2023608004 hasRelatedWork W2585167211 @default.
- W2023608004 isParatext "false" @default.
- W2023608004 isRetracted "false" @default.
- W2023608004 magId "2023608004" @default.
- W2023608004 workType "article" @default.