Matches in SemOpenAlex for { <https://semopenalex.org/work/W2783000183> ?p ?o ?g. }
Showing items 1 to 83 of
83
with 100 items per page.
- W2783000183 endingPage "270" @default.
- W2783000183 startingPage "255" @default.
- W2783000183 abstract "Set similarity join is a core operation for text data integration, cleaning, and mining. Previous research work on improving the performance of set similarity joins mostly focused on sequential, CPU-based algorithms. Main optimizations of such algorithms exploit high threshold values and the underlying data characteristics to derive efficient filters. In this article, we investigate strategies to accelerate set similarity join by exploiting massive parallelism available in modern Graphics Processing Units (GPUs). We develop two new parallel set similarity join algorithms, which implement an inverted index on the GPU to quickly identify similar sets. The first algorithm, called texttt{gSSJoin}, does not rely on any filtering scheme and, thus, exhibits much better robustness to variations of threshold values and data distributions. Moreover, gSSJoin adopts a load balancing strategy to evenly distribute the similarity calculations among the GPU's threads. The second algorithm, called sf-gSSJoin, applies a block division scheme for dealing with large datasets that do not fit in GPU memory. This scheme also enables a substantial reduction of the comparison space by discarding entire blocks based on size limits. We present variants of both algorithms for a multi-GPU platform to further exploit task parallelism in addition to data parallelism. Experimental evaluation on real-world datasets shows that we obtain up to 109x and 19.5x speedups over state-of-the-art algorithms for CPU and GPU, respectively." @default.
- W2783000183 created "2018-01-26" @default.
- W2783000183 creator A5033070469 @default.
- W2783000183 creator A5033478006 @default.
- W2783000183 creator A5082063263 @default.
- W2783000183 creator A5089604257 @default.
- W2783000183 date "2017-12-08" @default.
- W2783000183 modified "2023-09-23" @default.
- W2783000183 title "Fast Parallel Set Similarity Joins on Many-core Architectures" @default.
- W2783000183 hasPublicationYear "2017" @default.
- W2783000183 type Work @default.
- W2783000183 sameAs 2783000183 @default.
- W2783000183 citedByCount "2" @default.
- W2783000183 countsByYear W27830001832020 @default.
- W2783000183 countsByYear W27830001832021 @default.
- W2783000183 crossrefType "journal-article" @default.
- W2783000183 hasAuthorship W2783000183A5033070469 @default.
- W2783000183 hasAuthorship W2783000183A5033478006 @default.
- W2783000183 hasAuthorship W2783000183A5082063263 @default.
- W2783000183 hasAuthorship W2783000183A5089604257 @default.
- W2783000183 hasConcept C104317684 @default.
- W2783000183 hasConcept C11413529 @default.
- W2783000183 hasConcept C173608175 @default.
- W2783000183 hasConcept C177264268 @default.
- W2783000183 hasConcept C185592680 @default.
- W2783000183 hasConcept C199360897 @default.
- W2783000183 hasConcept C202491316 @default.
- W2783000183 hasConcept C2778692605 @default.
- W2783000183 hasConcept C2779851693 @default.
- W2783000183 hasConcept C2781172179 @default.
- W2783000183 hasConcept C41008148 @default.
- W2783000183 hasConcept C42992933 @default.
- W2783000183 hasConcept C55493867 @default.
- W2783000183 hasConcept C61483411 @default.
- W2783000183 hasConcept C63479239 @default.
- W2783000183 hasConcept C78766204 @default.
- W2783000183 hasConceptScore W2783000183C104317684 @default.
- W2783000183 hasConceptScore W2783000183C11413529 @default.
- W2783000183 hasConceptScore W2783000183C173608175 @default.
- W2783000183 hasConceptScore W2783000183C177264268 @default.
- W2783000183 hasConceptScore W2783000183C185592680 @default.
- W2783000183 hasConceptScore W2783000183C199360897 @default.
- W2783000183 hasConceptScore W2783000183C202491316 @default.
- W2783000183 hasConceptScore W2783000183C2778692605 @default.
- W2783000183 hasConceptScore W2783000183C2779851693 @default.
- W2783000183 hasConceptScore W2783000183C2781172179 @default.
- W2783000183 hasConceptScore W2783000183C41008148 @default.
- W2783000183 hasConceptScore W2783000183C42992933 @default.
- W2783000183 hasConceptScore W2783000183C55493867 @default.
- W2783000183 hasConceptScore W2783000183C61483411 @default.
- W2783000183 hasConceptScore W2783000183C63479239 @default.
- W2783000183 hasConceptScore W2783000183C78766204 @default.
- W2783000183 hasIssue "3" @default.
- W2783000183 hasLocation W27830001831 @default.
- W2783000183 hasOpenAccess W2783000183 @default.
- W2783000183 hasPrimaryLocation W27830001831 @default.
- W2783000183 hasRelatedWork W1735605371 @default.
- W2783000183 hasRelatedWork W1983582642 @default.
- W2783000183 hasRelatedWork W2063257142 @default.
- W2783000183 hasRelatedWork W2181749554 @default.
- W2783000183 hasRelatedWork W2241531243 @default.
- W2783000183 hasRelatedWork W2263559358 @default.
- W2783000183 hasRelatedWork W2295891025 @default.
- W2783000183 hasRelatedWork W2495231312 @default.
- W2783000183 hasRelatedWork W2625370688 @default.
- W2783000183 hasRelatedWork W2767167788 @default.
- W2783000183 hasRelatedWork W2793629884 @default.
- W2783000183 hasRelatedWork W2890532424 @default.
- W2783000183 hasRelatedWork W2907991499 @default.
- W2783000183 hasRelatedWork W2939089006 @default.
- W2783000183 hasRelatedWork W2955310615 @default.
- W2783000183 hasRelatedWork W2955403352 @default.
- W2783000183 hasRelatedWork W3106449537 @default.
- W2783000183 hasRelatedWork W3157025177 @default.
- W2783000183 hasRelatedWork W3193046301 @default.
- W2783000183 hasRelatedWork W3196777856 @default.
- W2783000183 hasVolume "8" @default.
- W2783000183 isParatext "false" @default.
- W2783000183 isRetracted "false" @default.
- W2783000183 magId "2783000183" @default.
- W2783000183 workType "article" @default.