Matches in SemOpenAlex for { <https://semopenalex.org/work/W3036697701> ?p ?o ?g. }
Showing items 1 to 67 of
67
with 100 items per page.
- W3036697701 abstract "The combination of powerful parallel frameworks and on-demand commodity hardware in distributed computing has made both analytics and decision support systems canonical to enterprises of all sizes. The unprecedented volumes of data stacked by companies present challenges to process analytical queries efficiently. This data is often organised as star schema, in which star join and group-by are ubiquitous and expensive operations. Although parallel frameworks such as Apache Spark facilitate join and group-by, the implementation can only process two tables at a time and fail to handle the excessive network communication, disk spills and multiple scans of data. In this paper, we present Distributed ATrie Group Join (DATGJ), a fast distributed star join and group-by algorithm for column-stores. DATGJ uses divide and broadcast-based joining technique where the fact table columns are partitioned equally and fast hash table (FHT) for each dimension table are broadcasted. This technique helps it avoid cross communication between workers and disk spills. DATGJ performs a single scan of partitioned fact table columns and use FHT to join data. FHT uses Robin Hood hashing with the upper limit on number of probes and achieve significant speed up during join. DATGJ performs group-by and aggregation leveraging progressive materialisation and realising grouping attributes as a tree shaped deterministic finite automation known as Aggregate Trie or ATrie. We evaluated our algorithm using Star Schema Benchmark (SSBM) to show that it is 1.5X to 6X faster than the most prominent approaches while having zero data shuffle and consistently perform well with addition of resources and in memory-constrained scenarios." @default.
- W3036697701 created "2020-06-25" @default.
- W3036697701 creator A5026035491 @default.
- W3036697701 creator A5065353209 @default.
- W3036697701 creator A5089591405 @default.
- W3036697701 date "2020-01-01" @default.
- W3036697701 modified "2023-09-25" @default.
- W3036697701 title "Distributed ATrie Group Join: Towards Zero Network Cost" @default.
- W3036697701 doi "https://doi.org/10.1109/access.2020.3003269" @default.
- W3036697701 hasPublicationYear "2020" @default.
- W3036697701 type Work @default.
- W3036697701 sameAs 3036697701 @default.
- W3036697701 citedByCount "0" @default.
- W3036697701 crossrefType "journal-article" @default.
- W3036697701 hasAuthorship W3036697701A5026035491 @default.
- W3036697701 hasAuthorship W3036697701A5065353209 @default.
- W3036697701 hasAuthorship W3036697701A5089591405 @default.
- W3036697701 hasBestOaLocation W30366977011 @default.
- W3036697701 hasConcept C114614502 @default.
- W3036697701 hasConcept C120314980 @default.
- W3036697701 hasConcept C173608175 @default.
- W3036697701 hasConcept C199360897 @default.
- W3036697701 hasConcept C2776124973 @default.
- W3036697701 hasConcept C2778692605 @default.
- W3036697701 hasConcept C2780224649 @default.
- W3036697701 hasConcept C33923547 @default.
- W3036697701 hasConcept C41008148 @default.
- W3036697701 hasConcept C42812 @default.
- W3036697701 hasConcept C44871818 @default.
- W3036697701 hasConcept C534932454 @default.
- W3036697701 hasConcept C67388219 @default.
- W3036697701 hasConcept C68339613 @default.
- W3036697701 hasConcept C80444323 @default.
- W3036697701 hasConcept C99138194 @default.
- W3036697701 hasConceptScore W3036697701C114614502 @default.
- W3036697701 hasConceptScore W3036697701C120314980 @default.
- W3036697701 hasConceptScore W3036697701C173608175 @default.
- W3036697701 hasConceptScore W3036697701C199360897 @default.
- W3036697701 hasConceptScore W3036697701C2776124973 @default.
- W3036697701 hasConceptScore W3036697701C2778692605 @default.
- W3036697701 hasConceptScore W3036697701C2780224649 @default.
- W3036697701 hasConceptScore W3036697701C33923547 @default.
- W3036697701 hasConceptScore W3036697701C41008148 @default.
- W3036697701 hasConceptScore W3036697701C42812 @default.
- W3036697701 hasConceptScore W3036697701C44871818 @default.
- W3036697701 hasConceptScore W3036697701C534932454 @default.
- W3036697701 hasConceptScore W3036697701C67388219 @default.
- W3036697701 hasConceptScore W3036697701C68339613 @default.
- W3036697701 hasConceptScore W3036697701C80444323 @default.
- W3036697701 hasConceptScore W3036697701C99138194 @default.
- W3036697701 hasLocation W30366977011 @default.
- W3036697701 hasOpenAccess W3036697701 @default.
- W3036697701 hasPrimaryLocation W30366977011 @default.
- W3036697701 hasRelatedWork W12527996 @default.
- W3036697701 hasRelatedWork W13345235 @default.
- W3036697701 hasRelatedWork W14046218 @default.
- W3036697701 hasRelatedWork W14135646 @default.
- W3036697701 hasRelatedWork W2929374 @default.
- W3036697701 hasRelatedWork W3450589 @default.
- W3036697701 hasRelatedWork W5574817 @default.
- W3036697701 hasRelatedWork W7146201 @default.
- W3036697701 hasRelatedWork W800331 @default.
- W3036697701 hasRelatedWork W221245 @default.
- W3036697701 isParatext "false" @default.
- W3036697701 isRetracted "false" @default.
- W3036697701 magId "3036697701" @default.
- W3036697701 workType "article" @default.