Matches in SemOpenAlex for { <https://semopenalex.org/work/W2766504241> ?p ?o ?g. }
- W2766504241 endingPage "2072" @default.
- W2766504241 startingPage "2061" @default.
- W2766504241 abstract "Linear algebra operations are at the core of many Machine Learning (ML) programs. At the same time, a considerable amount of the effort for solving data analytics problems is spent in data preparation. As a result, end-to-end ML pipelines often consist of ( i ) relational operators used for joining the input data, ( ii ) user defined functions used for feature extraction and vectorization, and ( iii ) linear algebra operators used for model training and cross-validation. Often, these pipelines need to scale out to large datasets. In this case, these pipelines are usually implemented on top of dataflow engines like Hadoop, Spark, or Flink. These dataflow engines implement relational operators on row-partitioned datasets. However, efficient linear algebra operators use block-partitioned matrices. As a result, pipelines combining both kinds of operators require rather expensive changes to the physical representation, in particular re-partitioning steps. In this paper, we investigate the potential of reducing shuffling costs by fusing relational and linear algebra operations into specialized physical operators. We present BlockJoin , a distributed join algorithm which directly produces block-partitioned results. To minimize shuffling costs, BlockJoin applies database techniques known from columnar processing, such as index-joins and late materialization, in the context of parallel dataflow engines. Our experimental evaluation shows speedups up to 6× and the skew resistance of BlockJoin compared to state-of-the-art pipelines implemented in Spark." @default.
- W2766504241 created "2017-11-10" @default.
- W2766504241 creator A5002030730 @default.
- W2766504241 creator A5002413906 @default.
- W2766504241 creator A5002932353 @default.
- W2766504241 creator A5028425403 @default.
- W2766504241 creator A5072690436 @default.
- W2766504241 date "2017-09-01" @default.
- W2766504241 modified "2023-10-02" @default.
- W2766504241 title "Blockjoin" @default.
- W2766504241 cites W1485587488 @default.
- W2766504241 cites W1947869163 @default.
- W2766504241 cites W1970372442 @default.
- W2766504241 cites W1993226606 @default.
- W2766504241 cites W1993433750 @default.
- W2766504241 cites W2003515726 @default.
- W2766504241 cites W2005112390 @default.
- W2766504241 cites W2014830756 @default.
- W2766504241 cites W2017383881 @default.
- W2766504241 cites W2032775418 @default.
- W2766504241 cites W2037168816 @default.
- W2766504241 cites W2058178853 @default.
- W2766504241 cites W2061601738 @default.
- W2766504241 cites W2064366207 @default.
- W2766504241 cites W2064619860 @default.
- W2766504241 cites W2075620950 @default.
- W2766504241 cites W2078945459 @default.
- W2766504241 cites W2102458936 @default.
- W2766504241 cites W2111708605 @default.
- W2766504241 cites W2116926680 @default.
- W2766504241 cites W2143108641 @default.
- W2766504241 cites W2146183750 @default.
- W2766504241 cites W2146635036 @default.
- W2766504241 cites W2167927436 @default.
- W2766504241 cites W2216541755 @default.
- W2766504241 cites W2411006959 @default.
- W2766504241 cites W2435648513 @default.
- W2766504241 cites W2535724050 @default.
- W2766504241 cites W2547190417 @default.
- W2766504241 cites W2998715488 @default.
- W2766504241 cites W3142730227 @default.
- W2766504241 cites W4238675359 @default.
- W2766504241 doi "https://doi.org/10.14778/3151106.3151110" @default.
- W2766504241 hasPublicationYear "2017" @default.
- W2766504241 type Work @default.
- W2766504241 sameAs 2766504241 @default.
- W2766504241 citedByCount "10" @default.
- W2766504241 countsByYear W27665042412018 @default.
- W2766504241 countsByYear W27665042412019 @default.
- W2766504241 countsByYear W27665042412020 @default.
- W2766504241 crossrefType "journal-article" @default.
- W2766504241 hasAuthorship W2766504241A5002030730 @default.
- W2766504241 hasAuthorship W2766504241A5002413906 @default.
- W2766504241 hasAuthorship W2766504241A5002932353 @default.
- W2766504241 hasAuthorship W2766504241A5028425403 @default.
- W2766504241 hasAuthorship W2766504241A5072690436 @default.
- W2766504241 hasConcept C127413603 @default.
- W2766504241 hasConcept C139352143 @default.
- W2766504241 hasConcept C151730666 @default.
- W2766504241 hasConcept C167927819 @default.
- W2766504241 hasConcept C173608175 @default.
- W2766504241 hasConcept C175309249 @default.
- W2766504241 hasConcept C199360897 @default.
- W2766504241 hasConcept C2524010 @default.
- W2766504241 hasConcept C2777210771 @default.
- W2766504241 hasConcept C2778692605 @default.
- W2766504241 hasConcept C2779343474 @default.
- W2766504241 hasConcept C2781215313 @default.
- W2766504241 hasConcept C33923547 @default.
- W2766504241 hasConcept C41008148 @default.
- W2766504241 hasConcept C80444323 @default.
- W2766504241 hasConcept C86803240 @default.
- W2766504241 hasConcept C87717796 @default.
- W2766504241 hasConcept C96324660 @default.
- W2766504241 hasConceptScore W2766504241C127413603 @default.
- W2766504241 hasConceptScore W2766504241C139352143 @default.
- W2766504241 hasConceptScore W2766504241C151730666 @default.
- W2766504241 hasConceptScore W2766504241C167927819 @default.
- W2766504241 hasConceptScore W2766504241C173608175 @default.
- W2766504241 hasConceptScore W2766504241C175309249 @default.
- W2766504241 hasConceptScore W2766504241C199360897 @default.
- W2766504241 hasConceptScore W2766504241C2524010 @default.
- W2766504241 hasConceptScore W2766504241C2777210771 @default.
- W2766504241 hasConceptScore W2766504241C2778692605 @default.
- W2766504241 hasConceptScore W2766504241C2779343474 @default.
- W2766504241 hasConceptScore W2766504241C2781215313 @default.
- W2766504241 hasConceptScore W2766504241C33923547 @default.
- W2766504241 hasConceptScore W2766504241C41008148 @default.
- W2766504241 hasConceptScore W2766504241C80444323 @default.
- W2766504241 hasConceptScore W2766504241C86803240 @default.
- W2766504241 hasConceptScore W2766504241C87717796 @default.
- W2766504241 hasConceptScore W2766504241C96324660 @default.
- W2766504241 hasIssue "13" @default.
- W2766504241 hasLocation W27665042411 @default.
- W2766504241 hasOpenAccess W2766504241 @default.
- W2766504241 hasPrimaryLocation W27665042411 @default.
- W2766504241 hasRelatedWork W1572523360 @default.
- W2766504241 hasRelatedWork W2047588290 @default.