Matches in SemOpenAlex for { <https://semopenalex.org/work/W4283709512> ?p ?o ?g. }
Showing items 1 to 95 of
95
with 100 items per page.
- W4283709512 endingPage "6554" @default.
- W4283709512 startingPage "6554" @default.
- W4283709512 abstract "In the era of data deluge, Big Data gradually offers numerous opportunities, but also poses significant challenges to conventional data processing and analysis methods. MapReduce has become a prominent parallel and distributed programming model for efficiently handling such massive datasets. One of the most elementary and extensive operations in MapReduce is the join operation. These joins have become ever more complex and expensive in the context of skewed data, in which some common join keys appear with a greater frequency than others. Some of the reduction tasks processing these join keys will finish later than others; thus, the benefits of parallel computation become meaningless. Some studies on the problem of skew joins have been conducted, but an adequate and systematic comparison in the Spark environment has not been presented. They have only provided experimental tests, so there is still a shortage of representations of mathematical models on which skew-join algorithms can be compared. This study is, therefore, designed to provide the theoretical and practical basics for evaluating skew-join strategies for large-scale datasets with MapReduce and Spark—both analytically with cost models and practically with experiments. The objectives of the study are, first, to present the implementation of prominent skew-join algorithms in Spark, second, to evaluate the algorithms by using cost models and experiments, and third, to show the advantages and disadvantages of each one and to recommend strategies for the better use of skew joins in Spark." @default.
- W4283709512 created "2022-06-30" @default.
- W4283709512 creator A5007514576 @default.
- W4283709512 creator A5041262440 @default.
- W4283709512 creator A5056921599 @default.
- W4283709512 creator A5072446309 @default.
- W4283709512 date "2022-06-28" @default.
- W4283709512 modified "2023-09-30" @default.
- W4283709512 title "Comparative Analysis of Skew-Join Strategies for Large-Scale Datasets with MapReduce and Spark" @default.
- W4283709512 cites W1969186495 @default.
- W4283709512 cites W2061601738 @default.
- W4283709512 cites W2075620950 @default.
- W4283709512 cites W2080131844 @default.
- W4283709512 cites W2121456247 @default.
- W4283709512 cites W2143377704 @default.
- W4283709512 cites W2173213060 @default.
- W4283709512 cites W2226599271 @default.
- W4283709512 cites W2277471751 @default.
- W4283709512 cites W2463897726 @default.
- W4283709512 cites W2537849534 @default.
- W4283709512 cites W2900972923 @default.
- W4283709512 cites W2972203418 @default.
- W4283709512 cites W3103210770 @default.
- W4283709512 doi "https://doi.org/10.3390/app12136554" @default.
- W4283709512 hasPublicationYear "2022" @default.
- W4283709512 type Work @default.
- W4283709512 citedByCount "1" @default.
- W4283709512 countsByYear W42837095122023 @default.
- W4283709512 crossrefType "journal-article" @default.
- W4283709512 hasAuthorship W4283709512A5007514576 @default.
- W4283709512 hasAuthorship W4283709512A5041262440 @default.
- W4283709512 hasAuthorship W4283709512A5056921599 @default.
- W4283709512 hasAuthorship W4283709512A5072446309 @default.
- W4283709512 hasBestOaLocation W42837095121 @default.
- W4283709512 hasConcept C11413529 @default.
- W4283709512 hasConcept C114614502 @default.
- W4283709512 hasConcept C121332964 @default.
- W4283709512 hasConcept C124101348 @default.
- W4283709512 hasConcept C151730666 @default.
- W4283709512 hasConcept C199360897 @default.
- W4283709512 hasConcept C2776124973 @default.
- W4283709512 hasConcept C2778692605 @default.
- W4283709512 hasConcept C2778755073 @default.
- W4283709512 hasConcept C2779343474 @default.
- W4283709512 hasConcept C2781215313 @default.
- W4283709512 hasConcept C33923547 @default.
- W4283709512 hasConcept C41008148 @default.
- W4283709512 hasConcept C43711488 @default.
- W4283709512 hasConcept C45374587 @default.
- W4283709512 hasConcept C62520636 @default.
- W4283709512 hasConcept C75684735 @default.
- W4283709512 hasConcept C76155785 @default.
- W4283709512 hasConcept C77088390 @default.
- W4283709512 hasConcept C86803240 @default.
- W4283709512 hasConceptScore W4283709512C11413529 @default.
- W4283709512 hasConceptScore W4283709512C114614502 @default.
- W4283709512 hasConceptScore W4283709512C121332964 @default.
- W4283709512 hasConceptScore W4283709512C124101348 @default.
- W4283709512 hasConceptScore W4283709512C151730666 @default.
- W4283709512 hasConceptScore W4283709512C199360897 @default.
- W4283709512 hasConceptScore W4283709512C2776124973 @default.
- W4283709512 hasConceptScore W4283709512C2778692605 @default.
- W4283709512 hasConceptScore W4283709512C2778755073 @default.
- W4283709512 hasConceptScore W4283709512C2779343474 @default.
- W4283709512 hasConceptScore W4283709512C2781215313 @default.
- W4283709512 hasConceptScore W4283709512C33923547 @default.
- W4283709512 hasConceptScore W4283709512C41008148 @default.
- W4283709512 hasConceptScore W4283709512C43711488 @default.
- W4283709512 hasConceptScore W4283709512C45374587 @default.
- W4283709512 hasConceptScore W4283709512C62520636 @default.
- W4283709512 hasConceptScore W4283709512C75684735 @default.
- W4283709512 hasConceptScore W4283709512C76155785 @default.
- W4283709512 hasConceptScore W4283709512C77088390 @default.
- W4283709512 hasConceptScore W4283709512C86803240 @default.
- W4283709512 hasIssue "13" @default.
- W4283709512 hasLocation W42837095121 @default.
- W4283709512 hasLocation W42837095122 @default.
- W4283709512 hasOpenAccess W4283709512 @default.
- W4283709512 hasPrimaryLocation W42837095121 @default.
- W4283709512 hasRelatedWork W1608248699 @default.
- W4283709512 hasRelatedWork W2088799466 @default.
- W4283709512 hasRelatedWork W2141636073 @default.
- W4283709512 hasRelatedWork W2159860101 @default.
- W4283709512 hasRelatedWork W2949447338 @default.
- W4283709512 hasRelatedWork W3110410193 @default.
- W4283709512 hasRelatedWork W4283709512 @default.
- W4283709512 hasRelatedWork W575918167 @default.
- W4283709512 hasRelatedWork W2185115783 @default.
- W4283709512 hasRelatedWork W2592917304 @default.
- W4283709512 hasVolume "12" @default.
- W4283709512 isParatext "false" @default.
- W4283709512 isRetracted "false" @default.
- W4283709512 workType "article" @default.