Matches in SemOpenAlex for { <https://semopenalex.org/work/W2011421331> ?p ?o ?g. }
Showing items 1 to 97 of
97
with 100 items per page.
- W2011421331 abstract "MapReduce has become quite popular to analyse very large datasets. Nevertheless, users typically have to run their MapReduce jobs over the whole dataset every time the dataset is appended by new records. Some researchers have proposed to reuse the intermediate data produced by previous MapReduce jobs. However, existing works still have to read the whole dataset in order to identify which parts of the dataset changed. Furthermore, storing intermediate results is not suitable in some cases, because it can lead to a very high storage overhead. In this paper, we propose Itchy, a MapReduce-based system that employes a set of different techniques to efficiently deal with growing datasets. Itchy uses an optimizer to automatically choose the right technique to process a MapReduce job. The beauty of Itchy is that it does not have to read the whole dataset again to deal with new records. In more detail, Itchy keeps track of the provenance of intermediate results in order to selectively recompute intermediate results as required. But, if intermediate results are small or the computational cost of map functions is high, Itchy can automatically start storing intermediate results rather than the provenance information. Additionally, Itchy also supports the option of directly merging outputs from several jobs in cases where MapReduce jobs allow for such kind of processing. We evaluate Itchy using two different benchmarks and compare it with Hadoop and Incoop. The results show the superiority of Itchy over both baseline systems for processing incremental jobs. In terms of job runtime, Itchy is more than one order of magnitude faster than Hadoop (up to ~41 times faster) and Incoop (up to ~11 times faster)." @default.
- W2011421331 created "2016-06-24" @default.
- W2011421331 creator A5056475951 @default.
- W2011421331 creator A5075498978 @default.
- W2011421331 creator A5091334588 @default.
- W2011421331 date "2013-06-01" @default.
- W2011421331 modified "2023-09-26" @default.
- W2011421331 title "Elephant, Do Not Forget Everything! Efficient Processing of Growing Datasets" @default.
- W2011421331 cites W135267584 @default.
- W2011421331 cites W1845494277 @default.
- W2011421331 cites W1861377444 @default.
- W2011421331 cites W1963763518 @default.
- W2011421331 cites W1978162812 @default.
- W2011421331 cites W1981420413 @default.
- W2011421331 cites W1993892970 @default.
- W2011421331 cites W2010929544 @default.
- W2011421331 cites W2035543557 @default.
- W2011421331 cites W2091765165 @default.
- W2011421331 cites W2119528150 @default.
- W2011421331 cites W2129743596 @default.
- W2011421331 cites W2142031898 @default.
- W2011421331 cites W2144002928 @default.
- W2011421331 cites W2156546605 @default.
- W2011421331 cites W2163669329 @default.
- W2011421331 cites W2173213060 @default.
- W2011421331 cites W2206204485 @default.
- W2011421331 doi "https://doi.org/10.1109/cloud.2013.67" @default.
- W2011421331 hasPublicationYear "2013" @default.
- W2011421331 type Work @default.
- W2011421331 sameAs 2011421331 @default.
- W2011421331 citedByCount "4" @default.
- W2011421331 countsByYear W20114213312015 @default.
- W2011421331 countsByYear W20114213312016 @default.
- W2011421331 countsByYear W20114213312017 @default.
- W2011421331 crossrefType "proceedings-article" @default.
- W2011421331 hasAuthorship W2011421331A5056475951 @default.
- W2011421331 hasAuthorship W2011421331A5075498978 @default.
- W2011421331 hasAuthorship W2011421331A5091334588 @default.
- W2011421331 hasConcept C10138342 @default.
- W2011421331 hasConcept C111919701 @default.
- W2011421331 hasConcept C124101348 @default.
- W2011421331 hasConcept C162324750 @default.
- W2011421331 hasConcept C173608175 @default.
- W2011421331 hasConcept C177264268 @default.
- W2011421331 hasConcept C182306322 @default.
- W2011421331 hasConcept C18903297 @default.
- W2011421331 hasConcept C199360897 @default.
- W2011421331 hasConcept C206588197 @default.
- W2011421331 hasConcept C23123220 @default.
- W2011421331 hasConcept C2779960059 @default.
- W2011421331 hasConcept C41008148 @default.
- W2011421331 hasConcept C77088390 @default.
- W2011421331 hasConcept C86803240 @default.
- W2011421331 hasConcept C98045186 @default.
- W2011421331 hasConceptScore W2011421331C10138342 @default.
- W2011421331 hasConceptScore W2011421331C111919701 @default.
- W2011421331 hasConceptScore W2011421331C124101348 @default.
- W2011421331 hasConceptScore W2011421331C162324750 @default.
- W2011421331 hasConceptScore W2011421331C173608175 @default.
- W2011421331 hasConceptScore W2011421331C177264268 @default.
- W2011421331 hasConceptScore W2011421331C182306322 @default.
- W2011421331 hasConceptScore W2011421331C18903297 @default.
- W2011421331 hasConceptScore W2011421331C199360897 @default.
- W2011421331 hasConceptScore W2011421331C206588197 @default.
- W2011421331 hasConceptScore W2011421331C23123220 @default.
- W2011421331 hasConceptScore W2011421331C2779960059 @default.
- W2011421331 hasConceptScore W2011421331C41008148 @default.
- W2011421331 hasConceptScore W2011421331C77088390 @default.
- W2011421331 hasConceptScore W2011421331C86803240 @default.
- W2011421331 hasConceptScore W2011421331C98045186 @default.
- W2011421331 hasLocation W20114213311 @default.
- W2011421331 hasOpenAccess W2011421331 @default.
- W2011421331 hasPrimaryLocation W20114213311 @default.
- W2011421331 hasRelatedWork W2023220927 @default.
- W2011421331 hasRelatedWork W2091804750 @default.
- W2011421331 hasRelatedWork W2125883268 @default.
- W2011421331 hasRelatedWork W2140404742 @default.
- W2011421331 hasRelatedWork W2161832851 @default.
- W2011421331 hasRelatedWork W2334966306 @default.
- W2011421331 hasRelatedWork W2531045075 @default.
- W2011421331 hasRelatedWork W2541967753 @default.
- W2011421331 hasRelatedWork W2610047507 @default.
- W2011421331 hasRelatedWork W2730660683 @default.
- W2011421331 hasRelatedWork W2731426225 @default.
- W2011421331 hasRelatedWork W2754715639 @default.
- W2011421331 hasRelatedWork W2787107741 @default.
- W2011421331 hasRelatedWork W2912802084 @default.
- W2011421331 hasRelatedWork W2953036606 @default.
- W2011421331 hasRelatedWork W3150104100 @default.
- W2011421331 hasRelatedWork W3161378647 @default.
- W2011421331 hasRelatedWork W3175717605 @default.
- W2011421331 hasRelatedWork W2560079483 @default.
- W2011421331 hasRelatedWork W2775140700 @default.
- W2011421331 isParatext "false" @default.
- W2011421331 isRetracted "false" @default.
- W2011421331 magId "2011421331" @default.
- W2011421331 workType "article" @default.