Matches in SemOpenAlex for { <https://semopenalex.org/work/W2102294813> ?p ?o ?g. }
- W2102294813 endingPage "585" @default.
- W2102294813 startingPage "575" @default.
- W2102294813 abstract "Hadoop has become an attractive platform for large-scale data analytics. In this paper, we identify a major performance bottleneck of Hadoop: its lack of ability to colocate related data on the same set of nodes. To overcome this bottleneck, we introduce CoHadoop, a lightweight extension of Hadoop that allows applications to control where data are stored. In contrast to previous approaches, CoHadoop retains the flexibility of Hadoop in that it does not require users to convert their data to a certain format (e.g., a relational database or a specific file format). Instead, applications give hints to CoHadoop that some set of files are related and may be processed jointly; CoHadoop then tries to colocate these files for improved efficiency. Our approach is designed such that the strong fault tolerance properties of Hadoop are retained. Colocation can be used to improve the efficiency of many operations, including indexing, grouping, aggregation, columnar storage, joins, and sessionization. We conducted a detailed study of joins and sessionization in the context of log processing---a common use case for Hadoop---, and propose efficient map-only algorithms that exploit colocated data partitions. In our experiments, we observed that CoHadoop outperforms both plain Hadoop and previous work. In particular, our approach not only performs better than repartition-based algorithms, but also outperforms map-only algorithms that do exploit data partitioning but not colocation. 8." @default.
- W2102294813 created "2016-06-24" @default.
- W2102294813 creator A5021751187 @default.
- W2102294813 creator A5041227178 @default.
- W2102294813 creator A5060550621 @default.
- W2102294813 creator A5070638387 @default.
- W2102294813 creator A5074322354 @default.
- W2102294813 creator A5090562336 @default.
- W2102294813 date "2011-06-01" @default.
- W2102294813 modified "2023-10-09" @default.
- W2102294813 title "CoHadoop" @default.
- W2102294813 cites W2010279913 @default.
- W2102294813 cites W2043099794 @default.
- W2102294813 cites W2044490410 @default.
- W2102294813 cites W2098935637 @default.
- W2102294813 cites W2110086534 @default.
- W2102294813 cites W2114303224 @default.
- W2102294813 cites W2157270375 @default.
- W2102294813 cites W3005237218 @default.
- W2102294813 doi "https://doi.org/10.14778/2002938.2002943" @default.
- W2102294813 hasPublicationYear "2011" @default.
- W2102294813 type Work @default.
- W2102294813 sameAs 2102294813 @default.
- W2102294813 citedByCount "201" @default.
- W2102294813 countsByYear W21022948132012 @default.
- W2102294813 countsByYear W21022948132013 @default.
- W2102294813 countsByYear W21022948132014 @default.
- W2102294813 countsByYear W21022948132015 @default.
- W2102294813 countsByYear W21022948132016 @default.
- W2102294813 countsByYear W21022948132017 @default.
- W2102294813 countsByYear W21022948132018 @default.
- W2102294813 countsByYear W21022948132019 @default.
- W2102294813 countsByYear W21022948132020 @default.
- W2102294813 countsByYear W21022948132021 @default.
- W2102294813 countsByYear W21022948132022 @default.
- W2102294813 crossrefType "journal-article" @default.
- W2102294813 hasAuthorship W2102294813A5021751187 @default.
- W2102294813 hasAuthorship W2102294813A5041227178 @default.
- W2102294813 hasAuthorship W2102294813A5060550621 @default.
- W2102294813 hasAuthorship W2102294813A5070638387 @default.
- W2102294813 hasAuthorship W2102294813A5074322354 @default.
- W2102294813 hasAuthorship W2102294813A5090562336 @default.
- W2102294813 hasConcept C105795698 @default.
- W2102294813 hasConcept C124101348 @default.
- W2102294813 hasConcept C149635348 @default.
- W2102294813 hasConcept C151730666 @default.
- W2102294813 hasConcept C165696696 @default.
- W2102294813 hasConcept C177264268 @default.
- W2102294813 hasConcept C199360897 @default.
- W2102294813 hasConcept C23123220 @default.
- W2102294813 hasConcept C2778692605 @default.
- W2102294813 hasConcept C2779343474 @default.
- W2102294813 hasConcept C2780513914 @default.
- W2102294813 hasConcept C2780598303 @default.
- W2102294813 hasConcept C33923547 @default.
- W2102294813 hasConcept C38652104 @default.
- W2102294813 hasConcept C41008148 @default.
- W2102294813 hasConcept C75165309 @default.
- W2102294813 hasConcept C75684735 @default.
- W2102294813 hasConcept C77088390 @default.
- W2102294813 hasConcept C86803240 @default.
- W2102294813 hasConceptScore W2102294813C105795698 @default.
- W2102294813 hasConceptScore W2102294813C124101348 @default.
- W2102294813 hasConceptScore W2102294813C149635348 @default.
- W2102294813 hasConceptScore W2102294813C151730666 @default.
- W2102294813 hasConceptScore W2102294813C165696696 @default.
- W2102294813 hasConceptScore W2102294813C177264268 @default.
- W2102294813 hasConceptScore W2102294813C199360897 @default.
- W2102294813 hasConceptScore W2102294813C23123220 @default.
- W2102294813 hasConceptScore W2102294813C2778692605 @default.
- W2102294813 hasConceptScore W2102294813C2779343474 @default.
- W2102294813 hasConceptScore W2102294813C2780513914 @default.
- W2102294813 hasConceptScore W2102294813C2780598303 @default.
- W2102294813 hasConceptScore W2102294813C33923547 @default.
- W2102294813 hasConceptScore W2102294813C38652104 @default.
- W2102294813 hasConceptScore W2102294813C41008148 @default.
- W2102294813 hasConceptScore W2102294813C75165309 @default.
- W2102294813 hasConceptScore W2102294813C75684735 @default.
- W2102294813 hasConceptScore W2102294813C77088390 @default.
- W2102294813 hasConceptScore W2102294813C86803240 @default.
- W2102294813 hasIssue "9" @default.
- W2102294813 hasLocation W21022948131 @default.
- W2102294813 hasOpenAccess W2102294813 @default.
- W2102294813 hasPrimaryLocation W21022948131 @default.
- W2102294813 hasRelatedWork W2118290651 @default.
- W2102294813 hasRelatedWork W2145189743 @default.
- W2102294813 hasRelatedWork W2153592001 @default.
- W2102294813 hasRelatedWork W2168002671 @default.
- W2102294813 hasRelatedWork W2350980814 @default.
- W2102294813 hasRelatedWork W2352031993 @default.
- W2102294813 hasRelatedWork W2734587838 @default.
- W2102294813 hasRelatedWork W2938333471 @default.
- W2102294813 hasRelatedWork W4200605018 @default.
- W2102294813 hasRelatedWork W980041598 @default.
- W2102294813 hasVolume "4" @default.
- W2102294813 isParatext "false" @default.
- W2102294813 isRetracted "false" @default.
- W2102294813 magId "2102294813" @default.