Matches in SemOpenAlex for { <https://semopenalex.org/work/W2808895360> ?p ?o ?g. }
Showing items 1 to 60 of
60
with 100 items per page.
- W2808895360 abstract "An abundance of data in many disciplines of science, engineering, national security, health care, and business has led to the emerging field of Big Data Analytics that run in a cloud computing environment. To process massive quantities of data in the cloud, developers leverage Data-Intensive Scalable Computing (DISC) systems such as Google's MapReduce, Hadoop, and Spark. Currently, developers do not have easy means to debug DISC applications. The use of cloud computing makes application development feel more like batch jobs and the nature of debugging is therefore post-mortem. Developers of big data applications write code that implements a data processing pipeline and test it on their local workstation with a small sample data, downloaded from a TB-scale data warehouse. They cross fingers and hope that the program works in the expensive production cloud. When a job fails or they get a suspicious result, data scientists spend hours guessing at the source of the error, digging through post-mortem logs. In such cases, the data scientists may want to pinpoint the root cause of errors by investigating a subset of corresponding input records. The vision of my work is to provide interactive, real-time and automated debugging services for big data processing programs in modern DISC systems with minimum performance impact. My work investigates the following research questions in the context of big data analytics: (1) What are the necessary debugging primitives for interactive big data processing? (2) What scalable fault localization algorithms are needed to help the user to localize and characterize the root causes of errors? (3) How can we improve testing efficiency during iterative development of DISC applications by reasoning the semantics of dataflow operators and user-defined functions used inside dataflow operators in tandem? To answer these questions, we synthesize and innovate ideas from software engineering, big data systems, and program analysis, and coordinate innovations across the software stack from the user-facing API all the way down to the systems infrastructure." @default.
- W2808895360 created "2018-06-29" @default.
- W2808895360 creator A5003747461 @default.
- W2808895360 date "2018-05-27" @default.
- W2808895360 modified "2023-09-23" @default.
- W2808895360 title "Interactive and automated debugging for big data analytics" @default.
- W2808895360 cites W1480909796 @default.
- W2808895360 cites W2036196659 @default.
- W2808895360 cites W2170224888 @default.
- W2808895360 cites W2384569204 @default.
- W2808895360 cites W2758552291 @default.
- W2808895360 cites W4247966332 @default.
- W2808895360 cites W4365786623 @default.
- W2808895360 doi "https://doi.org/10.1145/3183440.3190334" @default.
- W2808895360 hasPublicationYear "2018" @default.
- W2808895360 type Work @default.
- W2808895360 sameAs 2808895360 @default.
- W2808895360 citedByCount "3" @default.
- W2808895360 countsByYear W28088953602020 @default.
- W2808895360 countsByYear W28088953602021 @default.
- W2808895360 countsByYear W28088953602022 @default.
- W2808895360 crossrefType "proceedings-article" @default.
- W2808895360 hasAuthorship W2808895360A5003747461 @default.
- W2808895360 hasBestOaLocation W28088953601 @default.
- W2808895360 hasConcept C107457646 @default.
- W2808895360 hasConcept C115903868 @default.
- W2808895360 hasConcept C124101348 @default.
- W2808895360 hasConcept C168065819 @default.
- W2808895360 hasConcept C199360897 @default.
- W2808895360 hasConcept C2522767166 @default.
- W2808895360 hasConcept C41008148 @default.
- W2808895360 hasConcept C75684735 @default.
- W2808895360 hasConcept C79158427 @default.
- W2808895360 hasConceptScore W2808895360C107457646 @default.
- W2808895360 hasConceptScore W2808895360C115903868 @default.
- W2808895360 hasConceptScore W2808895360C124101348 @default.
- W2808895360 hasConceptScore W2808895360C168065819 @default.
- W2808895360 hasConceptScore W2808895360C199360897 @default.
- W2808895360 hasConceptScore W2808895360C2522767166 @default.
- W2808895360 hasConceptScore W2808895360C41008148 @default.
- W2808895360 hasConceptScore W2808895360C75684735 @default.
- W2808895360 hasConceptScore W2808895360C79158427 @default.
- W2808895360 hasFunder F4320306076 @default.
- W2808895360 hasLocation W28088953601 @default.
- W2808895360 hasOpenAccess W2808895360 @default.
- W2808895360 hasPrimaryLocation W28088953601 @default.
- W2808895360 hasRelatedWork W1498982577 @default.
- W2808895360 hasRelatedWork W1544341893 @default.
- W2808895360 hasRelatedWork W1578778518 @default.
- W2808895360 hasRelatedWork W1579177548 @default.
- W2808895360 hasRelatedWork W1587224678 @default.
- W2808895360 hasRelatedWork W1601811574 @default.
- W2808895360 hasRelatedWork W181842068 @default.
- W2808895360 hasRelatedWork W2025670560 @default.
- W2808895360 hasRelatedWork W2135396778 @default.
- W2808895360 hasRelatedWork W4234604123 @default.
- W2808895360 isParatext "false" @default.
- W2808895360 isRetracted "false" @default.
- W2808895360 magId "2808895360" @default.
- W2808895360 workType "article" @default.