Matches in SemOpenAlex for { <https://semopenalex.org/work/W2902303826> ?p ?o ?g. }
- W2902303826 abstract "We introduce a sampling framework to support approximate computing with estimated error bounds in Spark. Our framework allows sampling to be performed at the beginning of a sequence of multiple transformations ending in an aggregation operation. The framework constructs a data provenance tree as the computation proceeds, then combines the tree with multi-stage sampling and population estimation theories to compute error bounds for the aggregation. When information about output keys are available early, the framework can also use adaptive stratified reservoir sampling to avoid (or reduce) key losses in the final output and to achieve more consistent error bounds across popular and rare keys. Finally, the framework includes an algorithm to dynamically choose sampling rates to meet user specified constraints on the CDF of error bounds in the outputs. We have implemented a prototype of our framework called ApproxSpark, and used it to implement five approximate applications from different domains. Evaluation results show that ApproxSpark can (a) significantly reduce execution time if users can tolerate small amounts of uncertainties and, in many cases, loss of rare keys, and (b) automatically find sampling rates to meet user specified constraints on error bounds. We also explore and discuss extensively trade-offs between sampling rates, execution time, accuracy and key loss." @default.
- W2902303826 created "2018-12-11" @default.
- W2902303826 creator A5029037683 @default.
- W2902303826 creator A5032153685 @default.
- W2902303826 creator A5035928818 @default.
- W2902303826 creator A5072173713 @default.
- W2902303826 date "2018-12-05" @default.
- W2902303826 modified "2023-09-27" @default.
- W2902303826 title "Approximation with Error Bounds in Spark" @default.
- W2902303826 cites W1501500081 @default.
- W2902303826 cites W1561047078 @default.
- W2902303826 cites W1791348790 @default.
- W2902303826 cites W183063244 @default.
- W2902303826 cites W1854214752 @default.
- W2902303826 cites W1965706740 @default.
- W2902303826 cites W1982861695 @default.
- W2902303826 cites W1987034518 @default.
- W2902303826 cites W1987861412 @default.
- W2902303826 cites W2026844864 @default.
- W2902303826 cites W2038412523 @default.
- W2902303826 cites W2065196577 @default.
- W2902303826 cites W2071989194 @default.
- W2902303826 cites W2103201239 @default.
- W2902303826 cites W2103212156 @default.
- W2902303826 cites W2108646579 @default.
- W2902303826 cites W2119400430 @default.
- W2902303826 cites W2119885577 @default.
- W2902303826 cites W2123442489 @default.
- W2902303826 cites W2131166445 @default.
- W2902303826 cites W2131975293 @default.
- W2902303826 cites W2147869723 @default.
- W2902303826 cites W2149140091 @default.
- W2902303826 cites W2150915951 @default.
- W2902303826 cites W2152029707 @default.
- W2902303826 cites W2173213060 @default.
- W2902303826 cites W2179162132 @default.
- W2902303826 cites W2247317079 @default.
- W2902303826 cites W2265166184 @default.
- W2902303826 cites W2293019624 @default.
- W2902303826 cites W2295338537 @default.
- W2902303826 cites W2296677182 @default.
- W2902303826 cites W2319876296 @default.
- W2902303826 cites W2500111820 @default.
- W2902303826 cites W2516072053 @default.
- W2902303826 cites W2578054369 @default.
- W2902303826 cites W2772222064 @default.
- W2902303826 cites W2798277312 @default.
- W2902303826 cites W2941475075 @default.
- W2902303826 cites W2975606313 @default.
- W2902303826 doi "https://doi.org/10.7282/t3cn77js" @default.
- W2902303826 hasPublicationYear "2018" @default.
- W2902303826 type Work @default.
- W2902303826 sameAs 2902303826 @default.
- W2902303826 citedByCount "0" @default.
- W2902303826 crossrefType "posted-content" @default.
- W2902303826 hasAuthorship W2902303826A5029037683 @default.
- W2902303826 hasAuthorship W2902303826A5032153685 @default.
- W2902303826 hasAuthorship W2902303826A5035928818 @default.
- W2902303826 hasAuthorship W2902303826A5072173713 @default.
- W2902303826 hasConcept C105795698 @default.
- W2902303826 hasConcept C106131492 @default.
- W2902303826 hasConcept C113174947 @default.
- W2902303826 hasConcept C11413529 @default.
- W2902303826 hasConcept C123614077 @default.
- W2902303826 hasConcept C134306372 @default.
- W2902303826 hasConcept C140779682 @default.
- W2902303826 hasConcept C19499675 @default.
- W2902303826 hasConcept C199360897 @default.
- W2902303826 hasConcept C26517878 @default.
- W2902303826 hasConcept C2781215313 @default.
- W2902303826 hasConcept C2781395549 @default.
- W2902303826 hasConcept C31972630 @default.
- W2902303826 hasConcept C33923547 @default.
- W2902303826 hasConcept C38652104 @default.
- W2902303826 hasConcept C41008148 @default.
- W2902303826 hasConcept C45374587 @default.
- W2902303826 hasConcept C49898467 @default.
- W2902303826 hasConceptScore W2902303826C105795698 @default.
- W2902303826 hasConceptScore W2902303826C106131492 @default.
- W2902303826 hasConceptScore W2902303826C113174947 @default.
- W2902303826 hasConceptScore W2902303826C11413529 @default.
- W2902303826 hasConceptScore W2902303826C123614077 @default.
- W2902303826 hasConceptScore W2902303826C134306372 @default.
- W2902303826 hasConceptScore W2902303826C140779682 @default.
- W2902303826 hasConceptScore W2902303826C19499675 @default.
- W2902303826 hasConceptScore W2902303826C199360897 @default.
- W2902303826 hasConceptScore W2902303826C26517878 @default.
- W2902303826 hasConceptScore W2902303826C2781215313 @default.
- W2902303826 hasConceptScore W2902303826C2781395549 @default.
- W2902303826 hasConceptScore W2902303826C31972630 @default.
- W2902303826 hasConceptScore W2902303826C33923547 @default.
- W2902303826 hasConceptScore W2902303826C38652104 @default.
- W2902303826 hasConceptScore W2902303826C41008148 @default.
- W2902303826 hasConceptScore W2902303826C45374587 @default.
- W2902303826 hasConceptScore W2902303826C49898467 @default.
- W2902303826 hasLocation W29023038261 @default.
- W2902303826 hasOpenAccess W2902303826 @default.
- W2902303826 hasPrimaryLocation W29023038261 @default.
- W2902303826 hasRelatedWork W103440557 @default.
- W2902303826 hasRelatedWork W1968829657 @default.