Matches in SemOpenAlex for { <https://semopenalex.org/work/W2247123384> ?p ?o ?g. }
Showing items 1 to 93 of
93
with 100 items per page.
- W2247123384 endingPage "2007" @default.
- W2247123384 startingPage "2004" @default.
- W2247123384 abstract "Analysts report spending upwards of 80% of their time on problems in data cleaning. The data cleaning process is inherently iterative, with evolving cleaning workflows that start with basic exploratory data analysis on small samples of dirty data, then refine analysis with more sophisticated/expensive cleaning operators (e.g., crowdsourcing), and finally apply the insights to a full dataset. While an analyst often knows at a logical level what operations need to be done, they often have to manage a large search space of physical operators and parameters. We present Wisteria, a system designed to support the iterative development and optimization of data cleaning workflows, especially ones that utilize the crowd. Wisteria separates logical operations from physical implementations, and driven by analyst feedback, suggests optimizations and/or replacements to the analyst's choice of physical implementation. We highlight research challenges in sampling, in-flight operator replacement, and crowdsourcing. We overview the system architecture and these techniques, then provide a demonstration designed to showcase how Wisteria can improve iterative data analysis and cleaning. The code is available at: http://www.sampleclean.org." @default.
- W2247123384 created "2016-06-24" @default.
- W2247123384 creator A5018480426 @default.
- W2247123384 creator A5037061685 @default.
- W2247123384 creator A5049016095 @default.
- W2247123384 creator A5066141913 @default.
- W2247123384 creator A5090643027 @default.
- W2247123384 date "2015-08-01" @default.
- W2247123384 modified "2023-09-26" @default.
- W2247123384 title "Wisteria" @default.
- W2247123384 cites W2032655922 @default.
- W2247123384 cites W2164187405 @default.
- W2247123384 cites W4291720518 @default.
- W2247123384 doi "https://doi.org/10.14778/2824032.2824122" @default.
- W2247123384 hasPublicationYear "2015" @default.
- W2247123384 type Work @default.
- W2247123384 sameAs 2247123384 @default.
- W2247123384 citedByCount "38" @default.
- W2247123384 countsByYear W22471233842015 @default.
- W2247123384 countsByYear W22471233842016 @default.
- W2247123384 countsByYear W22471233842017 @default.
- W2247123384 countsByYear W22471233842018 @default.
- W2247123384 countsByYear W22471233842019 @default.
- W2247123384 countsByYear W22471233842020 @default.
- W2247123384 countsByYear W22471233842021 @default.
- W2247123384 countsByYear W22471233842022 @default.
- W2247123384 crossrefType "journal-article" @default.
- W2247123384 hasAuthorship W2247123384A5018480426 @default.
- W2247123384 hasAuthorship W2247123384A5037061685 @default.
- W2247123384 hasAuthorship W2247123384A5049016095 @default.
- W2247123384 hasAuthorship W2247123384A5066141913 @default.
- W2247123384 hasAuthorship W2247123384A5090643027 @default.
- W2247123384 hasConcept C104317684 @default.
- W2247123384 hasConcept C115903868 @default.
- W2247123384 hasConcept C124101348 @default.
- W2247123384 hasConcept C136764020 @default.
- W2247123384 hasConcept C143587482 @default.
- W2247123384 hasConcept C158448853 @default.
- W2247123384 hasConcept C17020691 @default.
- W2247123384 hasConcept C177212765 @default.
- W2247123384 hasConcept C177264268 @default.
- W2247123384 hasConcept C185592680 @default.
- W2247123384 hasConcept C199360897 @default.
- W2247123384 hasConcept C2522767166 @default.
- W2247123384 hasConcept C26713055 @default.
- W2247123384 hasConcept C2776760102 @default.
- W2247123384 hasConcept C41008148 @default.
- W2247123384 hasConcept C55493867 @default.
- W2247123384 hasConcept C62230096 @default.
- W2247123384 hasConcept C77088390 @default.
- W2247123384 hasConcept C86339819 @default.
- W2247123384 hasConcept C98045186 @default.
- W2247123384 hasConceptScore W2247123384C104317684 @default.
- W2247123384 hasConceptScore W2247123384C115903868 @default.
- W2247123384 hasConceptScore W2247123384C124101348 @default.
- W2247123384 hasConceptScore W2247123384C136764020 @default.
- W2247123384 hasConceptScore W2247123384C143587482 @default.
- W2247123384 hasConceptScore W2247123384C158448853 @default.
- W2247123384 hasConceptScore W2247123384C17020691 @default.
- W2247123384 hasConceptScore W2247123384C177212765 @default.
- W2247123384 hasConceptScore W2247123384C177264268 @default.
- W2247123384 hasConceptScore W2247123384C185592680 @default.
- W2247123384 hasConceptScore W2247123384C199360897 @default.
- W2247123384 hasConceptScore W2247123384C2522767166 @default.
- W2247123384 hasConceptScore W2247123384C26713055 @default.
- W2247123384 hasConceptScore W2247123384C2776760102 @default.
- W2247123384 hasConceptScore W2247123384C41008148 @default.
- W2247123384 hasConceptScore W2247123384C55493867 @default.
- W2247123384 hasConceptScore W2247123384C62230096 @default.
- W2247123384 hasConceptScore W2247123384C77088390 @default.
- W2247123384 hasConceptScore W2247123384C86339819 @default.
- W2247123384 hasConceptScore W2247123384C98045186 @default.
- W2247123384 hasIssue "12" @default.
- W2247123384 hasLocation W22471233841 @default.
- W2247123384 hasOpenAccess W2247123384 @default.
- W2247123384 hasPrimaryLocation W22471233841 @default.
- W2247123384 hasRelatedWork W112161299 @default.
- W2247123384 hasRelatedWork W1703951723 @default.
- W2247123384 hasRelatedWork W2294860147 @default.
- W2247123384 hasRelatedWork W2754696427 @default.
- W2247123384 hasRelatedWork W2788632858 @default.
- W2247123384 hasRelatedWork W2803870934 @default.
- W2247123384 hasRelatedWork W2945404560 @default.
- W2247123384 hasRelatedWork W3089325901 @default.
- W2247123384 hasRelatedWork W3205688938 @default.
- W2247123384 hasRelatedWork W3211370011 @default.
- W2247123384 hasVolume "8" @default.
- W2247123384 isParatext "false" @default.
- W2247123384 isRetracted "false" @default.
- W2247123384 magId "2247123384" @default.
- W2247123384 workType "article" @default.