Matches in SemOpenAlex for { <https://semopenalex.org/work/W2897056153> ?p ?o ?g. }
- W2897056153 abstract "Data preprocessing techniques are devoted to correct or alleviate errors in data. Discretization and feature selection are two of the most extended data preprocessing techniques. Although we can find many proposals for static Big Data preprocessing, there is little research devoted to the continuous Big Data problem. Apache Flink is a recent and novel Big Data framework, following the MapReduce paradigm, focused on distributed stream and batch data processing. In this paper we propose a data stream library for Big Data preprocessing, named DPASF, under Apache Flink. We have implemented six of the most popular data preprocessing algorithms, three for discretization and the rest for feature selection. The algorithms have been tested using two Big Data datasets. Experimental results show that preprocessing can not only reduce the size of the data, but to maintain or even improve the original accuracy in a short time. DPASF contains useful algorithms when dealing with Big Data data streams. The preprocessing algorithms included in the library are able to tackle Big Datasets efficiently and to correct imperfections in the data." @default.
- W2897056153 created "2018-10-26" @default.
- W2897056153 creator A5023173574 @default.
- W2897056153 creator A5045016749 @default.
- W2897056153 creator A5052686664 @default.
- W2897056153 creator A5078909305 @default.
- W2897056153 date "2018-10-14" @default.
- W2897056153 modified "2023-10-08" @default.
- W2897056153 title "DPASF: A Flink Library for Streaming Data preprocessing" @default.
- W2897056153 cites W1553806514 @default.
- W2897056153 cites W1570448133 @default.
- W2897056153 cites W1585387811 @default.
- W2897056153 cites W1661871015 @default.
- W2897056153 cites W2000180597 @default.
- W2897056153 cites W2032427901 @default.
- W2897056153 cites W2049092228 @default.
- W2897056153 cites W2110632376 @default.
- W2897056153 cites W2119885577 @default.
- W2897056153 cites W2126623642 @default.
- W2897056153 cites W2136051823 @default.
- W2897056153 cites W2170120409 @default.
- W2897056153 cites W2173440868 @default.
- W2897056153 cites W2206299551 @default.
- W2897056153 cites W2524620548 @default.
- W2897056153 cites W2537734429 @default.
- W2897056153 cites W2566979091 @default.
- W2897056153 cites W2588336250 @default.
- W2897056153 cites W2792744510 @default.
- W2897056153 cites W2805172956 @default.
- W2897056153 cites W2901263649 @default.
- W2897056153 cites W3097993951 @default.
- W2897056153 hasPublicationYear "2018" @default.
- W2897056153 type Work @default.
- W2897056153 sameAs 2897056153 @default.
- W2897056153 citedByCount "0" @default.
- W2897056153 crossrefType "posted-content" @default.
- W2897056153 hasAuthorship W2897056153A5023173574 @default.
- W2897056153 hasAuthorship W2897056153A5045016749 @default.
- W2897056153 hasAuthorship W2897056153A5052686664 @default.
- W2897056153 hasAuthorship W2897056153A5078909305 @default.
- W2897056153 hasConcept C10551718 @default.
- W2897056153 hasConcept C124101348 @default.
- W2897056153 hasConcept C134306372 @default.
- W2897056153 hasConcept C138885662 @default.
- W2897056153 hasConcept C148483581 @default.
- W2897056153 hasConcept C154945302 @default.
- W2897056153 hasConcept C2776401178 @default.
- W2897056153 hasConcept C2778484313 @default.
- W2897056153 hasConcept C33923547 @default.
- W2897056153 hasConcept C34736171 @default.
- W2897056153 hasConcept C41008148 @default.
- W2897056153 hasConcept C41895202 @default.
- W2897056153 hasConcept C73000952 @default.
- W2897056153 hasConcept C75684735 @default.
- W2897056153 hasConcept C76155785 @default.
- W2897056153 hasConcept C81917197 @default.
- W2897056153 hasConcept C89198739 @default.
- W2897056153 hasConceptScore W2897056153C10551718 @default.
- W2897056153 hasConceptScore W2897056153C124101348 @default.
- W2897056153 hasConceptScore W2897056153C134306372 @default.
- W2897056153 hasConceptScore W2897056153C138885662 @default.
- W2897056153 hasConceptScore W2897056153C148483581 @default.
- W2897056153 hasConceptScore W2897056153C154945302 @default.
- W2897056153 hasConceptScore W2897056153C2776401178 @default.
- W2897056153 hasConceptScore W2897056153C2778484313 @default.
- W2897056153 hasConceptScore W2897056153C33923547 @default.
- W2897056153 hasConceptScore W2897056153C34736171 @default.
- W2897056153 hasConceptScore W2897056153C41008148 @default.
- W2897056153 hasConceptScore W2897056153C41895202 @default.
- W2897056153 hasConceptScore W2897056153C73000952 @default.
- W2897056153 hasConceptScore W2897056153C75684735 @default.
- W2897056153 hasConceptScore W2897056153C76155785 @default.
- W2897056153 hasConceptScore W2897056153C81917197 @default.
- W2897056153 hasConceptScore W2897056153C89198739 @default.
- W2897056153 hasLocation W28970561531 @default.
- W2897056153 hasOpenAccess W2897056153 @default.
- W2897056153 hasPrimaryLocation W28970561531 @default.
- W2897056153 hasRelatedWork W197852767 @default.
- W2897056153 hasRelatedWork W2051243911 @default.
- W2897056153 hasRelatedWork W2205964444 @default.
- W2897056153 hasRelatedWork W2578264933 @default.
- W2897056153 hasRelatedWork W2580983921 @default.
- W2897056153 hasRelatedWork W2604456060 @default.
- W2897056153 hasRelatedWork W2735382913 @default.
- W2897056153 hasRelatedWork W2739512903 @default.
- W2897056153 hasRelatedWork W2755594639 @default.
- W2897056153 hasRelatedWork W2783114103 @default.
- W2897056153 hasRelatedWork W2783605012 @default.
- W2897056153 hasRelatedWork W2794873313 @default.
- W2897056153 hasRelatedWork W2915409139 @default.
- W2897056153 hasRelatedWork W2966607353 @default.
- W2897056153 hasRelatedWork W2994977393 @default.
- W2897056153 hasRelatedWork W3004196126 @default.
- W2897056153 hasRelatedWork W3007185378 @default.
- W2897056153 hasRelatedWork W3118189354 @default.
- W2897056153 hasRelatedWork W3131475429 @default.
- W2897056153 hasRelatedWork W3195937315 @default.
- W2897056153 isParatext "false" @default.
- W2897056153 isRetracted "false" @default.
- W2897056153 magId "2897056153" @default.