Matches in SemOpenAlex for { <https://semopenalex.org/work/W2943772075> ?p ?o ?g. }
Showing items 1 to 56 of
56
with 100 items per page.
- W2943772075 abstract "Data quality problems arise frequently when data is integrated from disparate sources. In the context of Big Data applications, data quality is becoming more important because of the unprecedented volume, large variety, and high velocity. The challenges caused by volume and velocity of Big Data have been addressed by many research projects and commercial solutions and can be partially solved by modern, scalable data management systems. However, variety remains to be a daunting challenge for Big Data Integration and requires also special methods for data quality management. Variety (or heterogeneity) exists at several levels: at the instance level, the same entity might be described with different attributes; at the schema level, the data is structured with various schemas; but also at the level of the modeling language, different data models can be used (e.g., relational, XML, or a document-oriented JSON representation). This might lead to data quality issues such as consistency, understandability, or completeness.The heterogeneity of data sources in the Big Data Era requires new integration approaches which can handle the large volume and speed of the generated data as well as the variety and quality of the data. Traditional ‘schema first’ approaches as in the relational world with data warehouse systems and ETL (Extract-Transform-Load) processes are inappropriate for a flexible and dynamically changing data management landscape. The requirement for pre-defined, explicit schemas is a limitation which has drawn interest of many developers and researchers to NoSQL data management systems as these systems should provide data management features for a high amount of schema-less data. Nevertheless, a one-size-fits-all Big Data system is unlikely to solve all the challenges which are required from data management systems today. Instead, multiple classes of systems, optimized for specific requirements or hardware platforms, will co-exist in a data management landscape.Thus, heterogeneity and data quality are challenges for many Big Data applications. While in some applications, a limited data quality for individual data items does not cause serious problems when a huge amount of data is aggregated, data quality problems in data sources are often revealed by the integration of these sources with other information. Data quality has been coined as ‘fitness for use’; thus, if data is used in another context than originally planned, data quality might become an issue. Similar observations have been also made for data warehouses which lead to a separate research area about data warehouse quality.The workshop QDB 2016 aims at discussing recent advances and challenges on data quality management in database systems, and focuses especially on problems in related to Big Data Integration and Big Data Quality. The workshop will provide a forum for the presentation of research results, a panel discussion, and an attractive keynote speaker." @default.
- W2943772075 created "2019-05-09" @default.
- W2943772075 creator A5008944415 @default.
- W2943772075 creator A5043782050 @default.
- W2943772075 creator A5046074866 @default.
- W2943772075 creator A5049301997 @default.
- W2943772075 creator A5091872345 @default.
- W2943772075 date "2016-09-01" @default.
- W2943772075 modified "2023-10-16" @default.
- W2943772075 title "The 11th International Workshop on Quality in DataBases in conjunction with VLDB 2016" @default.
- W2943772075 hasPublicationYear "2016" @default.
- W2943772075 type Work @default.
- W2943772075 sameAs 2943772075 @default.
- W2943772075 citedByCount "1" @default.
- W2943772075 countsByYear W29437720752019 @default.
- W2943772075 crossrefType "other" @default.
- W2943772075 hasAuthorship W2943772075A5008944415 @default.
- W2943772075 hasAuthorship W2943772075A5043782050 @default.
- W2943772075 hasAuthorship W2943772075A5046074866 @default.
- W2943772075 hasAuthorship W2943772075A5049301997 @default.
- W2943772075 hasAuthorship W2943772075A5091872345 @default.
- W2943772075 hasConcept C121332964 @default.
- W2943772075 hasConcept C1276947 @default.
- W2943772075 hasConcept C23123220 @default.
- W2943772075 hasConcept C23549232 @default.
- W2943772075 hasConcept C2779530757 @default.
- W2943772075 hasConcept C41008148 @default.
- W2943772075 hasConcept C59656382 @default.
- W2943772075 hasConcept C62520636 @default.
- W2943772075 hasConcept C77088390 @default.
- W2943772075 hasConceptScore W2943772075C121332964 @default.
- W2943772075 hasConceptScore W2943772075C1276947 @default.
- W2943772075 hasConceptScore W2943772075C23123220 @default.
- W2943772075 hasConceptScore W2943772075C23549232 @default.
- W2943772075 hasConceptScore W2943772075C2779530757 @default.
- W2943772075 hasConceptScore W2943772075C41008148 @default.
- W2943772075 hasConceptScore W2943772075C59656382 @default.
- W2943772075 hasConceptScore W2943772075C62520636 @default.
- W2943772075 hasConceptScore W2943772075C77088390 @default.
- W2943772075 hasLocation W29437720751 @default.
- W2943772075 hasOpenAccess W2943772075 @default.
- W2943772075 hasPrimaryLocation W29437720751 @default.
- W2943772075 hasRelatedWork W1505216615 @default.
- W2943772075 hasRelatedWork W2086064646 @default.
- W2943772075 hasRelatedWork W2115485936 @default.
- W2943772075 hasRelatedWork W2119135658 @default.
- W2943772075 hasRelatedWork W2349174110 @default.
- W2943772075 hasRelatedWork W2357241418 @default.
- W2943772075 hasRelatedWork W2792377126 @default.
- W2943772075 hasRelatedWork W2943772075 @default.
- W2943772075 hasRelatedWork W3022131925 @default.
- W2943772075 hasRelatedWork W3025861905 @default.
- W2943772075 isParatext "false" @default.
- W2943772075 isRetracted "false" @default.
- W2943772075 magId "2943772075" @default.
- W2943772075 workType "other" @default.