Matches in SemOpenAlex for { <https://semopenalex.org/work/W2140609014> ?p ?o ?g. }
Showing items 1 to 85 of
85
with 100 items per page.
- W2140609014 abstract "Data design has been characterized as a process of arriving at a design that maximizes the information content of each piece of data (or equivalently, one that minimizes redundancy). Information content (or redundancy) is measured with respect to a prescribed model for the data, a model that is often expressed as a set of constraints. In this work, we consider the problem of doing data redesign in an environment where the prescribed model is unknown or incomplete. Specifically, we consider the problem of finding structural clues in an instance of data, an instance which may contain errors, missing values, and duplicate records. We propose a set of information-theoretic tools for finding structural summaries that are useful in characterizing the information content of the data, and ultimately useful in data design. We provide algorithms for creating these summaries over large, categorical data sets. We study the use of these summaries in one specific physical design task, that of ranking functional dependencies based on their data redundancy. We show how our ranking can be used by a physical data-design tool to find good vertical decompositions of a relation (decompositions that improve the information content of the design). We present an evaluation of the approach on real data sets." @default.
- W2140609014 created "2016-06-24" @default.
- W2140609014 creator A5000377124 @default.
- W2140609014 creator A5022619313 @default.
- W2140609014 creator A5066962453 @default.
- W2140609014 date "2004-06-13" @default.
- W2140609014 modified "2023-09-27" @default.
- W2140609014 title "Information-theoretic tools for mining database structure from large data sets" @default.
- W2140609014 cites W1551385575 @default.
- W2140609014 cites W1560541823 @default.
- W2140609014 cites W1971778458 @default.
- W2140609014 cites W1984566373 @default.
- W2140609014 cites W2012644520 @default.
- W2140609014 cites W2024770506 @default.
- W2140609014 cites W2039795745 @default.
- W2140609014 cites W2063601856 @default.
- W2140609014 cites W2067566391 @default.
- W2140609014 cites W2095897464 @default.
- W2140609014 cites W2098506386 @default.
- W2140609014 cites W2139086573 @default.
- W2140609014 cites W2166549982 @default.
- W2140609014 cites W2166559705 @default.
- W2140609014 cites W2293122579 @default.
- W2140609014 cites W4245927080 @default.
- W2140609014 doi "https://doi.org/10.1145/1007568.1007650" @default.
- W2140609014 hasPublicationYear "2004" @default.
- W2140609014 type Work @default.
- W2140609014 sameAs 2140609014 @default.
- W2140609014 citedByCount "54" @default.
- W2140609014 countsByYear W21406090142012 @default.
- W2140609014 countsByYear W21406090142013 @default.
- W2140609014 countsByYear W21406090142014 @default.
- W2140609014 countsByYear W21406090142015 @default.
- W2140609014 countsByYear W21406090142016 @default.
- W2140609014 countsByYear W21406090142017 @default.
- W2140609014 countsByYear W21406090142018 @default.
- W2140609014 countsByYear W21406090142019 @default.
- W2140609014 countsByYear W21406090142022 @default.
- W2140609014 crossrefType "proceedings-article" @default.
- W2140609014 hasAuthorship W2140609014A5000377124 @default.
- W2140609014 hasAuthorship W2140609014A5022619313 @default.
- W2140609014 hasAuthorship W2140609014A5066962453 @default.
- W2140609014 hasConcept C111919701 @default.
- W2140609014 hasConcept C119857082 @default.
- W2140609014 hasConcept C124101348 @default.
- W2140609014 hasConcept C152124472 @default.
- W2140609014 hasConcept C177264268 @default.
- W2140609014 hasConcept C189430467 @default.
- W2140609014 hasConcept C199360897 @default.
- W2140609014 hasConcept C23123220 @default.
- W2140609014 hasConcept C41008148 @default.
- W2140609014 hasConcept C5274069 @default.
- W2140609014 hasConcept C67186912 @default.
- W2140609014 hasConcept C7545210 @default.
- W2140609014 hasConcept C77088390 @default.
- W2140609014 hasConceptScore W2140609014C111919701 @default.
- W2140609014 hasConceptScore W2140609014C119857082 @default.
- W2140609014 hasConceptScore W2140609014C124101348 @default.
- W2140609014 hasConceptScore W2140609014C152124472 @default.
- W2140609014 hasConceptScore W2140609014C177264268 @default.
- W2140609014 hasConceptScore W2140609014C189430467 @default.
- W2140609014 hasConceptScore W2140609014C199360897 @default.
- W2140609014 hasConceptScore W2140609014C23123220 @default.
- W2140609014 hasConceptScore W2140609014C41008148 @default.
- W2140609014 hasConceptScore W2140609014C5274069 @default.
- W2140609014 hasConceptScore W2140609014C67186912 @default.
- W2140609014 hasConceptScore W2140609014C7545210 @default.
- W2140609014 hasConceptScore W2140609014C77088390 @default.
- W2140609014 hasLocation W21406090141 @default.
- W2140609014 hasOpenAccess W2140609014 @default.
- W2140609014 hasPrimaryLocation W21406090141 @default.
- W2140609014 hasRelatedWork W2041335144 @default.
- W2140609014 hasRelatedWork W2059050506 @default.
- W2140609014 hasRelatedWork W2086253379 @default.
- W2140609014 hasRelatedWork W2106322795 @default.
- W2140609014 hasRelatedWork W2163489736 @default.
- W2140609014 hasRelatedWork W2249011631 @default.
- W2140609014 hasRelatedWork W2351574773 @default.
- W2140609014 hasRelatedWork W4296070890 @default.
- W2140609014 hasRelatedWork W50774052 @default.
- W2140609014 hasRelatedWork W54129904 @default.
- W2140609014 isParatext "false" @default.
- W2140609014 isRetracted "false" @default.
- W2140609014 magId "2140609014" @default.
- W2140609014 workType "article" @default.