Matches in SemOpenAlex for { <https://semopenalex.org/work/W1567785390> ?p ?o ?g. }
Showing items 1 to 79 of
79
with 100 items per page.
- W1567785390 abstract "The proliferation of data sources both in the private and public domains (e.g., in enterprise environments and on the World-Wide Web) underscores the need for data integration systems. The purpose of a data integration system is to enable users to access data residing in multiple heterogenous sources through a uniform interface. Manual solutions for building such systems are not a viable option, especially when dealing with large-scale and complex applications. This dissertation studies the automation of building data integration systems. In particular, it addresses three key challenges that lie at the heart of any such system. The first challenge relates to the construction of wrappers for the unstructured sources. A source wrapper would ensure that the data in the underlying source is perceived as structured data by the other parts of the system. We particularly focus on sources containing data formatted as lists, and propose a new solution for extracting relational tables from them. The proposed solution is completely unsupervised and domain-independent. It is based on leveraging various sources of information, including a corpus of tens of millions of relational tables published by users on the Web. The second and third challenges are concerned with establishing semantic mappings across data sources. We first propose a new solution for discovering the correspondences across the elements of two schemas. Then, based on these simple correspondences, we propose another solution to discover more complex declarative mapping rules that can actually be used to transform data and queries across the two schemas. The key underpinning for these two solutions is that, unlike previous approaches, they both exploit the usage information extracted from database query logs. This work is the first to introduce the usage-based approach for establishing mappings across data sources. To evaluate our approaches, we conducted experiments using realistic data sets, such as real web lists for the wrapper construction work; and schemas and query logs from the retail and life sciences domains for the work on semantic mappings. The experimental results have verified the effectiveness and applicability of our proposed approaches." @default.
- W1567785390 created "2016-06-24" @default.
- W1567785390 creator A5074464488 @default.
- W1567785390 creator A5089912733 @default.
- W1567785390 date "2010-01-01" @default.
- W1567785390 modified "2023-09-27" @default.
- W1567785390 title "Leveraging external user-generated information for large-scale data integration" @default.
- W1567785390 hasPublicationYear "2010" @default.
- W1567785390 type Work @default.
- W1567785390 sameAs 1567785390 @default.
- W1567785390 citedByCount "0" @default.
- W1567785390 crossrefType "journal-article" @default.
- W1567785390 hasAuthorship W1567785390A5074464488 @default.
- W1567785390 hasAuthorship W1567785390A5089912733 @default.
- W1567785390 hasConcept C113843644 @default.
- W1567785390 hasConcept C120665830 @default.
- W1567785390 hasConcept C121332964 @default.
- W1567785390 hasConcept C124101348 @default.
- W1567785390 hasConcept C129307140 @default.
- W1567785390 hasConcept C134306372 @default.
- W1567785390 hasConcept C157915830 @default.
- W1567785390 hasConcept C173608175 @default.
- W1567785390 hasConcept C192209626 @default.
- W1567785390 hasConcept C23123220 @default.
- W1567785390 hasConcept C2522767166 @default.
- W1567785390 hasConcept C26517878 @default.
- W1567785390 hasConcept C33923547 @default.
- W1567785390 hasConcept C36503486 @default.
- W1567785390 hasConcept C38652104 @default.
- W1567785390 hasConcept C41008148 @default.
- W1567785390 hasConcept C5655090 @default.
- W1567785390 hasConcept C61871575 @default.
- W1567785390 hasConcept C72634772 @default.
- W1567785390 hasConceptScore W1567785390C113843644 @default.
- W1567785390 hasConceptScore W1567785390C120665830 @default.
- W1567785390 hasConceptScore W1567785390C121332964 @default.
- W1567785390 hasConceptScore W1567785390C124101348 @default.
- W1567785390 hasConceptScore W1567785390C129307140 @default.
- W1567785390 hasConceptScore W1567785390C134306372 @default.
- W1567785390 hasConceptScore W1567785390C157915830 @default.
- W1567785390 hasConceptScore W1567785390C173608175 @default.
- W1567785390 hasConceptScore W1567785390C192209626 @default.
- W1567785390 hasConceptScore W1567785390C23123220 @default.
- W1567785390 hasConceptScore W1567785390C2522767166 @default.
- W1567785390 hasConceptScore W1567785390C26517878 @default.
- W1567785390 hasConceptScore W1567785390C33923547 @default.
- W1567785390 hasConceptScore W1567785390C36503486 @default.
- W1567785390 hasConceptScore W1567785390C38652104 @default.
- W1567785390 hasConceptScore W1567785390C41008148 @default.
- W1567785390 hasConceptScore W1567785390C5655090 @default.
- W1567785390 hasConceptScore W1567785390C61871575 @default.
- W1567785390 hasConceptScore W1567785390C72634772 @default.
- W1567785390 hasLocation W15677853901 @default.
- W1567785390 hasOpenAccess W1567785390 @default.
- W1567785390 hasPrimaryLocation W15677853901 @default.
- W1567785390 hasRelatedWork W133949188 @default.
- W1567785390 hasRelatedWork W1516908788 @default.
- W1567785390 hasRelatedWork W1532511805 @default.
- W1567785390 hasRelatedWork W1579035201 @default.
- W1567785390 hasRelatedWork W1590548082 @default.
- W1567785390 hasRelatedWork W165101881 @default.
- W1567785390 hasRelatedWork W1854286394 @default.
- W1567785390 hasRelatedWork W2010600533 @default.
- W1567785390 hasRelatedWork W2020487741 @default.
- W1567785390 hasRelatedWork W2064043226 @default.
- W1567785390 hasRelatedWork W2064208417 @default.
- W1567785390 hasRelatedWork W2097668987 @default.
- W1567785390 hasRelatedWork W2296376838 @default.
- W1567785390 hasRelatedWork W2297784942 @default.
- W1567785390 hasRelatedWork W2407149353 @default.
- W1567785390 hasRelatedWork W2426026510 @default.
- W1567785390 hasRelatedWork W2611541234 @default.
- W1567785390 hasRelatedWork W2962688206 @default.
- W1567785390 hasRelatedWork W3012662999 @default.
- W1567785390 hasRelatedWork W561294517 @default.
- W1567785390 isParatext "false" @default.
- W1567785390 isRetracted "false" @default.
- W1567785390 magId "1567785390" @default.
- W1567785390 workType "article" @default.