Matches in SemOpenAlex for { <https://semopenalex.org/work/W2904132383> ?p ?o ?g. }
- W2904132383 abstract "We consider the problem of creating a navigation structure that allows a user to most effectively navigate a data lake. We define an organization as a graph that contains nodes representing sets of attributes within a data lake and edges indicating subset relationships among nodes. We present a new probabilistic model of how users interact with an organization and define the likelihood of a user finding a table using the organization. We propose the data lake organization problem as the problem of finding an organization that maximizes the expected probability of discovering tables by navigating an organization. We propose an approximate algorithm for the data lake organization problem. We show the effectiveness of the algorithm on both real data lakes containing data from open data portals and on benchmarks that emulate the observed characteristics of real data lakes. Through a formal user study, we show that navigation can help users discover relevant tables that cannot be found by keyword search. In addition, in our study, 42% of users preferred the use of navigation and 58% preferred keyword search, suggesting these are complementary and both useful modalities for data discovery in data lakes. Our experiments show that data lake organizations take into account the data lake distribution and outperform an existing hand-curated taxonomy and a common baseline organization." @default.
- W2904132383 created "2018-12-22" @default.
- W2904132383 creator A5012572863 @default.
- W2904132383 creator A5013934423 @default.
- W2904132383 creator A5022619313 @default.
- W2904132383 creator A5044982088 @default.
- W2904132383 creator A5081282434 @default.
- W2904132383 date "2018-12-17" @default.
- W2904132383 modified "2023-09-27" @default.
- W2904132383 title "Optimizing Organizations for Navigating Data Lakes." @default.
- W2904132383 cites W1532325895 @default.
- W2904132383 cites W1604957788 @default.
- W2904132383 cites W1969621019 @default.
- W2904132383 cites W1976101401 @default.
- W2904132383 cites W1983305952 @default.
- W2904132383 cites W2022166150 @default.
- W2904132383 cites W2029344051 @default.
- W2904132383 cites W2066806792 @default.
- W2904132383 cites W2092364718 @default.
- W2904132383 cites W2105481243 @default.
- W2904132383 cites W2108223890 @default.
- W2904132383 cites W2111869785 @default.
- W2904132383 cites W2139902660 @default.
- W2904132383 cites W2140116426 @default.
- W2904132383 cites W2144108169 @default.
- W2904132383 cites W2144452830 @default.
- W2904132383 cites W2144731007 @default.
- W2904132383 cites W2153579005 @default.
- W2904132383 cites W2155734303 @default.
- W2904132383 cites W2160605923 @default.
- W2904132383 cites W2216189112 @default.
- W2904132383 cites W2250539671 @default.
- W2904132383 cites W2252211299 @default.
- W2904132383 cites W2403920869 @default.
- W2904132383 cites W2533904613 @default.
- W2904132383 cites W2585438896 @default.
- W2904132383 cites W2624356872 @default.
- W2904132383 cites W2750991217 @default.
- W2904132383 cites W2795089200 @default.
- W2904132383 cites W2795302121 @default.
- W2904132383 cites W2798664493 @default.
- W2904132383 cites W2810954846 @default.
- W2904132383 cites W2923400109 @default.
- W2904132383 cites W2926805670 @default.
- W2904132383 cites W2951621897 @default.
- W2904132383 cites W2963174348 @default.
- W2904132383 cites W3102476541 @default.
- W2904132383 cites W38703128 @default.
- W2904132383 cites W574700118 @default.
- W2904132383 hasPublicationYear "2018" @default.
- W2904132383 type Work @default.
- W2904132383 sameAs 2904132383 @default.
- W2904132383 citedByCount "1" @default.
- W2904132383 countsByYear W29041323832019 @default.
- W2904132383 crossrefType "posted-content" @default.
- W2904132383 hasAuthorship W2904132383A5012572863 @default.
- W2904132383 hasAuthorship W2904132383A5013934423 @default.
- W2904132383 hasAuthorship W2904132383A5022619313 @default.
- W2904132383 hasAuthorship W2904132383A5044982088 @default.
- W2904132383 hasAuthorship W2904132383A5081282434 @default.
- W2904132383 hasConcept C111368507 @default.
- W2904132383 hasConcept C124101348 @default.
- W2904132383 hasConcept C12725497 @default.
- W2904132383 hasConcept C127313418 @default.
- W2904132383 hasConcept C132525143 @default.
- W2904132383 hasConcept C154945302 @default.
- W2904132383 hasConcept C23123220 @default.
- W2904132383 hasConcept C2522767166 @default.
- W2904132383 hasConcept C41008148 @default.
- W2904132383 hasConcept C45235069 @default.
- W2904132383 hasConcept C49937458 @default.
- W2904132383 hasConcept C80444323 @default.
- W2904132383 hasConceptScore W2904132383C111368507 @default.
- W2904132383 hasConceptScore W2904132383C124101348 @default.
- W2904132383 hasConceptScore W2904132383C12725497 @default.
- W2904132383 hasConceptScore W2904132383C127313418 @default.
- W2904132383 hasConceptScore W2904132383C132525143 @default.
- W2904132383 hasConceptScore W2904132383C154945302 @default.
- W2904132383 hasConceptScore W2904132383C23123220 @default.
- W2904132383 hasConceptScore W2904132383C2522767166 @default.
- W2904132383 hasConceptScore W2904132383C41008148 @default.
- W2904132383 hasConceptScore W2904132383C45235069 @default.
- W2904132383 hasConceptScore W2904132383C49937458 @default.
- W2904132383 hasConceptScore W2904132383C80444323 @default.
- W2904132383 hasLocation W29041323831 @default.
- W2904132383 hasOpenAccess W2904132383 @default.
- W2904132383 hasPrimaryLocation W29041323831 @default.
- W2904132383 hasRelatedWork W1974124545 @default.
- W2904132383 hasRelatedWork W2000623017 @default.
- W2904132383 hasRelatedWork W2026532078 @default.
- W2904132383 hasRelatedWork W2097427044 @default.
- W2904132383 hasRelatedWork W2114079787 @default.
- W2904132383 hasRelatedWork W2188345419 @default.
- W2904132383 hasRelatedWork W2211257151 @default.
- W2904132383 hasRelatedWork W2292074274 @default.
- W2904132383 hasRelatedWork W2330506098 @default.
- W2904132383 hasRelatedWork W2402159371 @default.
- W2904132383 hasRelatedWork W25677450 @default.
- W2904132383 hasRelatedWork W2742144031 @default.
- W2904132383 hasRelatedWork W2753426599 @default.