Matches in SemOpenAlex for { <https://semopenalex.org/work/W3204189803> ?p ?o ?g. }
Showing items 1 to 91 of
91
with 100 items per page.
- W3204189803 abstract "Abstract Metadata management constitutes a key prerequisite for enterprises as they engage in data analytics and governance. Today, however, the context of data is often only manually documented by subject matter experts, and lacks completeness and reliability due to the complex nature of data pipelines. Thus, collecting data lineage—describing the origin, structure, and dependencies of data—in an automated fashion increases quality of provided metadata and reduces manual effort, making it critical for the development and operation of data pipelines. In our practice report, we propose an end-to-end solution that digests lineage via (Py‑)Spark execution plans. We build upon the open-source component Spline , allowing us to reliably consume lineage metadata and identify interdependencies. We map the digested data into an expandable data model, enabling us to extract graph structures for both coarse- and fine-grained data lineage. Lastly, our solution visualizes the extracted data lineage via a modern web app, and integrates with BMW Group’s soon-to-be open-sourced Cloud Data Hub." @default.
- W3204189803 created "2021-10-11" @default.
- W3204189803 creator A5030002526 @default.
- W3204189803 creator A5053708680 @default.
- W3204189803 creator A5056446358 @default.
- W3204189803 creator A5058968588 @default.
- W3204189803 date "2021-10-04" @default.
- W3204189803 modified "2023-09-26" @default.
- W3204189803 title "Collecting and visualizing data lineage of Spark jobs" @default.
- W3204189803 cites W1977690765 @default.
- W3204189803 cites W2006926885 @default.
- W3204189803 cites W2071353749 @default.
- W3204189803 cites W2691976780 @default.
- W3204189803 cites W2743230100 @default.
- W3204189803 cites W2753460696 @default.
- W3204189803 cites W2766945679 @default.
- W3204189803 cites W2805350738 @default.
- W3204189803 cites W2889473110 @default.
- W3204189803 cites W2948105215 @default.
- W3204189803 cites W3093530239 @default.
- W3204189803 cites W3100284210 @default.
- W3204189803 cites W3138843571 @default.
- W3204189803 cites W4232424318 @default.
- W3204189803 doi "https://doi.org/10.1007/s13222-021-00387-7" @default.
- W3204189803 hasPublicationYear "2021" @default.
- W3204189803 type Work @default.
- W3204189803 sameAs 3204189803 @default.
- W3204189803 citedByCount "0" @default.
- W3204189803 crossrefType "journal-article" @default.
- W3204189803 hasAuthorship W3204189803A5030002526 @default.
- W3204189803 hasAuthorship W3204189803A5053708680 @default.
- W3204189803 hasAuthorship W3204189803A5056446358 @default.
- W3204189803 hasAuthorship W3204189803A5058968588 @default.
- W3204189803 hasBestOaLocation W32041898031 @default.
- W3204189803 hasConcept C104317684 @default.
- W3204189803 hasConcept C111919701 @default.
- W3204189803 hasConcept C124101348 @default.
- W3204189803 hasConcept C132525143 @default.
- W3204189803 hasConcept C136764020 @default.
- W3204189803 hasConcept C151730666 @default.
- W3204189803 hasConcept C153048206 @default.
- W3204189803 hasConcept C185592680 @default.
- W3204189803 hasConcept C199360897 @default.
- W3204189803 hasConcept C2522767166 @default.
- W3204189803 hasConcept C2776817793 @default.
- W3204189803 hasConcept C2779343474 @default.
- W3204189803 hasConcept C2781215313 @default.
- W3204189803 hasConcept C41008148 @default.
- W3204189803 hasConcept C55493867 @default.
- W3204189803 hasConcept C77088390 @default.
- W3204189803 hasConcept C79974875 @default.
- W3204189803 hasConcept C80444323 @default.
- W3204189803 hasConcept C86803240 @default.
- W3204189803 hasConcept C93518851 @default.
- W3204189803 hasConceptScore W3204189803C104317684 @default.
- W3204189803 hasConceptScore W3204189803C111919701 @default.
- W3204189803 hasConceptScore W3204189803C124101348 @default.
- W3204189803 hasConceptScore W3204189803C132525143 @default.
- W3204189803 hasConceptScore W3204189803C136764020 @default.
- W3204189803 hasConceptScore W3204189803C151730666 @default.
- W3204189803 hasConceptScore W3204189803C153048206 @default.
- W3204189803 hasConceptScore W3204189803C185592680 @default.
- W3204189803 hasConceptScore W3204189803C199360897 @default.
- W3204189803 hasConceptScore W3204189803C2522767166 @default.
- W3204189803 hasConceptScore W3204189803C2776817793 @default.
- W3204189803 hasConceptScore W3204189803C2779343474 @default.
- W3204189803 hasConceptScore W3204189803C2781215313 @default.
- W3204189803 hasConceptScore W3204189803C41008148 @default.
- W3204189803 hasConceptScore W3204189803C55493867 @default.
- W3204189803 hasConceptScore W3204189803C77088390 @default.
- W3204189803 hasConceptScore W3204189803C79974875 @default.
- W3204189803 hasConceptScore W3204189803C80444323 @default.
- W3204189803 hasConceptScore W3204189803C86803240 @default.
- W3204189803 hasConceptScore W3204189803C93518851 @default.
- W3204189803 hasLocation W32041898031 @default.
- W3204189803 hasOpenAccess W3204189803 @default.
- W3204189803 hasPrimaryLocation W32041898031 @default.
- W3204189803 hasRelatedWork W1492937784 @default.
- W3204189803 hasRelatedWork W2348772280 @default.
- W3204189803 hasRelatedWork W2354642172 @default.
- W3204189803 hasRelatedWork W2361349944 @default.
- W3204189803 hasRelatedWork W2365137963 @default.
- W3204189803 hasRelatedWork W2365800529 @default.
- W3204189803 hasRelatedWork W2381351160 @default.
- W3204189803 hasRelatedWork W2385987419 @default.
- W3204189803 hasRelatedWork W2914587905 @default.
- W3204189803 hasRelatedWork W3214396836 @default.
- W3204189803 isParatext "false" @default.
- W3204189803 isRetracted "false" @default.
- W3204189803 magId "3204189803" @default.
- W3204189803 workType "article" @default.