Matches in SemOpenAlex for { <https://semopenalex.org/work/W4322010080> ?p ?o ?g. }
Showing items 1 to 66 of
66
with 100 items per page.
- W4322010080 abstract "Computational notebooks (e.g. Jupyter notebook) are a popular choice for interactive scientific computing to convey descriptive information together with executable source code. The user can annotate the scientific development of the work, the methods applied, describe ancillary data or the analysis of results, with text, illustrations, figures, and equations. Such ‘executable’ documents provide a paradigm shift in scientific writing, where not only the science is described, but the actual computation and source code are openly available and can be reproduced and validated.Therefore, it is of paramount importance to preserve these documents. A unique and persistent identification (PID) is essential together with providing enough information to execute the source code. Generating a PID for a Jupyter notebook is not technically challenging. We can automatically collect system and run-time information and, with a guided workflow for the user, assemble a rich set of metadata. The collected information allows us to recreate the computational environment and run the source code, which in return (theoretically) should produce the same results as published.The importance of providing a rich set of metadata for all digital objects in a human readable and machine actionable form is well understood and widely accepted as necessity for reproducibility, traceability, and provenance. This is reflected in the FAIR principles (Wilkinson, https://doi.org/10.1038/sdata.2016.18) which are regarded as gold standard by many scientific communities.Pimentel et al. (https://doi.org/10.1109/MSR.2019.00077) analysed over 800’000 Jupyter notebooks from GitHub. 24 % executed without errors and only 4 % produced the same results. The likelihood to successfully compile and run a decade old source code is slim. Long term support for well established operating systems varies between 5 to 10 years, user software support is usually shorter and looking at free and open-source repositories there is often no support (or best effort) offered.We present an approach to safely reproduce the computational environment in the future with a focus on long-term availability. Instead of trying to reinstall the computational environment based on the stored metadata, we propose to archive the docker image, the user space (user installed packages) and finally the source code. Recreating the system in this way is more like restoring a backup, where backup is the equivalent of an entire computer system. It does not solve all the problems but removes a great deal of complexity and uncertainty.Though there are shortcomings in our approach, we believe our solution will lower the threshold for scientists to provide rich meta data, code and results attached to a publication that can be reproduced in the far future." @default.
- W4322010080 created "2023-02-26" @default.
- W4322010080 creator A5003358895 @default.
- W4322010080 creator A5008141221 @default.
- W4322010080 creator A5051425888 @default.
- W4322010080 creator A5080362093 @default.
- W4322010080 creator A5091697030 @default.
- W4322010080 creator A5017598158 @default.
- W4322010080 date "2023-05-15" @default.
- W4322010080 modified "2023-09-29" @default.
- W4322010080 title "Long-term Reproducibility for Jupyter Notebook" @default.
- W4322010080 doi "https://doi.org/10.5194/egusphere-egu23-9235" @default.
- W4322010080 hasPublicationYear "2023" @default.
- W4322010080 type Work @default.
- W4322010080 citedByCount "0" @default.
- W4322010080 crossrefType "posted-content" @default.
- W4322010080 hasAuthorship W4322010080A5003358895 @default.
- W4322010080 hasAuthorship W4322010080A5008141221 @default.
- W4322010080 hasAuthorship W4322010080A5017598158 @default.
- W4322010080 hasAuthorship W4322010080A5051425888 @default.
- W4322010080 hasAuthorship W4322010080A5080362093 @default.
- W4322010080 hasAuthorship W4322010080A5091697030 @default.
- W4322010080 hasConcept C115903868 @default.
- W4322010080 hasConcept C136764020 @default.
- W4322010080 hasConcept C153876917 @default.
- W4322010080 hasConcept C160145156 @default.
- W4322010080 hasConcept C177212765 @default.
- W4322010080 hasConcept C177264268 @default.
- W4322010080 hasConcept C199360897 @default.
- W4322010080 hasConcept C23123220 @default.
- W4322010080 hasConcept C2776760102 @default.
- W4322010080 hasConcept C41008148 @default.
- W4322010080 hasConcept C43126263 @default.
- W4322010080 hasConcept C51929080 @default.
- W4322010080 hasConcept C77088390 @default.
- W4322010080 hasConcept C93518851 @default.
- W4322010080 hasConceptScore W4322010080C115903868 @default.
- W4322010080 hasConceptScore W4322010080C136764020 @default.
- W4322010080 hasConceptScore W4322010080C153876917 @default.
- W4322010080 hasConceptScore W4322010080C160145156 @default.
- W4322010080 hasConceptScore W4322010080C177212765 @default.
- W4322010080 hasConceptScore W4322010080C177264268 @default.
- W4322010080 hasConceptScore W4322010080C199360897 @default.
- W4322010080 hasConceptScore W4322010080C23123220 @default.
- W4322010080 hasConceptScore W4322010080C2776760102 @default.
- W4322010080 hasConceptScore W4322010080C41008148 @default.
- W4322010080 hasConceptScore W4322010080C43126263 @default.
- W4322010080 hasConceptScore W4322010080C51929080 @default.
- W4322010080 hasConceptScore W4322010080C77088390 @default.
- W4322010080 hasConceptScore W4322010080C93518851 @default.
- W4322010080 hasLocation W43220100801 @default.
- W4322010080 hasOpenAccess W4322010080 @default.
- W4322010080 hasPrimaryLocation W43220100801 @default.
- W4322010080 hasRelatedWork W1965294778 @default.
- W4322010080 hasRelatedWork W2146118316 @default.
- W4322010080 hasRelatedWork W2494932349 @default.
- W4322010080 hasRelatedWork W2557057023 @default.
- W4322010080 hasRelatedWork W2591061147 @default.
- W4322010080 hasRelatedWork W2748872428 @default.
- W4322010080 hasRelatedWork W3107917592 @default.
- W4322010080 hasRelatedWork W4244608052 @default.
- W4322010080 hasRelatedWork W4310364134 @default.
- W4322010080 hasRelatedWork W4376312175 @default.
- W4322010080 isParatext "false" @default.
- W4322010080 isRetracted "false" @default.
- W4322010080 workType "article" @default.