Matches in SemOpenAlex for { <https://semopenalex.org/work/W4365211652> ?p ?o ?g. }
Showing items 1 to 69 of
69
with 100 items per page.
- W4365211652 abstract "Nowadays, numerous industries have exceptional demand for skills in data science, such as data analysis, data mining, and machine learning. The computational notebook (e.g., Jupyter Notebook) is a well-known data science tool adopted in practice. Kaggle and GitHub are two platforms where data science communities are used for knowledge-sharing, skill-practicing, and collaboration. While tutorials and guidelines for novice data science are available on both platforms, there is a low number of Jupyter Notebooks that received high numbers of votes from the community. The high-voted notebook is considered well-documented, easy to understand, and applies the best data science and software engineering practices. In this research, we aim to understand the characteristics of high-voted Jupyter Notebooks on Kaggle and the popular Jupyter Notebooks for data science projects on GitHub. We plan to mine and analyse the Jupyter Notebooks on both platforms. We will perform exploratory analytics, data visualization, and feature importances to understand the overall structure of these notebooks and to identify common patterns and best-practice features separating the low-voted and high-voted notebooks. Upon the completion of this research, the discovered insights can be applied as training guidelines for aspiring data scientists and machine learning practitioners looking to improve their performance from novice ranking Jupyter Notebook on Kaggle to a deployable project on GitHub." @default.
- W4365211652 created "2023-04-13" @default.
- W4365211652 creator A5014020830 @default.
- W4365211652 creator A5023694183 @default.
- W4365211652 creator A5029010598 @default.
- W4365211652 creator A5030198911 @default.
- W4365211652 creator A5033078494 @default.
- W4365211652 creator A5059000011 @default.
- W4365211652 creator A5060634928 @default.
- W4365211652 creator A5091820517 @default.
- W4365211652 date "2023-04-11" @default.
- W4365211652 modified "2023-09-28" @default.
- W4365211652 title "Mining the Characteristics of Jupyter Notebooks in Data Science Projects" @default.
- W4365211652 doi "https://doi.org/10.48550/arxiv.2304.05325" @default.
- W4365211652 hasPublicationYear "2023" @default.
- W4365211652 type Work @default.
- W4365211652 citedByCount "0" @default.
- W4365211652 crossrefType "posted-content" @default.
- W4365211652 hasAuthorship W4365211652A5014020830 @default.
- W4365211652 hasAuthorship W4365211652A5023694183 @default.
- W4365211652 hasAuthorship W4365211652A5029010598 @default.
- W4365211652 hasAuthorship W4365211652A5030198911 @default.
- W4365211652 hasAuthorship W4365211652A5033078494 @default.
- W4365211652 hasAuthorship W4365211652A5059000011 @default.
- W4365211652 hasAuthorship W4365211652A5060634928 @default.
- W4365211652 hasAuthorship W4365211652A5091820517 @default.
- W4365211652 hasBestOaLocation W43652116521 @default.
- W4365211652 hasConcept C111919701 @default.
- W4365211652 hasConcept C120894424 @default.
- W4365211652 hasConcept C124101348 @default.
- W4365211652 hasConcept C136764020 @default.
- W4365211652 hasConcept C172367668 @default.
- W4365211652 hasConcept C175801342 @default.
- W4365211652 hasConcept C199360897 @default.
- W4365211652 hasConcept C2522767166 @default.
- W4365211652 hasConcept C2777904410 @default.
- W4365211652 hasConcept C36464697 @default.
- W4365211652 hasConcept C41008148 @default.
- W4365211652 hasConcept C79158427 @default.
- W4365211652 hasConcept C83283714 @default.
- W4365211652 hasConceptScore W4365211652C111919701 @default.
- W4365211652 hasConceptScore W4365211652C120894424 @default.
- W4365211652 hasConceptScore W4365211652C124101348 @default.
- W4365211652 hasConceptScore W4365211652C136764020 @default.
- W4365211652 hasConceptScore W4365211652C172367668 @default.
- W4365211652 hasConceptScore W4365211652C175801342 @default.
- W4365211652 hasConceptScore W4365211652C199360897 @default.
- W4365211652 hasConceptScore W4365211652C2522767166 @default.
- W4365211652 hasConceptScore W4365211652C2777904410 @default.
- W4365211652 hasConceptScore W4365211652C36464697 @default.
- W4365211652 hasConceptScore W4365211652C41008148 @default.
- W4365211652 hasConceptScore W4365211652C79158427 @default.
- W4365211652 hasConceptScore W4365211652C83283714 @default.
- W4365211652 hasLocation W43652116521 @default.
- W4365211652 hasOpenAccess W4365211652 @default.
- W4365211652 hasPrimaryLocation W43652116521 @default.
- W4365211652 hasRelatedWork W2128934137 @default.
- W4365211652 hasRelatedWork W2148525144 @default.
- W4365211652 hasRelatedWork W2176435512 @default.
- W4365211652 hasRelatedWork W2996464640 @default.
- W4365211652 hasRelatedWork W3012440071 @default.
- W4365211652 hasRelatedWork W3215716519 @default.
- W4365211652 hasRelatedWork W4306809819 @default.
- W4365211652 hasRelatedWork W4308350523 @default.
- W4365211652 hasRelatedWork W4312934005 @default.
- W4365211652 hasRelatedWork W4321121411 @default.
- W4365211652 isParatext "false" @default.
- W4365211652 isRetracted "false" @default.
- W4365211652 workType "article" @default.