Matches in SemOpenAlex for { <https://semopenalex.org/work/W3180830570> ?p ?o ?g. }
- W3180830570 endingPage "107262" @default.
- W3180830570 startingPage "107262" @default.
- W3180830570 abstract "Materials discovery via machine learning has become an increasingly popular method due to its ability to rapidly predict materials properties in a time-efficient and low-cost manner. However, one limitation in this field is the lack of benchmark datasets, particularly those that encompass the size, tasks, material systems, and data modalities present in the materials informatics literature. This makes it difficult to identify optimal machine learning model choices including algorithm, model architecture, data splitting, and data featurization for a given task. Here, we attempt to address this lack of benchmark datasets by assembling a unique repository of 50 different datasets for materials properties. The data contains both experimental and computational data, data suited for regression as well as classification, sizes ranging from 12 to 6354 samples, and materials systems spanning the diversity of materials research. Data were extracted from 16 publications. In addition to cleaning the data where necessary, each dataset was split into train, validation, and test splits. For datasets with more than 100 values, train-val-test splits were created, either with a 5-fold or 10-fold cross-validation method, depending on what each respective paper did in their studies. Datasets with less than 100 values had train-test splits created using the Leave-One-Out cross-validation method. These benchmark data can serve as a basis for a more diverse benchmark dataset in the future to further improve their effectiveness in the comparison of machine learning models." @default.
- W3180830570 created "2021-07-19" @default.
- W3180830570 creator A5003301534 @default.
- W3180830570 creator A5047086331 @default.
- W3180830570 creator A5050610772 @default.
- W3180830570 date "2021-08-01" @default.
- W3180830570 modified "2023-10-13" @default.
- W3180830570 title "Benchmark datasets incorporating diverse tasks, sample sizes, material systems, and data heterogeneity for materials informatics" @default.
- W3180830570 cites W3100220443 @default.
- W3180830570 doi "https://doi.org/10.1016/j.dib.2021.107262" @default.
- W3180830570 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/8319566" @default.
- W3180830570 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/34345637" @default.
- W3180830570 hasPublicationYear "2021" @default.
- W3180830570 type Work @default.
- W3180830570 sameAs 3180830570 @default.
- W3180830570 citedByCount "7" @default.
- W3180830570 countsByYear W31808305702021 @default.
- W3180830570 countsByYear W31808305702022 @default.
- W3180830570 countsByYear W31808305702023 @default.
- W3180830570 crossrefType "journal-article" @default.
- W3180830570 hasAuthorship W3180830570A5003301534 @default.
- W3180830570 hasAuthorship W3180830570A5047086331 @default.
- W3180830570 hasAuthorship W3180830570A5050610772 @default.
- W3180830570 hasBestOaLocation W31808305701 @default.
- W3180830570 hasConcept C105795698 @default.
- W3180830570 hasConcept C119857082 @default.
- W3180830570 hasConcept C124101348 @default.
- W3180830570 hasConcept C13280743 @default.
- W3180830570 hasConcept C138816342 @default.
- W3180830570 hasConcept C144024400 @default.
- W3180830570 hasConcept C144133560 @default.
- W3180830570 hasConcept C145642194 @default.
- W3180830570 hasConcept C154945302 @default.
- W3180830570 hasConcept C158518442 @default.
- W3180830570 hasConcept C159110408 @default.
- W3180830570 hasConcept C162324750 @default.
- W3180830570 hasConcept C162853370 @default.
- W3180830570 hasConcept C16910744 @default.
- W3180830570 hasConcept C185592680 @default.
- W3180830570 hasConcept C185798385 @default.
- W3180830570 hasConcept C187736073 @default.
- W3180830570 hasConcept C198531522 @default.
- W3180830570 hasConcept C199360897 @default.
- W3180830570 hasConcept C202444582 @default.
- W3180830570 hasConcept C205649164 @default.
- W3180830570 hasConcept C2779903281 @default.
- W3180830570 hasConcept C2780451532 @default.
- W3180830570 hasConcept C33923547 @default.
- W3180830570 hasConcept C36289849 @default.
- W3180830570 hasConcept C41008148 @default.
- W3180830570 hasConcept C43617362 @default.
- W3180830570 hasConcept C45804977 @default.
- W3180830570 hasConcept C55037315 @default.
- W3180830570 hasConcept C62085286 @default.
- W3180830570 hasConcept C71924100 @default.
- W3180830570 hasConcept C86251818 @default.
- W3180830570 hasConcept C9652623 @default.
- W3180830570 hasConceptScore W3180830570C105795698 @default.
- W3180830570 hasConceptScore W3180830570C119857082 @default.
- W3180830570 hasConceptScore W3180830570C124101348 @default.
- W3180830570 hasConceptScore W3180830570C13280743 @default.
- W3180830570 hasConceptScore W3180830570C138816342 @default.
- W3180830570 hasConceptScore W3180830570C144024400 @default.
- W3180830570 hasConceptScore W3180830570C144133560 @default.
- W3180830570 hasConceptScore W3180830570C145642194 @default.
- W3180830570 hasConceptScore W3180830570C154945302 @default.
- W3180830570 hasConceptScore W3180830570C158518442 @default.
- W3180830570 hasConceptScore W3180830570C159110408 @default.
- W3180830570 hasConceptScore W3180830570C162324750 @default.
- W3180830570 hasConceptScore W3180830570C162853370 @default.
- W3180830570 hasConceptScore W3180830570C16910744 @default.
- W3180830570 hasConceptScore W3180830570C185592680 @default.
- W3180830570 hasConceptScore W3180830570C185798385 @default.
- W3180830570 hasConceptScore W3180830570C187736073 @default.
- W3180830570 hasConceptScore W3180830570C198531522 @default.
- W3180830570 hasConceptScore W3180830570C199360897 @default.
- W3180830570 hasConceptScore W3180830570C202444582 @default.
- W3180830570 hasConceptScore W3180830570C205649164 @default.
- W3180830570 hasConceptScore W3180830570C2779903281 @default.
- W3180830570 hasConceptScore W3180830570C2780451532 @default.
- W3180830570 hasConceptScore W3180830570C33923547 @default.
- W3180830570 hasConceptScore W3180830570C36289849 @default.
- W3180830570 hasConceptScore W3180830570C41008148 @default.
- W3180830570 hasConceptScore W3180830570C43617362 @default.
- W3180830570 hasConceptScore W3180830570C45804977 @default.
- W3180830570 hasConceptScore W3180830570C55037315 @default.
- W3180830570 hasConceptScore W3180830570C62085286 @default.
- W3180830570 hasConceptScore W3180830570C71924100 @default.
- W3180830570 hasConceptScore W3180830570C86251818 @default.
- W3180830570 hasConceptScore W3180830570C9652623 @default.
- W3180830570 hasFunder F4320306076 @default.
- W3180830570 hasFunder F4320310050 @default.
- W3180830570 hasLocation W31808305701 @default.
- W3180830570 hasLocation W31808305702 @default.
- W3180830570 hasOpenAccess W3180830570 @default.
- W3180830570 hasPrimaryLocation W31808305701 @default.
- W3180830570 hasRelatedWork W1980585784 @default.
- W3180830570 hasRelatedWork W1999506221 @default.