Matches in SemOpenAlex for { <https://semopenalex.org/work/W3157889448> ?p ?o ?g. }
- W3157889448 abstract "Datasets in the Natural Sciences are often curated with the goal of aiding scientific understanding and hence may not always be in a form that facilitates the application of machine learning. In this paper, we identify three trends within the fields of chemical reaction prediction and synthesis design that require a change in direction. First, the manner in which reaction datasets are split into reactants and reagents encourages testing models in an unrealistically generous manner. Second, we highlight the prevalence of mislabelled data, and suggest that the focus should be on outlier removal rather than data fitting only. Lastly, we discuss the problem of reagent prediction, in addition to reactant prediction, in order to solve the full synthesis design problem, highlighting the mismatch between what machine learning solves and what a lab chemist would need. Our critiques are also relevant to the burgeoning field of using machine learning to accelerate progress in experimental Natural Sciences, where datasets are often split in a biased way, are highly noisy, and contextual variables that are not evident from the data strongly influence the outcome of experiments." @default.
- W3157889448 created "2021-05-10" @default.
- W3157889448 creator A5028051805 @default.
- W3157889448 creator A5071183925 @default.
- W3157889448 creator A5086142353 @default.
- W3157889448 date "2018-11-21" @default.
- W3157889448 modified "2023-09-24" @default.
- W3157889448 title "Dataset Bias in the Natural Sciences: A Case Study in Chemical Reaction Prediction and Synthesis Design" @default.
- W3157889448 cites W1576808520 @default.
- W3157889448 cites W2061343703 @default.
- W3157889448 cites W2069118567 @default.
- W3157889448 cites W2107844279 @default.
- W3157889448 cites W2115904656 @default.
- W3157889448 cites W2151554678 @default.
- W3157889448 cites W2153772742 @default.
- W3157889448 cites W2325811289 @default.
- W3157889448 cites W2551217916 @default.
- W3157889448 cites W2600383743 @default.
- W3157889448 cites W2747592475 @default.
- W3157889448 cites W2755801529 @default.
- W3157889448 cites W2763220183 @default.
- W3157889448 cites W2769423117 @default.
- W3157889448 cites W2769756736 @default.
- W3157889448 cites W2786722833 @default.
- W3157889448 cites W2786785157 @default.
- W3157889448 cites W2804182511 @default.
- W3157889448 cites W2805177834 @default.
- W3157889448 cites W2806351858 @default.
- W3157889448 cites W2806448755 @default.
- W3157889448 cites W2809158999 @default.
- W3157889448 cites W2886049025 @default.
- W3157889448 cites W2891063262 @default.
- W3157889448 cites W2891868449 @default.
- W3157889448 cites W29374554 @default.
- W3157889448 cites W2963215859 @default.
- W3157889448 cites W2963396480 @default.
- W3157889448 cites W2963445908 @default.
- W3157889448 cites W2963477006 @default.
- W3157889448 cites W2963676163 @default.
- W3157889448 cites W3098269892 @default.
- W3157889448 cites W3100751385 @default.
- W3157889448 cites W3104508774 @default.
- W3157889448 cites W3104956673 @default.
- W3157889448 cites W3200832534 @default.
- W3157889448 doi "https://doi.org/10.26434/chemrxiv.7366973.v1" @default.
- W3157889448 hasPublicationYear "2018" @default.
- W3157889448 type Work @default.
- W3157889448 sameAs 3157889448 @default.
- W3157889448 citedByCount "9" @default.
- W3157889448 countsByYear W31578894482018 @default.
- W3157889448 countsByYear W31578894482019 @default.
- W3157889448 countsByYear W31578894482020 @default.
- W3157889448 countsByYear W31578894482021 @default.
- W3157889448 crossrefType "posted-content" @default.
- W3157889448 hasAuthorship W3157889448A5028051805 @default.
- W3157889448 hasAuthorship W3157889448A5071183925 @default.
- W3157889448 hasAuthorship W3157889448A5086142353 @default.
- W3157889448 hasBestOaLocation W31578894482 @default.
- W3157889448 hasConcept C119857082 @default.
- W3157889448 hasConcept C154945302 @default.
- W3157889448 hasConcept C166957645 @default.
- W3157889448 hasConcept C178790620 @default.
- W3157889448 hasConcept C185592680 @default.
- W3157889448 hasConcept C202444582 @default.
- W3157889448 hasConcept C2522767166 @default.
- W3157889448 hasConcept C2776608160 @default.
- W3157889448 hasConcept C2779714115 @default.
- W3157889448 hasConcept C33923547 @default.
- W3157889448 hasConcept C41008148 @default.
- W3157889448 hasConcept C79337645 @default.
- W3157889448 hasConcept C95457728 @default.
- W3157889448 hasConcept C9652623 @default.
- W3157889448 hasConceptScore W3157889448C119857082 @default.
- W3157889448 hasConceptScore W3157889448C154945302 @default.
- W3157889448 hasConceptScore W3157889448C166957645 @default.
- W3157889448 hasConceptScore W3157889448C178790620 @default.
- W3157889448 hasConceptScore W3157889448C185592680 @default.
- W3157889448 hasConceptScore W3157889448C202444582 @default.
- W3157889448 hasConceptScore W3157889448C2522767166 @default.
- W3157889448 hasConceptScore W3157889448C2776608160 @default.
- W3157889448 hasConceptScore W3157889448C2779714115 @default.
- W3157889448 hasConceptScore W3157889448C33923547 @default.
- W3157889448 hasConceptScore W3157889448C41008148 @default.
- W3157889448 hasConceptScore W3157889448C79337645 @default.
- W3157889448 hasConceptScore W3157889448C95457728 @default.
- W3157889448 hasConceptScore W3157889448C9652623 @default.
- W3157889448 hasLocation W31578894481 @default.
- W3157889448 hasLocation W31578894482 @default.
- W3157889448 hasOpenAccess W3157889448 @default.
- W3157889448 hasPrimaryLocation W31578894481 @default.
- W3157889448 hasRelatedWork W2961085424 @default.
- W3157889448 hasRelatedWork W3046775127 @default.
- W3157889448 hasRelatedWork W4285260836 @default.
- W3157889448 hasRelatedWork W4286629047 @default.
- W3157889448 hasRelatedWork W4288754364 @default.
- W3157889448 hasRelatedWork W4306321456 @default.
- W3157889448 hasRelatedWork W4306674287 @default.
- W3157889448 hasRelatedWork W4308734192 @default.
- W3157889448 hasRelatedWork W4312831135 @default.
- W3157889448 hasRelatedWork W4224009465 @default.