Matches in SemOpenAlex for { <https://semopenalex.org/work/W3043767409> ?p ?o ?g. }
- W3043767409 endingPage "4950" @default.
- W3043767409 startingPage "4938" @default.
- W3043767409 abstract "Recent advances in theoretical thermochemistry have allowed the study of small organic and bio-organic molecules with high accuracy. However, applications to larger molecules are still impeded by the steep scaling problem of highly accurate quantum mechanical (QM) methods, forcing the use of approximate, more cost-effective methods at a greatly reduced accuracy. One of the most successful strategies to mitigate this error is the use of systematic error-cancellation schemes, in which highly accurate QM calculations can be performed on small portions of the molecule to construct corrections to an approximate method. Herein, we build on ideas from fragmentation and error-cancellation to introduce a new family of molecular descriptors for machine learning modeled after the Connectivity-Based Hierarchy (CBH) of generalized isodesmic reaction schemes. The best performing descriptor ML(CBH-2) is constructed from fragments preserving only the immediate connectivity of all heavy (non-H) atoms of a molecule along with overlapping regions of fragments in accordance with the inclusion–exclusion principle. Our proposed approach offers a simple, chemically intuitive grouping of atoms, tuned with an optimal amount of error-cancellation, and outperforms previous structure-based descriptors using a much smaller input vector length. For a wide variety of density functionals, DFT+ΔML(CBH-2) models, trained on a set of small- to medium-sized organic HCNOSCl-containing molecules, achieved an out-of-sample MAE within 0.5 kcal/mol and 2σ (95%) confidence interval of <1.5 kcal/mol compared to accurate G4 reference values at DFT cost." @default.
- W3043767409 created "2020-07-23" @default.
- W3043767409 creator A5011183754 @default.
- W3043767409 creator A5071011528 @default.
- W3043767409 date "2020-07-17" @default.
- W3043767409 modified "2023-09-28" @default.
- W3043767409 title "Effective Molecular Descriptors for Chemical Accuracy at DFT Cost: Fragmentation, Error-Cancellation, and Machine Learning" @default.
- W3043767409 cites W126191579 @default.
- W3043767409 cites W1531674615 @default.
- W3043767409 cites W1901616594 @default.
- W3043767409 cites W1964189755 @default.
- W3043767409 cites W1965503042 @default.
- W3043767409 cites W1971044734 @default.
- W3043767409 cites W2005491208 @default.
- W3043767409 cites W2033428036 @default.
- W3043767409 cites W2035807924 @default.
- W3043767409 cites W2039814685 @default.
- W3043767409 cites W2041404118 @default.
- W3043767409 cites W2048382000 @default.
- W3043767409 cites W2051146701 @default.
- W3043767409 cites W2055462837 @default.
- W3043767409 cites W2056007663 @default.
- W3043767409 cites W2063589866 @default.
- W3043767409 cites W2075711588 @default.
- W3043767409 cites W2077528747 @default.
- W3043767409 cites W2080635178 @default.
- W3043767409 cites W2101698639 @default.
- W3043767409 cites W2103176800 @default.
- W3043767409 cites W2104489082 @default.
- W3043767409 cites W2104631894 @default.
- W3043767409 cites W2117307259 @default.
- W3043767409 cites W2153693853 @default.
- W3043767409 cites W2158081577 @default.
- W3043767409 cites W2320294097 @default.
- W3043767409 cites W2321679921 @default.
- W3043767409 cites W2322993209 @default.
- W3043767409 cites W2324136134 @default.
- W3043767409 cites W2328402136 @default.
- W3043767409 cites W2331862130 @default.
- W3043767409 cites W2347129741 @default.
- W3043767409 cites W2519019522 @default.
- W3043767409 cites W2582187633 @default.
- W3043767409 cites W2610370425 @default.
- W3043767409 cites W2620906374 @default.
- W3043767409 cites W2639728117 @default.
- W3043767409 cites W2726770424 @default.
- W3043767409 cites W2776049733 @default.
- W3043767409 cites W2778051509 @default.
- W3043767409 cites W2787363292 @default.
- W3043767409 cites W2790808809 @default.
- W3043767409 cites W2792348590 @default.
- W3043767409 cites W2794704841 @default.
- W3043767409 cites W2799620402 @default.
- W3043767409 cites W2801991413 @default.
- W3043767409 cites W2805566720 @default.
- W3043767409 cites W2884430236 @default.
- W3043767409 cites W2886916841 @default.
- W3043767409 cites W2937576411 @default.
- W3043767409 cites W2950128007 @default.
- W3043767409 cites W2951642668 @default.
- W3043767409 cites W2954088480 @default.
- W3043767409 cites W2955551200 @default.
- W3043767409 cites W2963784900 @default.
- W3043767409 cites W2969507301 @default.
- W3043767409 cites W2970235642 @default.
- W3043767409 cites W2980026765 @default.
- W3043767409 cites W3005889887 @default.
- W3043767409 cites W3100859786 @default.
- W3043767409 cites W3101744125 @default.
- W3043767409 cites W3104541550 @default.
- W3043767409 cites W3104705366 @default.
- W3043767409 cites W4211025511 @default.
- W3043767409 doi "https://doi.org/10.1021/acs.jctc.0c00236" @default.
- W3043767409 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/32678593" @default.
- W3043767409 hasPublicationYear "2020" @default.
- W3043767409 type Work @default.
- W3043767409 sameAs 3043767409 @default.
- W3043767409 citedByCount "16" @default.
- W3043767409 countsByYear W30437674092021 @default.
- W3043767409 countsByYear W30437674092022 @default.
- W3043767409 countsByYear W30437674092023 @default.
- W3043767409 crossrefType "journal-article" @default.
- W3043767409 hasAuthorship W3043767409A5011183754 @default.
- W3043767409 hasAuthorship W3043767409A5071011528 @default.
- W3043767409 hasConcept C111919701 @default.
- W3043767409 hasConcept C11413529 @default.
- W3043767409 hasConcept C121332964 @default.
- W3043767409 hasConcept C144557053 @default.
- W3043767409 hasConcept C147597530 @default.
- W3043767409 hasConcept C147789679 @default.
- W3043767409 hasConcept C152365726 @default.
- W3043767409 hasConcept C177264268 @default.
- W3043767409 hasConcept C185592680 @default.
- W3043767409 hasConcept C186060115 @default.
- W3043767409 hasConcept C191015642 @default.
- W3043767409 hasConcept C199360897 @default.
- W3043767409 hasConcept C2524010 @default.
- W3043767409 hasConcept C29563950 @default.