Matches in SemOpenAlex for { <https://semopenalex.org/work/W2549115005> ?p ?o ?g. }
- W2549115005 endingPage "937" @default.
- W2549115005 startingPage "911" @default.
- W2549115005 abstract "The increasing availability of large collections of chemical structures and associated experimental data provides an opportunity to build robust QSAR models for applications in different fields. One common concern is the quality of both the chemical structure information and associated experimental data. Here we describe the development of an automated KNIME workflow to curate and correct errors in the structure and identity of chemicals using the publicly available PHYSPROP physicochemical properties and environmental fate datasets. The workflow first assembles structure-identity pairs using up to four provided chemical identifiers, including chemical name, CASRNs, SMILES, and MolBlock. Problems detected included errors and mismatches in chemical structure formats, identifiers and various structure validation issues, including hypervalency and stereochemistry descriptions. Subsequently, a machine learning procedure was applied to evaluate the impact of this curation process. The performance of QSAR models built on only the highest-quality subset of the original dataset was compared with the larger curated and corrected dataset. The latter showed statistically improved predictive performance. The final workflow was used to curate the full list of PHYSPROP datasets, and is being made publicly available for further usage and integration by the scientific community." @default.
- W2549115005 created "2016-11-30" @default.
- W2549115005 creator A5026992703 @default.
- W2549115005 creator A5054612487 @default.
- W2549115005 creator A5056766192 @default.
- W2549115005 creator A5062051745 @default.
- W2549115005 creator A5072822177 @default.
- W2549115005 date "2016-11-01" @default.
- W2549115005 modified "2023-10-02" @default.
- W2549115005 title "An automated curation procedure for addressing chemical errors and inconsistencies in public datasets used in QSAR modelling" @default.
- W2549115005 cites W143696470 @default.
- W2549115005 cites W1508604947 @default.
- W2549115005 cites W17944005 @default.
- W2549115005 cites W1835740130 @default.
- W2549115005 cites W1969378311 @default.
- W2549115005 cites W1978722911 @default.
- W2549115005 cites W1983464445 @default.
- W2549115005 cites W1999596182 @default.
- W2549115005 cites W1999638776 @default.
- W2549115005 cites W2001179019 @default.
- W2549115005 cites W2020552613 @default.
- W2549115005 cites W2024203393 @default.
- W2549115005 cites W2039609876 @default.
- W2549115005 cites W2041610798 @default.
- W2549115005 cites W2042719208 @default.
- W2549115005 cites W2049911450 @default.
- W2549115005 cites W2053246404 @default.
- W2549115005 cites W2054716083 @default.
- W2549115005 cites W2055972008 @default.
- W2549115005 cites W2057069496 @default.
- W2549115005 cites W2063396347 @default.
- W2549115005 cites W2066527689 @default.
- W2549115005 cites W2066657428 @default.
- W2549115005 cites W2068950612 @default.
- W2549115005 cites W2073503722 @default.
- W2549115005 cites W2073511756 @default.
- W2549115005 cites W2074166357 @default.
- W2549115005 cites W2079434695 @default.
- W2549115005 cites W2089578131 @default.
- W2549115005 cites W2090790364 @default.
- W2549115005 cites W2099071242 @default.
- W2549115005 cites W2103674568 @default.
- W2549115005 cites W2111966146 @default.
- W2549115005 cites W2159887157 @default.
- W2549115005 cites W2169016202 @default.
- W2549115005 cites W2266664921 @default.
- W2549115005 cites W2394108223 @default.
- W2549115005 cites W2473190403 @default.
- W2549115005 cites W342897826 @default.
- W2549115005 doi "https://doi.org/10.1080/1062936x.2016.1253611" @default.
- W2549115005 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/27885862" @default.
- W2549115005 hasPublicationYear "2016" @default.
- W2549115005 type Work @default.
- W2549115005 sameAs 2549115005 @default.
- W2549115005 citedByCount "82" @default.
- W2549115005 countsByYear W25491150052017 @default.
- W2549115005 countsByYear W25491150052018 @default.
- W2549115005 countsByYear W25491150052019 @default.
- W2549115005 countsByYear W25491150052020 @default.
- W2549115005 countsByYear W25491150052021 @default.
- W2549115005 countsByYear W25491150052022 @default.
- W2549115005 countsByYear W25491150052023 @default.
- W2549115005 crossrefType "journal-article" @default.
- W2549115005 hasAuthorship W2549115005A5026992703 @default.
- W2549115005 hasAuthorship W2549115005A5054612487 @default.
- W2549115005 hasAuthorship W2549115005A5056766192 @default.
- W2549115005 hasAuthorship W2549115005A5062051745 @default.
- W2549115005 hasAuthorship W2549115005A5072822177 @default.
- W2549115005 hasConcept C111472728 @default.
- W2549115005 hasConcept C111919701 @default.
- W2549115005 hasConcept C119857082 @default.
- W2549115005 hasConcept C124101348 @default.
- W2549115005 hasConcept C138885662 @default.
- W2549115005 hasConcept C154504017 @default.
- W2549115005 hasConcept C164126121 @default.
- W2549115005 hasConcept C177212765 @default.
- W2549115005 hasConcept C199360897 @default.
- W2549115005 hasConcept C23123220 @default.
- W2549115005 hasConcept C2779530757 @default.
- W2549115005 hasConcept C41008148 @default.
- W2549115005 hasConcept C77088390 @default.
- W2549115005 hasConcept C91632574 @default.
- W2549115005 hasConcept C98045186 @default.
- W2549115005 hasConceptScore W2549115005C111472728 @default.
- W2549115005 hasConceptScore W2549115005C111919701 @default.
- W2549115005 hasConceptScore W2549115005C119857082 @default.
- W2549115005 hasConceptScore W2549115005C124101348 @default.
- W2549115005 hasConceptScore W2549115005C138885662 @default.
- W2549115005 hasConceptScore W2549115005C154504017 @default.
- W2549115005 hasConceptScore W2549115005C164126121 @default.
- W2549115005 hasConceptScore W2549115005C177212765 @default.
- W2549115005 hasConceptScore W2549115005C199360897 @default.
- W2549115005 hasConceptScore W2549115005C23123220 @default.
- W2549115005 hasConceptScore W2549115005C2779530757 @default.
- W2549115005 hasConceptScore W2549115005C41008148 @default.
- W2549115005 hasConceptScore W2549115005C77088390 @default.
- W2549115005 hasConceptScore W2549115005C91632574 @default.
- W2549115005 hasConceptScore W2549115005C98045186 @default.