Matches in SemOpenAlex for { <https://semopenalex.org/work/W4313237598> ?p ?o ?g. }
- W4313237598 abstract "In the training of predictive models using high-dimensional genomic data, multiple studies' worth of data are often combined to increase sample size and improve generalizability. A drawback of this approach is that there may be different sets of features measured in each study due to variations in expression measurement platform or technology. It is often common practice to work only with the intersection of features measured in common across all studies, which results in the blind discarding of potentially useful feature information that is measured in individual or subsets of studies.We characterize the loss in predictive performance incurred by using only the intersection of feature information available across all studies when training predictors using gene expression data from microarray and sequencing datasets. We study the properties of linear and polynomial regression for imputing discarded features and demonstrate improvements in the external performance of prediction functions through simulation and in gene expression data collected on breast cancer patients. To improve this process, we propose a pairwise strategy that applies any imputation algorithm to two studies at a time and averages imputed features across pairs. We demonstrate that the pairwise strategy is preferable to first merging all datasets together and imputing any resulting missing features. Finally, we provide insights on which subsets of intersected and study-specific features should be used so that missing-feature imputation best promotes cross-study replicability.The code is available at https://github.com/YujieWuu/Pairwise_imputation.Supplementary information is available at Bioinformatics online." @default.
- W4313237598 created "2023-01-06" @default.
- W4313237598 creator A5001511264 @default.
- W4313237598 creator A5023356194 @default.
- W4313237598 creator A5049679555 @default.
- W4313237598 date "2022-12-28" @default.
- W4313237598 modified "2023-09-26" @default.
- W4313237598 title "A pairwise strategy for imputing predictive features when combining multiple datasets" @default.
- W4313237598 cites W1571010171 @default.
- W4313237598 cites W1999907667 @default.
- W4313237598 cites W2007176121 @default.
- W4313237598 cites W2007601283 @default.
- W4313237598 cites W2041413731 @default.
- W4313237598 cites W2064208261 @default.
- W4313237598 cites W2073755222 @default.
- W4313237598 cites W2077676209 @default.
- W4313237598 cites W2096561439 @default.
- W4313237598 cites W2098421558 @default.
- W4313237598 cites W2123035720 @default.
- W4313237598 cites W2128957040 @default.
- W4313237598 cites W2128985829 @default.
- W4313237598 cites W2134932622 @default.
- W4313237598 cites W2150791259 @default.
- W4313237598 cites W2152667098 @default.
- W4313237598 cites W2160572762 @default.
- W4313237598 cites W2163563958 @default.
- W4313237598 cites W2498672755 @default.
- W4313237598 cites W2534588488 @default.
- W4313237598 cites W2778455075 @default.
- W4313237598 cites W2787894218 @default.
- W4313237598 cites W2789960424 @default.
- W4313237598 cites W2949662834 @default.
- W4313237598 cites W2968856704 @default.
- W4313237598 cites W2989954384 @default.
- W4313237598 cites W3107095885 @default.
- W4313237598 doi "https://doi.org/10.1093/bioinformatics/btac839" @default.
- W4313237598 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/36576001" @default.
- W4313237598 hasPublicationYear "2022" @default.
- W4313237598 type Work @default.
- W4313237598 citedByCount "0" @default.
- W4313237598 crossrefType "journal-article" @default.
- W4313237598 hasAuthorship W4313237598A5001511264 @default.
- W4313237598 hasAuthorship W4313237598A5023356194 @default.
- W4313237598 hasAuthorship W4313237598A5049679555 @default.
- W4313237598 hasBestOaLocation W43132375981 @default.
- W4313237598 hasConcept C105795698 @default.
- W4313237598 hasConcept C111919701 @default.
- W4313237598 hasConcept C119857082 @default.
- W4313237598 hasConcept C124101348 @default.
- W4313237598 hasConcept C138885662 @default.
- W4313237598 hasConcept C153180895 @default.
- W4313237598 hasConcept C154945302 @default.
- W4313237598 hasConcept C184898388 @default.
- W4313237598 hasConcept C27158222 @default.
- W4313237598 hasConcept C2776401178 @default.
- W4313237598 hasConcept C33923547 @default.
- W4313237598 hasConcept C41008148 @default.
- W4313237598 hasConcept C41895202 @default.
- W4313237598 hasConcept C43126263 @default.
- W4313237598 hasConcept C45804977 @default.
- W4313237598 hasConcept C58041806 @default.
- W4313237598 hasConcept C83546350 @default.
- W4313237598 hasConcept C9357733 @default.
- W4313237598 hasConceptScore W4313237598C105795698 @default.
- W4313237598 hasConceptScore W4313237598C111919701 @default.
- W4313237598 hasConceptScore W4313237598C119857082 @default.
- W4313237598 hasConceptScore W4313237598C124101348 @default.
- W4313237598 hasConceptScore W4313237598C138885662 @default.
- W4313237598 hasConceptScore W4313237598C153180895 @default.
- W4313237598 hasConceptScore W4313237598C154945302 @default.
- W4313237598 hasConceptScore W4313237598C184898388 @default.
- W4313237598 hasConceptScore W4313237598C27158222 @default.
- W4313237598 hasConceptScore W4313237598C2776401178 @default.
- W4313237598 hasConceptScore W4313237598C33923547 @default.
- W4313237598 hasConceptScore W4313237598C41008148 @default.
- W4313237598 hasConceptScore W4313237598C41895202 @default.
- W4313237598 hasConceptScore W4313237598C43126263 @default.
- W4313237598 hasConceptScore W4313237598C45804977 @default.
- W4313237598 hasConceptScore W4313237598C58041806 @default.
- W4313237598 hasConceptScore W4313237598C83546350 @default.
- W4313237598 hasConceptScore W4313237598C9357733 @default.
- W4313237598 hasIssue "1" @default.
- W4313237598 hasLocation W43132375981 @default.
- W4313237598 hasLocation W43132375982 @default.
- W4313237598 hasLocation W43132375983 @default.
- W4313237598 hasOpenAccess W4313237598 @default.
- W4313237598 hasPrimaryLocation W43132375981 @default.
- W4313237598 hasRelatedWork W119228667 @default.
- W4313237598 hasRelatedWork W2541565311 @default.
- W4313237598 hasRelatedWork W2751555317 @default.
- W4313237598 hasRelatedWork W2784019465 @default.
- W4313237598 hasRelatedWork W2979641641 @default.
- W4313237598 hasRelatedWork W3049453136 @default.
- W4313237598 hasRelatedWork W4200631471 @default.
- W4313237598 hasRelatedWork W4294082001 @default.
- W4313237598 hasRelatedWork W4312712358 @default.
- W4313237598 hasRelatedWork W4225958631 @default.
- W4313237598 hasVolume "39" @default.
- W4313237598 isParatext "false" @default.
- W4313237598 isRetracted "false" @default.