Matches in SemOpenAlex for { <https://semopenalex.org/work/W4200324349> ?p ?o ?g. }
- W4200324349 abstract "Abstract Background Polygenic risk score (PRS) analyses are now routinely applied in biomedical research, with great hope that they will aid in our understanding of disease aetiology and contribute to personalized medicine. The continued growth of multi-cohort genome-wide association studies (GWASs) and large-scale biobank projects has provided researchers with a wealth of GWAS summary statistics and individual-level data suitable for performing PRS analyses. However, as the size of these studies increase, the risk of inter-cohort sample overlap and close relatedness increases. Ideally sample overlap would be identified and removed directly, but this is typically not possible due to privacy laws or consent agreements. This sample overlap, whether known or not, is a major problem in PRS analyses because it can lead to inflation of type 1 error and, thus, erroneous conclusions in published work. Results Here, for the first time, we report the scale of the sample overlap problem for PRS analyses by generating known sample overlap across sub-samples of the UK Biobank data, which we then use to produce GWAS and target data to mimic the effects of inter-cohort sample overlap. We demonstrate that inter-cohort overlap results in a significant and often substantial inflation in the observed PRS-trait association, coefficient of determination (R 2 ) and false-positive rate. This inflation can be high even when the absolute number of overlapping individuals is small if this makes up a notable fraction of the target sample. We develop and introduce EraSOR ( Era se S ample O verlap and R elatedness), a software for adjusting inflation in PRS prediction and association statistics in the presence of sample overlap or close relatedness between the GWAS and target samples. A key component of the EraSOR approach is inference of the degree of sample overlap from the intercept of a bivariate LD score regression applied to the GWAS and target data, making it powered in settings where both have sample sizes over 1,000 individuals. Through extensive benchmarking using UK Biobank and HapGen2 simulated genotype-phenotype data, we demonstrate that PRSs calculated using EraSOR-adjusted GWAS summary statistics are robust to inter-cohort overlap in a wide range of realistic scenarios and are even robust to high levels of residual genetic and environmental stratification. Conclusion The results of all PRS analyses for which sample overlap cannot be definitively ruled out should be considered with caution given high type 1 error observed in the presence of even low overlap between base and target cohorts. Given the strong performance of EraSOR in eliminating inflation caused by sample overlap in PRS studies with large (>5k) target samples, we recommend that EraSOR be used in all future such PRS studies to mitigate the potential effects of inter-cohort overlap and close relatedness." @default.
- W4200324349 created "2021-12-31" @default.
- W4200324349 creator A5004028660 @default.
- W4200324349 creator A5024575588 @default.
- W4200324349 creator A5060391039 @default.
- W4200324349 creator A5075815918 @default.
- W4200324349 date "2021-12-13" @default.
- W4200324349 modified "2023-10-01" @default.
- W4200324349 title "EraSOR: Erase Sample Overlap in polygenic score analyses" @default.
- W4200324349 cites W1971141173 @default.
- W4200324349 cites W1997338841 @default.
- W4200324349 cites W2082704080 @default.
- W4200324349 cites W2099085143 @default.
- W4200324349 cites W2102381029 @default.
- W4200324349 cites W2104549677 @default.
- W4200324349 cites W2110048796 @default.
- W4200324349 cites W2131803107 @default.
- W4200324349 cites W2133520037 @default.
- W4200324349 cites W2153118028 @default.
- W4200324349 cites W2153860431 @default.
- W4200324349 cites W2171705978 @default.
- W4200324349 cites W2195783463 @default.
- W4200324349 cites W2587257410 @default.
- W4200324349 cites W2590902687 @default.
- W4200324349 cites W2605897695 @default.
- W4200324349 cites W2609984900 @default.
- W4200324349 cites W2810042136 @default.
- W4200324349 cites W2899876723 @default.
- W4200324349 cites W2901952863 @default.
- W4200324349 cites W2951349772 @default.
- W4200324349 cites W2952883732 @default.
- W4200324349 cites W2958615822 @default.
- W4200324349 cites W3045441625 @default.
- W4200324349 cites W3111897279 @default.
- W4200324349 doi "https://doi.org/10.1101/2021.12.10.472164" @default.
- W4200324349 hasPublicationYear "2021" @default.
- W4200324349 type Work @default.
- W4200324349 citedByCount "4" @default.
- W4200324349 countsByYear W42003243492022 @default.
- W4200324349 crossrefType "posted-content" @default.
- W4200324349 hasAuthorship W4200324349A5004028660 @default.
- W4200324349 hasAuthorship W4200324349A5024575588 @default.
- W4200324349 hasAuthorship W4200324349A5060391039 @default.
- W4200324349 hasAuthorship W4200324349A5075815918 @default.
- W4200324349 hasBestOaLocation W42003243491 @default.
- W4200324349 hasConcept C104317684 @default.
- W4200324349 hasConcept C105795698 @default.
- W4200324349 hasConcept C106208931 @default.
- W4200324349 hasConcept C106934330 @default.
- W4200324349 hasConcept C116567970 @default.
- W4200324349 hasConcept C129848803 @default.
- W4200324349 hasConcept C135763542 @default.
- W4200324349 hasConcept C144024400 @default.
- W4200324349 hasConcept C149782125 @default.
- W4200324349 hasConcept C149923435 @default.
- W4200324349 hasConcept C153209595 @default.
- W4200324349 hasConcept C185592680 @default.
- W4200324349 hasConcept C198531522 @default.
- W4200324349 hasConcept C199360897 @default.
- W4200324349 hasConcept C32792767 @default.
- W4200324349 hasConcept C33923547 @default.
- W4200324349 hasConcept C41008148 @default.
- W4200324349 hasConcept C43617362 @default.
- W4200324349 hasConcept C54355233 @default.
- W4200324349 hasConcept C71924100 @default.
- W4200324349 hasConcept C72563966 @default.
- W4200324349 hasConcept C86803240 @default.
- W4200324349 hasConceptScore W4200324349C104317684 @default.
- W4200324349 hasConceptScore W4200324349C105795698 @default.
- W4200324349 hasConceptScore W4200324349C106208931 @default.
- W4200324349 hasConceptScore W4200324349C106934330 @default.
- W4200324349 hasConceptScore W4200324349C116567970 @default.
- W4200324349 hasConceptScore W4200324349C129848803 @default.
- W4200324349 hasConceptScore W4200324349C135763542 @default.
- W4200324349 hasConceptScore W4200324349C144024400 @default.
- W4200324349 hasConceptScore W4200324349C149782125 @default.
- W4200324349 hasConceptScore W4200324349C149923435 @default.
- W4200324349 hasConceptScore W4200324349C153209595 @default.
- W4200324349 hasConceptScore W4200324349C185592680 @default.
- W4200324349 hasConceptScore W4200324349C198531522 @default.
- W4200324349 hasConceptScore W4200324349C199360897 @default.
- W4200324349 hasConceptScore W4200324349C32792767 @default.
- W4200324349 hasConceptScore W4200324349C33923547 @default.
- W4200324349 hasConceptScore W4200324349C41008148 @default.
- W4200324349 hasConceptScore W4200324349C43617362 @default.
- W4200324349 hasConceptScore W4200324349C54355233 @default.
- W4200324349 hasConceptScore W4200324349C71924100 @default.
- W4200324349 hasConceptScore W4200324349C72563966 @default.
- W4200324349 hasConceptScore W4200324349C86803240 @default.
- W4200324349 hasLocation W42003243491 @default.
- W4200324349 hasOpenAccess W4200324349 @default.
- W4200324349 hasPrimaryLocation W42003243491 @default.
- W4200324349 hasRelatedWork W2172032168 @default.
- W4200324349 hasRelatedWork W2290081135 @default.
- W4200324349 hasRelatedWork W2971863267 @default.
- W4200324349 hasRelatedWork W3005194147 @default.
- W4200324349 hasRelatedWork W3022782703 @default.
- W4200324349 hasRelatedWork W3108703221 @default.
- W4200324349 hasRelatedWork W3205199184 @default.
- W4200324349 hasRelatedWork W4200324349 @default.