Matches in SemOpenAlex for { <https://semopenalex.org/work/W2616931575> ?p ?o ?g. }
- W2616931575 abstract "Abstract Genome wide association studies (GWAS) rely on microarrays, or more recently mapping of whole-genome sequencing reads, to genotype individuals. The reliance on prior sequencing of a reference genome for the organism on which the association study is to be performed limits the scope of association studies, and also precludes the identification of differences between cases and controls outside of the reference. We present an alignment free method for association studies that is based on counting k-mers in sequencing reads, testing for associations directly between k-mers and the trait of interest, and local assembly of the statistically significant k-mers to identify sequence differences. Results with simulated data and an analysis of the 1000 genomes data provide a proof of principle for the approach. In a pairwise comparison of the Toscani in Italia (TSI) and the Yoruba in Ibadan, Nigeria (YRI) populations we find that sequences identified by our method largely agree with results obtained using standard GWAS based on variant calling from mapped reads. However unlike standard GWAS, we find that our method identifies associations with structural variations and sites not present in the reference genome revealing sequences absent from the human reference genome. We also analyze data from the Bengali from Bangladesh (BEB) population to explore possible genetic basis of high rate of mortality due to cardiovascular diseases (CVD) among South Asians and find significant differences in frequencies of a number of non-synonymous variants in genes linked to CVDs between BEB and TSI samples, including the site rs1042034, which has been associated with higher risk of CVDs previously, and the nearby rs676210 in the Apolipoprotein B (ApoB) gene. Author Summary We present a method for associating regions in genomes to traits or diseases. The method is based on finding differences in frequencies of short strings of letters in sequencing reads and do not require reads to be aligned to a reference genome. This makes it applicable to study of organisms with no or incomplete reference genomes. We test our method with simulated data and sequencing data from the 1000 genomes project and find agreement with the conventional approach based on alignment to a reference genome. In addition, our method finds associations with sequences not in reference genomes and reveals sequences missing from the human reference genome. We also explore high rates of mortality due to cardiovascular diseases among South Asians and find prevalence of variations in genes associated with heart diseases in samples from the Bengali from Bangladesh population including one that has been reported to be associated with early onset of cardiovascular diseases." @default.
- W2616931575 created "2017-06-05" @default.
- W2616931575 creator A5044103741 @default.
- W2616931575 creator A5067318904 @default.
- W2616931575 creator A5068063822 @default.
- W2616931575 creator A5080064706 @default.
- W2616931575 date "2017-05-23" @default.
- W2616931575 modified "2023-09-26" @default.
- W2616931575 title "Association Mapping from Sequencing Reads Using K-mers" @default.
- W2616931575 cites W1966822396 @default.
- W2616931575 cites W1979751471 @default.
- W2616931575 cites W2005287518 @default.
- W2616931575 cites W2010361633 @default.
- W2616931575 cites W2026518999 @default.
- W2616931575 cites W2028513990 @default.
- W2616931575 cites W2037959887 @default.
- W2616931575 cites W2044805330 @default.
- W2616931575 cites W2046801160 @default.
- W2616931575 cites W2065128082 @default.
- W2616931575 cites W2068780269 @default.
- W2616931575 cites W2084981198 @default.
- W2616931575 cites W2094890728 @default.
- W2616931575 cites W2096128575 @default.
- W2616931575 cites W2096791516 @default.
- W2616931575 cites W2106242560 @default.
- W2616931575 cites W2108234281 @default.
- W2616931575 cites W2110808585 @default.
- W2616931575 cites W2112749286 @default.
- W2616931575 cites W2112902627 @default.
- W2616931575 cites W2115801316 @default.
- W2616931575 cites W2122132285 @default.
- W2616931575 cites W2128016314 @default.
- W2616931575 cites W2136145671 @default.
- W2616931575 cites W2142138981 @default.
- W2616931575 cites W2149992227 @default.
- W2616931575 cites W2155121555 @default.
- W2616931575 cites W2156924697 @default.
- W2616931575 cites W2162033846 @default.
- W2616931575 cites W2162634302 @default.
- W2616931575 cites W2167969973 @default.
- W2616931575 cites W2167989424 @default.
- W2616931575 cites W2170797039 @default.
- W2616931575 cites W4236236547 @default.
- W2616931575 doi "https://doi.org/10.1101/141267" @default.
- W2616931575 hasPublicationYear "2017" @default.
- W2616931575 type Work @default.
- W2616931575 sameAs 2616931575 @default.
- W2616931575 citedByCount "5" @default.
- W2616931575 countsByYear W26169315752017 @default.
- W2616931575 countsByYear W26169315752018 @default.
- W2616931575 countsByYear W26169315752019 @default.
- W2616931575 crossrefType "posted-content" @default.
- W2616931575 hasAuthorship W2616931575A5044103741 @default.
- W2616931575 hasAuthorship W2616931575A5067318904 @default.
- W2616931575 hasAuthorship W2616931575A5068063822 @default.
- W2616931575 hasAuthorship W2616931575A5080064706 @default.
- W2616931575 hasBestOaLocation W26169315751 @default.
- W2616931575 hasConcept C104317684 @default.
- W2616931575 hasConcept C106208931 @default.
- W2616931575 hasConcept C135763542 @default.
- W2616931575 hasConcept C141231307 @default.
- W2616931575 hasConcept C153209595 @default.
- W2616931575 hasConcept C186413461 @default.
- W2616931575 hasConcept C192953774 @default.
- W2616931575 hasConcept C24432333 @default.
- W2616931575 hasConcept C2908647359 @default.
- W2616931575 hasConcept C51679486 @default.
- W2616931575 hasConcept C54355233 @default.
- W2616931575 hasConcept C70721500 @default.
- W2616931575 hasConcept C71924100 @default.
- W2616931575 hasConcept C86803240 @default.
- W2616931575 hasConcept C97425143 @default.
- W2616931575 hasConcept C99454951 @default.
- W2616931575 hasConceptScore W2616931575C104317684 @default.
- W2616931575 hasConceptScore W2616931575C106208931 @default.
- W2616931575 hasConceptScore W2616931575C135763542 @default.
- W2616931575 hasConceptScore W2616931575C141231307 @default.
- W2616931575 hasConceptScore W2616931575C153209595 @default.
- W2616931575 hasConceptScore W2616931575C186413461 @default.
- W2616931575 hasConceptScore W2616931575C192953774 @default.
- W2616931575 hasConceptScore W2616931575C24432333 @default.
- W2616931575 hasConceptScore W2616931575C2908647359 @default.
- W2616931575 hasConceptScore W2616931575C51679486 @default.
- W2616931575 hasConceptScore W2616931575C54355233 @default.
- W2616931575 hasConceptScore W2616931575C70721500 @default.
- W2616931575 hasConceptScore W2616931575C71924100 @default.
- W2616931575 hasConceptScore W2616931575C86803240 @default.
- W2616931575 hasConceptScore W2616931575C97425143 @default.
- W2616931575 hasConceptScore W2616931575C99454951 @default.
- W2616931575 hasLocation W26169315751 @default.
- W2616931575 hasLocation W26169315752 @default.
- W2616931575 hasLocation W26169315753 @default.
- W2616931575 hasLocation W26169315754 @default.
- W2616931575 hasOpenAccess W2616931575 @default.
- W2616931575 hasPrimaryLocation W26169315751 @default.
- W2616931575 hasRelatedWork W1522853405 @default.
- W2616931575 hasRelatedWork W1534187367 @default.
- W2616931575 hasRelatedWork W1747983814 @default.
- W2616931575 hasRelatedWork W1965921640 @default.
- W2616931575 hasRelatedWork W2123828287 @default.