Matches in SemOpenAlex for { <https://semopenalex.org/work/W3100743957> ?p ?o ?g. }
- W3100743957 abstract "Abstract Background Structural variants comprise diverse genomic arrangements including deletions, insertions, inversions, and translocations, which can generally be detected in humans through sequence comparison to the reference genome. Among structural variants, insertions are the least frequently identified variants, mainly due to ascertainment bias in the reference genome, lack of previous sequence knowledge, and low complexity of typical insertion sequences. Though recent developments in long-read sequencing deliver promise in annotating individual non-reference insertions, population-level catalogues on non-reference insertion variants have not been identified and the possible functional roles of these hidden variants remain elusive. Results To detect non-reference insertion variants, we developed a pipeline, InserTag, which generates non-reference contigs by local de novo assembly and then infers the full-sequence of insertion variants by tracing contigs from non-human primates and other human genome assemblies. Application of the pipeline to data from 2535 individuals of the 1000 Genomes Project helped identify 1696 non-reference insertion variants and re-classify the variants as retention of ancestral sequences or novel sequence insertions based on the ancestral state. Genotyping of the variants showed that individuals had, on average, 0.92-Mbp sequences missing from the reference genome, 92% of the variants were common (allele frequency > 5%) among human populations, and more than half of the variants were major alleles. Among human populations, African populations were the most divergent and had the most non-reference sequences, which was attributed to the greater prevalence of high-frequency insertion variants. The subsets of insertion variants were in high linkage disequilibrium with phenotype-associated SNPs and showed signals of recent continent-specific selection. Conclusions Non-reference insertion variants represent an important type of genetic variation in the human population, and our developed pipeline, InserTag, provides the frameworks for the detection and genotyping of non-reference sequences missing from human populations." @default.
- W3100743957 created "2020-11-23" @default.
- W3100743957 creator A5064605300 @default.
- W3100743957 creator A5077721033 @default.
- W3100743957 creator A5084165999 @default.
- W3100743957 creator A5085644660 @default.
- W3100743957 date "2020-11-13" @default.
- W3100743957 modified "2023-10-14" @default.
- W3100743957 title "Insertion variants missing in the human reference genome are widespread among human populations" @default.
- W3100743957 cites W1595360086 @default.
- W3100743957 cites W1816176398 @default.
- W3100743957 cites W1912672559 @default.
- W3100743957 cites W1946252112 @default.
- W3100743957 cites W1970470405 @default.
- W3100743957 cites W1976555450 @default.
- W3100743957 cites W1992436001 @default.
- W3100743957 cites W1997417518 @default.
- W3100743957 cites W2016943568 @default.
- W3100743957 cites W2025943989 @default.
- W3100743957 cites W2037331531 @default.
- W3100743957 cites W2041825325 @default.
- W3100743957 cites W2061539393 @default.
- W3100743957 cites W2065776802 @default.
- W3100743957 cites W2083870688 @default.
- W3100743957 cites W2084981198 @default.
- W3100743957 cites W2085459418 @default.
- W3100743957 cites W2086493366 @default.
- W3100743957 cites W2091687746 @default.
- W3100743957 cites W2092768543 @default.
- W3100743957 cites W2093529520 @default.
- W3100743957 cites W2097101746 @default.
- W3100743957 cites W2103441770 @default.
- W3100743957 cites W2107126369 @default.
- W3100743957 cites W2108234281 @default.
- W3100743957 cites W2116126626 @default.
- W3100743957 cites W2117716708 @default.
- W3100743957 cites W2120762351 @default.
- W3100743957 cites W2122592421 @default.
- W3100743957 cites W2124429242 @default.
- W3100743957 cites W2125341668 @default.
- W3100743957 cites W2136145671 @default.
- W3100743957 cites W2142642738 @default.
- W3100743957 cites W2143196035 @default.
- W3100743957 cites W2146005104 @default.
- W3100743957 cites W2160969485 @default.
- W3100743957 cites W2161633633 @default.
- W3100743957 cites W2165963439 @default.
- W3100743957 cites W2168784669 @default.
- W3100743957 cites W2208231423 @default.
- W3100743957 cites W2235737683 @default.
- W3100743957 cites W2323327415 @default.
- W3100743957 cites W2528792175 @default.
- W3100743957 cites W2594239943 @default.
- W3100743957 cites W2608920382 @default.
- W3100743957 cites W2750782486 @default.
- W3100743957 cites W2799524357 @default.
- W3100743957 cites W2883573930 @default.
- W3100743957 cites W2884270047 @default.
- W3100743957 cites W2901214479 @default.
- W3100743957 cites W2901303766 @default.
- W3100743957 cites W2908721251 @default.
- W3100743957 cites W2950337916 @default.
- W3100743957 cites W2964679279 @default.
- W3100743957 cites W2966929769 @default.
- W3100743957 cites W2972657082 @default.
- W3100743957 cites W2989655129 @default.
- W3100743957 cites W3032034316 @default.
- W3100743957 cites W93588716 @default.
- W3100743957 doi "https://doi.org/10.1186/s12915-020-00894-1" @default.
- W3100743957 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/7666470" @default.
- W3100743957 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/33187521" @default.
- W3100743957 hasPublicationYear "2020" @default.
- W3100743957 type Work @default.
- W3100743957 sameAs 3100743957 @default.
- W3100743957 citedByCount "6" @default.
- W3100743957 countsByYear W31007439572021 @default.
- W3100743957 countsByYear W31007439572022 @default.
- W3100743957 crossrefType "journal-article" @default.
- W3100743957 hasAuthorship W3100743957A5064605300 @default.
- W3100743957 hasAuthorship W3100743957A5077721033 @default.
- W3100743957 hasAuthorship W3100743957A5084165999 @default.
- W3100743957 hasAuthorship W3100743957A5085644660 @default.
- W3100743957 hasBestOaLocation W31007439571 @default.
- W3100743957 hasConcept C104317684 @default.
- W3100743957 hasConcept C119054055 @default.
- W3100743957 hasConcept C135763542 @default.
- W3100743957 hasConcept C141231307 @default.
- W3100743957 hasConcept C144024400 @default.
- W3100743957 hasConcept C149923435 @default.
- W3100743957 hasConcept C151020129 @default.
- W3100743957 hasConcept C153209595 @default.
- W3100743957 hasConcept C192953774 @default.
- W3100743957 hasConcept C197077220 @default.
- W3100743957 hasConcept C2778112365 @default.
- W3100743957 hasConcept C2908647359 @default.
- W3100743957 hasConcept C31467283 @default.
- W3100743957 hasConcept C54355233 @default.
- W3100743957 hasConcept C59582021 @default.
- W3100743957 hasConcept C70721500 @default.
- W3100743957 hasConcept C86803240 @default.