Matches in SemOpenAlex for { <https://semopenalex.org/work/W4313650079> ?p ?o ?g. }
Showing items 1 to 78 of
78
with 100 items per page.
- W4313650079 abstract "The Li & Stephens (LS) hidden Markov model (HMM) models the process of reconstructing a haplotype as a mosaic copy of haplotypes in a reference panel (haplotype threading). For small panels the probabilistic parameterization of LS enables modeling the uncertainties of such mosaics, and has been the foundational model for haplotype phasing and imputation. However, LS becomes inefficient when sample size is large (tens of thousands to millions), because of its linear time complexity ( O ( MN ), where M is the number of haplotypes and N is the number of sites in the panel). Recently the PBWT, an efficient data structure capturing the local haplotype matching among haplotypes, was proposed to offer fast methods for giving some optimal solution (Viterbi) to the LS HMM. But the solution space of the LS for large panels is still elusive. Previously we introduced the Minimal Positional Substring Cover (MPSC) problem as an alternative formulation of LS whose objective is to cover a query haplotype by a minimum number of segments from haplotypes in a reference panel. The MPSC formulation allows the generation of a haplotype threading in time constant to sample size ( O ( N )). This allows haplotype threading on very large biobank scale panels on which the LS model is infeasible. Here we present new results on the solution space of the MPSC by first identifying a property that any MPSC will have a set of required regions, and then proposing a MPSC graph. In addition, we derived a number of optimal algorithms for MPSC, including solution enumerations, the Length Maximal MPSC, and h -MPSC solutions. In doing so, our algorithms reveal the solution space of LS for large panels. Even though we only solved an extreme case of LS where the emission probability is 0, our algorithms can be made more robust by PBWT smoothing. We show that our method is informative in terms of revealing the characteristics of biobank-scale data sets and can improve genotype imputation." @default.
- W4313650079 created "2023-01-07" @default.
- W4313650079 creator A5012354604 @default.
- W4313650079 creator A5016396311 @default.
- W4313650079 creator A5083185181 @default.
- W4313650079 date "2023-01-06" @default.
- W4313650079 modified "2023-09-24" @default.
- W4313650079 title "Minimal Positional Substring Cover: A Haplotype Threading Alternative to Li & Stephens Model" @default.
- W4313650079 cites W1962960380 @default.
- W4313650079 cites W2147477044 @default.
- W4313650079 cites W2510973425 @default.
- W4313650079 cites W2529241974 @default.
- W4313650079 cites W2888599454 @default.
- W4313650079 cites W2895486342 @default.
- W4313650079 cites W2919831875 @default.
- W4313650079 cites W2941113916 @default.
- W4313650079 cites W2948091620 @default.
- W4313650079 cites W2952113787 @default.
- W4313650079 cites W2990059271 @default.
- W4313650079 cites W3017235832 @default.
- W4313650079 cites W3104299286 @default.
- W4313650079 cites W3197367468 @default.
- W4313650079 cites W4206475898 @default.
- W4313650079 cites W4283162782 @default.
- W4313650079 doi "https://doi.org/10.1101/2023.01.04.522803" @default.
- W4313650079 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/36711469" @default.
- W4313650079 hasPublicationYear "2023" @default.
- W4313650079 type Work @default.
- W4313650079 citedByCount "0" @default.
- W4313650079 crossrefType "posted-content" @default.
- W4313650079 hasAuthorship W4313650079A5012354604 @default.
- W4313650079 hasAuthorship W4313650079A5016396311 @default.
- W4313650079 hasAuthorship W4313650079A5083185181 @default.
- W4313650079 hasBestOaLocation W43136500791 @default.
- W4313650079 hasConcept C104317684 @default.
- W4313650079 hasConcept C11413529 @default.
- W4313650079 hasConcept C135763542 @default.
- W4313650079 hasConcept C162319229 @default.
- W4313650079 hasConcept C182407805 @default.
- W4313650079 hasConcept C197754878 @default.
- W4313650079 hasConcept C199360897 @default.
- W4313650079 hasConcept C33923547 @default.
- W4313650079 hasConcept C41008148 @default.
- W4313650079 hasConcept C55493867 @default.
- W4313650079 hasConcept C57273362 @default.
- W4313650079 hasConcept C60582962 @default.
- W4313650079 hasConcept C86803240 @default.
- W4313650079 hasConceptScore W4313650079C104317684 @default.
- W4313650079 hasConceptScore W4313650079C11413529 @default.
- W4313650079 hasConceptScore W4313650079C135763542 @default.
- W4313650079 hasConceptScore W4313650079C162319229 @default.
- W4313650079 hasConceptScore W4313650079C182407805 @default.
- W4313650079 hasConceptScore W4313650079C197754878 @default.
- W4313650079 hasConceptScore W4313650079C199360897 @default.
- W4313650079 hasConceptScore W4313650079C33923547 @default.
- W4313650079 hasConceptScore W4313650079C41008148 @default.
- W4313650079 hasConceptScore W4313650079C55493867 @default.
- W4313650079 hasConceptScore W4313650079C57273362 @default.
- W4313650079 hasConceptScore W4313650079C60582962 @default.
- W4313650079 hasConceptScore W4313650079C86803240 @default.
- W4313650079 hasLocation W43136500791 @default.
- W4313650079 hasLocation W43136500792 @default.
- W4313650079 hasLocation W43136500793 @default.
- W4313650079 hasOpenAccess W4313650079 @default.
- W4313650079 hasPrimaryLocation W43136500791 @default.
- W4313650079 hasRelatedWork W2148315173 @default.
- W4313650079 hasRelatedWork W2353601037 @default.
- W4313650079 hasRelatedWork W2355928363 @default.
- W4313650079 hasRelatedWork W2372590927 @default.
- W4313650079 hasRelatedWork W2374526264 @default.
- W4313650079 hasRelatedWork W2613777820 @default.
- W4313650079 hasRelatedWork W3139170180 @default.
- W4313650079 hasRelatedWork W4313313360 @default.
- W4313650079 hasRelatedWork W4313650079 @default.
- W4313650079 hasRelatedWork W4380591140 @default.
- W4313650079 isParatext "false" @default.
- W4313650079 isRetracted "false" @default.
- W4313650079 workType "article" @default.