Matches in SemOpenAlex for { <https://semopenalex.org/work/W3098179732> ?p ?o ?g. }
- W3098179732 abstract "Abstract The goal of strain-aware genome assembly is to reconstruct all individual haplotypes from a mixed sample at the strain level and to provide abundance estimates for the strains. Given that the use of a reference genome can introduce significant biases, de novo approaches are most suitable for this task. So far, reference-genome-independent assemblers have been shown to reconstruct haplotypes for mixed samples of limited complexity and genomes not exceeding 10000 bp in length. Here, we present VG-Flow, a de novo approach that enables full-length haplotype reconstruction from pre-assembled contigs of complex mixed samples. Our method increases contiguity of the input assembly and, at the same time, it performs haplotype abundance estimation. VG-Flow is the first approach to require polynomial, and not exponential runtime in terms of the underlying graphs. Since runtime increases only linearly in the length of the genomes in practice, it enables the reconstruction also of genomes that are longer by orders of magnitude, thereby establishing the first de novo solution to strain-aware full-length genome assembly applicable to bacterial sized genomes. VG-Flow is based on the flow variation graph as a novel concept that both captures all diversity present in the sample and enables to cast the central contig abundance estimation problem as a flow-like, polynomial time solvable optimization problem. As a consequence, we are in position to compute maximal-length haplotypes in terms of decomposing the resulting flow efficiently using a greedy algorithm, and obtain accurate frequency estimates for the reconstructed haplotypes through linear programming techniques. Benchmarking experiments show that our method outperforms state-of-the-art approaches on mixed samples from short genomes in terms of assembly accuracy as well as abundance estimation. Experiments on longer, bacterial sized genomes demonstrate that VG-Flow is the only current approach that can reconstruct full-length haplotypes from mixed samples at the strain level in human-affordable runtime." @default.
- W3098179732 created "2020-11-23" @default.
- W3098179732 creator A5000303828 @default.
- W3098179732 creator A5001148638 @default.
- W3098179732 creator A5044184077 @default.
- W3098179732 date "2019-05-24" @default.
- W3098179732 modified "2023-10-02" @default.
- W3098179732 title "Strain-aware assembly of genomes from mixed samples using flow variation graphs" @default.
- W3098179732 cites W1632601927 @default.
- W3098179732 cites W1993577356 @default.
- W3098179732 cites W2003574450 @default.
- W3098179732 cites W2010002522 @default.
- W3098179732 cites W2063797600 @default.
- W3098179732 cites W2065256070 @default.
- W3098179732 cites W2068916031 @default.
- W3098179732 cites W2073918065 @default.
- W3098179732 cites W2104005232 @default.
- W3098179732 cites W2107772251 @default.
- W3098179732 cites W2113679889 @default.
- W3098179732 cites W2120902911 @default.
- W3098179732 cites W2133459862 @default.
- W3098179732 cites W2134007392 @default.
- W3098179732 cites W2141458291 @default.
- W3098179732 cites W2142113273 @default.
- W3098179732 cites W2153918414 @default.
- W3098179732 cites W2163326395 @default.
- W3098179732 cites W2163903604 @default.
- W3098179732 cites W2164946254 @default.
- W3098179732 cites W2167377888 @default.
- W3098179732 cites W2236822143 @default.
- W3098179732 cites W2267659990 @default.
- W3098179732 cites W2323326409 @default.
- W3098179732 cites W2599417231 @default.
- W3098179732 cites W2603734366 @default.
- W3098179732 cites W2735337572 @default.
- W3098179732 cites W2736047874 @default.
- W3098179732 cites W2773939681 @default.
- W3098179732 cites W2789208418 @default.
- W3098179732 cites W2888300707 @default.
- W3098179732 cites W2949491511 @default.
- W3098179732 cites W2950774292 @default.
- W3098179732 cites W2952626522 @default.
- W3098179732 cites W3094571539 @default.
- W3098179732 cites W3106423944 @default.
- W3098179732 doi "https://doi.org/10.1101/645721" @default.
- W3098179732 hasPublicationYear "2019" @default.
- W3098179732 type Work @default.
- W3098179732 sameAs 3098179732 @default.
- W3098179732 citedByCount "11" @default.
- W3098179732 countsByYear W30981797322017 @default.
- W3098179732 countsByYear W30981797322018 @default.
- W3098179732 countsByYear W30981797322019 @default.
- W3098179732 countsByYear W30981797322020 @default.
- W3098179732 countsByYear W30981797322021 @default.
- W3098179732 countsByYear W30981797322022 @default.
- W3098179732 countsByYear W30981797322023 @default.
- W3098179732 crossrefType "posted-content" @default.
- W3098179732 hasAuthorship W3098179732A5000303828 @default.
- W3098179732 hasAuthorship W3098179732A5001148638 @default.
- W3098179732 hasAuthorship W3098179732A5044184077 @default.
- W3098179732 hasBestOaLocation W30981797321 @default.
- W3098179732 hasConcept C104317684 @default.
- W3098179732 hasConcept C105702510 @default.
- W3098179732 hasConcept C11413529 @default.
- W3098179732 hasConcept C135763542 @default.
- W3098179732 hasConcept C141231307 @default.
- W3098179732 hasConcept C150194340 @default.
- W3098179732 hasConcept C154945302 @default.
- W3098179732 hasConcept C162317418 @default.
- W3098179732 hasConcept C173801870 @default.
- W3098179732 hasConcept C18949551 @default.
- W3098179732 hasConcept C197754878 @default.
- W3098179732 hasConcept C2778022156 @default.
- W3098179732 hasConcept C41008148 @default.
- W3098179732 hasConcept C54355233 @default.
- W3098179732 hasConcept C59582021 @default.
- W3098179732 hasConcept C70721500 @default.
- W3098179732 hasConcept C86803240 @default.
- W3098179732 hasConceptScore W3098179732C104317684 @default.
- W3098179732 hasConceptScore W3098179732C105702510 @default.
- W3098179732 hasConceptScore W3098179732C11413529 @default.
- W3098179732 hasConceptScore W3098179732C135763542 @default.
- W3098179732 hasConceptScore W3098179732C141231307 @default.
- W3098179732 hasConceptScore W3098179732C150194340 @default.
- W3098179732 hasConceptScore W3098179732C154945302 @default.
- W3098179732 hasConceptScore W3098179732C162317418 @default.
- W3098179732 hasConceptScore W3098179732C173801870 @default.
- W3098179732 hasConceptScore W3098179732C18949551 @default.
- W3098179732 hasConceptScore W3098179732C197754878 @default.
- W3098179732 hasConceptScore W3098179732C2778022156 @default.
- W3098179732 hasConceptScore W3098179732C41008148 @default.
- W3098179732 hasConceptScore W3098179732C54355233 @default.
- W3098179732 hasConceptScore W3098179732C59582021 @default.
- W3098179732 hasConceptScore W3098179732C70721500 @default.
- W3098179732 hasConceptScore W3098179732C86803240 @default.
- W3098179732 hasLocation W30981797321 @default.
- W3098179732 hasLocation W30981797322 @default.
- W3098179732 hasLocation W30981797323 @default.
- W3098179732 hasLocation W30981797324 @default.
- W3098179732 hasLocation W30981797325 @default.