Matches in SemOpenAlex for { <https://semopenalex.org/work/W2062610273> ?p ?o ?g. }
- W2062610273 abstract "Deep shotgun sequencing on next generation sequencing (NGS) platforms has contributed significant amounts of data to enrich our understanding of genomes, transcriptomes, amplified single-cell genomes, and metagenomes. However, deep coverage variations in short-read data sets and high sequencing error rates of modern sequencers present new computational challenges in data interpretation, including mapping and de novo assembly. New lab techniques such as multiple displacement amplification (MDA) of single cells and sequence independent single primer amplification (SISPA) allow for sequencing of organisms that cannot be cultured, but generate highly variable coverage due to amplification biases.Here we introduce NeatFreq, a software tool that reduces a data set to more uniform coverage by clustering and selecting from reads binned by their median kmer frequency (RMKF) and uniqueness. Previous algorithms normalize read coverage based on RMKF, but do not include methods for the preferred selection of (1) extremely low coverage regions produced by extremely variable sequencing of random-primed products and (2) 2-sided paired-end sequences. The algorithm increases the incorporation of the most unique, lowest coverage, segments of a genome using an error-corrected data set. NeatFreq was applied to bacterial, viral plaque, and single-cell sequencing data. The algorithm showed an increase in the rate at which the most unique reads in a genome were included in the assembled consensus while also reducing the count of duplicative and erroneous contigs (strings of high confidence overlaps) in the deliverable consensus. The results obtained from conventional Overlap-Layout-Consensus (OLC) were compared to simulated multi-de Bruijn graph assembly alternatives trained for variable coverage input using sequence before and after normalization of coverage. Coverage reduction was shown to increase processing speed and reduce memory requirements when using conventional bacterial assembly algorithms.The normalization of deep coverage spikes, which would otherwise inhibit consensus resolution, enables High Throughput Sequencing (HTS) assembly projects to consistently run to completion with existing assembly software. The NeatFreq software package is free, open source and available at https://github.com/bioh4x/NeatFreq ." @default.
- W2062610273 created "2016-06-24" @default.
- W2062610273 creator A5041386145 @default.
- W2062610273 creator A5046196541 @default.
- W2062610273 creator A5061018682 @default.
- W2062610273 creator A5064828949 @default.
- W2062610273 creator A5076389214 @default.
- W2062610273 creator A5089982542 @default.
- W2062610273 date "2014-11-19" @default.
- W2062610273 modified "2023-10-07" @default.
- W2062610273 title "NeatFreq: reference-free data reduction and coverage normalization for De Novosequence assembly" @default.
- W2062610273 cites W1578516052 @default.
- W2062610273 cites W1968762427 @default.
- W2062610273 cites W1973862547 @default.
- W2062610273 cites W1979761600 @default.
- W2062610273 cites W1998663222 @default.
- W2062610273 cites W2009391851 @default.
- W2062610273 cites W2013935032 @default.
- W2062610273 cites W2036897871 @default.
- W2062610273 cites W2037444377 @default.
- W2062610273 cites W2041943305 @default.
- W2062610273 cites W2051397417 @default.
- W2062610273 cites W2091245185 @default.
- W2062610273 cites W2112515280 @default.
- W2062610273 cites W2113287691 @default.
- W2062610273 cites W2120902911 @default.
- W2062610273 cites W2122080645 @default.
- W2062610273 cites W2124451996 @default.
- W2062610273 cites W2127230663 @default.
- W2062610273 cites W2133956160 @default.
- W2062610273 cites W2141920662 @default.
- W2062610273 cites W2142749416 @default.
- W2062610273 cites W2146918217 @default.
- W2062610273 cites W2151017710 @default.
- W2062610273 cites W2156125289 @default.
- W2062610273 cites W2159591897 @default.
- W2062610273 cites W2169242924 @default.
- W2062610273 doi "https://doi.org/10.1186/s12859-014-0357-3" @default.
- W2062610273 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/4245761" @default.
- W2062610273 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/25407910" @default.
- W2062610273 hasPublicationYear "2014" @default.
- W2062610273 type Work @default.
- W2062610273 sameAs 2062610273 @default.
- W2062610273 citedByCount "16" @default.
- W2062610273 countsByYear W20626102732014 @default.
- W2062610273 countsByYear W20626102732015 @default.
- W2062610273 countsByYear W20626102732016 @default.
- W2062610273 countsByYear W20626102732017 @default.
- W2062610273 countsByYear W20626102732018 @default.
- W2062610273 countsByYear W20626102732019 @default.
- W2062610273 countsByYear W20626102732021 @default.
- W2062610273 countsByYear W20626102732022 @default.
- W2062610273 crossrefType "journal-article" @default.
- W2062610273 hasAuthorship W2062610273A5041386145 @default.
- W2062610273 hasAuthorship W2062610273A5046196541 @default.
- W2062610273 hasAuthorship W2062610273A5061018682 @default.
- W2062610273 hasAuthorship W2062610273A5064828949 @default.
- W2062610273 hasAuthorship W2062610273A5076389214 @default.
- W2062610273 hasAuthorship W2062610273A5089982542 @default.
- W2062610273 hasBestOaLocation W20626102731 @default.
- W2062610273 hasConcept C101985253 @default.
- W2062610273 hasConcept C104317684 @default.
- W2062610273 hasConcept C113425843 @default.
- W2062610273 hasConcept C11413529 @default.
- W2062610273 hasConcept C132525143 @default.
- W2062610273 hasConcept C132917006 @default.
- W2062610273 hasConcept C136886441 @default.
- W2062610273 hasConcept C141231307 @default.
- W2062610273 hasConcept C144024400 @default.
- W2062610273 hasConcept C150194340 @default.
- W2062610273 hasConcept C162317418 @default.
- W2062610273 hasConcept C16671776 @default.
- W2062610273 hasConcept C174749747 @default.
- W2062610273 hasConcept C18949551 @default.
- W2062610273 hasConcept C19165224 @default.
- W2062610273 hasConcept C192953774 @default.
- W2062610273 hasConcept C20218877 @default.
- W2062610273 hasConcept C41008148 @default.
- W2062610273 hasConcept C501734568 @default.
- W2062610273 hasConcept C51679486 @default.
- W2062610273 hasConcept C54355233 @default.
- W2062610273 hasConcept C59582021 @default.
- W2062610273 hasConcept C70721500 @default.
- W2062610273 hasConcept C80444323 @default.
- W2062610273 hasConcept C86803240 @default.
- W2062610273 hasConceptScore W2062610273C101985253 @default.
- W2062610273 hasConceptScore W2062610273C104317684 @default.
- W2062610273 hasConceptScore W2062610273C113425843 @default.
- W2062610273 hasConceptScore W2062610273C11413529 @default.
- W2062610273 hasConceptScore W2062610273C132525143 @default.
- W2062610273 hasConceptScore W2062610273C132917006 @default.
- W2062610273 hasConceptScore W2062610273C136886441 @default.
- W2062610273 hasConceptScore W2062610273C141231307 @default.
- W2062610273 hasConceptScore W2062610273C144024400 @default.
- W2062610273 hasConceptScore W2062610273C150194340 @default.
- W2062610273 hasConceptScore W2062610273C162317418 @default.
- W2062610273 hasConceptScore W2062610273C16671776 @default.
- W2062610273 hasConceptScore W2062610273C174749747 @default.
- W2062610273 hasConceptScore W2062610273C18949551 @default.
- W2062610273 hasConceptScore W2062610273C19165224 @default.