Matches in SemOpenAlex for { <https://semopenalex.org/work/W2894094021> ?p ?o ?g. }
- W2894094021 endingPage "3280" @default.
- W2894094021 startingPage "3273" @default.
- W2894094021 abstract "De Bruijn graphs are a common assembly data structure for sequencing datasets. But with the advances in sequencing technologies, assembling high coverage datasets has become a computational challenge. Read normalization, which removes redundancy in datasets, is widely applied to reduce resource requirements. Current normalization algorithms, though efficient, provide no guarantee to preserve important k-mers that form connections between regions in the graph.Here, normalization is phrased as a set multi-cover problem on reads and a heuristic algorithm, Optimized Read Normalization Algorithm (ORNA), is proposed. ORNA normalizes to the minimum number of reads required to retain all k-mers and their relative k-mer abundances from the original dataset. Hence, all connections from the original graph are preserved. ORNA was tested on various RNA-seq datasets with different coverage values. It was compared to the current normalization algorithms and was found to be performing better. Normalizing error corrected data allows for more accurate assemblies compared to the normalized uncorrected dataset. Further, an application is proposed in which multiple datasets are combined and normalized to predict novel transcripts that would have been missed otherwise. Finally, ORNA is a general purpose normalization algorithm that is fast and significantly reduces datasets with loss of assembly quality in between [1, 30]% depending on reduction stringency.ORNA is available at https://github.com/SchulzLab/ORNA.Supplementary data are available at Bioinformatics online." @default.
- W2894094021 created "2018-10-05" @default.
- W2894094021 creator A5080725960 @default.
- W2894094021 creator A5087263651 @default.
- W2894094021 date "2018-04-18" @default.
- W2894094021 modified "2023-10-14" @default.
- W2894094021 title "In silico read normalization using set multi-cover optimization" @default.
- W2894094021 cites W1954100204 @default.
- W2894094021 cites W1972924519 @default.
- W2894094021 cites W1993324779 @default.
- W2894094021 cites W2009735916 @default.
- W2894094021 cites W2011657487 @default.
- W2894094021 cites W2054841963 @default.
- W2894094021 cites W2057253402 @default.
- W2894094021 cites W2062610273 @default.
- W2894094021 cites W2074490119 @default.
- W2894094021 cites W2093456070 @default.
- W2894094021 cites W2096465161 @default.
- W2894094021 cites W2100441605 @default.
- W2894094021 cites W2104052851 @default.
- W2894094021 cites W2112888168 @default.
- W2894094021 cites W2126419817 @default.
- W2894094021 cites W2129949113 @default.
- W2894094021 cites W2136145671 @default.
- W2894094021 cites W2151462466 @default.
- W2894094021 cites W2152556446 @default.
- W2894094021 cites W2156841387 @default.
- W2894094021 cites W2163830511 @default.
- W2894094021 cites W2170747616 @default.
- W2894094021 cites W2198888083 @default.
- W2894094021 cites W2291017679 @default.
- W2894094021 cites W2344026279 @default.
- W2894094021 cites W2438121987 @default.
- W2894094021 cites W2592811885 @default.
- W2894094021 cites W2951899931 @default.
- W2894094021 cites W2952944477 @default.
- W2894094021 cites W4230266413 @default.
- W2894094021 cites W4233928114 @default.
- W2894094021 doi "https://doi.org/10.1093/bioinformatics/bty307" @default.
- W2894094021 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/6157080" @default.
- W2894094021 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/29912280" @default.
- W2894094021 hasPublicationYear "2018" @default.
- W2894094021 type Work @default.
- W2894094021 sameAs 2894094021 @default.
- W2894094021 citedByCount "6" @default.
- W2894094021 countsByYear W28940940212019 @default.
- W2894094021 countsByYear W28940940212020 @default.
- W2894094021 crossrefType "journal-article" @default.
- W2894094021 hasAuthorship W2894094021A5080725960 @default.
- W2894094021 hasAuthorship W2894094021A5087263651 @default.
- W2894094021 hasBestOaLocation W28940940211 @default.
- W2894094021 hasConcept C111919701 @default.
- W2894094021 hasConcept C11413529 @default.
- W2894094021 hasConcept C124101348 @default.
- W2894094021 hasConcept C132525143 @default.
- W2894094021 hasConcept C136886441 @default.
- W2894094021 hasConcept C144024400 @default.
- W2894094021 hasConcept C152124472 @default.
- W2894094021 hasConcept C153180895 @default.
- W2894094021 hasConcept C154945302 @default.
- W2894094021 hasConcept C162984825 @default.
- W2894094021 hasConcept C19165224 @default.
- W2894094021 hasConcept C20218877 @default.
- W2894094021 hasConcept C41008148 @default.
- W2894094021 hasConcept C80444323 @default.
- W2894094021 hasConceptScore W2894094021C111919701 @default.
- W2894094021 hasConceptScore W2894094021C11413529 @default.
- W2894094021 hasConceptScore W2894094021C124101348 @default.
- W2894094021 hasConceptScore W2894094021C132525143 @default.
- W2894094021 hasConceptScore W2894094021C136886441 @default.
- W2894094021 hasConceptScore W2894094021C144024400 @default.
- W2894094021 hasConceptScore W2894094021C152124472 @default.
- W2894094021 hasConceptScore W2894094021C153180895 @default.
- W2894094021 hasConceptScore W2894094021C154945302 @default.
- W2894094021 hasConceptScore W2894094021C162984825 @default.
- W2894094021 hasConceptScore W2894094021C19165224 @default.
- W2894094021 hasConceptScore W2894094021C20218877 @default.
- W2894094021 hasConceptScore W2894094021C41008148 @default.
- W2894094021 hasConceptScore W2894094021C80444323 @default.
- W2894094021 hasIssue "19" @default.
- W2894094021 hasLocation W28940940211 @default.
- W2894094021 hasLocation W28940940212 @default.
- W2894094021 hasLocation W28940940213 @default.
- W2894094021 hasLocation W28940940214 @default.
- W2894094021 hasLocation W28940940215 @default.
- W2894094021 hasLocation W28940940216 @default.
- W2894094021 hasOpenAccess W2894094021 @default.
- W2894094021 hasPrimaryLocation W28940940211 @default.
- W2894094021 hasRelatedWork W1558963043 @default.
- W2894094021 hasRelatedWork W1561336969 @default.
- W2894094021 hasRelatedWork W1566685310 @default.
- W2894094021 hasRelatedWork W2006956706 @default.
- W2894094021 hasRelatedWork W2054080489 @default.
- W2894094021 hasRelatedWork W2156481544 @default.
- W2894094021 hasRelatedWork W2183456362 @default.
- W2894094021 hasRelatedWork W2690313894 @default.
- W2894094021 hasRelatedWork W3097182452 @default.
- W2894094021 hasRelatedWork W4239030218 @default.