Matches in SemOpenAlex for { <https://semopenalex.org/work/W2161048357> ?p ?o ?g. }
- W2161048357 endingPage "i373" @default.
- W2161048357 startingPage "i367" @default.
- W2161048357 abstract "Sequence assembly is a difficult problem whose importance has grown again recently as the cost of sequencing has dramatically dropped. Most new sequence assembly software has started by building a de Bruijn graph, avoiding the overlap-based methods used previously because of the computational cost and complexity of these with very large numbers of short reads. Here, we show how to use suffix array-based methods that have formed the basis of recent very fast sequence mapping algorithms to find overlaps and generate assembly string graphs asymptotically faster than previously described algorithms.Standard overlap assembly methods have time complexity O(N(2)), where N is the sum of the lengths of the reads. We use the Ferragina-Manzini index (FM-index) derived from the Burrows-Wheeler transform to find overlaps of length at least tau among a set of reads. As well as an approach that finds all overlaps then implements transitive reduction to produce a string graph, we show how to output directly only the irreducible overlaps, significantly shrinking memory requirements and reducing compute time to O(N), independent of depth. Overlap-based assembly methods naturally handle mixed length read sets, including capillary reads or long reads promised by the third generation sequencing technologies. The algorithms we present here pave the way for overlap-based assembly approaches to be developed that scale to whole vertebrate genome de novo assembly." @default.
- W2161048357 created "2016-06-24" @default.
- W2161048357 creator A5003935660 @default.
- W2161048357 creator A5082040351 @default.
- W2161048357 date "2010-06-06" @default.
- W2161048357 modified "2023-10-16" @default.
- W2161048357 title "Efficient construction of an assembly string graph using the FM-index" @default.
- W2161048357 cites W1966822396 @default.
- W2161048357 cites W1976682045 @default.
- W2161048357 cites W2023911006 @default.
- W2161048357 cites W2097341408 @default.
- W2161048357 cites W2103441770 @default.
- W2161048357 cites W2114133992 @default.
- W2161048357 cites W2118703123 @default.
- W2161048357 cites W2124985265 @default.
- W2161048357 cites W2136651963 @default.
- W2161048357 cites W2151017710 @default.
- W2161048357 cites W2156104322 @default.
- W2161048357 cites W2160969485 @default.
- W2161048357 cites W4247053599 @default.
- W2161048357 doi "https://doi.org/10.1093/bioinformatics/btq217" @default.
- W2161048357 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/2881401" @default.
- W2161048357 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/20529929" @default.
- W2161048357 hasPublicationYear "2010" @default.
- W2161048357 type Work @default.
- W2161048357 sameAs 2161048357 @default.
- W2161048357 citedByCount "226" @default.
- W2161048357 countsByYear W21610483572012 @default.
- W2161048357 countsByYear W21610483572013 @default.
- W2161048357 countsByYear W21610483572014 @default.
- W2161048357 countsByYear W21610483572015 @default.
- W2161048357 countsByYear W21610483572016 @default.
- W2161048357 countsByYear W21610483572017 @default.
- W2161048357 countsByYear W21610483572018 @default.
- W2161048357 countsByYear W21610483572019 @default.
- W2161048357 countsByYear W21610483572020 @default.
- W2161048357 countsByYear W21610483572021 @default.
- W2161048357 countsByYear W21610483572022 @default.
- W2161048357 countsByYear W21610483572023 @default.
- W2161048357 crossrefType "journal-article" @default.
- W2161048357 hasAuthorship W2161048357A5003935660 @default.
- W2161048357 hasAuthorship W2161048357A5082040351 @default.
- W2161048357 hasBestOaLocation W21610483571 @default.
- W2161048357 hasConcept C104317684 @default.
- W2161048357 hasConcept C11413529 @default.
- W2161048357 hasConcept C114614502 @default.
- W2161048357 hasConcept C132525143 @default.
- W2161048357 hasConcept C150194340 @default.
- W2161048357 hasConcept C157486923 @default.
- W2161048357 hasConcept C162317418 @default.
- W2161048357 hasConcept C162319229 @default.
- W2161048357 hasConcept C170320093 @default.
- W2161048357 hasConcept C177264268 @default.
- W2161048357 hasConcept C18949551 @default.
- W2161048357 hasConcept C199360897 @default.
- W2161048357 hasConcept C20218877 @default.
- W2161048357 hasConcept C2279292 @default.
- W2161048357 hasConcept C2777904410 @default.
- W2161048357 hasConcept C2778112365 @default.
- W2161048357 hasConcept C2781166958 @default.
- W2161048357 hasConcept C33923547 @default.
- W2161048357 hasConcept C37914503 @default.
- W2161048357 hasConcept C41008148 @default.
- W2161048357 hasConcept C51679486 @default.
- W2161048357 hasConcept C54355233 @default.
- W2161048357 hasConcept C552990157 @default.
- W2161048357 hasConcept C55493867 @default.
- W2161048357 hasConcept C80444323 @default.
- W2161048357 hasConcept C86803240 @default.
- W2161048357 hasConceptScore W2161048357C104317684 @default.
- W2161048357 hasConceptScore W2161048357C11413529 @default.
- W2161048357 hasConceptScore W2161048357C114614502 @default.
- W2161048357 hasConceptScore W2161048357C132525143 @default.
- W2161048357 hasConceptScore W2161048357C150194340 @default.
- W2161048357 hasConceptScore W2161048357C157486923 @default.
- W2161048357 hasConceptScore W2161048357C162317418 @default.
- W2161048357 hasConceptScore W2161048357C162319229 @default.
- W2161048357 hasConceptScore W2161048357C170320093 @default.
- W2161048357 hasConceptScore W2161048357C177264268 @default.
- W2161048357 hasConceptScore W2161048357C18949551 @default.
- W2161048357 hasConceptScore W2161048357C199360897 @default.
- W2161048357 hasConceptScore W2161048357C20218877 @default.
- W2161048357 hasConceptScore W2161048357C2279292 @default.
- W2161048357 hasConceptScore W2161048357C2777904410 @default.
- W2161048357 hasConceptScore W2161048357C2778112365 @default.
- W2161048357 hasConceptScore W2161048357C2781166958 @default.
- W2161048357 hasConceptScore W2161048357C33923547 @default.
- W2161048357 hasConceptScore W2161048357C37914503 @default.
- W2161048357 hasConceptScore W2161048357C41008148 @default.
- W2161048357 hasConceptScore W2161048357C51679486 @default.
- W2161048357 hasConceptScore W2161048357C54355233 @default.
- W2161048357 hasConceptScore W2161048357C552990157 @default.
- W2161048357 hasConceptScore W2161048357C55493867 @default.
- W2161048357 hasConceptScore W2161048357C80444323 @default.
- W2161048357 hasConceptScore W2161048357C86803240 @default.
- W2161048357 hasIssue "12" @default.
- W2161048357 hasLocation W21610483571 @default.
- W2161048357 hasLocation W21610483572 @default.