Matches in SemOpenAlex for { <https://semopenalex.org/work/W2006291299> ?p ?o ?g. }
Showing items 1 to 78 of
78
with 100 items per page.
- W2006291299 endingPage "137" @default.
- W2006291299 startingPage "135" @default.
- W2006291299 abstract "<h3>Abstract</h3> Much of the DNA and RNA sequencing data available is in the form of high-throughput sequencing (HTS) reads and is currently unindexed by established sequence search databases. Recent succinct data structures for indexing both reference sequences and HTS data, along with associated metadata, have been based on either hashing or graph models, but many of these structures are static in nature, and thus, not well-suited as backends for dynamic databases. We propose a parallel construction method for and novel application of the <i>wavelet trie</i> as a dynamic data structure for compressing and indexing graph metadata. By developing an algorithm for merging wavelet tries, we are able to construct large tries in parallel by merging smaller tries constructed concurrently from batches of data. When compared against general compression algorithms and those developed specifically for graph colors (VARI and Rainbowfish), our method achieves compression ratios superior to gzip and VARI, converging to compression ratios of 6.5% to 2% on data sets constructed from over 600 virus genomes. While marginally worse than compression by bzip2 or Rainbowfish, this structure allows for both fast extension and query. We also found that additionally encoding graph topology metadata improved compression ratios, particularly on data sets consisting of several mutually-exclusive reference genomes. It was also observed that the compression ratio of wavelet tries grew sublinearly with the density of the annotation matrices. This work is a significant step towards implementing a dynamic data structure for indexing large annotated sequence data sets that supports fast query and update operations. At the time of writing, no established standard tool has filled this niche." @default.
- W2006291299 created "2016-06-24" @default.
- W2006291299 creator A5074686451 @default.
- W2006291299 date "2008-01-15" @default.
- W2006291299 modified "2023-10-14" @default.
- W2006291299 title "Delivering health care on US$19 per capita" @default.
- W2006291299 doi "https://doi.org/10.1503/cmaj.071795" @default.
- W2006291299 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/2174986" @default.
- W2006291299 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/18195276" @default.
- W2006291299 hasPublicationYear "2008" @default.
- W2006291299 type Work @default.
- W2006291299 sameAs 2006291299 @default.
- W2006291299 citedByCount "5" @default.
- W2006291299 crossrefType "journal-article" @default.
- W2006291299 hasAuthorship W2006291299A5074686451 @default.
- W2006291299 hasBestOaLocation W20062912991 @default.
- W2006291299 hasConcept C111919701 @default.
- W2006291299 hasConcept C11413529 @default.
- W2006291299 hasConcept C124101348 @default.
- W2006291299 hasConcept C127413603 @default.
- W2006291299 hasConcept C132525143 @default.
- W2006291299 hasConcept C154945302 @default.
- W2006291299 hasConcept C171146098 @default.
- W2006291299 hasConcept C23123220 @default.
- W2006291299 hasConcept C25797200 @default.
- W2006291299 hasConcept C2776321320 @default.
- W2006291299 hasConcept C38652104 @default.
- W2006291299 hasConcept C41008148 @default.
- W2006291299 hasConcept C511840579 @default.
- W2006291299 hasConcept C67388219 @default.
- W2006291299 hasConcept C75165309 @default.
- W2006291299 hasConcept C78548338 @default.
- W2006291299 hasConcept C80444323 @default.
- W2006291299 hasConcept C93518851 @default.
- W2006291299 hasConcept C99138194 @default.
- W2006291299 hasConceptScore W2006291299C111919701 @default.
- W2006291299 hasConceptScore W2006291299C11413529 @default.
- W2006291299 hasConceptScore W2006291299C124101348 @default.
- W2006291299 hasConceptScore W2006291299C127413603 @default.
- W2006291299 hasConceptScore W2006291299C132525143 @default.
- W2006291299 hasConceptScore W2006291299C154945302 @default.
- W2006291299 hasConceptScore W2006291299C171146098 @default.
- W2006291299 hasConceptScore W2006291299C23123220 @default.
- W2006291299 hasConceptScore W2006291299C25797200 @default.
- W2006291299 hasConceptScore W2006291299C2776321320 @default.
- W2006291299 hasConceptScore W2006291299C38652104 @default.
- W2006291299 hasConceptScore W2006291299C41008148 @default.
- W2006291299 hasConceptScore W2006291299C511840579 @default.
- W2006291299 hasConceptScore W2006291299C67388219 @default.
- W2006291299 hasConceptScore W2006291299C75165309 @default.
- W2006291299 hasConceptScore W2006291299C78548338 @default.
- W2006291299 hasConceptScore W2006291299C80444323 @default.
- W2006291299 hasConceptScore W2006291299C93518851 @default.
- W2006291299 hasConceptScore W2006291299C99138194 @default.
- W2006291299 hasIssue "2" @default.
- W2006291299 hasLocation W20062912991 @default.
- W2006291299 hasLocation W20062912992 @default.
- W2006291299 hasLocation W20062912993 @default.
- W2006291299 hasLocation W20062912994 @default.
- W2006291299 hasOpenAccess W2006291299 @default.
- W2006291299 hasPrimaryLocation W20062912991 @default.
- W2006291299 hasRelatedWork W1553682171 @default.
- W2006291299 hasRelatedWork W1566408939 @default.
- W2006291299 hasRelatedWork W2022953428 @default.
- W2006291299 hasRelatedWork W2141532373 @default.
- W2006291299 hasRelatedWork W2352031993 @default.
- W2006291299 hasRelatedWork W2357865405 @default.
- W2006291299 hasRelatedWork W2521251760 @default.
- W2006291299 hasRelatedWork W3102769546 @default.
- W2006291299 hasRelatedWork W1535698018 @default.
- W2006291299 hasRelatedWork W162007055 @default.
- W2006291299 hasVolume "178" @default.
- W2006291299 isParatext "false" @default.
- W2006291299 isRetracted "false" @default.
- W2006291299 magId "2006291299" @default.
- W2006291299 workType "article" @default.