Matches in SemOpenAlex for { <https://semopenalex.org/work/W4322008640> ?p ?o ?g. }
Showing items 1 to 83 of
83
with 100 items per page.
- W4322008640 abstract "Abstract Motivation Huge data sets containing whole-genome sequences of bacterial strains are now commonplace and represent a rich and important resource for modern genomic epidemiology and metagenomics. In order to efficiently make use of these data sets, efficient indexing data structures — that are both scalable and provide rapid query throughput — are paramount. Results Here, we present Themisto, a scalable colored k -mer index designed for large collections of microbial reference genomes, that works for both short and long read data. Themisto indexes 179 thousand Salmonella enterica genomes in 9 hours. The resulting index takes 142 gigabytes. In comparison, the best competing tools Metagraph and Bifrost were only able to index 11 thousand genomes in the same time. In pseudoalignment, these other tools were either an order of magnitude slower than Themisto, or used an order of magnitude more memory. Themisto also offers superior pseudoalignment quality, achieving a higher recall than previous methods on Nanopore read sets. Availability and implementation Themisto is available and documented as a C++ package at https://github.com/algbio/themisto available under the GPLv2 license. Contact jarno.alanko@helsinki.fi Supplementary information Supplementary data are available at Bioinformatics online." @default.
- W4322008640 created "2023-02-26" @default.
- W4322008640 creator A5021097696 @default.
- W4322008640 creator A5037583532 @default.
- W4322008640 creator A5067903348 @default.
- W4322008640 creator A5071986862 @default.
- W4322008640 date "2023-02-24" @default.
- W4322008640 modified "2023-10-01" @default.
- W4322008640 title "Themisto: a scalable colored<i>k</i>-mer index for sensitive pseudoalignment against hundreds of thousands of bacterial genomes" @default.
- W4322008640 cites W2144560237 @default.
- W4322008640 cites W2323326409 @default.
- W4322008640 cites W2912288210 @default.
- W4322008640 cites W2949087252 @default.
- W4322008640 cites W2950417030 @default.
- W4322008640 cites W2952379095 @default.
- W4322008640 cites W2963629592 @default.
- W4322008640 cites W2998284004 @default.
- W4322008640 cites W3007172120 @default.
- W4322008640 cites W3090493712 @default.
- W4322008640 cites W3118763530 @default.
- W4322008640 cites W3161732531 @default.
- W4322008640 cites W3207938624 @default.
- W4322008640 cites W3210751207 @default.
- W4322008640 cites W3210926305 @default.
- W4322008640 cites W3212599762 @default.
- W4322008640 cites W4226101760 @default.
- W4322008640 cites W4243730337 @default.
- W4322008640 cites W4281288662 @default.
- W4322008640 cites W4304084232 @default.
- W4322008640 cites W4307607783 @default.
- W4322008640 cites W4310699959 @default.
- W4322008640 cites W6247929 @default.
- W4322008640 doi "https://doi.org/10.1101/2023.02.24.529942" @default.
- W4322008640 hasPublicationYear "2023" @default.
- W4322008640 type Work @default.
- W4322008640 citedByCount "3" @default.
- W4322008640 countsByYear W43220086402023 @default.
- W4322008640 crossrefType "posted-content" @default.
- W4322008640 hasAuthorship W4322008640A5021097696 @default.
- W4322008640 hasAuthorship W4322008640A5037583532 @default.
- W4322008640 hasAuthorship W4322008640A5067903348 @default.
- W4322008640 hasAuthorship W4322008640A5071986862 @default.
- W4322008640 hasBestOaLocation W43220086401 @default.
- W4322008640 hasConcept C104317684 @default.
- W4322008640 hasConcept C124101348 @default.
- W4322008640 hasConcept C136764020 @default.
- W4322008640 hasConcept C141231307 @default.
- W4322008640 hasConcept C15151743 @default.
- W4322008640 hasConcept C2777382242 @default.
- W4322008640 hasConcept C41008148 @default.
- W4322008640 hasConcept C48044578 @default.
- W4322008640 hasConcept C54355233 @default.
- W4322008640 hasConcept C70721500 @default.
- W4322008640 hasConcept C77088390 @default.
- W4322008640 hasConcept C86803240 @default.
- W4322008640 hasConceptScore W4322008640C104317684 @default.
- W4322008640 hasConceptScore W4322008640C124101348 @default.
- W4322008640 hasConceptScore W4322008640C136764020 @default.
- W4322008640 hasConceptScore W4322008640C141231307 @default.
- W4322008640 hasConceptScore W4322008640C15151743 @default.
- W4322008640 hasConceptScore W4322008640C2777382242 @default.
- W4322008640 hasConceptScore W4322008640C41008148 @default.
- W4322008640 hasConceptScore W4322008640C48044578 @default.
- W4322008640 hasConceptScore W4322008640C54355233 @default.
- W4322008640 hasConceptScore W4322008640C70721500 @default.
- W4322008640 hasConceptScore W4322008640C77088390 @default.
- W4322008640 hasConceptScore W4322008640C86803240 @default.
- W4322008640 hasLocation W43220086401 @default.
- W4322008640 hasOpenAccess W4322008640 @default.
- W4322008640 hasPrimaryLocation W43220086401 @default.
- W4322008640 hasRelatedWork W2001037326 @default.
- W4322008640 hasRelatedWork W2001561488 @default.
- W4322008640 hasRelatedWork W2127106003 @default.
- W4322008640 hasRelatedWork W2340204314 @default.
- W4322008640 hasRelatedWork W3047585652 @default.
- W4322008640 hasRelatedWork W3090971826 @default.
- W4322008640 hasRelatedWork W3128872455 @default.
- W4322008640 hasRelatedWork W4224273931 @default.
- W4322008640 hasRelatedWork W4247428816 @default.
- W4322008640 hasRelatedWork W4313598112 @default.
- W4322008640 isParatext "false" @default.
- W4322008640 isRetracted "false" @default.
- W4322008640 workType "article" @default.