Matches in SemOpenAlex for { <https://semopenalex.org/work/W3157490637> ?p ?o ?g. }
- W3157490637 abstract "Owing to technological advances in ancient DNA, it is now possible to sequence viruses from the past to track down their origin and evolution. However, ancient DNA data is considerably more degraded and contaminated than modern data making the identification of ancient viral genomes particularly challenging. Several methods to characterise the modern microbiome (and, within this, the virome) have been developed. Many of them assign sequenced reads to specific taxa to characterise the organisms present in a sample of interest. While these existing tools are routinely used in modern data, their performance when applied to ancient virome data remains unknown. In this work, we conduct an extensive simulation study using public viral sequences to establish which tool is the most suitable for ancient virome studies. We compare the performance of four widely used classifiers, namely Centrifuge, Kraken2, DIAMOND and MetaPhlAn2, in correctly assigning sequencing reads to the corresponding viruses. To do so, we simulate reads by adding noise typical of ancient DNA to a randomly chosen set of publicly available viral sequences and to the human genome. We fragment the DNA into different lengths, add sequencing error and C to T and G to A deamination substitutions at the read termini. Then we measure the resulting precision and sensitivity for all classifiers. Across most simulations, 119 out of the 120 simulated viruses are recovered by Centrifuge, Kraken2 and DIAMOND in contrast to MetaPhlAn2 which recovers only around one third. While deamination damage has little impact on the performance of the classifiers, DIAMOND and Kraken2 cannot classify very short reads. For data with longer fragments, if precision is strongly favoured over sensitivity, DIAMOND performs best. However, since Centrifuge can handle short reads and since it achieves the highest sensitivity and precision at the species level, it is our recommended tool overall. Regardless of the tool used, our simulations indicate that, for ancient human studies, users should use strict filters to remove all reads of potential human origin. Finally, if the goal is to detect a specific virus, given the high variability observed among tested viral sequences, a simulation study to determine if a given tool can recover the virus of interest should be conducted prior to analysing real data." @default.
- W3157490637 created "2021-05-10" @default.
- W3157490637 creator A5003077340 @default.
- W3157490637 creator A5064314813 @default.
- W3157490637 creator A5086775525 @default.
- W3157490637 date "2021-04-30" @default.
- W3157490637 modified "2023-10-16" @default.
- W3157490637 title "Benchmarking metagenomics classifiers on ancient viral DNA: a simulation study" @default.
- W3157490637 cites W1703384511 @default.
- W3157490637 cites W1760976735 @default.
- W3157490637 cites W1973231394 @default.
- W3157490637 cites W2022327489 @default.
- W3157490637 cites W2024058880 @default.
- W3157490637 cites W2045204781 @default.
- W3157490637 cites W2045934762 @default.
- W3157490637 cites W2080585492 @default.
- W3157490637 cites W2108982334 @default.
- W3157490637 cites W2116895571 @default.
- W3157490637 cites W2124031503 @default.
- W3157490637 cites W2126557729 @default.
- W3157490637 cites W2131026689 @default.
- W3157490637 cites W2156330112 @default.
- W3157490637 cites W2157539385 @default.
- W3157490637 cites W2158322625 @default.
- W3157490637 cites W2159954944 @default.
- W3157490637 cites W2161488606 @default.
- W3157490637 cites W2170551349 @default.
- W3157490637 cites W2173732482 @default.
- W3157490637 cites W2542611942 @default.
- W3157490637 cites W2544029679 @default.
- W3157490637 cites W2560352004 @default.
- W3157490637 cites W2609869963 @default.
- W3157490637 cites W2747773696 @default.
- W3157490637 cites W2793484499 @default.
- W3157490637 cites W2801745197 @default.
- W3157490637 cites W2810852747 @default.
- W3157490637 cites W2884441543 @default.
- W3157490637 cites W2896061277 @default.
- W3157490637 cites W2902592841 @default.
- W3157490637 cites W2951160681 @default.
- W3157490637 cites W2952539125 @default.
- W3157490637 cites W2954021593 @default.
- W3157490637 cites W2990249310 @default.
- W3157490637 cites W2990618091 @default.
- W3157490637 doi "https://doi.org/10.1101/2021.04.30.442132" @default.
- W3157490637 hasPublicationYear "2021" @default.
- W3157490637 type Work @default.
- W3157490637 sameAs 3157490637 @default.
- W3157490637 citedByCount "3" @default.
- W3157490637 countsByYear W31574906372022 @default.
- W3157490637 crossrefType "posted-content" @default.
- W3157490637 hasAuthorship W3157490637A5003077340 @default.
- W3157490637 hasAuthorship W3157490637A5064314813 @default.
- W3157490637 hasAuthorship W3157490637A5086775525 @default.
- W3157490637 hasBestOaLocation W31574906371 @default.
- W3157490637 hasConcept C104317684 @default.
- W3157490637 hasConcept C116834253 @default.
- W3157490637 hasConcept C132917006 @default.
- W3157490637 hasConcept C141231307 @default.
- W3157490637 hasConcept C144024400 @default.
- W3157490637 hasConcept C149923435 @default.
- W3157490637 hasConcept C15151743 @default.
- W3157490637 hasConcept C189338579 @default.
- W3157490637 hasConcept C190743605 @default.
- W3157490637 hasConcept C2279292 @default.
- W3157490637 hasConcept C2908647359 @default.
- W3157490637 hasConcept C51679486 @default.
- W3157490637 hasConcept C54355233 @default.
- W3157490637 hasConcept C552990157 @default.
- W3157490637 hasConcept C59822182 @default.
- W3157490637 hasConcept C70721500 @default.
- W3157490637 hasConcept C86803240 @default.
- W3157490637 hasConcept C8769409 @default.
- W3157490637 hasConceptScore W3157490637C104317684 @default.
- W3157490637 hasConceptScore W3157490637C116834253 @default.
- W3157490637 hasConceptScore W3157490637C132917006 @default.
- W3157490637 hasConceptScore W3157490637C141231307 @default.
- W3157490637 hasConceptScore W3157490637C144024400 @default.
- W3157490637 hasConceptScore W3157490637C149923435 @default.
- W3157490637 hasConceptScore W3157490637C15151743 @default.
- W3157490637 hasConceptScore W3157490637C189338579 @default.
- W3157490637 hasConceptScore W3157490637C190743605 @default.
- W3157490637 hasConceptScore W3157490637C2279292 @default.
- W3157490637 hasConceptScore W3157490637C2908647359 @default.
- W3157490637 hasConceptScore W3157490637C51679486 @default.
- W3157490637 hasConceptScore W3157490637C54355233 @default.
- W3157490637 hasConceptScore W3157490637C552990157 @default.
- W3157490637 hasConceptScore W3157490637C59822182 @default.
- W3157490637 hasConceptScore W3157490637C70721500 @default.
- W3157490637 hasConceptScore W3157490637C86803240 @default.
- W3157490637 hasConceptScore W3157490637C8769409 @default.
- W3157490637 hasLocation W31574906371 @default.
- W3157490637 hasLocation W31574906372 @default.
- W3157490637 hasOpenAccess W3157490637 @default.
- W3157490637 hasPrimaryLocation W31574906371 @default.
- W3157490637 hasRelatedWork W2014982185 @default.
- W3157490637 hasRelatedWork W2935685975 @default.
- W3157490637 hasRelatedWork W2984906837 @default.
- W3157490637 hasRelatedWork W3023436389 @default.
- W3157490637 hasRelatedWork W3157490637 @default.