Matches in SemOpenAlex for { <https://semopenalex.org/work/W3005014603> ?p ?o ?g. }
- W3005014603 abstract "Abstract Background Accurately identifying single-nucleotide polymorphisms (SNPs) from bacterial sequencing data is an essential requirement for using genomics to track transmission and predict important phenotypes such as antimicrobial resistance. However, most previous performance evaluations of SNP calling have been restricted to eukaryotic (human) data. Additionally, bacterial SNP calling requires choosing an appropriate reference genome to align reads to, which, together with the bioinformatic pipeline, affects the accuracy and completeness of a set of SNP calls obtained. This study evaluates the performance of 209 SNP-calling pipelines using a combination of simulated data from 254 strains of 10 clinically common bacteria and real data from environmentally sourced and genomically diverse isolates within the genera Citrobacter, Enterobacter, Escherichia, and Klebsiella. Results We evaluated the performance of 209 SNP-calling pipelines, aligning reads to genomes of the same or a divergent strain. Irrespective of pipeline, a principal determinant of reliable SNP calling was reference genome selection. Across multiple taxa, there was a strong inverse relationship between pipeline sensitivity and precision, and the Mash distance (a proxy for average nucleotide divergence) between reads and reference genome. The effect was especially pronounced for diverse, recombinogenic bacteria such as Escherichia coli but less dominant for clonal species such as Mycobacterium tuberculosis. Conclusions The accuracy of SNP calling for a given species is compromised by increasing intra-species diversity. When reads were aligned to the same genome from which they were sequenced, among the highest-performing pipelines was Novoalign/GATK. By contrast, when reads were aligned to particularly divergent genomes, the highest-performing pipelines often used the aligners NextGenMap or SMALT, and/or the variant callers LoFreq, mpileup, or Strelka." @default.
- W3005014603 created "2020-02-14" @default.
- W3005014603 creator A5001331694 @default.
- W3005014603 creator A5021299815 @default.
- W3005014603 creator A5026480684 @default.
- W3005014603 creator A5028335853 @default.
- W3005014603 creator A5030621715 @default.
- W3005014603 creator A5031551214 @default.
- W3005014603 creator A5055049959 @default.
- W3005014603 creator A5064800233 @default.
- W3005014603 creator A5071728473 @default.
- W3005014603 creator A5075264383 @default.
- W3005014603 date "2020-02-01" @default.
- W3005014603 modified "2023-10-15" @default.
- W3005014603 title "Genomic diversity affects the accuracy of bacterial single-nucleotide polymorphism–calling pipelines" @default.
- W3005014603 cites W1566773348 @default.
- W3005014603 cites W1578830280 @default.
- W3005014603 cites W1579111454 @default.
- W3005014603 cites W1711571047 @default.
- W3005014603 cites W1882576502 @default.
- W3005014603 cites W1959574505 @default.
- W3005014603 cites W1964477845 @default.
- W3005014603 cites W1964807436 @default.
- W3005014603 cites W1966966274 @default.
- W3005014603 cites W1968828904 @default.
- W3005014603 cites W1978813754 @default.
- W3005014603 cites W1981394448 @default.
- W3005014603 cites W1982579644 @default.
- W3005014603 cites W1982855075 @default.
- W3005014603 cites W1984174967 @default.
- W3005014603 cites W1988282697 @default.
- W3005014603 cites W2005271647 @default.
- W3005014603 cites W2007890464 @default.
- W3005014603 cites W2021724381 @default.
- W3005014603 cites W2022200628 @default.
- W3005014603 cites W2025648894 @default.
- W3005014603 cites W2029657346 @default.
- W3005014603 cites W2033167804 @default.
- W3005014603 cites W2050109119 @default.
- W3005014603 cites W2060965924 @default.
- W3005014603 cites W2075429769 @default.
- W3005014603 cites W2076124986 @default.
- W3005014603 cites W2076359272 @default.
- W3005014603 cites W2083870688 @default.
- W3005014603 cites W2093931624 @default.
- W3005014603 cites W2094614987 @default.
- W3005014603 cites W2095763520 @default.
- W3005014603 cites W2096094352 @default.
- W3005014603 cites W2101793321 @default.
- W3005014603 cites W2102278945 @default.
- W3005014603 cites W2103441770 @default.
- W3005014603 cites W2104240120 @default.
- W3005014603 cites W2108234281 @default.
- W3005014603 cites W2118442768 @default.
- W3005014603 cites W2119180969 @default.
- W3005014603 cites W2122673596 @default.
- W3005014603 cites W2124465358 @default.
- W3005014603 cites W2125418992 @default.
- W3005014603 cites W2125840395 @default.
- W3005014603 cites W2128524432 @default.
- W3005014603 cites W2129714591 @default.
- W3005014603 cites W2133212095 @default.
- W3005014603 cites W2140067143 @default.
- W3005014603 cites W2143007385 @default.
- W3005014603 cites W2146290346 @default.
- W3005014603 cites W2149753281 @default.
- W3005014603 cites W2152956782 @default.
- W3005014603 cites W2154468535 @default.
- W3005014603 cites W2158336776 @default.
- W3005014603 cites W2159954944 @default.
- W3005014603 cites W2161085554 @default.
- W3005014603 cites W2161815151 @default.
- W3005014603 cites W2166694026 @default.
- W3005014603 cites W2168133698 @default.
- W3005014603 cites W2168358002 @default.
- W3005014603 cites W2170551349 @default.
- W3005014603 cites W2171203723 @default.
- W3005014603 cites W2173732482 @default.
- W3005014603 cites W2190569576 @default.
- W3005014603 cites W2236299009 @default.
- W3005014603 cites W2273187858 @default.
- W3005014603 cites W2311203695 @default.
- W3005014603 cites W2337747100 @default.
- W3005014603 cites W2341468196 @default.
- W3005014603 cites W2464112039 @default.
- W3005014603 cites W2491124495 @default.
- W3005014603 cites W2509491391 @default.
- W3005014603 cites W2526975281 @default.
- W3005014603 cites W2536199001 @default.
- W3005014603 cites W2586283250 @default.
- W3005014603 cites W2590415818 @default.
- W3005014603 cites W2596937542 @default.
- W3005014603 cites W2600407737 @default.
- W3005014603 cites W2616026176 @default.
- W3005014603 cites W2727237484 @default.
- W3005014603 cites W2734764141 @default.
- W3005014603 cites W2774962890 @default.
- W3005014603 cites W2784788330 @default.
- W3005014603 cites W2789843538 @default.
- W3005014603 cites W2889664156 @default.