Matches in SemOpenAlex for { <https://semopenalex.org/work/W3126948783> ?p ?o ?g. }
- W3126948783 abstract "Abstract Shigella and enteroinvasive Escherichia coli (EIEC) cause human bacillary dysentery with similar invasion mechanisms and share similar physiological, biochemical and genetic characteristics. The ability to differentiate Shigella and EIEC from each other is important for clinical diagnostic and epidemiologic investigations. The existing genetic signatures may not discriminate between Shigella and EIEC. However, phylogenetically, Shigella and EIEC strains are composed of multiple clusters and are different forms of E. coli. In this study, we identified 10 Shigella clusters, 7 EIEC clusters and 53 sporadic types of EIEC by examining over 17,000 publicly available Shigella /EIEC genomes. We compared Shigella and EIEC accessory genomes to identify the cluster-specific gene markers or marker sets for the 17 clusters and 53 sporadic types. The gene markers showed 99.63% accuracy and more than 97.02% specificity. In addition, we developed a freely available in silico serotyping pipeline named Shigella EIEC Cluster Enhanced Serotype Finder (ShigEiFinder) by incorporating the cluster-specific gene markers and established Shigella /EIEC serotype specific O antigen genes and modification genes into typing. ShigEiFinder can process either paired end Illumina sequencing reads or assembled genomes and almost perfectly differentiated Shigella from EIEC with 99.70% and 99.81% cluster assignment accuracy for the assembled genomes and mapped reads respectively. ShigEiFinder was able to serotype over 59 Shigella serotypes and 22 EIEC serotypes and provided a high specificity with 99.40% for assembled genomes and 99.38% for mapped reads for serotyping. The cluster markers and our new serotyping tool, ShigEiFinder ( https://github.com/LanLab/ShigEiFinder ), will be useful for epidemiologic and diagnostic investigations. Impact statement The differentiation of Shigella strains from enteroinvasive E. coli (EIEC) is important for clinical diagnosis and public health epidemiologic investigations. The similarities between Shigella and EIEC strains make this differentiation very difficult as both share common ancestries within E. coli . However, Shigella and EIEC are phylogenetically separated into multiple clusters, making high resolution separation using cluster specific genomic markers possible. In this study, we identified 17 Shigella or EIEC clusters including five that were newly identified through examination of over 17,000 publicly available Shigella and EIEC genomes. We further identified an individual or a set of cluster-specific gene markers for each cluster using comparative genomic analysis. These markers can then be used to classify isolates into clusters and were used to develop an in silico pipeline, ShigEiFinder ( https://github.com/LanLab/ShigEiFinder ) for accurate differentiation, cluster typing and serotyping of Shigella and EIEC from Illumina sequencing reads or assembled genomes. This study will have broad application from understanding the evolution of Shigella /EIEC to diagnosis and epidemiology. Data summary Sequencing data have been deposited at the National Center for Biotechnology Information under BioProject number PRJNA692536. Repositories Raw sequence data are available from NCBI under the BioProject number PRJNA692536." @default.
- W3126948783 created "2021-02-15" @default.
- W3126948783 creator A5000119945 @default.
- W3126948783 creator A5017114509 @default.
- W3126948783 creator A5022098811 @default.
- W3126948783 creator A5028511579 @default.
- W3126948783 creator A5036382866 @default.
- W3126948783 date "2021-01-30" @default.
- W3126948783 modified "2023-10-17" @default.
- W3126948783 title "Cluster-specific gene markers enhance Shigella and Enteroinvasive Escherichia coli in silico serotyping" @default.
- W3126948783 cites W1490889904 @default.
- W3126948783 cites W1607161276 @default.
- W3126948783 cites W1709258400 @default.
- W3126948783 cites W1729424228 @default.
- W3126948783 cites W1814150976 @default.
- W3126948783 cites W1932244720 @default.
- W3126948783 cites W1955389919 @default.
- W3126948783 cites W1961504819 @default.
- W3126948783 cites W1963587327 @default.
- W3126948783 cites W1963626819 @default.
- W3126948783 cites W1978438390 @default.
- W3126948783 cites W1980042804 @default.
- W3126948783 cites W1980702664 @default.
- W3126948783 cites W2004548026 @default.
- W3126948783 cites W2008080212 @default.
- W3126948783 cites W2015624834 @default.
- W3126948783 cites W2033775119 @default.
- W3126948783 cites W2053940732 @default.
- W3126948783 cites W2054108638 @default.
- W3126948783 cites W2065419969 @default.
- W3126948783 cites W2080723443 @default.
- W3126948783 cites W2090874339 @default.
- W3126948783 cites W2096093282 @default.
- W3126948783 cites W2102619694 @default.
- W3126948783 cites W2105067592 @default.
- W3126948783 cites W2105517716 @default.
- W3126948783 cites W2107772251 @default.
- W3126948783 cites W2108234281 @default.
- W3126948783 cites W2113908179 @default.
- W3126948783 cites W2114053580 @default.
- W3126948783 cites W2114674713 @default.
- W3126948783 cites W2115687464 @default.
- W3126948783 cites W2117530634 @default.
- W3126948783 cites W2119129605 @default.
- W3126948783 cites W2120164127 @default.
- W3126948783 cites W2120902911 @default.
- W3126948783 cites W2122673596 @default.
- W3126948783 cites W2123227667 @default.
- W3126948783 cites W2123349167 @default.
- W3126948783 cites W2127519951 @default.
- W3126948783 cites W2133130415 @default.
- W3126948783 cites W2135512116 @default.
- W3126948783 cites W2142678478 @default.
- W3126948783 cites W2147337541 @default.
- W3126948783 cites W2150083824 @default.
- W3126948783 cites W2152792077 @default.
- W3126948783 cites W2159954944 @default.
- W3126948783 cites W2162567909 @default.
- W3126948783 cites W2164439056 @default.
- W3126948783 cites W2184405267 @default.
- W3126948783 cites W2287849783 @default.
- W3126948783 cites W2290281403 @default.
- W3126948783 cites W2315281034 @default.
- W3126948783 cites W2327924310 @default.
- W3126948783 cites W2418082614 @default.
- W3126948783 cites W2521607002 @default.
- W3126948783 cites W2527082466 @default.
- W3126948783 cites W2536091702 @default.
- W3126948783 cites W2547329187 @default.
- W3126948783 cites W2568548070 @default.
- W3126948783 cites W2592811885 @default.
- W3126948783 cites W2606524141 @default.
- W3126948783 cites W2620681788 @default.
- W3126948783 cites W2762072130 @default.
- W3126948783 cites W2768017947 @default.
- W3126948783 cites W2768967257 @default.
- W3126948783 cites W2796241996 @default.
- W3126948783 cites W2797364692 @default.
- W3126948783 cites W2810194623 @default.
- W3126948783 cites W2893080428 @default.
- W3126948783 cites W2917504095 @default.
- W3126948783 cites W2994826553 @default.
- W3126948783 doi "https://doi.org/10.1101/2021.01.30.428723" @default.
- W3126948783 hasPublicationYear "2021" @default.
- W3126948783 type Work @default.
- W3126948783 sameAs 3126948783 @default.
- W3126948783 citedByCount "2" @default.
- W3126948783 countsByYear W31269487832021 @default.
- W3126948783 crossrefType "posted-content" @default.
- W3126948783 hasAuthorship W3126948783A5000119945 @default.
- W3126948783 hasAuthorship W3126948783A5017114509 @default.
- W3126948783 hasAuthorship W3126948783A5022098811 @default.
- W3126948783 hasAuthorship W3126948783A5028511579 @default.
- W3126948783 hasAuthorship W3126948783A5036382866 @default.
- W3126948783 hasBestOaLocation W31269487831 @default.
- W3126948783 hasConcept C10389963 @default.
- W3126948783 hasConcept C104317684 @default.
- W3126948783 hasConcept C141231307 @default.
- W3126948783 hasConcept C2775905019 @default.
- W3126948783 hasConcept C2776986154 @default.