Matches in SemOpenAlex for { <https://semopenalex.org/work/W4313305812> ?p ?o ?g. }
Showing items 1 to 71 of
71
with 100 items per page.
- W4313305812 endingPage "2397" @default.
- W4313305812 startingPage "2395" @default.
- W4313305812 abstract "Human gastrointestinal tract harbors trillions of commensal microbes including bacteria, viruses, and fungi to form an ecological community, commonly known as the gut microbiome. Being the second most dominant taxonomic kingdom in the gut (5.8% of total microbiome DNA), enteric viruses are increasingly acknowledged for their roles in shaping the overall composition of the gut microbiome and maintaining human health.[1,2] Recent advancements in metagenomic sequencing have brought breakthrough in deciphering the features and functions of the gut virome.[3] Notably, there are several factors restricting the application of this technology in virome study. For instance, the amount of isolated viral DNA from human samples is usually insufficient for metagenomic sequencing. To solve this issue, amplification techniques such as polymerase chain reaction and multiple displacement amplification are widely used to increase the amount of extracted viral DNA prior to sequencing, which in turn could introduce amplification bias, chimeric reads, and random mutations.[4] Meanwhile, given that most studies analyzed metagenomic reads at low depth (from 0.0003 GB to 7.7 GB),[5,6] there is currently lack of high quality assembled viral genomes, as well as failure in identifying viruses with low abundance in samples. In addition, Illumina sequencing that generates short reads, has been frequently used. However, short sequencing reads were reported to be inadequate for accurate assembly of viral genomes as they contain hypervariable sequences and repeating regions.[7] All these issues have greatly obstructed virome investigations, as evidenced by the high portion (75–95%) of viral metagenomic reads from the human gut being unclassified.[8,9] A plausible solution is to conduct deep metagenomic sequencing as well as long-read sequencing (Pacific Biosciences [PacBio] or Oxford Nanopore), which could be an important complement to short-read sequencing.[1] The team in the Chinese University of Hong Kong previously reported the alteration of gut virome in patients with colorectal cancer (CRC),[10] while it is also of importance to investigate the complete profile of the gut virome in nondisease conditions. To this end, the study by Zhao et al[11], published in June 2022 online issue of Gastroenterology, has performed ultra-deep amplification-free metagenomic sequencing on fecal samples of healthy individuals. First and foremost, sufficient amount of viral DNA is necessary for sequencing at such depth; hence, Zhao et al adopted and modified extraction protocols from multiple previous studies. For example, multi-steps filtration and lysozymes were used to hydrolyze bacterial cell walls and eliminate as many bacteria as possible.[12] To obtain viral DNA, viral capsids need to be disrupted and lysed which is commonly done by the treatment of phenol/chloroform/isoamyl alcohol.[13] However, the extraction yield of this method has been unsatisfactory due to the low efficiency in DNA lysis and precipitation. In comparison, previous studies reported that column-based DNA purification has greater recovery efficiency than the conventional lysis method.[14] Zhao et al therefore applied this column-based approach, which has a single lysis step with more lenient experimental conditions, and succeeded to extract an increased amount of viral DNA compared to previous protocols. Together, these modifications including differential lysis, depletion of cell-free DNA, and multiple-time filtration were capable of improving the extraction yield and depleting nonviral DNA in human fecal samples, thereby facilitating amplification-free library preparation prior to metagenomic sequencing. To comprehensively evaluate the gut virome, Zhao et al[11] parallelly performed Illumina and PacBio High-Fidelity sequencing on the extracted viral DNA. While Illumina short-read sequencing has been used extensively, PacBio sequencing is a more recent technology that can generate long reads. In general, long reads can improve contig generation from genome regions that have high similarity or complexity, as well as prevent formation of chimeric contigs.[15] Once sequencing reads are generated, they need to be assembled as contigs for constructing genomes. As two different sequencing platforms were used, Zhao et al developed a strategy for de novo assembly to accurately identify viral genomes from raw long and short reads. Typically, a single assembler is employed for genome assembly.[16] To take advantage of each assembler, Zhao et al[11] used a combination of six assemblers to achieve better recovery of viral genomes. For instance, assemblers for short reads (e.g., megagit and metaSPAdes) are well established and proved to have great performance,[17] whereas assemblers for long reads are much more diverse probably due to the recent bloom of their development. Filtering is another critical process in distinguishing viral genomes from complex contigs. A series of filters was therefore established to recognize clean viral genomes with high precision and remove any bacterial or other nonviral sequences, eventually obtaining 1178 complete virus contigs. Hence, as demonstrated in the study by Zhao et al, a rigorous approach to processing raw reads is critical for analyzing viral genomes. Among all assembled viral genomes, 1058 of them were newly identified in the study by Zhao et al[11] with subsequent validation in three published metagenomes and fecal samples from additional healthy individuals. Long reads are known to benefit the assembly of microbial genomes with long sequences. Indeed, 13 viral genomes with lengths greater than the longest phage sequence found in human samples (393 kb) were identified.[18] Particularly, two genomes were assembled (HugePhage1 and Hugephage2), which are even longer than the current reported largest phage sequence (735 kb).[19] Given that many viruses are newly identified, it is of interest to depict the evolutionary history of these viruses. Zhao et al[11] therefore constructed a phylogenetic tree to reveal uncharacterized viruses based on the hidden Markov model (HMM) alignment scores of four conserved viral proteins (major capsid protein [MCP], primase, terminase, and portal protein). Viruses with similar HMM alignment scores were clustered together to form nine clades (Hkyuvirus 1–9), and each clade has specific HMM alignment scores. For example, although MCP is the major component of viral capsids, it is absent in viruses from Hkyuvirus 4 and Hkyuvirus 7 clades. To date, a large portion of human enteric viruses is still undefined. In the current study by Zhao et al, the authors developed a strategy to assort viruses based on conserved viral proteins, of which the establishment of these protein-based clades could benefit the characterization of enteric viruses with unknown taxonomy. Zhao et al[11] further incorporated their viral genomes into the NCBI RefSeq database, and this significantly improved the mapping percentage of published metagenomic datasets by up to 18.1 times. RefSeq has been heavily used for the alignment of viral metagenomic reads. However, it only includes a limited number of well-characterized viruses, while most gut commensal viruses remain unrecognized. The substantial insufficiency of viral genomes included in RefSeq was suggested to be the major cause of unmappable genomes, as some studies reported that ≤98% of sequencing reads could not be assigned to viral taxa when using RefSeq.[1] Similarly, Zhao et al found that only 40–45% of their raw reads were recognized by RefSeq. Notably, incorporating viral genomes identified by Zhao et al into RefSeq markedly increased the read mapping ratio to over 80%. The assembled viral genomes could therefore expand the reference genomes in RefSeq, enabling future studies to have a more complete mapping of sequencing reads. Based on the identified viral genomes, Zhao et al[11] established a biomarker panel comprising of 14 newly identified viruses, and this panel demonstrated great performance in discriminating patients with CRC from healthy subjects. Among 11 enriched viruses, 8 belong to Podoviridae, Myoviridae, and Siphoviridae which are frequently reported as the most abundant viral families in the human gut.[6] The enrichment of Lactobacillus prophage phiadh was also observed in CRC patients. As probiotic Lactobacillus are known to be depleted in CRC, increased abundance of Lactobacillus-infecting phage/prophage may contribute to Lactobacillus depletion, indicating a negative correlation between gut bacteria and phage in CRC. In general, Zhao et al showed that even uncharacterized viruses could be potential biomarkers; thus, it is worth investigating whether incorporating enteric viruses into established biomarker panel could further improve diagnostic performance. Nevertheless, complete characterization and functional investigation are necessary before utilizing these newly identified viral genomes in clinical practice. In conclusion, using the modified extraction protocol, Zhao et al obtained sufficient viral DNA that largely represent the human gut virome. Through performing ultra-deep metagenomic sequencing with de novo assembly and multistage filtering, Zhao et al succeeded to accurately classify the viral genomes. This enabled the identification of 1058 novel viral genomes including 13 long viral sequences, as well as improved read mapping ratio of published metagenomic datasets. Altogether, these findings will contribute to clinical diagnosis, current viral reference genome database and future in-depth investigation of the human gut virome. In addition, it is noteworthy to highlight that only double-stranded DNA viruses were considered in the study by Zhao et al, as the library preparation strategy used in amplification-free Illumina and PacBio sequencing could only identify double-stranded DNA.[20,21] This sequencing approach is also incapable of discriminating foodborne viruses from gut commensal viruses, as the majority of foodborne viruses is not double-stranded DNA viruses.[22] Future study with deep metagenomic sequencing is needed to investigate the roles of other virus types, including single-stranded DNA and RNA viruses in human gut virome. Conflicts of interest None." @default.
- W4313305812 created "2023-01-06" @default.
- W4313305812 creator A5024721864 @default.
- W4313305812 creator A5069395778 @default.
- W4313305812 date "2022-10-20" @default.
- W4313305812 modified "2023-10-14" @default.
- W4313305812 title "Uncovering novel human gut virome using ultra-deep metagenomic sequencing" @default.
- W4313305812 cites W2116570187 @default.
- W4313305812 cites W2161251845 @default.
- W4313305812 cites W2169899471 @default.
- W4313305812 cites W2215093806 @default.
- W4313305812 cites W2300639547 @default.
- W4313305812 cites W2732496903 @default.
- W4313305812 cites W2761734077 @default.
- W4313305812 cites W2800015127 @default.
- W4313305812 cites W2811431636 @default.
- W4313305812 cites W2913714888 @default.
- W4313305812 cites W2922170278 @default.
- W4313305812 cites W2941199225 @default.
- W4313305812 cites W2950060642 @default.
- W4313305812 cites W2951371411 @default.
- W4313305812 cites W2989706094 @default.
- W4313305812 cites W3006022816 @default.
- W4313305812 cites W3016425662 @default.
- W4313305812 cites W3043242071 @default.
- W4313305812 cites W3081166727 @default.
- W4313305812 cites W4281800584 @default.
- W4313305812 doi "https://doi.org/10.1097/cm9.0000000000002382" @default.
- W4313305812 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/36583859" @default.
- W4313305812 hasPublicationYear "2022" @default.
- W4313305812 type Work @default.
- W4313305812 citedByCount "0" @default.
- W4313305812 crossrefType "journal-article" @default.
- W4313305812 hasAuthorship W4313305812A5024721864 @default.
- W4313305812 hasAuthorship W4313305812A5069395778 @default.
- W4313305812 hasBestOaLocation W43133058121 @default.
- W4313305812 hasConcept C104317684 @default.
- W4313305812 hasConcept C15151743 @default.
- W4313305812 hasConcept C190743605 @default.
- W4313305812 hasConcept C54355233 @default.
- W4313305812 hasConcept C70721500 @default.
- W4313305812 hasConcept C86803240 @default.
- W4313305812 hasConceptScore W4313305812C104317684 @default.
- W4313305812 hasConceptScore W4313305812C15151743 @default.
- W4313305812 hasConceptScore W4313305812C190743605 @default.
- W4313305812 hasConceptScore W4313305812C54355233 @default.
- W4313305812 hasConceptScore W4313305812C70721500 @default.
- W4313305812 hasConceptScore W4313305812C86803240 @default.
- W4313305812 hasIssue "20" @default.
- W4313305812 hasLocation W43133058121 @default.
- W4313305812 hasLocation W43133058122 @default.
- W4313305812 hasLocation W43133058123 @default.
- W4313305812 hasLocation W43133058124 @default.
- W4313305812 hasOpenAccess W4313305812 @default.
- W4313305812 hasPrimaryLocation W43133058121 @default.
- W4313305812 hasRelatedWork W2101878615 @default.
- W4313305812 hasRelatedWork W2289294330 @default.
- W4313305812 hasRelatedWork W2902131597 @default.
- W4313305812 hasRelatedWork W2951684367 @default.
- W4313305812 hasRelatedWork W2989625545 @default.
- W4313305812 hasRelatedWork W3033129278 @default.
- W4313305812 hasRelatedWork W3182597671 @default.
- W4313305812 hasRelatedWork W4220807255 @default.
- W4313305812 hasRelatedWork W4249046229 @default.
- W4313305812 hasRelatedWork W4311784838 @default.
- W4313305812 hasVolume "135" @default.
- W4313305812 isParatext "false" @default.
- W4313305812 isRetracted "false" @default.
- W4313305812 workType "article" @default.