Matches in SemOpenAlex for { <https://semopenalex.org/work/W4200505767> ?p ?o ?g. }
- W4200505767 endingPage "1222" @default.
- W4200505767 startingPage "1216" @default.
- W4200505767 abstract "Viruses, the most abundant biological entities on earth, are important components of microbial communities, and as major human pathogens, they are responsible for human mortality and morbidity. The identification of viral sequences from metagenomes is critical for viral analysis. As massive quantities of short sequences are generated by next-generation sequencing, most methods utilize discrete and sparse one-hot vectors to encode nucleotide sequences, which are usually ineffective in viral identification.In this article, Virtifier, a deep learning-based viral identifier for sequences from metagenomic data is proposed. It includes a meaningful nucleotide sequence encoding method named Seq2Vec and a variant viral sequence predictor with an attention-based long short-term memory (LSTM) network. By utilizing a fully trained embedding matrix to encode codons, Seq2Vec can efficiently extract the relationships among those codons in a nucleotide sequence. Combined with an attention layer, the LSTM neural network can further analyze the codon relationships and sift the parts that contribute to the final features. Experimental results of three datasets have shown that Virtifier can accurately identify short viral sequences (<500 bp) from metagenomes, surpassing three widely used methods, VirFinder, DeepVirFinder and PPR-Meta. Meanwhile, a comparable performance was achieved by Virtifier at longer lengths (>5000 bp).A Python implementation of Virtifier and the Python code developed for this study have been provided on Github https://github.com/crazyinter/Seq2Vec. The RefSeq genomes in this article are available in VirFinder at https://dx.doi.org/10.1186/s40168-017-0283-5. The CAMI Challenge Dataset 3 CAMI_high dataset in this article is available in CAMI at https://data.cami-challenge.org/participate. The real human gut metagenomes in this article are available at https://dx.doi.org/10.1101/gr.142315.112.Supplementary data are available at Bioinformatics online." @default.
- W4200505767 created "2021-12-31" @default.
- W4200505767 creator A5003065832 @default.
- W4200505767 creator A5004933429 @default.
- W4200505767 creator A5013401457 @default.
- W4200505767 creator A5041781809 @default.
- W4200505767 date "2021-12-15" @default.
- W4200505767 modified "2023-10-07" @default.
- W4200505767 title "Virtifier: a deep learning-based identifier for viral sequences from metagenomes" @default.
- W4200505767 cites W1693165432 @default.
- W4200505767 cites W1975375203 @default.
- W4200505767 cites W1981617416 @default.
- W4200505767 cites W1982265452 @default.
- W4200505767 cites W2003347102 @default.
- W4200505767 cites W2005129098 @default.
- W4200505767 cites W2011297553 @default.
- W4200505767 cites W2045204781 @default.
- W4200505767 cites W2048818637 @default.
- W4200505767 cites W2055043387 @default.
- W4200505767 cites W2063089818 @default.
- W4200505767 cites W2064675550 @default.
- W4200505767 cites W2072000548 @default.
- W4200505767 cites W2107878631 @default.
- W4200505767 cites W2119859604 @default.
- W4200505767 cites W2121472218 @default.
- W4200505767 cites W2145537580 @default.
- W4200505767 cites W2147152072 @default.
- W4200505767 cites W2159954944 @default.
- W4200505767 cites W2165612380 @default.
- W4200505767 cites W2321452352 @default.
- W4200505767 cites W2403288892 @default.
- W4200505767 cites W2610214149 @default.
- W4200505767 cites W2732139758 @default.
- W4200505767 cites W2740966378 @default.
- W4200505767 cites W2750954606 @default.
- W4200505767 cites W2886112809 @default.
- W4200505767 cites W2887749901 @default.
- W4200505767 cites W2889042945 @default.
- W4200505767 cites W2951160681 @default.
- W4200505767 cites W2973098646 @default.
- W4200505767 cites W3003110834 @default.
- W4200505767 cites W3027189436 @default.
- W4200505767 doi "https://doi.org/10.1093/bioinformatics/btab845" @default.
- W4200505767 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/34908121" @default.
- W4200505767 hasPublicationYear "2021" @default.
- W4200505767 type Work @default.
- W4200505767 citedByCount "12" @default.
- W4200505767 countsByYear W42005057672021 @default.
- W4200505767 countsByYear W42005057672022 @default.
- W4200505767 countsByYear W42005057672023 @default.
- W4200505767 crossrefType "journal-article" @default.
- W4200505767 hasAuthorship W4200505767A5003065832 @default.
- W4200505767 hasAuthorship W4200505767A5004933429 @default.
- W4200505767 hasAuthorship W4200505767A5013401457 @default.
- W4200505767 hasAuthorship W4200505767A5041781809 @default.
- W4200505767 hasConcept C104317684 @default.
- W4200505767 hasConcept C108583219 @default.
- W4200505767 hasConcept C116834253 @default.
- W4200505767 hasConcept C141231307 @default.
- W4200505767 hasConcept C15151743 @default.
- W4200505767 hasConcept C151810110 @default.
- W4200505767 hasConcept C154504017 @default.
- W4200505767 hasConcept C154945302 @default.
- W4200505767 hasConcept C199360897 @default.
- W4200505767 hasConcept C41008148 @default.
- W4200505767 hasConcept C43126263 @default.
- W4200505767 hasConcept C519991488 @default.
- W4200505767 hasConcept C54355233 @default.
- W4200505767 hasConcept C59822182 @default.
- W4200505767 hasConcept C66746571 @default.
- W4200505767 hasConcept C70721500 @default.
- W4200505767 hasConcept C86803240 @default.
- W4200505767 hasConceptScore W4200505767C104317684 @default.
- W4200505767 hasConceptScore W4200505767C108583219 @default.
- W4200505767 hasConceptScore W4200505767C116834253 @default.
- W4200505767 hasConceptScore W4200505767C141231307 @default.
- W4200505767 hasConceptScore W4200505767C15151743 @default.
- W4200505767 hasConceptScore W4200505767C151810110 @default.
- W4200505767 hasConceptScore W4200505767C154504017 @default.
- W4200505767 hasConceptScore W4200505767C154945302 @default.
- W4200505767 hasConceptScore W4200505767C199360897 @default.
- W4200505767 hasConceptScore W4200505767C41008148 @default.
- W4200505767 hasConceptScore W4200505767C43126263 @default.
- W4200505767 hasConceptScore W4200505767C519991488 @default.
- W4200505767 hasConceptScore W4200505767C54355233 @default.
- W4200505767 hasConceptScore W4200505767C59822182 @default.
- W4200505767 hasConceptScore W4200505767C66746571 @default.
- W4200505767 hasConceptScore W4200505767C70721500 @default.
- W4200505767 hasConceptScore W4200505767C86803240 @default.
- W4200505767 hasFunder F4320321543 @default.
- W4200505767 hasIssue "5" @default.
- W4200505767 hasLocation W42005057671 @default.
- W4200505767 hasLocation W42005057672 @default.
- W4200505767 hasOpenAccess W4200505767 @default.
- W4200505767 hasPrimaryLocation W42005057671 @default.
- W4200505767 hasRelatedWork W1981194976 @default.
- W4200505767 hasRelatedWork W2099969795 @default.
- W4200505767 hasRelatedWork W2107903949 @default.