Matches in SemOpenAlex for { <https://semopenalex.org/work/W2971597226> ?p ?o ?g. }
- W2971597226 abstract "Abstract Background Shotgun metagenomes are often assembled prior to annotation of genes which biases the functional capacity of a community towards its most abundant members. For an unbiased assessment of community function, short reads need to be mapped directly to a gene or protein database. The ability to detect genes in short read sequences is dependent on pre- and post-sequencing decisions. The objective of the current study was to determine how library size selection, read length and format, protein database, e-value threshold, and sequencing depth impact gene-centric analysis of human fecal microbiomes when using DIAMOND, an alignment tool that is up to 20,000 times faster than BLASTX. Results Using metagenomes simulated from a database of experimentally verified protein sequences, we find that read length, e-value threshold, and the choice of protein database dramatically impact detection of a known target, with best performance achieved with longer reads, stricter e-value thresholds, and a custom database. Using publicly available metagenomes, we evaluated library size selection, paired end read strategy, and sequencing depth. Longer read lengths were acheivable by merging paired ends when the sequencing library was size-selected to enable overlaps. When paired ends could not be merged, a congruent strategy in which both ends are independently mapped was acceptable. Sequencing depths of 5 million merged reads minimized the error of abundance estimates of specific target genes, including an antimicrobial resistance gene. Conclusions Shotgun metagenomes of DNA extracted from human fecal samples sequenced using the Illumina platform should be size-selected to enable merging of paired end reads and should be sequenced in the PE150 format with a minimum sequencing depth of 5 million merge-able reads to enable detection of specific target genes. Expecting the merged reads to be 180-250bp in length, the appropriate e-value threshold for DIAMOND would then need to be more strict than the default. Accurate and interpretable results for specific hypotheses will be best obtained using small databases customized for the research question." @default.
- W2971597226 created "2019-09-12" @default.
- W2971597226 creator A5030976378 @default.
- W2971597226 creator A5047165779 @default.
- W2971597226 creator A5054512806 @default.
- W2971597226 creator A5081619085 @default.
- W2971597226 creator A5083824130 @default.
- W2971597226 date "2019-09-08" @default.
- W2971597226 modified "2023-10-16" @default.
- W2971597226 title "Pre- and post-sequencing recommendations for functional annotation of human fecal metagenomes" @default.
- W2971597226 cites W1990453950 @default.
- W2971597226 cites W1998077837 @default.
- W2971597226 cites W2004014148 @default.
- W2971597226 cites W2008825357 @default.
- W2971597226 cites W2038802458 @default.
- W2971597226 cites W2044496774 @default.
- W2971597226 cites W2045204781 @default.
- W2971597226 cites W2047242127 @default.
- W2971597226 cites W2047387827 @default.
- W2971597226 cites W2048818637 @default.
- W2971597226 cites W2055567175 @default.
- W2971597226 cites W2079031212 @default.
- W2971597226 cites W2079477175 @default.
- W2971597226 cites W2097124003 @default.
- W2971597226 cites W2102455033 @default.
- W2971597226 cites W2108638811 @default.
- W2971597226 cites W2110256992 @default.
- W2971597226 cites W2110300022 @default.
- W2971597226 cites W2113883363 @default.
- W2971597226 cites W2114104545 @default.
- W2971597226 cites W2122559203 @default.
- W2971597226 cites W2125826054 @default.
- W2971597226 cites W2128711701 @default.
- W2971597226 cites W2128769815 @default.
- W2971597226 cites W2131271579 @default.
- W2971597226 cites W2132801908 @default.
- W2971597226 cites W2133371096 @default.
- W2971597226 cites W2142428670 @default.
- W2971597226 cites W2152942352 @default.
- W2971597226 cites W2158714788 @default.
- W2971597226 cites W2167768477 @default.
- W2971597226 cites W2169474803 @default.
- W2971597226 cites W2173732482 @default.
- W2971597226 cites W2179438025 @default.
- W2971597226 cites W2270107170 @default.
- W2971597226 cites W24553703 @default.
- W2971597226 cites W2463794784 @default.
- W2971597226 cites W2509730012 @default.
- W2971597226 cites W2510554120 @default.
- W2971597226 cites W2548957578 @default.
- W2971597226 cites W2759833325 @default.
- W2971597226 cites W2768781920 @default.
- W2971597226 cites W2769145826 @default.
- W2971597226 cites W2806408903 @default.
- W2971597226 cites W2889185930 @default.
- W2971597226 cites W2895270801 @default.
- W2971597226 cites W2898402099 @default.
- W2971597226 cites W2938574745 @default.
- W2971597226 cites W2947757730 @default.
- W2971597226 cites W2949383250 @default.
- W2971597226 cites W2951076599 @default.
- W2971597226 cites W2951083569 @default.
- W2971597226 cites W2952557828 @default.
- W2971597226 doi "https://doi.org/10.1101/760207" @default.
- W2971597226 hasPublicationYear "2019" @default.
- W2971597226 type Work @default.
- W2971597226 sameAs 2971597226 @default.
- W2971597226 citedByCount "0" @default.
- W2971597226 crossrefType "posted-content" @default.
- W2971597226 hasAuthorship W2971597226A5030976378 @default.
- W2971597226 hasAuthorship W2971597226A5047165779 @default.
- W2971597226 hasAuthorship W2971597226A5054512806 @default.
- W2971597226 hasAuthorship W2971597226A5081619085 @default.
- W2971597226 hasAuthorship W2971597226A5083824130 @default.
- W2971597226 hasBestOaLocation W29715972261 @default.
- W2971597226 hasConcept C101985253 @default.
- W2971597226 hasConcept C104317684 @default.
- W2971597226 hasConcept C132917006 @default.
- W2971597226 hasConcept C141231307 @default.
- W2971597226 hasConcept C15151743 @default.
- W2971597226 hasConcept C154945302 @default.
- W2971597226 hasConcept C2776321320 @default.
- W2971597226 hasConcept C2781434637 @default.
- W2971597226 hasConcept C41008148 @default.
- W2971597226 hasConcept C51679486 @default.
- W2971597226 hasConcept C54355233 @default.
- W2971597226 hasConcept C70721500 @default.
- W2971597226 hasConcept C81917197 @default.
- W2971597226 hasConcept C86803240 @default.
- W2971597226 hasConceptScore W2971597226C101985253 @default.
- W2971597226 hasConceptScore W2971597226C104317684 @default.
- W2971597226 hasConceptScore W2971597226C132917006 @default.
- W2971597226 hasConceptScore W2971597226C141231307 @default.
- W2971597226 hasConceptScore W2971597226C15151743 @default.
- W2971597226 hasConceptScore W2971597226C154945302 @default.
- W2971597226 hasConceptScore W2971597226C2776321320 @default.
- W2971597226 hasConceptScore W2971597226C2781434637 @default.
- W2971597226 hasConceptScore W2971597226C41008148 @default.
- W2971597226 hasConceptScore W2971597226C51679486 @default.
- W2971597226 hasConceptScore W2971597226C54355233 @default.