Matches in SemOpenAlex for { <https://semopenalex.org/work/W3182093451> ?p ?o ?g. }
- W3182093451 abstract "Abstract The mitochondrial cytochrome C oxidase subunit I gene (COI) is commonly used in eDNA metabarcoding studies, especially for assessing metazoan diversity. Yet, a great number of COI operational taxonomic units or/and amplicon sequence variants are retrieved from such studies and referred to as “dark matter”, and do not get a taxonomic assignment with a reference sequence. For a thorough investigation of this dark matter, we have developed the Dark mAtteR iNvestigator (DARN) software tool. A reference COI-oriented phylogenetic tree was built from 1,240 consensus sequences covering all the three domains of life, with more than 80% of those representing eukaryotic taxa. With respect to eukaryotes, consensus sequences at the family level were constructed from 183,330 retrieved from the Midori reference 2 database. Similarly, sequences from 559 bacterial genera and 41 archaeal were retrieved from the BOLD database. DARN makes use of the phylogenetic tree to investigate and quantify pre-processed sequences of amplicon samples to provide both a tabular and a graphical overview of phylogenetic assignments. To evaluate DARN, both environmental and bulk metabarcoding samples from different aquatic environments using various primer sets were analysed. We demonstrate that a large proportion of non-target prokaryotic organisms such as bacteria and archaea are also amplified in eDNA samples and we suggest bacterial COI sequences to be included in the reference databases used for the taxonomy assignment to allow for further analyses of dark matter. DARN source code is available on GitHub at https://github.com/hariszaf/darn and you may find it as a Docker at https://hub.docker.com/r/hariszaf/darn . Author summary DARN is a software approach aiming to provide further insight in the COI amplicon data coming from environmental samples. Building a COI-oriented reference phylogeny tree is a challenging task especially considering the small number of microbial curated COI sequences deposited in reference databases; e.g ~4,000 bacterial and ~150 archaeal in BOLD. Apparently, as more and more such sequences are collated, the DARN approach improves. To provide a more interactive way of communicating both our approach and our results, we strongly suggest the reader to visit this Google Collab notebook where all steps are described step by step and also this GitHub page where our results are demonstrated. Our approach corroborates the known presence of microbial sequences in COI environmental sequencing samples and highlights the need for curated bacterial and archaeal COI sequences and their integration into reference databases (i.e. Midori, BOLD, etc). We argue that DARN will benefit researchers as a quality control tool for their sequenced samples in terms of distinguishing eukaryotic from non-eukaryotic OTUs/ASVs, but also in terms of understanding the unknown unknowns." @default.
- W3182093451 created "2021-07-19" @default.
- W3182093451 creator A5007385808 @default.
- W3182093451 creator A5034221206 @default.
- W3182093451 creator A5075811022 @default.
- W3182093451 creator A5086173989 @default.
- W3182093451 creator A5090933641 @default.
- W3182093451 date "2021-07-11" @default.
- W3182093451 modified "2023-10-16" @default.
- W3182093451 title "Bacteria are everywhere, even in your COI marker gene data!" @default.
- W3182093451 cites W1554544352 @default.
- W3182093451 cites W2064696227 @default.
- W3182093451 cites W2085525886 @default.
- W3182093451 cites W2085926377 @default.
- W3182093451 cites W2098869000 @default.
- W3182093451 cites W2101462556 @default.
- W3182093451 cites W2125275348 @default.
- W3182093451 cites W2127774996 @default.
- W3182093451 cites W2149415648 @default.
- W3182093451 cites W2194764270 @default.
- W3182093451 cites W2317481118 @default.
- W3182093451 cites W2401404581 @default.
- W3182093451 cites W2412804189 @default.
- W3182093451 cites W2425675749 @default.
- W3182093451 cites W2608541709 @default.
- W3182093451 cites W2621721648 @default.
- W3182093451 cites W2656226357 @default.
- W3182093451 cites W2754086603 @default.
- W3182093451 cites W2754769271 @default.
- W3182093451 cites W2780601052 @default.
- W3182093451 cites W2792809884 @default.
- W3182093451 cites W2804615759 @default.
- W3182093451 cites W2888451776 @default.
- W3182093451 cites W2897782355 @default.
- W3182093451 cites W2912959159 @default.
- W3182093451 cites W2949259316 @default.
- W3182093451 cites W2951358594 @default.
- W3182093451 cites W2952263670 @default.
- W3182093451 cites W2965120276 @default.
- W3182093451 cites W3010217457 @default.
- W3182093451 cites W3085313947 @default.
- W3182093451 cites W3107865087 @default.
- W3182093451 cites W3116417409 @default.
- W3182093451 cites W3121953277 @default.
- W3182093451 cites W3148604764 @default.
- W3182093451 cites W4242789727 @default.
- W3182093451 doi "https://doi.org/10.1101/2021.07.10.451903" @default.
- W3182093451 hasPublicationYear "2021" @default.
- W3182093451 type Work @default.
- W3182093451 sameAs 3182093451 @default.
- W3182093451 citedByCount "0" @default.
- W3182093451 crossrefType "posted-content" @default.
- W3182093451 hasAuthorship W3182093451A5007385808 @default.
- W3182093451 hasAuthorship W3182093451A5034221206 @default.
- W3182093451 hasAuthorship W3182093451A5075811022 @default.
- W3182093451 hasAuthorship W3182093451A5086173989 @default.
- W3182093451 hasAuthorship W3182093451A5090933641 @default.
- W3182093451 hasBestOaLocation W31820934511 @default.
- W3182093451 hasConcept C104317684 @default.
- W3182093451 hasConcept C124104306 @default.
- W3182093451 hasConcept C15151743 @default.
- W3182093451 hasConcept C18903297 @default.
- W3182093451 hasConcept C189592816 @default.
- W3182093451 hasConcept C193252679 @default.
- W3182093451 hasConcept C42062724 @default.
- W3182093451 hasConcept C49105822 @default.
- W3182093451 hasConcept C54355233 @default.
- W3182093451 hasConcept C550995028 @default.
- W3182093451 hasConcept C58642233 @default.
- W3182093451 hasConcept C70721500 @default.
- W3182093451 hasConcept C71640776 @default.
- W3182093451 hasConcept C78458016 @default.
- W3182093451 hasConcept C8185291 @default.
- W3182093451 hasConcept C86803240 @default.
- W3182093451 hasConcept C90132467 @default.
- W3182093451 hasConceptScore W3182093451C104317684 @default.
- W3182093451 hasConceptScore W3182093451C124104306 @default.
- W3182093451 hasConceptScore W3182093451C15151743 @default.
- W3182093451 hasConceptScore W3182093451C18903297 @default.
- W3182093451 hasConceptScore W3182093451C189592816 @default.
- W3182093451 hasConceptScore W3182093451C193252679 @default.
- W3182093451 hasConceptScore W3182093451C42062724 @default.
- W3182093451 hasConceptScore W3182093451C49105822 @default.
- W3182093451 hasConceptScore W3182093451C54355233 @default.
- W3182093451 hasConceptScore W3182093451C550995028 @default.
- W3182093451 hasConceptScore W3182093451C58642233 @default.
- W3182093451 hasConceptScore W3182093451C70721500 @default.
- W3182093451 hasConceptScore W3182093451C71640776 @default.
- W3182093451 hasConceptScore W3182093451C78458016 @default.
- W3182093451 hasConceptScore W3182093451C8185291 @default.
- W3182093451 hasConceptScore W3182093451C86803240 @default.
- W3182093451 hasConceptScore W3182093451C90132467 @default.
- W3182093451 hasLocation W31820934511 @default.
- W3182093451 hasOpenAccess W3182093451 @default.
- W3182093451 hasPrimaryLocation W31820934511 @default.
- W3182093451 hasRelatedWork W11324632 @default.
- W3182093451 hasRelatedWork W11373413 @default.
- W3182093451 hasRelatedWork W11404868 @default.
- W3182093451 hasRelatedWork W14374489 @default.
- W3182093451 hasRelatedWork W18442666 @default.