Matches in SemOpenAlex for { <https://semopenalex.org/work/W3004837829> ?p ?o ?g. }
- W3004837829 abstract "Comprehensive identification and cataloging of copy number variation (CNVs) are essential to providing a complete view of human genetic variation and to finding diseased genes. Due to the large-scale sequencing and cost control whole-genome sequencing (WGS) data, low-coverage data is favorably disposed towards CNV identification. However, such low-coverage data is sensitive to noise and sequencing biases, which results in low resolution of CNV detection in past experimental designs for WGS datasets. In this paper, we present a control-free Dirichlet process Gaussian mixture model (dpGMM) based approach, to analyze the read depth (RD) of low-coverage WGS datasets for CNV discovery. First, noise and biases of the RD signals are corrected through the preprocessing step of dpGMM. Then we assume that RD signals across genomic regions follow a Gaussian mixture model (GMM) in which each Gaussian distribution is followed by a copy number state. Without requiring the number of Gaussian distributions, dpGMM builds a Dirichlet process (DP) GMM for RD signals and further uses a DP prior to infer the number of Gaussian models. After that, we apply dpGMM to simulation datasets with different coverages and individual datasets, and compare ours to three widely used RD-based pipelines, CNVnator, GROM-RD, and BIC-seq2. Simulation results demonstrate that our approach, dpGMM, has a high F1 score in both low- and high- coverage sequences. Also, the number of overlaps between CNVs detected in real data by ours and the standard benchmark is twice as much as that detected by other tools such as CNVnator and GROM-RD." @default.
- W3004837829 created "2020-02-14" @default.
- W3004837829 creator A5009624424 @default.
- W3004837829 creator A5010671013 @default.
- W3004837829 creator A5017698164 @default.
- W3004837829 creator A5049916114 @default.
- W3004837829 date "2020-01-01" @default.
- W3004837829 modified "2023-09-24" @default.
- W3004837829 title "dpGMM: A Dirichlet Process Gaussian Mixture Model for Copy Number Variation Detection in Low-Coverage Whole-Genome Sequencing Data" @default.
- W3004837829 cites W1520880567 @default.
- W3004837829 cites W1928416196 @default.
- W3004837829 cites W1975874583 @default.
- W3004837829 cites W1986538275 @default.
- W3004837829 cites W1987050564 @default.
- W3004837829 cites W2012151892 @default.
- W3004837829 cites W2020213096 @default.
- W3004837829 cites W2023369015 @default.
- W3004837829 cites W2052549259 @default.
- W3004837829 cites W2075346916 @default.
- W3004837829 cites W2096020932 @default.
- W3004837829 cites W2096791516 @default.
- W3004837829 cites W2103441770 @default.
- W3004837829 cites W2116126626 @default.
- W3004837829 cites W2117608012 @default.
- W3004837829 cites W2118622562 @default.
- W3004837829 cites W2121743948 @default.
- W3004837829 cites W2130542470 @default.
- W3004837829 cites W2137779717 @default.
- W3004837829 cites W2140888103 @default.
- W3004837829 cites W2150506261 @default.
- W3004837829 cites W2154432179 @default.
- W3004837829 cites W2158392984 @default.
- W3004837829 cites W2169986510 @default.
- W3004837829 cites W2225287204 @default.
- W3004837829 cites W2419528132 @default.
- W3004837829 cites W2764274744 @default.
- W3004837829 cites W2775698023 @default.
- W3004837829 cites W2793000921 @default.
- W3004837829 cites W2902308587 @default.
- W3004837829 cites W2963074323 @default.
- W3004837829 cites W2997875816 @default.
- W3004837829 cites W337797674 @default.
- W3004837829 doi "https://doi.org/10.1109/access.2020.2971863" @default.
- W3004837829 hasPublicationYear "2020" @default.
- W3004837829 type Work @default.
- W3004837829 sameAs 3004837829 @default.
- W3004837829 citedByCount "0" @default.
- W3004837829 crossrefType "journal-article" @default.
- W3004837829 hasAuthorship W3004837829A5009624424 @default.
- W3004837829 hasAuthorship W3004837829A5010671013 @default.
- W3004837829 hasAuthorship W3004837829A5017698164 @default.
- W3004837829 hasAuthorship W3004837829A5049916114 @default.
- W3004837829 hasBestOaLocation W30048378291 @default.
- W3004837829 hasConcept C104317684 @default.
- W3004837829 hasConcept C11413529 @default.
- W3004837829 hasConcept C120821319 @default.
- W3004837829 hasConcept C121332964 @default.
- W3004837829 hasConcept C124101348 @default.
- W3004837829 hasConcept C13280743 @default.
- W3004837829 hasConcept C141231307 @default.
- W3004837829 hasConcept C153180895 @default.
- W3004837829 hasConcept C154945302 @default.
- W3004837829 hasConcept C163716315 @default.
- W3004837829 hasConcept C185798385 @default.
- W3004837829 hasConcept C205649164 @default.
- W3004837829 hasConcept C41008148 @default.
- W3004837829 hasConcept C54355233 @default.
- W3004837829 hasConcept C61224824 @default.
- W3004837829 hasConcept C61326573 @default.
- W3004837829 hasConcept C62520636 @default.
- W3004837829 hasConcept C86803240 @default.
- W3004837829 hasConceptScore W3004837829C104317684 @default.
- W3004837829 hasConceptScore W3004837829C11413529 @default.
- W3004837829 hasConceptScore W3004837829C120821319 @default.
- W3004837829 hasConceptScore W3004837829C121332964 @default.
- W3004837829 hasConceptScore W3004837829C124101348 @default.
- W3004837829 hasConceptScore W3004837829C13280743 @default.
- W3004837829 hasConceptScore W3004837829C141231307 @default.
- W3004837829 hasConceptScore W3004837829C153180895 @default.
- W3004837829 hasConceptScore W3004837829C154945302 @default.
- W3004837829 hasConceptScore W3004837829C163716315 @default.
- W3004837829 hasConceptScore W3004837829C185798385 @default.
- W3004837829 hasConceptScore W3004837829C205649164 @default.
- W3004837829 hasConceptScore W3004837829C41008148 @default.
- W3004837829 hasConceptScore W3004837829C54355233 @default.
- W3004837829 hasConceptScore W3004837829C61224824 @default.
- W3004837829 hasConceptScore W3004837829C61326573 @default.
- W3004837829 hasConceptScore W3004837829C62520636 @default.
- W3004837829 hasConceptScore W3004837829C86803240 @default.
- W3004837829 hasFunder F4320321001 @default.
- W3004837829 hasFunder F4320324173 @default.
- W3004837829 hasLocation W30048378291 @default.
- W3004837829 hasOpenAccess W3004837829 @default.
- W3004837829 hasPrimaryLocation W30048378291 @default.
- W3004837829 hasRelatedWork W10593963 @default.
- W3004837829 hasRelatedWork W11421100 @default.
- W3004837829 hasRelatedWork W13112433 @default.
- W3004837829 hasRelatedWork W13704299 @default.
- W3004837829 hasRelatedWork W2607572 @default.
- W3004837829 hasRelatedWork W3048226 @default.