Matches in SemOpenAlex for { <https://semopenalex.org/work/W2948686188> ?p ?o ?g. }
Showing items 1 to 59 of
59
with 100 items per page.
- W2948686188 abstract "Author(s): Cario, Clinton L | Advisor(s): Witte, John S | Abstract: Genomics is expected to soon overtake astronomy, particle physics, and even YouTube as the biggest creator of digital information (1). Analysis of this information has already led to important and ground breaking discoveries relevant to our health, but ongoing work will require creative solutions to the multitude of challenges arising from this volume of data. Practically speaking, one such challenge comes from determining what data should be collected and how it is to be managed. As cohort sizes in population based studies grow into the hundreds of thousands, practical issues about collection, storage, and filtering have begun to come more into focus. Additionally, frameworks that seamlessly integrate disparate datasets and also allow for flexible analysis will be required. Finally, as technical challenges and limitations arise, new analytical approaches and designs will have to be considered. This dissertation work was comprised of three projects relating to these questions as approached from the perspective of a bioinformatician. These projects describe the development of new software and methods for sample management, data integration and analysis, and design strategies to improve signal in noisy data. The first chapter of this dissertation consists of background material relating to the projects, including a description about the state of prostate cancer genomics, the development of biomarkers for its detection, and an exploration of a promising new biomarker, cell-free DNA (cfDNA). It also includes a discussion about some of the overarching questions of my PhD. The second chapter describes a web based sample management system, called Samasy. Born out of necessity, this tool addresses a very practical issue of sample subsetting that is often required of resequencing studies. Samasy was used to facilitate the selection of 16,600 samples from a much larger cohort of 54,000 while preserving ethnicity and age balance among cases and controls. This tool integrates with liquid handling systems and provides a visually intuitive interface for plate/sample management and batch sample transfer execution. The third chapter details Orchid, a framework designed to make machine learning of cancer variant data easy and extendible. It does so by integrating a variety of biological annotations (or features) and simple somatic tumor data available from large repositories like the The Cancer Genome Atlas (TCGA) or the International Cancer Genome Consortium (ICGC). This tool supports an efficient data store, MemSQL, that allows for very fast retrieval and filtering, and extends the popular python pandas and scikit-learn packages to facilitate machine learning of this data. Finally, the fourth chapter outlines the creation of a custom targeted sequencing panel for prostate cancer that was designed for screening tumor variants in cfDNA. Building upon the power of Orchid, we detail how machine learning on whole genome prostate tumor datasets can be used to rank mutations by likelihood of being found in a patient with few mutations, or in other words, involved in early state disease. This ranking was used to build a targeted sequencing panel for detection of tumor-derived cfDNA variants. This panel was then validated and applied to a cohort of nine UCSF prostate cancer patients with multiple tumor foci that were collected at time of Radical Prostatectomy (RP). Taken together, the information described in this dissertation provides tools and methodologies for the analysis of germline and somatic variants in prostate and other cancers. It also attempts to further technological development of cfDNA as biomarker for the detection or monitoring of diseases like cancer." @default.
- W2948686188 created "2019-06-14" @default.
- W2948686188 creator A5079726555 @default.
- W2948686188 date "2018-01-01" @default.
- W2948686188 modified "2023-09-26" @default.
- W2948686188 title "Management, Integration, and Mining of Tumor Data" @default.
- W2948686188 hasPublicationYear "2018" @default.
- W2948686188 type Work @default.
- W2948686188 sameAs 2948686188 @default.
- W2948686188 citedByCount "0" @default.
- W2948686188 crossrefType "journal-article" @default.
- W2948686188 hasAuthorship W2948686188A5079726555 @default.
- W2948686188 hasConcept C111472728 @default.
- W2948686188 hasConcept C124101348 @default.
- W2948686188 hasConcept C138885662 @default.
- W2948686188 hasConcept C1668388 @default.
- W2948686188 hasConcept C2522767166 @default.
- W2948686188 hasConcept C2780565519 @default.
- W2948686188 hasConcept C2908647359 @default.
- W2948686188 hasConcept C41008148 @default.
- W2948686188 hasConcept C71924100 @default.
- W2948686188 hasConcept C99454951 @default.
- W2948686188 hasConceptScore W2948686188C111472728 @default.
- W2948686188 hasConceptScore W2948686188C124101348 @default.
- W2948686188 hasConceptScore W2948686188C138885662 @default.
- W2948686188 hasConceptScore W2948686188C1668388 @default.
- W2948686188 hasConceptScore W2948686188C2522767166 @default.
- W2948686188 hasConceptScore W2948686188C2780565519 @default.
- W2948686188 hasConceptScore W2948686188C2908647359 @default.
- W2948686188 hasConceptScore W2948686188C41008148 @default.
- W2948686188 hasConceptScore W2948686188C71924100 @default.
- W2948686188 hasConceptScore W2948686188C99454951 @default.
- W2948686188 hasLocation W29486861881 @default.
- W2948686188 hasOpenAccess W2948686188 @default.
- W2948686188 hasPrimaryLocation W29486861881 @default.
- W2948686188 hasRelatedWork W1765881398 @default.
- W2948686188 hasRelatedWork W1978851402 @default.
- W2948686188 hasRelatedWork W1980052880 @default.
- W2948686188 hasRelatedWork W2020604801 @default.
- W2948686188 hasRelatedWork W2477622967 @default.
- W2948686188 hasRelatedWork W2564161042 @default.
- W2948686188 hasRelatedWork W2594729012 @default.
- W2948686188 hasRelatedWork W2725120278 @default.
- W2948686188 hasRelatedWork W2898709301 @default.
- W2948686188 hasRelatedWork W2900909328 @default.
- W2948686188 hasRelatedWork W2901284966 @default.
- W2948686188 hasRelatedWork W2955453537 @default.
- W2948686188 hasRelatedWork W2958662187 @default.
- W2948686188 hasRelatedWork W2968343391 @default.
- W2948686188 hasRelatedWork W3021851693 @default.
- W2948686188 hasRelatedWork W3082516559 @default.
- W2948686188 hasRelatedWork W3083383150 @default.
- W2948686188 hasRelatedWork W3093956594 @default.
- W2948686188 hasRelatedWork W3094438816 @default.
- W2948686188 hasRelatedWork W3170309561 @default.
- W2948686188 isParatext "false" @default.
- W2948686188 isRetracted "false" @default.
- W2948686188 magId "2948686188" @default.
- W2948686188 workType "article" @default.