Matches in SemOpenAlex for { <https://semopenalex.org/work/W3181080183> ?p ?o ?g. }
Showing items 1 to 83 of
83
with 100 items per page.
- W3181080183 abstract "Biological domain is one of the critical areas that always seek for useful knowledge and patterns observed through available methods, including data mining. One of genomic benchmark data sources is from Genome Wide Association Studies (GWAS), which uses a set of genetic variants, namely Single Nucleotide Polymorphisms (SNPs), in different individuals to observe the association of the variants with a particular trait. Usually, the association test in GWAS is done by finding the risk measure of each of the SNPs separately. But many of the variants and its effect remain a mystery which has high potential of knowledge discovery especially for complex diseases. The aim of the research is to develop an improved method for processing information and to find the relationship between genetic variants and disease with in-depth interpretation. Therefore, this research attempts to investigate the association between genetic variants to diseases, and thus propose a method that can identify multiple SNPs combination to form an association using Frequent Itemset Mining (FIM). Five main stages of methodology in this research are, data understanding, data representation and pruning items, FIM and analysis and validation of knowledge. This thesis elaborates a set of crucial tasks in FIM for GWAS datasets. It proposes a strategy of Ranking-based Pruning of Items (RPI) for SNPs. Next, the development of Weighted Support Model (WSM) was done to search for interesting itemsets. The measurement used are Information Gain for ranking to prune items and Weighted Support for interestingness of itemset. High dimensional dataset presented by SNPs confirmed the reason to apply row enumeration strategy algorithm to mine frequent closed itemsets. It is found that SNPs with known risks to Type 2 Diabetes Mellitus (T2DM) occur in low support values, that cause the process of searching frequent itemsets to be repeated many times until the low support values are retrieved. The implementation of WSM with Odds Ratio (OR) values, gives visibility of these itemsets as higher weighted support value. Finally, the validation for interestingness of produced itemsets is through the integration of available and relevant biological information with scrutinization of an expert as presented in the Descriptive Gene set Analysis (DGA). The information found in the itemsets concluded that the identified SNPs interact with other variants in the chain of T2DM. The scope of the work is using two most commonly chromosomes of T2DM studied, which are Chromosome 11 and 16. The results show that the itemsets with the T2DM risk variants were found within the support values of 40 to 48, and after the RPI and WSM are applied, the weighted support value increases to 50 and 97 within significant number of SNPs. These results show that RPI-WSM is able to solve the huge dataset problem and low support value problem in FIM. In addition, to improve the interpretation, each itemset is presented as combination of genes in DGA with gene annotation information, that supplies scientist with further valuing patterns. RPI, WSM and DGA are the contributions of the research and significant in discovering potential new knowledge and complimenting research by scientists to perform further validations. The study could also contribute to the advancement in healthcare and digital genome market, which focuses on developing healthier society through monitoring and early protection of any threats, especially of chronic diseases such as T2DM through personalized treatment or medicine." @default.
- W3181080183 created "2021-07-19" @default.
- W3181080183 creator A5062751861 @default.
- W3181080183 date "2019-01-01" @default.
- W3181080183 modified "2023-09-27" @default.
- W3181080183 title "Ranking-based pruning and weighted support model for gene association in frequent itemsets / Sofianita Mutalib" @default.
- W3181080183 hasPublicationYear "2019" @default.
- W3181080183 type Work @default.
- W3181080183 sameAs 3181080183 @default.
- W3181080183 citedByCount "0" @default.
- W3181080183 crossrefType "dissertation" @default.
- W3181080183 hasAuthorship W3181080183A5062751861 @default.
- W3181080183 hasConcept C104317684 @default.
- W3181080183 hasConcept C106208931 @default.
- W3181080183 hasConcept C106934330 @default.
- W3181080183 hasConcept C108010975 @default.
- W3181080183 hasConcept C119857082 @default.
- W3181080183 hasConcept C124101348 @default.
- W3181080183 hasConcept C135763542 @default.
- W3181080183 hasConcept C153209595 @default.
- W3181080183 hasConcept C154945302 @default.
- W3181080183 hasConcept C177264268 @default.
- W3181080183 hasConcept C185798385 @default.
- W3181080183 hasConcept C186413461 @default.
- W3181080183 hasConcept C189430467 @default.
- W3181080183 hasConcept C193524817 @default.
- W3181080183 hasConcept C199360897 @default.
- W3181080183 hasConcept C205649164 @default.
- W3181080183 hasConcept C41008148 @default.
- W3181080183 hasConcept C54355233 @default.
- W3181080183 hasConcept C58640448 @default.
- W3181080183 hasConcept C6557445 @default.
- W3181080183 hasConcept C70721500 @default.
- W3181080183 hasConcept C86803240 @default.
- W3181080183 hasConceptScore W3181080183C104317684 @default.
- W3181080183 hasConceptScore W3181080183C106208931 @default.
- W3181080183 hasConceptScore W3181080183C106934330 @default.
- W3181080183 hasConceptScore W3181080183C108010975 @default.
- W3181080183 hasConceptScore W3181080183C119857082 @default.
- W3181080183 hasConceptScore W3181080183C124101348 @default.
- W3181080183 hasConceptScore W3181080183C135763542 @default.
- W3181080183 hasConceptScore W3181080183C153209595 @default.
- W3181080183 hasConceptScore W3181080183C154945302 @default.
- W3181080183 hasConceptScore W3181080183C177264268 @default.
- W3181080183 hasConceptScore W3181080183C185798385 @default.
- W3181080183 hasConceptScore W3181080183C186413461 @default.
- W3181080183 hasConceptScore W3181080183C189430467 @default.
- W3181080183 hasConceptScore W3181080183C193524817 @default.
- W3181080183 hasConceptScore W3181080183C199360897 @default.
- W3181080183 hasConceptScore W3181080183C205649164 @default.
- W3181080183 hasConceptScore W3181080183C41008148 @default.
- W3181080183 hasConceptScore W3181080183C54355233 @default.
- W3181080183 hasConceptScore W3181080183C58640448 @default.
- W3181080183 hasConceptScore W3181080183C6557445 @default.
- W3181080183 hasConceptScore W3181080183C70721500 @default.
- W3181080183 hasConceptScore W3181080183C86803240 @default.
- W3181080183 hasLocation W31810801831 @default.
- W3181080183 hasOpenAccess W3181080183 @default.
- W3181080183 hasPrimaryLocation W31810801831 @default.
- W3181080183 hasRelatedWork W114359387 @default.
- W3181080183 hasRelatedWork W1572524301 @default.
- W3181080183 hasRelatedWork W1591721487 @default.
- W3181080183 hasRelatedWork W1936575861 @default.
- W3181080183 hasRelatedWork W1990459083 @default.
- W3181080183 hasRelatedWork W2005283425 @default.
- W3181080183 hasRelatedWork W2010953838 @default.
- W3181080183 hasRelatedWork W2026796161 @default.
- W3181080183 hasRelatedWork W2039333540 @default.
- W3181080183 hasRelatedWork W2056482097 @default.
- W3181080183 hasRelatedWork W2063916835 @default.
- W3181080183 hasRelatedWork W2087876307 @default.
- W3181080183 hasRelatedWork W2119238968 @default.
- W3181080183 hasRelatedWork W2322146135 @default.
- W3181080183 hasRelatedWork W2352559670 @default.
- W3181080183 hasRelatedWork W2407639434 @default.
- W3181080183 hasRelatedWork W56684036 @default.
- W3181080183 hasRelatedWork W637759898 @default.
- W3181080183 hasRelatedWork W2132297311 @default.
- W3181080183 hasRelatedWork W2338045944 @default.
- W3181080183 isParatext "false" @default.
- W3181080183 isRetracted "false" @default.
- W3181080183 magId "3181080183" @default.
- W3181080183 workType "dissertation" @default.