Matches in SemOpenAlex for { <https://semopenalex.org/work/W2079023204> ?p ?o ?g. }
- W2079023204 endingPage "227" @default.
- W2079023204 startingPage "216" @default.
- W2079023204 abstract "For high dimensional genome-wide association (GWA) case-control data of complex disease, there are usually a large portion of single-nucleotide polymorphisms (SNPs) that are irrelevant with the disease. A simple random sampling method in random forest using default mtry parameter to choose feature subspace, will select too many subspaces without informative SNPs. Exhaustive searching an optimal mtry is often required in order to include useful and relevant SNPs and get rid of vast of non-informative SNPs. However, it is too time-consuming and not favorable in GWA for high-dimensional data. The main aim of this paper is to propose a stratified sampling method for feature subspace selection to generate decision trees in a random forest for GWA high-dimensional data. Our idea is to design an equal-width discretization scheme for informativeness to divide SNPs into multiple groups. In feature subspace selection, we randomly select the same number of SNPs from each group and combine them to form a subspace to generate a decision tree. The advantage of this stratified sampling procedure can make sure each subspace contains enough useful SNPs, but can avoid a very high computational cost of exhaustive search of an optimal mtry, and maintain the randomness of a random forest. We employ two genome-wide SNP data sets (Parkinson case-control data comprised of 408 803 SNPs and Alzheimer case-control data comprised of 380 157 SNPs) to demonstrate that the proposed stratified sampling method is effective, and it can generate better random forest with higher accuracy and lower error bound than those by Breiman's random forest generation method. For Parkinson data, we also show some interesting genes identified by the method, which may be associated with neurological disorders for further biological investigations." @default.
- W2079023204 created "2016-06-24" @default.
- W2079023204 creator A5002523892 @default.
- W2079023204 creator A5010561682 @default.
- W2079023204 creator A5023130798 @default.
- W2079023204 creator A5023363049 @default.
- W2079023204 date "2012-09-01" @default.
- W2079023204 modified "2023-10-12" @default.
- W2079023204 title "SNP Selection and Classification of Genome-Wide SNP Data Using Stratified Sampling Random Forests" @default.
- W2079023204 cites W1520812622 @default.
- W2079023204 cites W1539593569 @default.
- W2079023204 cites W1875061881 @default.
- W2079023204 cites W1986112393 @default.
- W2079023204 cites W1994664175 @default.
- W2079023204 cites W2013577952 @default.
- W2079023204 cites W2025143098 @default.
- W2079023204 cites W2043175314 @default.
- W2079023204 cites W2063575312 @default.
- W2079023204 cites W2065709315 @default.
- W2079023204 cites W2071767222 @default.
- W2079023204 cites W2086099578 @default.
- W2079023204 cites W2095499628 @default.
- W2079023204 cites W2101350555 @default.
- W2079023204 cites W2101889545 @default.
- W2079023204 cites W2111607888 @default.
- W2079023204 cites W2120372853 @default.
- W2079023204 cites W2128207034 @default.
- W2079023204 cites W2128302979 @default.
- W2079023204 cites W2131822674 @default.
- W2079023204 cites W2134783591 @default.
- W2079023204 cites W2143481518 @default.
- W2079023204 cites W2152905639 @default.
- W2079023204 cites W2154298141 @default.
- W2079023204 cites W2155635136 @default.
- W2079023204 cites W2158416439 @default.
- W2079023204 cites W2911964244 @default.
- W2079023204 cites W3152294918 @default.
- W2079023204 doi "https://doi.org/10.1109/tnb.2012.2214232" @default.
- W2079023204 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/22987127" @default.
- W2079023204 hasPublicationYear "2012" @default.
- W2079023204 type Work @default.
- W2079023204 sameAs 2079023204 @default.
- W2079023204 citedByCount "48" @default.
- W2079023204 countsByYear W20790232042013 @default.
- W2079023204 countsByYear W20790232042014 @default.
- W2079023204 countsByYear W20790232042015 @default.
- W2079023204 countsByYear W20790232042016 @default.
- W2079023204 countsByYear W20790232042017 @default.
- W2079023204 countsByYear W20790232042018 @default.
- W2079023204 countsByYear W20790232042019 @default.
- W2079023204 countsByYear W20790232042020 @default.
- W2079023204 countsByYear W20790232042021 @default.
- W2079023204 countsByYear W20790232042022 @default.
- W2079023204 countsByYear W20790232042023 @default.
- W2079023204 crossrefType "journal-article" @default.
- W2079023204 hasAuthorship W2079023204A5002523892 @default.
- W2079023204 hasAuthorship W2079023204A5010561682 @default.
- W2079023204 hasAuthorship W2079023204A5023130798 @default.
- W2079023204 hasAuthorship W2079023204A5023363049 @default.
- W2079023204 hasConcept C104317684 @default.
- W2079023204 hasConcept C106131492 @default.
- W2079023204 hasConcept C124101348 @default.
- W2079023204 hasConcept C135763542 @default.
- W2079023204 hasConcept C138885662 @default.
- W2079023204 hasConcept C139275648 @default.
- W2079023204 hasConcept C140779682 @default.
- W2079023204 hasConcept C144024400 @default.
- W2079023204 hasConcept C148483581 @default.
- W2079023204 hasConcept C149923435 @default.
- W2079023204 hasConcept C153180895 @default.
- W2079023204 hasConcept C153209595 @default.
- W2079023204 hasConcept C154945302 @default.
- W2079023204 hasConcept C169258074 @default.
- W2079023204 hasConcept C20353970 @default.
- W2079023204 hasConcept C2776401178 @default.
- W2079023204 hasConcept C2908647359 @default.
- W2079023204 hasConcept C31972630 @default.
- W2079023204 hasConcept C32834561 @default.
- W2079023204 hasConcept C41008148 @default.
- W2079023204 hasConcept C41895202 @default.
- W2079023204 hasConcept C54355233 @default.
- W2079023204 hasConcept C55060382 @default.
- W2079023204 hasConcept C81917197 @default.
- W2079023204 hasConcept C86803240 @default.
- W2079023204 hasConceptScore W2079023204C104317684 @default.
- W2079023204 hasConceptScore W2079023204C106131492 @default.
- W2079023204 hasConceptScore W2079023204C124101348 @default.
- W2079023204 hasConceptScore W2079023204C135763542 @default.
- W2079023204 hasConceptScore W2079023204C138885662 @default.
- W2079023204 hasConceptScore W2079023204C139275648 @default.
- W2079023204 hasConceptScore W2079023204C140779682 @default.
- W2079023204 hasConceptScore W2079023204C144024400 @default.
- W2079023204 hasConceptScore W2079023204C148483581 @default.
- W2079023204 hasConceptScore W2079023204C149923435 @default.
- W2079023204 hasConceptScore W2079023204C153180895 @default.
- W2079023204 hasConceptScore W2079023204C153209595 @default.
- W2079023204 hasConceptScore W2079023204C154945302 @default.
- W2079023204 hasConceptScore W2079023204C169258074 @default.