Matches in SemOpenAlex for { <https://semopenalex.org/work/W2040892106> ?p ?o ?g. }
Showing items 1 to 78 of
78
with 100 items per page.
- W2040892106 abstract "Identifying important biomarkers to improve disease diagnosis and treatment is a significant topic of research in bioinformatics. However, bioinformatics datasets frequently have a large number of features per sample or instance. This problem, known as “high dimensionality,” can be alleviated through the use of dimension reducing techniques such as feature (gene) selection which remove unnecessary features. There are many versions of feature selection, with varying biases and predictive abilities. However, predictive power is but one factor to consider when choosing a feature selection technique: one must also consider the technique's stability, that is, its ability to create feature subsets which remain valid in the face of changes to the data. While there has been work in determining the relative stability of different feature selection techniques, this does not always help determine whether a chosen feature selection technique will give stable feature subsets for a specific dataset. Factors such as difficulty of learning (e.g., dataset difficulty) may also influence feature selection stability, making generally-true facts about different techniques not applicable to a given dataset. In this work, we study how dataset difficulty can affect the stability of feature selection techniques, leading to good performance from bad techniques and vice versa. We use a set of twenty-six DNA microarray datasets with varying levels of difficulty of learning, along with four levels of dataset perturbation, six feature selection techniques with various levels of stability, and twelve feature subset sizes. The results show that as the dataset difficulty increases, the stability decreases. However, the relative stability between the techniques remains the same. Additionally, the more difficult the dataset, the more the stability is affected by changes to the data. We also found that unstable rankers are more affected by the transition between Easy and Moderate datasets, whereas the stable techniques are more affected by the change between Moderate and Hard datasets. Lastly, as the feature subset size increases, the stability increases and the difference between the levels of dataset difficulty decreases. Overall, we conclude that difficulty of learning must be taken into account before interpreting stability results." @default.
- W2040892106 created "2016-06-24" @default.
- W2040892106 creator A5047817565 @default.
- W2040892106 creator A5067147877 @default.
- W2040892106 creator A5089170562 @default.
- W2040892106 creator A5090913753 @default.
- W2040892106 date "2013-08-01" @default.
- W2040892106 modified "2023-10-18" @default.
- W2040892106 title "Gene selection stability's dependence on dataset difficulty" @default.
- W2040892106 cites W144440563 @default.
- W2040892106 cites W2017605669 @default.
- W2040892106 cites W2022441134 @default.
- W2040892106 cites W2032909675 @default.
- W2040892106 cites W2055271333 @default.
- W2040892106 cites W2075333494 @default.
- W2040892106 cites W2098740506 @default.
- W2040892106 cites W2103333826 @default.
- W2040892106 cites W2119387367 @default.
- W2040892106 cites W2131391419 @default.
- W2040892106 cites W2138776277 @default.
- W2040892106 cites W2146739527 @default.
- W2040892106 cites W968026779 @default.
- W2040892106 doi "https://doi.org/10.1109/iri.2013.6642491" @default.
- W2040892106 hasPublicationYear "2013" @default.
- W2040892106 type Work @default.
- W2040892106 sameAs 2040892106 @default.
- W2040892106 citedByCount "9" @default.
- W2040892106 countsByYear W20408921062014 @default.
- W2040892106 countsByYear W20408921062018 @default.
- W2040892106 countsByYear W20408921062020 @default.
- W2040892106 crossrefType "proceedings-article" @default.
- W2040892106 hasAuthorship W2040892106A5047817565 @default.
- W2040892106 hasAuthorship W2040892106A5067147877 @default.
- W2040892106 hasAuthorship W2040892106A5089170562 @default.
- W2040892106 hasAuthorship W2040892106A5090913753 @default.
- W2040892106 hasConcept C111030470 @default.
- W2040892106 hasConcept C112972136 @default.
- W2040892106 hasConcept C119857082 @default.
- W2040892106 hasConcept C124101348 @default.
- W2040892106 hasConcept C138885662 @default.
- W2040892106 hasConcept C148483581 @default.
- W2040892106 hasConcept C153180895 @default.
- W2040892106 hasConcept C154945302 @default.
- W2040892106 hasConcept C2776401178 @default.
- W2040892106 hasConcept C41008148 @default.
- W2040892106 hasConcept C41895202 @default.
- W2040892106 hasConcept C70518039 @default.
- W2040892106 hasConcept C81917197 @default.
- W2040892106 hasConceptScore W2040892106C111030470 @default.
- W2040892106 hasConceptScore W2040892106C112972136 @default.
- W2040892106 hasConceptScore W2040892106C119857082 @default.
- W2040892106 hasConceptScore W2040892106C124101348 @default.
- W2040892106 hasConceptScore W2040892106C138885662 @default.
- W2040892106 hasConceptScore W2040892106C148483581 @default.
- W2040892106 hasConceptScore W2040892106C153180895 @default.
- W2040892106 hasConceptScore W2040892106C154945302 @default.
- W2040892106 hasConceptScore W2040892106C2776401178 @default.
- W2040892106 hasConceptScore W2040892106C41008148 @default.
- W2040892106 hasConceptScore W2040892106C41895202 @default.
- W2040892106 hasConceptScore W2040892106C70518039 @default.
- W2040892106 hasConceptScore W2040892106C81917197 @default.
- W2040892106 hasLocation W20408921061 @default.
- W2040892106 hasOpenAccess W2040892106 @default.
- W2040892106 hasPrimaryLocation W20408921061 @default.
- W2040892106 hasRelatedWork W1965771882 @default.
- W2040892106 hasRelatedWork W2108104958 @default.
- W2040892106 hasRelatedWork W2156248978 @default.
- W2040892106 hasRelatedWork W2347213675 @default.
- W2040892106 hasRelatedWork W2385233088 @default.
- W2040892106 hasRelatedWork W2612877759 @default.
- W2040892106 hasRelatedWork W2767021621 @default.
- W2040892106 hasRelatedWork W2883447302 @default.
- W2040892106 hasRelatedWork W3211035526 @default.
- W2040892106 hasRelatedWork W4312247183 @default.
- W2040892106 isParatext "false" @default.
- W2040892106 isRetracted "false" @default.
- W2040892106 magId "2040892106" @default.
- W2040892106 workType "article" @default.