Matches in SemOpenAlex for { <https://semopenalex.org/work/W2123668032> ?p ?o ?g. }
- W2123668032 abstract "Data from biomedical domains often have an inherit hierarchical structure. As this structure is usually implicit, its existence can be overlooked by practitioners interested in constructing and evaluating predictive models from such data. Ignoring these constructs leads to potentially problematic and the routinely unrecognized bias in the models and results. In this work, we discuss this bias in detail and propose a simple, sampling-based solution for it. Next, we explore its sources and extent on synthetic data. Finally, we demonstrate how the state-of-the-art variant prioritization framework, eXtasy, benefits from using the described approach in its Random forest-based core classification model. The conducted simulations clearly indicate that the heterogeneous granularity of feature domains poses significant problems for both the standard Random forest classifier and a modification that relies on stratified bootstrapping. Conversely, using the proposed sampling scheme when training the classifier mitigates the described bias. Furthermore, when applied to the eXtasy data under a realistic class distribution scenario, a Random forest learned using the proposed sampling scheme displays much better precision that its standard version, without degrading recall. Moreover, the largest performance gains are achieved in the most important part of the operating range: the top of prioritized gene list." @default.
- W2123668032 created "2016-06-24" @default.
- W2123668032 creator A5018814006 @default.
- W2123668032 creator A5041984163 @default.
- W2123668032 creator A5045411063 @default.
- W2123668032 creator A5074752965 @default.
- W2123668032 creator A5085611600 @default.
- W2123668032 date "2015-02-23" @default.
- W2123668032 modified "2023-10-08" @default.
- W2123668032 title "Problems with the nested granularity of feature domains in bioinformatics: the eXtasy case" @default.
- W2123668032 cites W1976526581 @default.
- W2123668032 cites W1978005360 @default.
- W2123668032 cites W2038132549 @default.
- W2123668032 cites W2045164090 @default.
- W2123668032 cites W2059145105 @default.
- W2123668032 cites W2067315559 @default.
- W2123668032 cites W2076357933 @default.
- W2123668032 cites W2087588809 @default.
- W2123668032 cites W2089903000 @default.
- W2123668032 cites W2109555487 @default.
- W2123668032 cites W2117897510 @default.
- W2123668032 cites W2118246164 @default.
- W2123668032 cites W2135445066 @default.
- W2123668032 cites W2141307855 @default.
- W2123668032 cites W2143238378 @default.
- W2123668032 cites W2145187337 @default.
- W2123668032 cites W2145191876 @default.
- W2123668032 cites W2146841526 @default.
- W2123668032 cites W2156868954 @default.
- W2123668032 cites W2158116492 @default.
- W2123668032 cites W2158698691 @default.
- W2123668032 cites W2162974025 @default.
- W2123668032 cites W2167277498 @default.
- W2123668032 cites W2167917621 @default.
- W2123668032 cites W2171777347 @default.
- W2123668032 cites W2911964244 @default.
- W2123668032 doi "https://doi.org/10.1186/1471-2105-16-s4-s2" @default.
- W2123668032 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/4347616" @default.
- W2123668032 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/25734591" @default.
- W2123668032 hasPublicationYear "2015" @default.
- W2123668032 type Work @default.
- W2123668032 sameAs 2123668032 @default.
- W2123668032 citedByCount "5" @default.
- W2123668032 countsByYear W21236680322015 @default.
- W2123668032 countsByYear W21236680322018 @default.
- W2123668032 countsByYear W21236680322019 @default.
- W2123668032 crossrefType "journal-article" @default.
- W2123668032 hasAuthorship W2123668032A5018814006 @default.
- W2123668032 hasAuthorship W2123668032A5041984163 @default.
- W2123668032 hasAuthorship W2123668032A5045411063 @default.
- W2123668032 hasAuthorship W2123668032A5074752965 @default.
- W2123668032 hasAuthorship W2123668032A5085611600 @default.
- W2123668032 hasBestOaLocation W21236680321 @default.
- W2123668032 hasConcept C111919701 @default.
- W2123668032 hasConcept C119857082 @default.
- W2123668032 hasConcept C124101348 @default.
- W2123668032 hasConcept C138885662 @default.
- W2123668032 hasConcept C149782125 @default.
- W2123668032 hasConcept C154945302 @default.
- W2123668032 hasConcept C169258074 @default.
- W2123668032 hasConcept C177774035 @default.
- W2123668032 hasConcept C207609745 @default.
- W2123668032 hasConcept C2776401178 @default.
- W2123668032 hasConcept C33923547 @default.
- W2123668032 hasConcept C41008148 @default.
- W2123668032 hasConcept C41895202 @default.
- W2123668032 hasConcept C95623464 @default.
- W2123668032 hasConceptScore W2123668032C111919701 @default.
- W2123668032 hasConceptScore W2123668032C119857082 @default.
- W2123668032 hasConceptScore W2123668032C124101348 @default.
- W2123668032 hasConceptScore W2123668032C138885662 @default.
- W2123668032 hasConceptScore W2123668032C149782125 @default.
- W2123668032 hasConceptScore W2123668032C154945302 @default.
- W2123668032 hasConceptScore W2123668032C169258074 @default.
- W2123668032 hasConceptScore W2123668032C177774035 @default.
- W2123668032 hasConceptScore W2123668032C207609745 @default.
- W2123668032 hasConceptScore W2123668032C2776401178 @default.
- W2123668032 hasConceptScore W2123668032C33923547 @default.
- W2123668032 hasConceptScore W2123668032C41008148 @default.
- W2123668032 hasConceptScore W2123668032C41895202 @default.
- W2123668032 hasConceptScore W2123668032C95623464 @default.
- W2123668032 hasIssue "S4" @default.
- W2123668032 hasLocation W21236680321 @default.
- W2123668032 hasLocation W21236680322 @default.
- W2123668032 hasLocation W21236680323 @default.
- W2123668032 hasLocation W21236680324 @default.
- W2123668032 hasLocation W21236680325 @default.
- W2123668032 hasOpenAccess W2123668032 @default.
- W2123668032 hasPrimaryLocation W21236680321 @default.
- W2123668032 hasRelatedWork W2123668032 @default.
- W2123668032 hasRelatedWork W2911455822 @default.
- W2123668032 hasRelatedWork W3174196512 @default.
- W2123668032 hasRelatedWork W3211546796 @default.
- W2123668032 hasRelatedWork W4281560664 @default.
- W2123668032 hasRelatedWork W4281616679 @default.
- W2123668032 hasRelatedWork W4293525103 @default.
- W2123668032 hasRelatedWork W4308191010 @default.
- W2123668032 hasRelatedWork W4318350883 @default.
- W2123668032 hasRelatedWork W4323021782 @default.
- W2123668032 hasVolume "16" @default.