Matches in SemOpenAlex for { <https://semopenalex.org/work/W4387497625> ?p ?o ?g. }
Showing items 1 to 65 of
65
with 100 items per page.
- W4387497625 abstract "Adapting to new environments is a hallmark of animal and human cognition, and Reinforcement Learning (RL) models provide a powerful and general framework for studying such adaptation. A fundamental learning component identified by RL models is that in the absence of direct supervision, when learning is driven by trial-and-error, exploration is essential. The necessary ingredients of effective exploration have been studied extensively in machine learning. However, the relevance of some of these principles to humans’ exploration is still unknown. An important reason for this gap is the dominance of the Multi-Armed Bandit tasks in human exploration studies. In these tasks, the exploration component per se is simple, because local measures of uncertainty, most notably visit-counters, are sufficient to effectively direct exploration. By contrast, in more complex environments, actions have long-term exploratory consequences that should be accounted for when measuring their associated uncertainties. Here, we use a novel experimental task that goes beyond the bandit task to study human exploration. We show that when local measures of uncertainty are insufficient, humans use exploration strategies that propagate uncertainties over states and actions. Moreover, we show that the long-term exploration consequences are temporally-discounted, similar to the temporal discounting of rewards in standard RL tasks. Additionally, we show that human exploration is largely uncertainty-driven. Finally, we find that humans exhibit signatures of temporally-extended learning, rather than local, 1-step update rules which are commonly assumed in RL models. All these aspects of human exploration are well-captured by a computational model in which agents learn an exploration “value-function”, analogous to the standard (reward-based) value-function in RL." @default.
- W4387497625 created "2023-10-11" @default.
- W4387497625 creator A5019527475 @default.
- W4387497625 creator A5037676184 @default.
- W4387497625 creator A5074945649 @default.
- W4387497625 date "2023-10-10" @default.
- W4387497625 modified "2023-10-12" @default.
- W4387497625 title "On the computational principles underlying human exploration" @default.
- W4387497625 doi "https://doi.org/10.7554/elife.90684" @default.
- W4387497625 hasPublicationYear "2023" @default.
- W4387497625 type Work @default.
- W4387497625 citedByCount "0" @default.
- W4387497625 crossrefType "posted-content" @default.
- W4387497625 hasAuthorship W4387497625A5019527475 @default.
- W4387497625 hasAuthorship W4387497625A5037676184 @default.
- W4387497625 hasAuthorship W4387497625A5074945649 @default.
- W4387497625 hasBestOaLocation W43874976251 @default.
- W4387497625 hasConcept C119857082 @default.
- W4387497625 hasConcept C121332964 @default.
- W4387497625 hasConcept C139807058 @default.
- W4387497625 hasConcept C154945302 @default.
- W4387497625 hasConcept C15744967 @default.
- W4387497625 hasConcept C158154518 @default.
- W4387497625 hasConcept C162324750 @default.
- W4387497625 hasConcept C168167062 @default.
- W4387497625 hasConcept C169760540 @default.
- W4387497625 hasConcept C17744445 @default.
- W4387497625 hasConcept C187736073 @default.
- W4387497625 hasConcept C199539241 @default.
- W4387497625 hasConcept C2780451532 @default.
- W4387497625 hasConcept C41008148 @default.
- W4387497625 hasConcept C97355855 @default.
- W4387497625 hasConcept C97541855 @default.
- W4387497625 hasConceptScore W4387497625C119857082 @default.
- W4387497625 hasConceptScore W4387497625C121332964 @default.
- W4387497625 hasConceptScore W4387497625C139807058 @default.
- W4387497625 hasConceptScore W4387497625C154945302 @default.
- W4387497625 hasConceptScore W4387497625C15744967 @default.
- W4387497625 hasConceptScore W4387497625C158154518 @default.
- W4387497625 hasConceptScore W4387497625C162324750 @default.
- W4387497625 hasConceptScore W4387497625C168167062 @default.
- W4387497625 hasConceptScore W4387497625C169760540 @default.
- W4387497625 hasConceptScore W4387497625C17744445 @default.
- W4387497625 hasConceptScore W4387497625C187736073 @default.
- W4387497625 hasConceptScore W4387497625C199539241 @default.
- W4387497625 hasConceptScore W4387497625C2780451532 @default.
- W4387497625 hasConceptScore W4387497625C41008148 @default.
- W4387497625 hasConceptScore W4387497625C97355855 @default.
- W4387497625 hasConceptScore W4387497625C97541855 @default.
- W4387497625 hasLocation W43874976251 @default.
- W4387497625 hasOpenAccess W4387497625 @default.
- W4387497625 hasPrimaryLocation W43874976251 @default.
- W4387497625 hasRelatedWork W129772185 @default.
- W4387497625 hasRelatedWork W2031695474 @default.
- W4387497625 hasRelatedWork W2138720691 @default.
- W4387497625 hasRelatedWork W2141866091 @default.
- W4387497625 hasRelatedWork W2167645963 @default.
- W4387497625 hasRelatedWork W2997567050 @default.
- W4387497625 hasRelatedWork W4306904969 @default.
- W4387497625 hasRelatedWork W4362501864 @default.
- W4387497625 hasRelatedWork W4380318855 @default.
- W4387497625 hasRelatedWork W889372020 @default.
- W4387497625 isParatext "false" @default.
- W4387497625 isRetracted "false" @default.
- W4387497625 workType "article" @default.