Matches in SemOpenAlex for { <https://semopenalex.org/work/W3110819537> ?p ?o ?g. }
- W3110819537 endingPage "e3001028" @default.
- W3110819537 startingPage "e3001028" @default.
- W3110819537 abstract "While there is no doubt that social signals affect human reinforcement learning, there is still no consensus about how this process is computationally implemented. To address this issue, we compared three psychologically plausible hypotheses about the algorithmic implementation of imitation in reinforcement learning. The first hypothesis, decision biasing (DB), postulates that imitation consists in transiently biasing the learner’s action selection without affecting their value function. According to the second hypothesis, model-based imitation (MB), the learner infers the demonstrator’s value function through inverse reinforcement learning and uses it to bias action selection. Finally, according to the third hypothesis, value shaping (VS), the demonstrator’s actions directly affect the learner’s value function. We tested these three hypotheses in 2 experiments ( N = 24 and N = 44) featuring a new variant of a social reinforcement learning task. We show through model comparison and model simulation that VS provides the best explanation of learner’s behavior. Results replicated in a third independent experiment featuring a larger cohort and a different design ( N = 302). In our experiments, we also manipulated the quality of the demonstrators’ choices and found that learners were able to adapt their imitation rate, so that only skilled demonstrators were imitated. We proposed and tested an efficient meta-learning process to account for this effect, where imitation is regulated by the agreement between the learner and the demonstrator. In sum, our findings provide new insights and perspectives on the computational mechanisms underlying adaptive imitation in human reinforcement learning." @default.
- W3110819537 created "2020-12-21" @default.
- W3110819537 creator A5002179303 @default.
- W3110819537 creator A5028199267 @default.
- W3110819537 creator A5057267466 @default.
- W3110819537 creator A5084334220 @default.
- W3110819537 date "2020-12-08" @default.
- W3110819537 modified "2023-10-02" @default.
- W3110819537 title "The actions of others act as a pseudo-reward to drive imitation in the context of social reinforcement learning" @default.
- W3110819537 cites W1526915519 @default.
- W3110819537 cites W1909892879 @default.
- W3110819537 cites W1971597000 @default.
- W3110819537 cites W1981686173 @default.
- W3110819537 cites W1984345750 @default.
- W3110819537 cites W1996579288 @default.
- W3110819537 cites W2018922056 @default.
- W3110819537 cites W2023645850 @default.
- W3110819537 cites W2030802915 @default.
- W3110819537 cites W2033198212 @default.
- W3110819537 cites W2049657715 @default.
- W3110819537 cites W2050730932 @default.
- W3110819537 cites W2051226132 @default.
- W3110819537 cites W2063139645 @default.
- W3110819537 cites W2081614221 @default.
- W3110819537 cites W2098391782 @default.
- W3110819537 cites W2101837595 @default.
- W3110819537 cites W2103702476 @default.
- W3110819537 cites W2111968380 @default.
- W3110819537 cites W2112774974 @default.
- W3110819537 cites W2123429050 @default.
- W3110819537 cites W2125761275 @default.
- W3110819537 cites W2135173838 @default.
- W3110819537 cites W2146263285 @default.
- W3110819537 cites W2151516755 @default.
- W3110819537 cites W2157174816 @default.
- W3110819537 cites W2157300266 @default.
- W3110819537 cites W2158196600 @default.
- W3110819537 cites W2160172765 @default.
- W3110819537 cites W2331415569 @default.
- W3110819537 cites W2462574003 @default.
- W3110819537 cites W2582561810 @default.
- W3110819537 cites W2597613574 @default.
- W3110819537 cites W2620461957 @default.
- W3110819537 cites W2625369207 @default.
- W3110819537 cites W2626804262 @default.
- W3110819537 cites W2765861418 @default.
- W3110819537 cites W2769487915 @default.
- W3110819537 cites W2801901844 @default.
- W3110819537 cites W2884249461 @default.
- W3110819537 cites W2895764062 @default.
- W3110819537 cites W2908393636 @default.
- W3110819537 cites W2966242103 @default.
- W3110819537 cites W4234297223 @default.
- W3110819537 cites W4240799150 @default.
- W3110819537 doi "https://doi.org/10.1371/journal.pbio.3001028" @default.
- W3110819537 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/7723279" @default.
- W3110819537 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/33290387" @default.
- W3110819537 hasPublicationYear "2020" @default.
- W3110819537 type Work @default.
- W3110819537 sameAs 3110819537 @default.
- W3110819537 citedByCount "20" @default.
- W3110819537 countsByYear W31108195372020 @default.
- W3110819537 countsByYear W31108195372021 @default.
- W3110819537 countsByYear W31108195372022 @default.
- W3110819537 countsByYear W31108195372023 @default.
- W3110819537 crossrefType "journal-article" @default.
- W3110819537 hasAuthorship W3110819537A5002179303 @default.
- W3110819537 hasAuthorship W3110819537A5028199267 @default.
- W3110819537 hasAuthorship W3110819537A5057267466 @default.
- W3110819537 hasAuthorship W3110819537A5084334220 @default.
- W3110819537 hasBestOaLocation W31108195371 @default.
- W3110819537 hasConcept C119857082 @default.
- W3110819537 hasConcept C121332964 @default.
- W3110819537 hasConcept C126388530 @default.
- W3110819537 hasConcept C14036430 @default.
- W3110819537 hasConcept C151730666 @default.
- W3110819537 hasConcept C154945302 @default.
- W3110819537 hasConcept C15744967 @default.
- W3110819537 hasConcept C166109690 @default.
- W3110819537 hasConcept C169760540 @default.
- W3110819537 hasConcept C180747234 @default.
- W3110819537 hasConcept C26760741 @default.
- W3110819537 hasConcept C2776035688 @default.
- W3110819537 hasConcept C2776291640 @default.
- W3110819537 hasConcept C2779343474 @default.
- W3110819537 hasConcept C2780791683 @default.
- W3110819537 hasConcept C34868163 @default.
- W3110819537 hasConcept C41008148 @default.
- W3110819537 hasConcept C46312422 @default.
- W3110819537 hasConcept C56739046 @default.
- W3110819537 hasConcept C62520636 @default.
- W3110819537 hasConcept C67203356 @default.
- W3110819537 hasConcept C77805123 @default.
- W3110819537 hasConcept C78458016 @default.
- W3110819537 hasConcept C79416737 @default.
- W3110819537 hasConcept C81917197 @default.
- W3110819537 hasConcept C86803240 @default.
- W3110819537 hasConcept C97541855 @default.