Matches in SemOpenAlex for { <https://semopenalex.org/work/W1978046526> ?p ?o ?g. }
- W1978046526 endingPage "e1002691" @default.
- W1978046526 startingPage "e1002691" @default.
- W1978046526 abstract "Humans and animals face decision tasks in an uncertain multi-agent environment where an agent's strategy may change in time due to the co-adaptation of others strategies. The neuronal substrate and the computational algorithms underlying such adaptive decision making, however, is largely unknown. We propose a population coding model of spiking neurons with a policy gradient procedure that successfully acquires optimal strategies for classical game-theoretical tasks. The suggested population reinforcement learning reproduces data from human behavioral experiments for the blackjack and the inspector game. It performs optimally according to a pure (deterministic) and mixed (stochastic) Nash equilibrium, respectively. In contrast, temporal-difference(TD)-learning, covariance-learning, and basic reinforcement learning fail to perform optimally for the stochastic strategy. Spike-based population reinforcement learning, shown to follow the stochastic reward gradient, is therefore a viable candidate to explain automated decision learning of a Nash equilibrium in two-player games." @default.
- W1978046526 created "2016-06-24" @default.
- W1978046526 creator A5033106713 @default.
- W1978046526 creator A5088167504 @default.
- W1978046526 date "2012-09-27" @default.
- W1978046526 modified "2023-09-28" @default.
- W1978046526 title "Spike-based Decision Learning of Nash Equilibria in Two-Player Games" @default.
- W1978046526 cites W1492146705 @default.
- W1978046526 cites W1517741531 @default.
- W1978046526 cites W1542782021 @default.
- W1978046526 cites W1542941925 @default.
- W1978046526 cites W1564229172 @default.
- W1978046526 cites W160989634 @default.
- W1978046526 cites W1966767762 @default.
- W1978046526 cites W1967438932 @default.
- W1978046526 cites W1983624302 @default.
- W1978046526 cites W1985623473 @default.
- W1978046526 cites W1993594397 @default.
- W1978046526 cites W1998050452 @default.
- W1978046526 cites W2005629202 @default.
- W1978046526 cites W2024152195 @default.
- W1978046526 cites W2028145673 @default.
- W1978046526 cites W2034725503 @default.
- W1978046526 cites W2037917209 @default.
- W1978046526 cites W2041176801 @default.
- W1978046526 cites W2061897041 @default.
- W1978046526 cites W2062444449 @default.
- W1978046526 cites W2066947576 @default.
- W1978046526 cites W2069538478 @default.
- W1978046526 cites W2069873783 @default.
- W1978046526 cites W2074621710 @default.
- W1978046526 cites W2075567596 @default.
- W1978046526 cites W2084912121 @default.
- W1978046526 cites W2093394772 @default.
- W1978046526 cites W2093657864 @default.
- W1978046526 cites W2094536313 @default.
- W1978046526 cites W2112439305 @default.
- W1978046526 cites W2113545762 @default.
- W1978046526 cites W2117726420 @default.
- W1978046526 cites W2119717200 @default.
- W1978046526 cites W2120846115 @default.
- W1978046526 cites W2121863487 @default.
- W1978046526 cites W2126404188 @default.
- W1978046526 cites W2127978988 @default.
- W1978046526 cites W2135886642 @default.
- W1978046526 cites W2136518234 @default.
- W1978046526 cites W2144846366 @default.
- W1978046526 cites W2152571732 @default.
- W1978046526 cites W2154583360 @default.
- W1978046526 cites W2155910333 @default.
- W1978046526 cites W2157518752 @default.
- W1978046526 cites W2169519447 @default.
- W1978046526 cites W2172030907 @default.
- W1978046526 cites W3015812362 @default.
- W1978046526 cites W609092168 @default.
- W1978046526 cites W646470451 @default.
- W1978046526 cites W69361866 @default.
- W1978046526 cites W2282055800 @default.
- W1978046526 doi "https://doi.org/10.1371/journal.pcbi.1002691" @default.
- W1978046526 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/3459907" @default.
- W1978046526 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/23028289" @default.
- W1978046526 hasPublicationYear "2012" @default.
- W1978046526 type Work @default.
- W1978046526 sameAs 1978046526 @default.
- W1978046526 citedByCount "6" @default.
- W1978046526 countsByYear W19780465262014 @default.
- W1978046526 countsByYear W19780465262016 @default.
- W1978046526 countsByYear W19780465262017 @default.
- W1978046526 crossrefType "journal-article" @default.
- W1978046526 hasAuthorship W1978046526A5033106713 @default.
- W1978046526 hasAuthorship W1978046526A5088167504 @default.
- W1978046526 hasBestOaLocation W19780465261 @default.
- W1978046526 hasConcept C119857082 @default.
- W1978046526 hasConcept C125014702 @default.
- W1978046526 hasConcept C126255220 @default.
- W1978046526 hasConcept C139807058 @default.
- W1978046526 hasConcept C144024400 @default.
- W1978046526 hasConcept C144237770 @default.
- W1978046526 hasConcept C149923435 @default.
- W1978046526 hasConcept C154945302 @default.
- W1978046526 hasConcept C15744967 @default.
- W1978046526 hasConcept C169760540 @default.
- W1978046526 hasConcept C177142836 @default.
- W1978046526 hasConcept C2908647359 @default.
- W1978046526 hasConcept C32407928 @default.
- W1978046526 hasConcept C33923547 @default.
- W1978046526 hasConcept C41008148 @default.
- W1978046526 hasConcept C46814582 @default.
- W1978046526 hasConcept C97541855 @default.
- W1978046526 hasConceptScore W1978046526C119857082 @default.
- W1978046526 hasConceptScore W1978046526C125014702 @default.
- W1978046526 hasConceptScore W1978046526C126255220 @default.
- W1978046526 hasConceptScore W1978046526C139807058 @default.
- W1978046526 hasConceptScore W1978046526C144024400 @default.
- W1978046526 hasConceptScore W1978046526C144237770 @default.
- W1978046526 hasConceptScore W1978046526C149923435 @default.
- W1978046526 hasConceptScore W1978046526C154945302 @default.
- W1978046526 hasConceptScore W1978046526C15744967 @default.