Matches in SemOpenAlex for { <https://semopenalex.org/work/W2026729042> ?p ?o ?g. }
Showing items 1 to 96 of
96
with 100 items per page.
- W2026729042 endingPage "307" @default.
- W2026729042 startingPage "289" @default.
- W2026729042 abstract "We present a continuous-time master-equation formulation of reinforcement learning. Both non-associative (stochastic learning automaton) and associative (neural network) cases are considered. A Fokker–Planck equation for the stochastic dynamics of the learning process is derived using a small-fluctuation expansion of the master equation. We then show how the Fokker–Planck approximation can be used to determine the global asymptotic behaviour of ergodic learning schemes such as linear reward–penalty (LR−P) and associative reward–penalty (LR−P), in the limit of small learning rates. A simple example of reinforcement learning in a non-stationary environment is studied." @default.
- W2026729042 created "2016-06-24" @default.
- W2026729042 creator A5013056878 @default.
- W2026729042 date "1995-01-01" @default.
- W2026729042 modified "2023-09-27" @default.
- W2026729042 title "Stochastic dynamics of reinforcement learning" @default.
- W2026729042 cites W1605158001 @default.
- W2026729042 cites W1990359011 @default.
- W2026729042 cites W2019904074 @default.
- W2026729042 cites W2021801581 @default.
- W2026729042 cites W2047054315 @default.
- W2026729042 cites W2054901429 @default.
- W2026729042 cites W2064018461 @default.
- W2026729042 cites W207289571 @default.
- W2026729042 cites W2090614046 @default.
- W2026729042 cites W2092643070 @default.
- W2026729042 cites W2109049531 @default.
- W2026729042 cites W2150872430 @default.
- W2026729042 cites W2158832545 @default.
- W2026729042 cites W2164301806 @default.
- W2026729042 cites W2164472445 @default.
- W2026729042 cites W2235056388 @default.
- W2026729042 doi "https://doi.org/10.1088/0954-898x_6_2_009" @default.
- W2026729042 hasPublicationYear "1995" @default.
- W2026729042 type Work @default.
- W2026729042 sameAs 2026729042 @default.
- W2026729042 citedByCount "1" @default.
- W2026729042 crossrefType "journal-article" @default.
- W2026729042 hasAuthorship W2026729042A5013056878 @default.
- W2026729042 hasConcept C105795698 @default.
- W2026729042 hasConcept C121332964 @default.
- W2026729042 hasConcept C121864883 @default.
- W2026729042 hasConcept C122044880 @default.
- W2026729042 hasConcept C128805008 @default.
- W2026729042 hasConcept C134306372 @default.
- W2026729042 hasConcept C151201525 @default.
- W2026729042 hasConcept C154945302 @default.
- W2026729042 hasConcept C159423971 @default.
- W2026729042 hasConcept C169760540 @default.
- W2026729042 hasConcept C202444582 @default.
- W2026729042 hasConcept C28826006 @default.
- W2026729042 hasConcept C2983526489 @default.
- W2026729042 hasConcept C33923547 @default.
- W2026729042 hasConcept C41008148 @default.
- W2026729042 hasConcept C50644808 @default.
- W2026729042 hasConcept C62520636 @default.
- W2026729042 hasConcept C69123182 @default.
- W2026729042 hasConcept C78045399 @default.
- W2026729042 hasConcept C8272713 @default.
- W2026729042 hasConcept C84114770 @default.
- W2026729042 hasConcept C86803240 @default.
- W2026729042 hasConcept C97541855 @default.
- W2026729042 hasConceptScore W2026729042C105795698 @default.
- W2026729042 hasConceptScore W2026729042C121332964 @default.
- W2026729042 hasConceptScore W2026729042C121864883 @default.
- W2026729042 hasConceptScore W2026729042C122044880 @default.
- W2026729042 hasConceptScore W2026729042C128805008 @default.
- W2026729042 hasConceptScore W2026729042C134306372 @default.
- W2026729042 hasConceptScore W2026729042C151201525 @default.
- W2026729042 hasConceptScore W2026729042C154945302 @default.
- W2026729042 hasConceptScore W2026729042C159423971 @default.
- W2026729042 hasConceptScore W2026729042C169760540 @default.
- W2026729042 hasConceptScore W2026729042C202444582 @default.
- W2026729042 hasConceptScore W2026729042C28826006 @default.
- W2026729042 hasConceptScore W2026729042C2983526489 @default.
- W2026729042 hasConceptScore W2026729042C33923547 @default.
- W2026729042 hasConceptScore W2026729042C41008148 @default.
- W2026729042 hasConceptScore W2026729042C50644808 @default.
- W2026729042 hasConceptScore W2026729042C62520636 @default.
- W2026729042 hasConceptScore W2026729042C69123182 @default.
- W2026729042 hasConceptScore W2026729042C78045399 @default.
- W2026729042 hasConceptScore W2026729042C8272713 @default.
- W2026729042 hasConceptScore W2026729042C84114770 @default.
- W2026729042 hasConceptScore W2026729042C86803240 @default.
- W2026729042 hasConceptScore W2026729042C97541855 @default.
- W2026729042 hasIssue "2" @default.
- W2026729042 hasLocation W20267290421 @default.
- W2026729042 hasOpenAccess W2026729042 @default.
- W2026729042 hasPrimaryLocation W20267290421 @default.
- W2026729042 hasRelatedWork W2026729042 @default.
- W2026729042 hasRelatedWork W2067366940 @default.
- W2026729042 hasRelatedWork W2076727598 @default.
- W2026729042 hasRelatedWork W2334009344 @default.
- W2026729042 hasRelatedWork W2533678160 @default.
- W2026729042 hasRelatedWork W2753279558 @default.
- W2026729042 hasRelatedWork W2792553019 @default.
- W2026729042 hasRelatedWork W3099804662 @default.
- W2026729042 hasRelatedWork W632533387 @default.
- W2026729042 hasRelatedWork W2949635667 @default.
- W2026729042 hasVolume "6" @default.
- W2026729042 isParatext "false" @default.
- W2026729042 isRetracted "false" @default.
- W2026729042 magId "2026729042" @default.
- W2026729042 workType "article" @default.