Matches in SemOpenAlex for { <https://semopenalex.org/work/W2183708240> ?p ?o ?g. }
Showing items 1 to 82 of
82
with 100 items per page.
- W2183708240 abstract "We present an empirical survey of reinforcement learning techniques and relate these techniques to concepts from behaviorism, a field of psychology concerned with the learning process. Specifically, we examine two standard RL algorithms, model-free SARSA, and model-based R-MAX, when used with various shaping techniques. We consider multiple techniques for incorporating shaping into these algorithms, including the use of options and potentialbased shaping. Findings indicate any improvement in sample complexity that results from shaping is limited at best. We suggest that this is either due to reinforcement learning not modeling behaviorism well, or behaviorism not modeling animal learning well. We further suggest that a paradigm shift in reinforcement learning techniques is required before the kind of learning performance that techniques from behaviorism indicate are possible can be realized." @default.
- W2183708240 created "2016-06-24" @default.
- W2183708240 creator A5009722403 @default.
- W2183708240 creator A5012351691 @default.
- W2183708240 creator A5017027485 @default.
- W2183708240 creator A5029561177 @default.
- W2183708240 creator A5070914351 @default.
- W2183708240 date "2012-01-01" @default.
- W2183708240 modified "2023-09-24" @default.
- W2183708240 title "An Empirical Analysis of RL's Drift From Its Behaviorism Roots" @default.
- W2183708240 cites W1514621373 @default.
- W2183708240 cites W1532534028 @default.
- W2183708240 cites W1777239053 @default.
- W2183708240 cites W2028357975 @default.
- W2183708240 cites W2066947576 @default.
- W2183708240 cites W2109910161 @default.
- W2183708240 cites W2152166054 @default.
- W2183708240 cites W2403763024 @default.
- W2183708240 cites W2487265914 @default.
- W2183708240 cites W2911283634 @default.
- W2183708240 cites W2914656440 @default.
- W2183708240 hasPublicationYear "2012" @default.
- W2183708240 type Work @default.
- W2183708240 sameAs 2183708240 @default.
- W2183708240 citedByCount "0" @default.
- W2183708240 crossrefType "journal-article" @default.
- W2183708240 hasAuthorship W2183708240A5009722403 @default.
- W2183708240 hasAuthorship W2183708240A5012351691 @default.
- W2183708240 hasAuthorship W2183708240A5017027485 @default.
- W2183708240 hasAuthorship W2183708240A5029561177 @default.
- W2183708240 hasAuthorship W2183708240A5070914351 @default.
- W2183708240 hasConcept C119857082 @default.
- W2183708240 hasConcept C154945302 @default.
- W2183708240 hasConcept C15744967 @default.
- W2183708240 hasConcept C180747234 @default.
- W2183708240 hasConcept C188147891 @default.
- W2183708240 hasConcept C41008148 @default.
- W2183708240 hasConcept C542102704 @default.
- W2183708240 hasConcept C67203356 @default.
- W2183708240 hasConcept C77805123 @default.
- W2183708240 hasConcept C79053195 @default.
- W2183708240 hasConcept C92393732 @default.
- W2183708240 hasConcept C97541855 @default.
- W2183708240 hasConceptScore W2183708240C119857082 @default.
- W2183708240 hasConceptScore W2183708240C154945302 @default.
- W2183708240 hasConceptScore W2183708240C15744967 @default.
- W2183708240 hasConceptScore W2183708240C180747234 @default.
- W2183708240 hasConceptScore W2183708240C188147891 @default.
- W2183708240 hasConceptScore W2183708240C41008148 @default.
- W2183708240 hasConceptScore W2183708240C542102704 @default.
- W2183708240 hasConceptScore W2183708240C67203356 @default.
- W2183708240 hasConceptScore W2183708240C77805123 @default.
- W2183708240 hasConceptScore W2183708240C79053195 @default.
- W2183708240 hasConceptScore W2183708240C92393732 @default.
- W2183708240 hasConceptScore W2183708240C97541855 @default.
- W2183708240 hasLocation W21837082401 @default.
- W2183708240 hasOpenAccess W2183708240 @default.
- W2183708240 hasPrimaryLocation W21837082401 @default.
- W2183708240 hasRelatedWork W137357473 @default.
- W2183708240 hasRelatedWork W2009656295 @default.
- W2183708240 hasRelatedWork W2063270381 @default.
- W2183708240 hasRelatedWork W2101921692 @default.
- W2183708240 hasRelatedWork W2120096117 @default.
- W2183708240 hasRelatedWork W2164035091 @default.
- W2183708240 hasRelatedWork W2172264025 @default.
- W2183708240 hasRelatedWork W2271262588 @default.
- W2183708240 hasRelatedWork W2323058736 @default.
- W2183708240 hasRelatedWork W2904815624 @default.
- W2183708240 hasRelatedWork W2945791024 @default.
- W2183708240 hasRelatedWork W2950658018 @default.
- W2183708240 hasRelatedWork W2963423916 @default.
- W2183708240 hasRelatedWork W2987753679 @default.
- W2183708240 hasRelatedWork W3040440659 @default.
- W2183708240 hasRelatedWork W3087487143 @default.
- W2183708240 hasRelatedWork W3126044426 @default.
- W2183708240 hasRelatedWork W3206188474 @default.
- W2183708240 hasRelatedWork W3212259478 @default.
- W2183708240 hasRelatedWork W86588979 @default.
- W2183708240 isParatext "false" @default.
- W2183708240 isRetracted "false" @default.
- W2183708240 magId "2183708240" @default.
- W2183708240 workType "article" @default.