Matches in SemOpenAlex for { <https://semopenalex.org/work/W2186649097> ?p ?o ?g. }
- W2186649097 endingPage "351" @default.
- W2186649097 startingPage "331" @default.
- W2186649097 abstract "This chapter introduces the reinforcement learning framework and gives a brief background to the origins and history of reinforcement learning models of decision-making. Reinforcement learning provides a normative framework, within which conditioning can be analyzed. That is, this suggests a means by which optimal prediction and action selection can be achieved, and exposes explicitly the computations that must be realized in the service of these. In contrast to descriptive models that describe behavior as it is, normative models study behavior from the point of view of its hypothesized function—that is, they study behavior, as it should be if it were to accomplish specific goals in an optimal way. The appeal of normative models derives from several sources. Historically, the core ideas in reinforcement learning arose from two separate and parallel lines of research. One axis is mainly associated with Richard Sutton, formerly an undergraduate psychology major, and his PhD advisor, Andrew Barto, a computer scientist. Interested in artificial intelligence and agent-based learning, Sutton and Barto developed algorithms for reinforcement learning that were inspired by the psychological literature on Pavlovian and instrumental conditioning." @default.
- W2186649097 created "2016-06-24" @default.
- W2186649097 creator A5055862445 @default.
- W2186649097 creator A5077225105 @default.
- W2186649097 date "2009-01-01" @default.
- W2186649097 modified "2023-09-28" @default.
- W2186649097 title "Theoretical and Empirical Studies of Learning" @default.
- W2186649097 cites W1503962455 @default.
- W2186649097 cites W1542275754 @default.
- W2186649097 cites W1642699591 @default.
- W2186649097 cites W1646707810 @default.
- W2186649097 cites W1652173018 @default.
- W2186649097 cites W1810663112 @default.
- W2186649097 cites W1968490092 @default.
- W2186649097 cites W1970115165 @default.
- W2186649097 cites W1979215131 @default.
- W2186649097 cites W1979640215 @default.
- W2186649097 cites W1983272530 @default.
- W2186649097 cites W1993631201 @default.
- W2186649097 cites W1993866011 @default.
- W2186649097 cites W1994483275 @default.
- W2186649097 cites W1994984635 @default.
- W2186649097 cites W1998152406 @default.
- W2186649097 cites W2002352860 @default.
- W2186649097 cites W2004014356 @default.
- W2186649097 cites W2005866069 @default.
- W2186649097 cites W2007414406 @default.
- W2186649097 cites W2009303086 @default.
- W2186649097 cites W2016186194 @default.
- W2186649097 cites W2031336666 @default.
- W2186649097 cites W2037197807 @default.
- W2186649097 cites W2037457092 @default.
- W2186649097 cites W2040418090 @default.
- W2186649097 cites W2041589773 @default.
- W2186649097 cites W2041639631 @default.
- W2186649097 cites W2042343007 @default.
- W2186649097 cites W2046713808 @default.
- W2186649097 cites W2046837952 @default.
- W2186649097 cites W2056908956 @default.
- W2186649097 cites W2059638332 @default.
- W2186649097 cites W2060039072 @default.
- W2186649097 cites W2061350749 @default.
- W2186649097 cites W2061659108 @default.
- W2186649097 cites W2069258922 @default.
- W2186649097 cites W2074870954 @default.
- W2186649097 cites W2076256116 @default.
- W2186649097 cites W2077611535 @default.
- W2186649097 cites W2078725761 @default.
- W2186649097 cites W2080839667 @default.
- W2186649097 cites W2084912121 @default.
- W2186649097 cites W2088247534 @default.
- W2186649097 cites W2090763917 @default.
- W2186649097 cites W2091565802 @default.
- W2186649097 cites W2092641982 @default.
- W2186649097 cites W2096684870 @default.
- W2186649097 cites W2098534820 @default.
- W2186649097 cites W2103073562 @default.
- W2186649097 cites W2105454649 @default.
- W2186649097 cites W2109059823 @default.
- W2186649097 cites W2109152498 @default.
- W2186649097 cites W2112917428 @default.
- W2186649097 cites W2113501460 @default.
- W2186649097 cites W2114207481 @default.
- W2186649097 cites W2114524506 @default.
- W2186649097 cites W2116085129 @default.
- W2186649097 cites W2117649346 @default.
- W2186649097 cites W2117726420 @default.
- W2186649097 cites W2119128116 @default.
- W2186649097 cites W2119170562 @default.
- W2186649097 cites W2119717200 @default.
- W2186649097 cites W2123093508 @default.
- W2186649097 cites W2123429050 @default.
- W2186649097 cites W2126680030 @default.
- W2186649097 cites W2128211763 @default.
- W2186649097 cites W2128521145 @default.
- W2186649097 cites W2128562176 @default.
- W2186649097 cites W2128577709 @default.
- W2186649097 cites W2129478155 @default.
- W2186649097 cites W2129717318 @default.
- W2186649097 cites W2133166069 @default.
- W2186649097 cites W2135259298 @default.
- W2186649097 cites W2136748963 @default.
- W2186649097 cites W2139582082 @default.
- W2186649097 cites W2139635035 @default.
- W2186649097 cites W2143839686 @default.
- W2186649097 cites W2145594774 @default.
- W2186649097 cites W2148228571 @default.
- W2186649097 cites W2148819999 @default.
- W2186649097 cites W2148902913 @default.
- W2186649097 cites W2149774750 @default.
- W2186649097 cites W2154145851 @default.
- W2186649097 cites W2154562678 @default.
- W2186649097 cites W2156952192 @default.
- W2186649097 cites W2157088567 @default.
- W2186649097 cites W2158420012 @default.
- W2186649097 cites W2159088729 @default.
- W2186649097 cites W2161563886 @default.
- W2186649097 cites W2162580191 @default.