Matches in SemOpenAlex for { <https://semopenalex.org/work/W2168839459> ?p ?o ?g. }
- W2168839459 endingPage "1770" @default.
- W2168839459 startingPage "1729" @default.
- W2168839459 abstract "Bayesian learning methods have recently been shown to provide an elegant solution to the exploration-exploitation trade-off in reinforcement learning. However most investigations of Bayesian reinforcement learning to date focus on the standard Markov Decision Processes (MDPs). The primary focus of this paper is to extend these ideas to the case of partially observable domains, by introducing the Bayes-Adaptive Partially Observable Markov Decision Processes. This new framework can be used to simultaneously (1) learn a model of the POMDP domain through interaction with the environment, (2) track the state of the system under partial observability, and (3) plan (near-)optimal sequences of actions. An important contribution of this paper is to provide theoretical results showing how the model can be finitely approximated while preserving good learning performance. We present approximate algorithms for belief tracking and planning in this model, as well as empirical results that illustrate how the model estimate and agent's return improve as a function of experience." @default.
- W2168839459 created "2016-06-24" @default.
- W2168839459 creator A5002976111 @default.
- W2168839459 creator A5014065467 @default.
- W2168839459 creator A5057087592 @default.
- W2168839459 creator A5080591144 @default.
- W2168839459 date "2011-02-01" @default.
- W2168839459 modified "2023-09-23" @default.
- W2168839459 title "A Bayesian Approach for Learning and Planning in Partially Observable Markov Decision Processes" @default.
- W2168839459 cites W1483307070 @default.
- W2168839459 cites W1484113995 @default.
- W2168839459 cites W1496855202 @default.
- W2168839459 cites W1497039698 @default.
- W2168839459 cites W1505937442 @default.
- W2168839459 cites W1515851193 @default.
- W2168839459 cites W1526144783 @default.
- W2168839459 cites W1532688806 @default.
- W2168839459 cites W1552684655 @default.
- W2168839459 cites W1576369122 @default.
- W2168839459 cites W1582436621 @default.
- W2168839459 cites W1583380718 @default.
- W2168839459 cites W1591803298 @default.
- W2168839459 cites W1850488217 @default.
- W2168839459 cites W2013391942 @default.
- W2168839459 cites W2028169578 @default.
- W2168839459 cites W2034725503 @default.
- W2168839459 cites W2058066080 @default.
- W2168839459 cites W2061226732 @default.
- W2168839459 cites W2071814471 @default.
- W2168839459 cites W2073384958 @default.
- W2168839459 cites W2082691056 @default.
- W2168839459 cites W2097931172 @default.
- W2168839459 cites W2100785108 @default.
- W2168839459 cites W2102579467 @default.
- W2168839459 cites W2111833414 @default.
- W2168839459 cites W2116459397 @default.
- W2168839459 cites W2121863487 @default.
- W2168839459 cites W2123372395 @default.
- W2168839459 cites W2123803235 @default.
- W2168839459 cites W2124352385 @default.
- W2168839459 cites W2125710232 @default.
- W2168839459 cites W2126163471 @default.
- W2168839459 cites W2132849848 @default.
- W2168839459 cites W2132908009 @default.
- W2168839459 cites W2134802714 @default.
- W2168839459 cites W2135681007 @default.
- W2168839459 cites W2137509429 @default.
- W2168839459 cites W2144794447 @default.
- W2168839459 cites W2144913588 @default.
- W2168839459 cites W2145049547 @default.
- W2168839459 cites W2145432078 @default.
- W2168839459 cites W2156974606 @default.
- W2168839459 cites W2158282517 @default.
- W2168839459 cites W2158640361 @default.
- W2168839459 cites W2164569010 @default.
- W2168839459 cites W2167724876 @default.
- W2168839459 cites W2168359464 @default.
- W2168839459 cites W2170112109 @default.
- W2168839459 cites W2171029115 @default.
- W2168839459 cites W2171084228 @default.
- W2168839459 cites W2294604761 @default.
- W2168839459 cites W2406242985 @default.
- W2168839459 cites W2782975919 @default.
- W2168839459 cites W2799002609 @default.
- W2168839459 cites W3023151133 @default.
- W2168839459 cites W3023407077 @default.
- W2168839459 hasPublicationYear "2011" @default.
- W2168839459 type Work @default.
- W2168839459 sameAs 2168839459 @default.
- W2168839459 citedByCount "62" @default.
- W2168839459 countsByYear W21688394592012 @default.
- W2168839459 countsByYear W21688394592013 @default.
- W2168839459 countsByYear W21688394592014 @default.
- W2168839459 countsByYear W21688394592015 @default.
- W2168839459 countsByYear W21688394592016 @default.
- W2168839459 countsByYear W21688394592017 @default.
- W2168839459 countsByYear W21688394592018 @default.
- W2168839459 countsByYear W21688394592019 @default.
- W2168839459 countsByYear W21688394592020 @default.
- W2168839459 countsByYear W21688394592021 @default.
- W2168839459 crossrefType "journal-article" @default.
- W2168839459 hasAuthorship W2168839459A5002976111 @default.
- W2168839459 hasAuthorship W2168839459A5014065467 @default.
- W2168839459 hasAuthorship W2168839459A5057087592 @default.
- W2168839459 hasAuthorship W2168839459A5080591144 @default.
- W2168839459 hasConcept C105795698 @default.
- W2168839459 hasConcept C106189395 @default.
- W2168839459 hasConcept C107673813 @default.
- W2168839459 hasConcept C119857082 @default.
- W2168839459 hasConcept C120665830 @default.
- W2168839459 hasConcept C121332964 @default.
- W2168839459 hasConcept C126255220 @default.
- W2168839459 hasConcept C154945302 @default.
- W2168839459 hasConcept C159886148 @default.
- W2168839459 hasConcept C163836022 @default.
- W2168839459 hasConcept C17098449 @default.
- W2168839459 hasConcept C192209626 @default.
- W2168839459 hasConcept C28826006 @default.