Matches in SemOpenAlex for { <https://semopenalex.org/work/W2103061585> ?p ?o ?g. }
Showing items 1 to 85 of
85
with 100 items per page.
- W2103061585 abstract "This paper presents a modified version of U-tree (A.K. McCallum, 1996), a memory-based reinforcement learning (RL) algorithm that uses selective perception and short-term memory to handle partially observable Markovian decision processes (POMDP). Conventional RL algorithms rely on a set of pre-defined states to model the environment, even though it can learn the state transitions from experience. U-tree is not only able to do that, it can also build the state model by itself based on raw sensor inputs. This paper enhances U-Treepsilas model generation process. The paper also shows that because of the simplified and yet effective state model generated by U-tree, it is feasible and preferable to adopt the classical dynamic programming (DP) algorithm for average reward MDP to solve some difficult POMDP problems. The new U-tree is tested using a car-driving task with 31,224 world states, with the agent having very limited sensory information and little knowledge about the dynamics of the environment." @default.
- W2103061585 created "2016-06-24" @default.
- W2103061585 creator A5006841849 @default.
- W2103061585 creator A5063229067 @default.
- W2103061585 creator A5090385093 @default.
- W2103061585 date "2008-06-01" @default.
- W2103061585 modified "2023-10-16" @default.
- W2103061585 title "A memory-based reinforcement learning algorithm for partially observable Markovian decision processes" @default.
- W2103061585 cites W2098432798 @default.
- W2103061585 cites W2101242010 @default.
- W2103061585 cites W2107726111 @default.
- W2103061585 cites W2121863487 @default.
- W2103061585 cites W2169294731 @default.
- W2103061585 cites W2170120409 @default.
- W2103061585 cites W2341171179 @default.
- W2103061585 cites W3139460557 @default.
- W2103061585 doi "https://doi.org/10.1109/ijcnn.2008.4633888" @default.
- W2103061585 hasPublicationYear "2008" @default.
- W2103061585 type Work @default.
- W2103061585 sameAs 2103061585 @default.
- W2103061585 citedByCount "2" @default.
- W2103061585 countsByYear W21030615852021 @default.
- W2103061585 crossrefType "proceedings-article" @default.
- W2103061585 hasAuthorship W2103061585A5006841849 @default.
- W2103061585 hasAuthorship W2103061585A5063229067 @default.
- W2103061585 hasAuthorship W2103061585A5090385093 @default.
- W2103061585 hasConcept C105795698 @default.
- W2103061585 hasConcept C106189395 @default.
- W2103061585 hasConcept C113174947 @default.
- W2103061585 hasConcept C11413529 @default.
- W2103061585 hasConcept C119857082 @default.
- W2103061585 hasConcept C121332964 @default.
- W2103061585 hasConcept C134306372 @default.
- W2103061585 hasConcept C154945302 @default.
- W2103061585 hasConcept C159886148 @default.
- W2103061585 hasConcept C163836022 @default.
- W2103061585 hasConcept C17098449 @default.
- W2103061585 hasConcept C177264268 @default.
- W2103061585 hasConcept C199360897 @default.
- W2103061585 hasConcept C32848918 @default.
- W2103061585 hasConcept C33923547 @default.
- W2103061585 hasConcept C37404715 @default.
- W2103061585 hasConcept C41008148 @default.
- W2103061585 hasConcept C48103436 @default.
- W2103061585 hasConcept C62520636 @default.
- W2103061585 hasConcept C97541855 @default.
- W2103061585 hasConcept C98763669 @default.
- W2103061585 hasConceptScore W2103061585C105795698 @default.
- W2103061585 hasConceptScore W2103061585C106189395 @default.
- W2103061585 hasConceptScore W2103061585C113174947 @default.
- W2103061585 hasConceptScore W2103061585C11413529 @default.
- W2103061585 hasConceptScore W2103061585C119857082 @default.
- W2103061585 hasConceptScore W2103061585C121332964 @default.
- W2103061585 hasConceptScore W2103061585C134306372 @default.
- W2103061585 hasConceptScore W2103061585C154945302 @default.
- W2103061585 hasConceptScore W2103061585C159886148 @default.
- W2103061585 hasConceptScore W2103061585C163836022 @default.
- W2103061585 hasConceptScore W2103061585C17098449 @default.
- W2103061585 hasConceptScore W2103061585C177264268 @default.
- W2103061585 hasConceptScore W2103061585C199360897 @default.
- W2103061585 hasConceptScore W2103061585C32848918 @default.
- W2103061585 hasConceptScore W2103061585C33923547 @default.
- W2103061585 hasConceptScore W2103061585C37404715 @default.
- W2103061585 hasConceptScore W2103061585C41008148 @default.
- W2103061585 hasConceptScore W2103061585C48103436 @default.
- W2103061585 hasConceptScore W2103061585C62520636 @default.
- W2103061585 hasConceptScore W2103061585C97541855 @default.
- W2103061585 hasConceptScore W2103061585C98763669 @default.
- W2103061585 hasLocation W21030615851 @default.
- W2103061585 hasOpenAccess W2103061585 @default.
- W2103061585 hasPrimaryLocation W21030615851 @default.
- W2103061585 hasRelatedWork W1511927616 @default.
- W2103061585 hasRelatedWork W1541084404 @default.
- W2103061585 hasRelatedWork W1568770747 @default.
- W2103061585 hasRelatedWork W2146763310 @default.
- W2103061585 hasRelatedWork W2332673980 @default.
- W2103061585 hasRelatedWork W2347690758 @default.
- W2103061585 hasRelatedWork W2381909226 @default.
- W2103061585 hasRelatedWork W2922301831 @default.
- W2103061585 hasRelatedWork W3198564127 @default.
- W2103061585 hasRelatedWork W3211465897 @default.
- W2103061585 isParatext "false" @default.
- W2103061585 isRetracted "false" @default.
- W2103061585 magId "2103061585" @default.
- W2103061585 workType "article" @default.