Matches in SemOpenAlex for { <https://semopenalex.org/work/W1932117986> ?p ?o ?g. }
Showing items 1 to 95 of
95
with 100 items per page.
- W1932117986 abstract "The most widely used reinforcement learning (RL) algorithms, such as Q-learning and TD (/spl lambda/) are limited to Markovian environments. Recent research on reinforcement learning algorithms has concentrated on partially observable Markov decision process (POMDP). The only way to overcome partial observability is to use memory to estimate state. In this paper, we present a new memory architecture of RL algorithms to solve certain type of POMDPs. Our algorithm, which we call labeling Q-learning (LQ-learning), is applied to test problems of simple mazes taken from recent literature. The results demonstrate LQ-learning's ability to work well in near optimal manner." @default.
- W1932117986 created "2016-06-24" @default.
- W1932117986 creator A5016065680 @default.
- W1932117986 creator A5027059197 @default.
- W1932117986 creator A5028642666 @default.
- W1932117986 date "2003-01-20" @default.
- W1932117986 modified "2023-09-24" @default.
- W1932117986 title "Labeling Q-learning for non-Markovian environments" @default.
- W1932117986 cites W1499371387 @default.
- W1932117986 cites W2100677568 @default.
- W1932117986 cites W2107726111 @default.
- W1932117986 cites W2113913482 @default.
- W1932117986 cites W2121863487 @default.
- W1932117986 cites W2158091072 @default.
- W1932117986 cites W2912185451 @default.
- W1932117986 cites W2914656440 @default.
- W1932117986 cites W32403112 @default.
- W1932117986 cites W6242441 @default.
- W1932117986 doi "https://doi.org/10.1109/icsmc.1999.815599" @default.
- W1932117986 hasPublicationYear "2003" @default.
- W1932117986 type Work @default.
- W1932117986 sameAs 1932117986 @default.
- W1932117986 citedByCount "2" @default.
- W1932117986 countsByYear W19321179862020 @default.
- W1932117986 crossrefType "proceedings-article" @default.
- W1932117986 hasAuthorship W1932117986A5016065680 @default.
- W1932117986 hasAuthorship W1932117986A5027059197 @default.
- W1932117986 hasAuthorship W1932117986A5028642666 @default.
- W1932117986 hasConcept C105795698 @default.
- W1932117986 hasConcept C106189395 @default.
- W1932117986 hasConcept C111472728 @default.
- W1932117986 hasConcept C119857082 @default.
- W1932117986 hasConcept C121332964 @default.
- W1932117986 hasConcept C138885662 @default.
- W1932117986 hasConcept C154945302 @default.
- W1932117986 hasConcept C159886148 @default.
- W1932117986 hasConcept C163836022 @default.
- W1932117986 hasConcept C17098449 @default.
- W1932117986 hasConcept C188116033 @default.
- W1932117986 hasConcept C2780586882 @default.
- W1932117986 hasConcept C28826006 @default.
- W1932117986 hasConcept C32848918 @default.
- W1932117986 hasConcept C33923547 @default.
- W1932117986 hasConcept C36299963 @default.
- W1932117986 hasConcept C41008148 @default.
- W1932117986 hasConcept C62520636 @default.
- W1932117986 hasConcept C97541855 @default.
- W1932117986 hasConcept C98763669 @default.
- W1932117986 hasConceptScore W1932117986C105795698 @default.
- W1932117986 hasConceptScore W1932117986C106189395 @default.
- W1932117986 hasConceptScore W1932117986C111472728 @default.
- W1932117986 hasConceptScore W1932117986C119857082 @default.
- W1932117986 hasConceptScore W1932117986C121332964 @default.
- W1932117986 hasConceptScore W1932117986C138885662 @default.
- W1932117986 hasConceptScore W1932117986C154945302 @default.
- W1932117986 hasConceptScore W1932117986C159886148 @default.
- W1932117986 hasConceptScore W1932117986C163836022 @default.
- W1932117986 hasConceptScore W1932117986C17098449 @default.
- W1932117986 hasConceptScore W1932117986C188116033 @default.
- W1932117986 hasConceptScore W1932117986C2780586882 @default.
- W1932117986 hasConceptScore W1932117986C28826006 @default.
- W1932117986 hasConceptScore W1932117986C32848918 @default.
- W1932117986 hasConceptScore W1932117986C33923547 @default.
- W1932117986 hasConceptScore W1932117986C36299963 @default.
- W1932117986 hasConceptScore W1932117986C41008148 @default.
- W1932117986 hasConceptScore W1932117986C62520636 @default.
- W1932117986 hasConceptScore W1932117986C97541855 @default.
- W1932117986 hasConceptScore W1932117986C98763669 @default.
- W1932117986 hasLocation W19321179861 @default.
- W1932117986 hasOpenAccess W1932117986 @default.
- W1932117986 hasPrimaryLocation W19321179861 @default.
- W1932117986 hasRelatedWork W1486341833 @default.
- W1932117986 hasRelatedWork W1521228173 @default.
- W1932117986 hasRelatedWork W1583080569 @default.
- W1932117986 hasRelatedWork W1624779897 @default.
- W1932117986 hasRelatedWork W1860787181 @default.
- W1932117986 hasRelatedWork W2008491003 @default.
- W1932117986 hasRelatedWork W2014081627 @default.
- W1932117986 hasRelatedWork W2033606355 @default.
- W1932117986 hasRelatedWork W2038771780 @default.
- W1932117986 hasRelatedWork W2080409389 @default.
- W1932117986 hasRelatedWork W2086724964 @default.
- W1932117986 hasRelatedWork W2113013121 @default.
- W1932117986 hasRelatedWork W2115615930 @default.
- W1932117986 hasRelatedWork W2147852745 @default.
- W1932117986 hasRelatedWork W2208012000 @default.
- W1932117986 hasRelatedWork W2557003457 @default.
- W1932117986 hasRelatedWork W2936135090 @default.
- W1932117986 hasRelatedWork W2967002752 @default.
- W1932117986 hasRelatedWork W3174939050 @default.
- W1932117986 hasRelatedWork W2100033121 @default.
- W1932117986 isParatext "false" @default.
- W1932117986 isRetracted "false" @default.
- W1932117986 magId "1932117986" @default.
- W1932117986 workType "article" @default.