Matches in SemOpenAlex for { <https://semopenalex.org/work/W2383312578> ?p ?o ?g. }
Showing items 1 to 87 of
87
with 100 items per page.
- W2383312578 abstract "A reinforcement-learning agent solves its decision problems by learning optimal decision mapping from a state to an action. Reinforcement learning is a method in which an agent improves its actions by using experiment and interacting with environment. Markov decision process (MDP)model is the general frame for solving reinforcement learning problems. TD(λ) is the learning value functions algorithm related with policy in Markov decision processes. Generally, the agent must remember all of its value functions. This remembering quantity becomes very large while the state space increases its range of size. A forgetting algorithm is given, in which the forgetting principle in psychology is introduced into reinforcement learning.With the use of forgetting algorithm, the large POMDP problems are solved." @default.
- W2383312578 created "2016-06-24" @default.
- W2383312578 creator A5000057250 @default.
- W2383312578 date "2003-01-01" @default.
- W2383312578 modified "2023-09-25" @default.
- W2383312578 title "Average Asymptotic Temporal Difference LearningForgetting Algorithm Based on Eligibility Trace" @default.
- W2383312578 hasPublicationYear "2003" @default.
- W2383312578 type Work @default.
- W2383312578 sameAs 2383312578 @default.
- W2383312578 citedByCount "0" @default.
- W2383312578 crossrefType "journal-article" @default.
- W2383312578 hasAuthorship W2383312578A5000057250 @default.
- W2383312578 hasConcept C105795698 @default.
- W2383312578 hasConcept C106189395 @default.
- W2383312578 hasConcept C119857082 @default.
- W2383312578 hasConcept C126255220 @default.
- W2383312578 hasConcept C127413603 @default.
- W2383312578 hasConcept C138885662 @default.
- W2383312578 hasConcept C14646407 @default.
- W2383312578 hasConcept C146978453 @default.
- W2383312578 hasConcept C154945302 @default.
- W2383312578 hasConcept C159886148 @default.
- W2383312578 hasConcept C17098449 @default.
- W2383312578 hasConcept C188116033 @default.
- W2383312578 hasConcept C196340769 @default.
- W2383312578 hasConcept C204323151 @default.
- W2383312578 hasConcept C33923547 @default.
- W2383312578 hasConcept C41008148 @default.
- W2383312578 hasConcept C41895202 @default.
- W2383312578 hasConcept C66938386 @default.
- W2383312578 hasConcept C67203356 @default.
- W2383312578 hasConcept C7149132 @default.
- W2383312578 hasConcept C72434380 @default.
- W2383312578 hasConcept C75291252 @default.
- W2383312578 hasConcept C97541855 @default.
- W2383312578 hasConcept C98763669 @default.
- W2383312578 hasConceptScore W2383312578C105795698 @default.
- W2383312578 hasConceptScore W2383312578C106189395 @default.
- W2383312578 hasConceptScore W2383312578C119857082 @default.
- W2383312578 hasConceptScore W2383312578C126255220 @default.
- W2383312578 hasConceptScore W2383312578C127413603 @default.
- W2383312578 hasConceptScore W2383312578C138885662 @default.
- W2383312578 hasConceptScore W2383312578C14646407 @default.
- W2383312578 hasConceptScore W2383312578C146978453 @default.
- W2383312578 hasConceptScore W2383312578C154945302 @default.
- W2383312578 hasConceptScore W2383312578C159886148 @default.
- W2383312578 hasConceptScore W2383312578C17098449 @default.
- W2383312578 hasConceptScore W2383312578C188116033 @default.
- W2383312578 hasConceptScore W2383312578C196340769 @default.
- W2383312578 hasConceptScore W2383312578C204323151 @default.
- W2383312578 hasConceptScore W2383312578C33923547 @default.
- W2383312578 hasConceptScore W2383312578C41008148 @default.
- W2383312578 hasConceptScore W2383312578C41895202 @default.
- W2383312578 hasConceptScore W2383312578C66938386 @default.
- W2383312578 hasConceptScore W2383312578C67203356 @default.
- W2383312578 hasConceptScore W2383312578C7149132 @default.
- W2383312578 hasConceptScore W2383312578C72434380 @default.
- W2383312578 hasConceptScore W2383312578C75291252 @default.
- W2383312578 hasConceptScore W2383312578C97541855 @default.
- W2383312578 hasConceptScore W2383312578C98763669 @default.
- W2383312578 hasLocation W23833125781 @default.
- W2383312578 hasOpenAccess W2383312578 @default.
- W2383312578 hasPrimaryLocation W23833125781 @default.
- W2383312578 hasRelatedWork W1521228173 @default.
- W2383312578 hasRelatedWork W1547859793 @default.
- W2383312578 hasRelatedWork W1895872655 @default.
- W2383312578 hasRelatedWork W2006115215 @default.
- W2383312578 hasRelatedWork W2059069309 @default.
- W2383312578 hasRelatedWork W2067483065 @default.
- W2383312578 hasRelatedWork W2097031964 @default.
- W2383312578 hasRelatedWork W2107222090 @default.
- W2383312578 hasRelatedWork W2117626647 @default.
- W2383312578 hasRelatedWork W2144655553 @default.
- W2383312578 hasRelatedWork W2146763310 @default.
- W2383312578 hasRelatedWork W2348457532 @default.
- W2383312578 hasRelatedWork W2357817307 @default.
- W2383312578 hasRelatedWork W2370576243 @default.
- W2383312578 hasRelatedWork W2524346060 @default.
- W2383312578 hasRelatedWork W2892172348 @default.
- W2383312578 hasRelatedWork W2964010185 @default.
- W2383312578 hasRelatedWork W3139043417 @default.
- W2383312578 hasRelatedWork W315747311 @default.
- W2383312578 hasRelatedWork W3174939050 @default.
- W2383312578 isParatext "false" @default.
- W2383312578 isRetracted "false" @default.
- W2383312578 magId "2383312578" @default.
- W2383312578 workType "article" @default.