Matches in SemOpenAlex for { <https://semopenalex.org/work/W1706571876> ?p ?o ?g. }
- W1706571876 endingPage "318" @default.
- W1706571876 startingPage "287" @default.
- W1706571876 abstract "Temporal difference (TD) methods constitute a class of methods for learning predictions in multi-step prediction problems, parameterized by a recency factor lambda. Currently the most important application of these methods is to temporal credit assignment in reinforcement learning. Well known reinforcement learning algorithms, such as AHC or Q-learning, may be viewed as instances of TD learning. This paper examines the issues of the efficient and general implementation of TD(lambda) for arbitrary lambda, for use with reinforcement learning algorithms optimizing the discounted sum of rewards. The traditional approach, based on eligibility traces, is argued to suffer from both inefficiency and lack of generality. The TTD (Truncated Temporal Differences) procedure is proposed as an alternative, that indeed only approximates TD(lambda), but requires very little computation per action and can be used with arbitrary function representation methods. The idea from which it is derived is fairly simple and not new, but probably unexplored so far. Encouraging experimental results are presented, suggesting that using lambda > 0 with the TTD procedure allows one to obtain a significant learning speedup at essentially the same cost as usual TD(0) learning." @default.
- W1706571876 created "2016-06-24" @default.
- W1706571876 creator A5068140368 @default.
- W1706571876 date "1995-01-01" @default.
- W1706571876 modified "2023-10-06" @default.
- W1706571876 title "Truncating Temporal Differences: On the Efficient Implementation of TD(lambda) for Reinforcement Learning" @default.
- W1706571876 cites W134786152 @default.
- W1706571876 cites W1491843047 @default.
- W1706571876 cites W1549353711 @default.
- W1706571876 cites W1569296262 @default.
- W1706571876 cites W1595483645 @default.
- W1706571876 cites W1599610710 @default.
- W1706571876 cites W1610678877 @default.
- W1706571876 cites W1965227651 @default.
- W1706571876 cites W2091565802 @default.
- W1706571876 cites W2100677568 @default.
- W1706571876 cites W2101242010 @default.
- W1706571876 cites W2103626435 @default.
- W1706571876 cites W2141559645 @default.
- W1706571876 cites W2158091072 @default.
- W1706571876 cites W2165131254 @default.
- W1706571876 cites W2808421695 @default.
- W1706571876 cites W3011120880 @default.
- W1706571876 cites W32403112 @default.
- W1706571876 doi "https://doi.org/10.1613/jair.135" @default.
- W1706571876 hasPublicationYear "1995" @default.
- W1706571876 type Work @default.
- W1706571876 sameAs 1706571876 @default.
- W1706571876 citedByCount "44" @default.
- W1706571876 countsByYear W17065718762012 @default.
- W1706571876 countsByYear W17065718762013 @default.
- W1706571876 countsByYear W17065718762014 @default.
- W1706571876 countsByYear W17065718762016 @default.
- W1706571876 countsByYear W17065718762017 @default.
- W1706571876 countsByYear W17065718762018 @default.
- W1706571876 countsByYear W17065718762020 @default.
- W1706571876 countsByYear W17065718762021 @default.
- W1706571876 countsByYear W17065718762022 @default.
- W1706571876 countsByYear W17065718762023 @default.
- W1706571876 crossrefType "journal-article" @default.
- W1706571876 hasAuthorship W1706571876A5068140368 @default.
- W1706571876 hasBestOaLocation W17065718761 @default.
- W1706571876 hasConcept C111919701 @default.
- W1706571876 hasConcept C11413529 @default.
- W1706571876 hasConcept C119857082 @default.
- W1706571876 hasConcept C120665830 @default.
- W1706571876 hasConcept C121332964 @default.
- W1706571876 hasConcept C14036430 @default.
- W1706571876 hasConcept C154945302 @default.
- W1706571876 hasConcept C15744967 @default.
- W1706571876 hasConcept C162324750 @default.
- W1706571876 hasConcept C165464430 @default.
- W1706571876 hasConcept C175444787 @default.
- W1706571876 hasConcept C17744445 @default.
- W1706571876 hasConcept C196340769 @default.
- W1706571876 hasConcept C199539241 @default.
- W1706571876 hasConcept C2776359362 @default.
- W1706571876 hasConcept C2778113609 @default.
- W1706571876 hasConcept C2778869765 @default.
- W1706571876 hasConcept C2780767217 @default.
- W1706571876 hasConcept C41008148 @default.
- W1706571876 hasConcept C45374587 @default.
- W1706571876 hasConcept C542102704 @default.
- W1706571876 hasConcept C68339613 @default.
- W1706571876 hasConcept C78458016 @default.
- W1706571876 hasConcept C86803240 @default.
- W1706571876 hasConcept C94625758 @default.
- W1706571876 hasConcept C97541855 @default.
- W1706571876 hasConceptScore W1706571876C111919701 @default.
- W1706571876 hasConceptScore W1706571876C11413529 @default.
- W1706571876 hasConceptScore W1706571876C119857082 @default.
- W1706571876 hasConceptScore W1706571876C120665830 @default.
- W1706571876 hasConceptScore W1706571876C121332964 @default.
- W1706571876 hasConceptScore W1706571876C14036430 @default.
- W1706571876 hasConceptScore W1706571876C154945302 @default.
- W1706571876 hasConceptScore W1706571876C15744967 @default.
- W1706571876 hasConceptScore W1706571876C162324750 @default.
- W1706571876 hasConceptScore W1706571876C165464430 @default.
- W1706571876 hasConceptScore W1706571876C175444787 @default.
- W1706571876 hasConceptScore W1706571876C17744445 @default.
- W1706571876 hasConceptScore W1706571876C196340769 @default.
- W1706571876 hasConceptScore W1706571876C199539241 @default.
- W1706571876 hasConceptScore W1706571876C2776359362 @default.
- W1706571876 hasConceptScore W1706571876C2778113609 @default.
- W1706571876 hasConceptScore W1706571876C2778869765 @default.
- W1706571876 hasConceptScore W1706571876C2780767217 @default.
- W1706571876 hasConceptScore W1706571876C41008148 @default.
- W1706571876 hasConceptScore W1706571876C45374587 @default.
- W1706571876 hasConceptScore W1706571876C542102704 @default.
- W1706571876 hasConceptScore W1706571876C68339613 @default.
- W1706571876 hasConceptScore W1706571876C78458016 @default.
- W1706571876 hasConceptScore W1706571876C86803240 @default.
- W1706571876 hasConceptScore W1706571876C94625758 @default.
- W1706571876 hasConceptScore W1706571876C97541855 @default.
- W1706571876 hasLocation W17065718761 @default.
- W1706571876 hasOpenAccess W1706571876 @default.
- W1706571876 hasPrimaryLocation W17065718761 @default.
- W1706571876 hasRelatedWork W1586189497 @default.