Matches in SemOpenAlex for { <https://semopenalex.org/work/W2902098903> ?p ?o ?g. }
Showing items 1 to 88 of
88
with 100 items per page.
- W2902098903 abstract "We know from reinforcement learning theory that temporal difference learning can fail in certain cases. Sutton and Barto (2018) identify a deadly triad of function approximation, bootstrapping, and off-policy learning. When these three properties are combined, learning can diverge with the value estimates becoming unbounded. However, several algorithms successfully combine these three properties, which indicates that there is at least a partial gap in our understanding. In this work, we investigate the impact of the deadly triad in practice, in the context of a family of popular deep reinforcement learning models - deep Q-networks trained with experience replay - analysing how the components of this system play a role in the emergence of the deadly triad, and in the agent's performance" @default.
- W2902098903 created "2018-12-11" @default.
- W2902098903 creator A5006017771 @default.
- W2902098903 creator A5024394972 @default.
- W2902098903 creator A5033070893 @default.
- W2902098903 creator A5033135596 @default.
- W2902098903 creator A5036908874 @default.
- W2902098903 creator A5054065284 @default.
- W2902098903 date "2018-12-10" @default.
- W2902098903 modified "2023-09-23" @default.
- W2902098903 title "Deep Reinforcement Learning and the Deadly Triad" @default.
- W2902098903 cites W1515851193 @default.
- W2902098903 cites W1547925194 @default.
- W2902098903 cites W1600046456 @default.
- W2902098903 cites W1646707810 @default.
- W2902098903 cites W1757796397 @default.
- W2902098903 cites W2028145673 @default.
- W2902098903 cites W2094387729 @default.
- W2902098903 cites W2100677568 @default.
- W2902098903 cites W2119567691 @default.
- W2902098903 cites W2121863487 @default.
- W2902098903 cites W2139418546 @default.
- W2902098903 cites W2145339207 @default.
- W2902098903 cites W2155968351 @default.
- W2902098903 cites W2173564293 @default.
- W2902098903 cites W2341171179 @default.
- W2902098903 cites W2473364827 @default.
- W2902098903 cites W2761873684 @default.
- W2902098903 cites W2786928559 @default.
- W2902098903 cites W2953334758 @default.
- W2902098903 cites W2963477884 @default.
- W2902098903 cites W2964043796 @default.
- W2902098903 cites W2964121744 @default.
- W2902098903 cites W3011120880 @default.
- W2902098903 cites W359568995 @default.
- W2902098903 cites W755046805 @default.
- W2902098903 cites W42630095 @default.
- W2902098903 hasPublicationYear "2018" @default.
- W2902098903 type Work @default.
- W2902098903 sameAs 2902098903 @default.
- W2902098903 citedByCount "56" @default.
- W2902098903 countsByYear W29020989032018 @default.
- W2902098903 countsByYear W29020989032019 @default.
- W2902098903 countsByYear W29020989032020 @default.
- W2902098903 countsByYear W29020989032021 @default.
- W2902098903 countsByYear W29020989032022 @default.
- W2902098903 crossrefType "posted-content" @default.
- W2902098903 hasAuthorship W2902098903A5006017771 @default.
- W2902098903 hasAuthorship W2902098903A5024394972 @default.
- W2902098903 hasAuthorship W2902098903A5033070893 @default.
- W2902098903 hasAuthorship W2902098903A5033135596 @default.
- W2902098903 hasAuthorship W2902098903A5036908874 @default.
- W2902098903 hasAuthorship W2902098903A5054065284 @default.
- W2902098903 hasBestOaLocation W29020989032 @default.
- W2902098903 hasConcept C11171543 @default.
- W2902098903 hasConcept C154945302 @default.
- W2902098903 hasConcept C15744967 @default.
- W2902098903 hasConcept C176038584 @default.
- W2902098903 hasConcept C41008148 @default.
- W2902098903 hasConcept C67203356 @default.
- W2902098903 hasConcept C77805123 @default.
- W2902098903 hasConcept C97541855 @default.
- W2902098903 hasConceptScore W2902098903C11171543 @default.
- W2902098903 hasConceptScore W2902098903C154945302 @default.
- W2902098903 hasConceptScore W2902098903C15744967 @default.
- W2902098903 hasConceptScore W2902098903C176038584 @default.
- W2902098903 hasConceptScore W2902098903C41008148 @default.
- W2902098903 hasConceptScore W2902098903C67203356 @default.
- W2902098903 hasConceptScore W2902098903C77805123 @default.
- W2902098903 hasConceptScore W2902098903C97541855 @default.
- W2902098903 hasLocation W29020989031 @default.
- W2902098903 hasLocation W29020989032 @default.
- W2902098903 hasOpenAccess W2902098903 @default.
- W2902098903 hasPrimaryLocation W29020989031 @default.
- W2902098903 hasRelatedWork W1965768489 @default.
- W2902098903 hasRelatedWork W1968918481 @default.
- W2902098903 hasRelatedWork W1972770565 @default.
- W2902098903 hasRelatedWork W1981082505 @default.
- W2902098903 hasRelatedWork W1990835700 @default.
- W2902098903 hasRelatedWork W2022498796 @default.
- W2902098903 hasRelatedWork W2062574812 @default.
- W2902098903 hasRelatedWork W2076575706 @default.
- W2902098903 hasRelatedWork W2077383779 @default.
- W2902098903 hasRelatedWork W2091779189 @default.
- W2902098903 isParatext "false" @default.
- W2902098903 isRetracted "false" @default.
- W2902098903 magId "2902098903" @default.
- W2902098903 workType "article" @default.