Matches in SemOpenAlex for { <https://semopenalex.org/work/W4384616108> ?p ?o ?g. }
Showing items 1 to 100 of
100
with 100 items per page.
- W4384616108 abstract "Reinforcement learning has revolutionized our understanding of evolved systems and our ability to engineer systems based on a theoretical framework for understanding how to maximize expected reward. However, time delays between the observation and action are estimated to be roughly $ensuremath{sim}150phantom{rule{0.16em}{0ex}}mathrm{ms}$ for humans, and this should affect reinforcement learning algorithms. We reformulate the Markov Decision Process framework to include time delays in action, first deriving a new Bellman equation in a way that unifies previous attempts and then implementing the corresponding SARSA-like algorithm. The main ramification---potentially useful for both evolved and engineered systems---is that, when the size of the state space is lower than that of the action space, the modified reinforcement learning algorithms will prefer to operate on sequences of states rather than just the present state with the length of the sequence equal to 1 plus the time delay." @default.
- W4384616108 created "2023-07-18" @default.
- W4384616108 creator A5003673584 @default.
- W4384616108 creator A5034685587 @default.
- W4384616108 creator A5092489800 @default.
- W4384616108 date "2023-07-18" @default.
- W4384616108 modified "2023-09-25" @default.
- W4384616108 title "Framework for solving time-delayed Markov Decision Processes" @default.
- W4384616108 cites W1652173018 @default.
- W4384616108 cites W1970006557 @default.
- W4384616108 cites W1974049455 @default.
- W4384616108 cites W1977655452 @default.
- W4384616108 cites W1986389067 @default.
- W4384616108 cites W2053816989 @default.
- W4384616108 cites W2091845343 @default.
- W4384616108 cites W2104148727 @default.
- W4384616108 cites W2104974239 @default.
- W4384616108 cites W2117726420 @default.
- W4384616108 cites W2117864026 @default.
- W4384616108 cites W2144662207 @default.
- W4384616108 cites W2150393154 @default.
- W4384616108 cites W3013672175 @default.
- W4384616108 cites W3082381308 @default.
- W4384616108 cites W4242962246 @default.
- W4384616108 cites W4280567972 @default.
- W4384616108 cites W4299401133 @default.
- W4384616108 doi "https://doi.org/10.1103/physrevresearch.5.033034" @default.
- W4384616108 hasPublicationYear "2023" @default.
- W4384616108 type Work @default.
- W4384616108 citedByCount "0" @default.
- W4384616108 crossrefType "journal-article" @default.
- W4384616108 hasAuthorship W4384616108A5003673584 @default.
- W4384616108 hasAuthorship W4384616108A5034685587 @default.
- W4384616108 hasAuthorship W4384616108A5092489800 @default.
- W4384616108 hasBestOaLocation W43846161081 @default.
- W4384616108 hasConcept C105795698 @default.
- W4384616108 hasConcept C106189395 @default.
- W4384616108 hasConcept C111919701 @default.
- W4384616108 hasConcept C11413529 @default.
- W4384616108 hasConcept C119857082 @default.
- W4384616108 hasConcept C121332964 @default.
- W4384616108 hasConcept C126255220 @default.
- W4384616108 hasConcept C154945302 @default.
- W4384616108 hasConcept C159886148 @default.
- W4384616108 hasConcept C17098449 @default.
- W4384616108 hasConcept C2778112365 @default.
- W4384616108 hasConcept C2778572836 @default.
- W4384616108 hasConcept C2780791683 @default.
- W4384616108 hasConcept C33923547 @default.
- W4384616108 hasConcept C41008148 @default.
- W4384616108 hasConcept C48103436 @default.
- W4384616108 hasConcept C54355233 @default.
- W4384616108 hasConcept C62520636 @default.
- W4384616108 hasConcept C72434380 @default.
- W4384616108 hasConcept C86803240 @default.
- W4384616108 hasConcept C97541855 @default.
- W4384616108 hasConcept C98045186 @default.
- W4384616108 hasConcept C98763669 @default.
- W4384616108 hasConceptScore W4384616108C105795698 @default.
- W4384616108 hasConceptScore W4384616108C106189395 @default.
- W4384616108 hasConceptScore W4384616108C111919701 @default.
- W4384616108 hasConceptScore W4384616108C11413529 @default.
- W4384616108 hasConceptScore W4384616108C119857082 @default.
- W4384616108 hasConceptScore W4384616108C121332964 @default.
- W4384616108 hasConceptScore W4384616108C126255220 @default.
- W4384616108 hasConceptScore W4384616108C154945302 @default.
- W4384616108 hasConceptScore W4384616108C159886148 @default.
- W4384616108 hasConceptScore W4384616108C17098449 @default.
- W4384616108 hasConceptScore W4384616108C2778112365 @default.
- W4384616108 hasConceptScore W4384616108C2778572836 @default.
- W4384616108 hasConceptScore W4384616108C2780791683 @default.
- W4384616108 hasConceptScore W4384616108C33923547 @default.
- W4384616108 hasConceptScore W4384616108C41008148 @default.
- W4384616108 hasConceptScore W4384616108C48103436 @default.
- W4384616108 hasConceptScore W4384616108C54355233 @default.
- W4384616108 hasConceptScore W4384616108C62520636 @default.
- W4384616108 hasConceptScore W4384616108C72434380 @default.
- W4384616108 hasConceptScore W4384616108C86803240 @default.
- W4384616108 hasConceptScore W4384616108C97541855 @default.
- W4384616108 hasConceptScore W4384616108C98045186 @default.
- W4384616108 hasConceptScore W4384616108C98763669 @default.
- W4384616108 hasFunder F4320338279 @default.
- W4384616108 hasIssue "3" @default.
- W4384616108 hasLocation W43846161081 @default.
- W4384616108 hasOpenAccess W4384616108 @default.
- W4384616108 hasPrimaryLocation W43846161081 @default.
- W4384616108 hasRelatedWork W1966071689 @default.
- W4384616108 hasRelatedWork W1996326480 @default.
- W4384616108 hasRelatedWork W2133764300 @default.
- W4384616108 hasRelatedWork W2149476049 @default.
- W4384616108 hasRelatedWork W2314068453 @default.
- W4384616108 hasRelatedWork W2381909226 @default.
- W4384616108 hasRelatedWork W2482498454 @default.
- W4384616108 hasRelatedWork W3201878770 @default.
- W4384616108 hasRelatedWork W4246015605 @default.
- W4384616108 hasRelatedWork W60247044 @default.
- W4384616108 hasVolume "5" @default.
- W4384616108 isParatext "false" @default.
- W4384616108 isRetracted "false" @default.
- W4384616108 workType "article" @default.