Matches in SemOpenAlex for { <https://semopenalex.org/work/W3163174904> ?p ?o ?g. }
- W3163174904 abstract "In many deep reinforcement learning settings, when an agent takes an action, it repeats the same action a predefined number of times without observing the states until the next action-decision point. This technique of action repetition has several merits in training the agent, but the data between action-decision points (i.e., intermediate frames) are, in effect, discarded. Since the amount of training data is inversely proportional to the interval of action repeats, they can have a negative impact on the sample efficiency of training. In this paper, we propose a simple but effective approach to alleviate to this problem by introducing the concept of pseudo-actions. The key idea of our method is making the transition between action-decision points usable as training data by considering pseudo-actions. Pseudo-actions for continuous control tasks are obtained as the average of the action sequence straddling an action-decision point. For discrete control tasks, pseudo-actions are computed from learned action embeddings. This method can be combined with any model-free reinforcement learning algorithm that involves the learning of Q-functions. We demonstrate the effectiveness of our approach on both continuous and discrete control tasks in OpenAI Gym." @default.
- W3163174904 created "2021-05-24" @default.
- W3163174904 creator A5064113904 @default.
- W3163174904 creator A5074770940 @default.
- W3163174904 date "2021-05-07" @default.
- W3163174904 modified "2023-10-03" @default.
- W3163174904 title "Utilizing Skipped Frames in Action Repeats via Pseudo-Actions." @default.
- W3163174904 cites W1757796397 @default.
- W3163174904 cites W2155968351 @default.
- W3163174904 cites W2157864803 @default.
- W3163174904 cites W2158782408 @default.
- W3163174904 cites W2292128556 @default.
- W3163174904 cites W2592915494 @default.
- W3163174904 cites W2614839826 @default.
- W3163174904 cites W2736601468 @default.
- W3163174904 cites W2781585732 @default.
- W3163174904 cites W2900152462 @default.
- W3163174904 cites W2904246096 @default.
- W3163174904 cites W2913403708 @default.
- W3163174904 cites W2951032747 @default.
- W3163174904 cites W2962847657 @default.
- W3163174904 cites W2962902376 @default.
- W3163174904 cites W2963403143 @default.
- W3163174904 cites W2964158321 @default.
- W3163174904 cites W2964174623 @default.
- W3163174904 cites W2964291307 @default.
- W3163174904 cites W2977481643 @default.
- W3163174904 cites W2994714051 @default.
- W3163174904 cites W2995298643 @default.
- W3163174904 cites W3022566517 @default.
- W3163174904 cites W3023640063 @default.
- W3163174904 cites W3034607397 @default.
- W3163174904 cites W3101283005 @default.
- W3163174904 cites W3115293622 @default.
- W3163174904 hasPublicationYear "2021" @default.
- W3163174904 type Work @default.
- W3163174904 sameAs 3163174904 @default.
- W3163174904 citedByCount "0" @default.
- W3163174904 crossrefType "posted-content" @default.
- W3163174904 hasAuthorship W3163174904A5064113904 @default.
- W3163174904 hasAuthorship W3163174904A5074770940 @default.
- W3163174904 hasConcept C119857082 @default.
- W3163174904 hasConcept C121332964 @default.
- W3163174904 hasConcept C136764020 @default.
- W3163174904 hasConcept C154945302 @default.
- W3163174904 hasConcept C2524010 @default.
- W3163174904 hasConcept C26517878 @default.
- W3163174904 hasConcept C2775924081 @default.
- W3163174904 hasConcept C2778112365 @default.
- W3163174904 hasConcept C2780615836 @default.
- W3163174904 hasConcept C2780791683 @default.
- W3163174904 hasConcept C28719098 @default.
- W3163174904 hasConcept C33923547 @default.
- W3163174904 hasConcept C38652104 @default.
- W3163174904 hasConcept C41008148 @default.
- W3163174904 hasConcept C54355233 @default.
- W3163174904 hasConcept C62520636 @default.
- W3163174904 hasConcept C86803240 @default.
- W3163174904 hasConcept C97541855 @default.
- W3163174904 hasConceptScore W3163174904C119857082 @default.
- W3163174904 hasConceptScore W3163174904C121332964 @default.
- W3163174904 hasConceptScore W3163174904C136764020 @default.
- W3163174904 hasConceptScore W3163174904C154945302 @default.
- W3163174904 hasConceptScore W3163174904C2524010 @default.
- W3163174904 hasConceptScore W3163174904C26517878 @default.
- W3163174904 hasConceptScore W3163174904C2775924081 @default.
- W3163174904 hasConceptScore W3163174904C2778112365 @default.
- W3163174904 hasConceptScore W3163174904C2780615836 @default.
- W3163174904 hasConceptScore W3163174904C2780791683 @default.
- W3163174904 hasConceptScore W3163174904C28719098 @default.
- W3163174904 hasConceptScore W3163174904C33923547 @default.
- W3163174904 hasConceptScore W3163174904C38652104 @default.
- W3163174904 hasConceptScore W3163174904C41008148 @default.
- W3163174904 hasConceptScore W3163174904C54355233 @default.
- W3163174904 hasConceptScore W3163174904C62520636 @default.
- W3163174904 hasConceptScore W3163174904C86803240 @default.
- W3163174904 hasConceptScore W3163174904C97541855 @default.
- W3163174904 hasLocation W31631749041 @default.
- W3163174904 hasOpenAccess W3163174904 @default.
- W3163174904 hasPrimaryLocation W31631749041 @default.
- W3163174904 hasRelatedWork W115077809 @default.
- W3163174904 hasRelatedWork W1151968828 @default.
- W3163174904 hasRelatedWork W1561485809 @default.
- W3163174904 hasRelatedWork W197857547 @default.
- W3163174904 hasRelatedWork W2020648518 @default.
- W3163174904 hasRelatedWork W2097451572 @default.
- W3163174904 hasRelatedWork W2162817713 @default.
- W3163174904 hasRelatedWork W2165792602 @default.
- W3163174904 hasRelatedWork W2256420211 @default.
- W3163174904 hasRelatedWork W2283374337 @default.
- W3163174904 hasRelatedWork W2896412913 @default.
- W3163174904 hasRelatedWork W2903892364 @default.
- W3163174904 hasRelatedWork W2941168284 @default.
- W3163174904 hasRelatedWork W3126150352 @default.
- W3163174904 hasRelatedWork W3131249706 @default.
- W3163174904 hasRelatedWork W3188280031 @default.
- W3163174904 hasRelatedWork W3200221028 @default.
- W3163174904 hasRelatedWork W3206169115 @default.
- W3163174904 hasRelatedWork W2757627919 @default.
- W3163174904 hasRelatedWork W2984639565 @default.