Matches in SemOpenAlex for { <https://semopenalex.org/work/W2981668237> ?p ?o ?g. }
Showing items 1 to 80 of
80
with 100 items per page.
- W2981668237 abstract "Reinforcement Learning enables to train an agent via interaction with the environment. However, in the majority of real-world scenarios, the extrinsic feedback is sparse or not sufficient, thus intrinsic reward formulations are needed to successfully train the agent. This work investigates and extends the paradigm of curiosity-driven exploration. First, a probabilistic approach is taken to exploit the advantages of the attention mechanism, which is successfully applied in other domains of Deep Learning. Combining them, we propose new methods, such as AttA2C, an extension of the Actor-Critic framework. Second, another curiosity-based approach - ICM - is extended. The proposed model utilizes attention to emphasize features for the dynamic models within ICM, moreover, we also modify the loss function, resulting in a new curiosity formulation, which we call rational curiosity. The corresponding implementation can be found at this https URL." @default.
- W2981668237 created "2019-11-01" @default.
- W2981668237 creator A5013735103 @default.
- W2981668237 creator A5034660028 @default.
- W2981668237 date "2019-10-23" @default.
- W2981668237 modified "2023-09-27" @default.
- W2981668237 title "Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning." @default.
- W2981668237 cites W2099471712 @default.
- W2981668237 cites W2121863487 @default.
- W2981668237 cites W2614839826 @default.
- W2981668237 cites W2626778328 @default.
- W2981668237 cites W2798405286 @default.
- W2981668237 cites W2885550588 @default.
- W2981668237 cites W2899771611 @default.
- W2981668237 cites W2948199691 @default.
- W2981668237 cites W2963966654 @default.
- W2981668237 cites W2964043796 @default.
- W2981668237 cites W3037207827 @default.
- W2981668237 hasPublicationYear "2019" @default.
- W2981668237 type Work @default.
- W2981668237 sameAs 2981668237 @default.
- W2981668237 citedByCount "0" @default.
- W2981668237 crossrefType "posted-content" @default.
- W2981668237 hasAuthorship W2981668237A5013735103 @default.
- W2981668237 hasAuthorship W2981668237A5034660028 @default.
- W2981668237 hasConcept C107457646 @default.
- W2981668237 hasConcept C119857082 @default.
- W2981668237 hasConcept C14036430 @default.
- W2981668237 hasConcept C154945302 @default.
- W2981668237 hasConcept C15744967 @default.
- W2981668237 hasConcept C165696696 @default.
- W2981668237 hasConcept C33435437 @default.
- W2981668237 hasConcept C38652104 @default.
- W2981668237 hasConcept C41008148 @default.
- W2981668237 hasConcept C49937458 @default.
- W2981668237 hasConcept C77805123 @default.
- W2981668237 hasConcept C78458016 @default.
- W2981668237 hasConcept C86803240 @default.
- W2981668237 hasConcept C97541855 @default.
- W2981668237 hasConceptScore W2981668237C107457646 @default.
- W2981668237 hasConceptScore W2981668237C119857082 @default.
- W2981668237 hasConceptScore W2981668237C14036430 @default.
- W2981668237 hasConceptScore W2981668237C154945302 @default.
- W2981668237 hasConceptScore W2981668237C15744967 @default.
- W2981668237 hasConceptScore W2981668237C165696696 @default.
- W2981668237 hasConceptScore W2981668237C33435437 @default.
- W2981668237 hasConceptScore W2981668237C38652104 @default.
- W2981668237 hasConceptScore W2981668237C41008148 @default.
- W2981668237 hasConceptScore W2981668237C49937458 @default.
- W2981668237 hasConceptScore W2981668237C77805123 @default.
- W2981668237 hasConceptScore W2981668237C78458016 @default.
- W2981668237 hasConceptScore W2981668237C86803240 @default.
- W2981668237 hasConceptScore W2981668237C97541855 @default.
- W2981668237 hasLocation W29816682371 @default.
- W2981668237 hasOpenAccess W2981668237 @default.
- W2981668237 hasPrimaryLocation W29816682371 @default.
- W2981668237 hasRelatedWork W2328358444 @default.
- W2981668237 hasRelatedWork W2335959470 @default.
- W2981668237 hasRelatedWork W2751973545 @default.
- W2981668237 hasRelatedWork W2884970059 @default.
- W2981668237 hasRelatedWork W2913879094 @default.
- W2981668237 hasRelatedWork W2922388521 @default.
- W2981668237 hasRelatedWork W2953772919 @default.
- W2981668237 hasRelatedWork W2970909667 @default.
- W2981668237 hasRelatedWork W2982393770 @default.
- W2981668237 hasRelatedWork W2989640809 @default.
- W2981668237 hasRelatedWork W3010930801 @default.
- W2981668237 hasRelatedWork W3016913676 @default.
- W2981668237 hasRelatedWork W3036175138 @default.
- W2981668237 hasRelatedWork W3037636088 @default.
- W2981668237 hasRelatedWork W3042772609 @default.
- W2981668237 hasRelatedWork W3081661003 @default.
- W2981668237 hasRelatedWork W3085832734 @default.
- W2981668237 hasRelatedWork W3090196474 @default.
- W2981668237 hasRelatedWork W3169914016 @default.
- W2981668237 hasRelatedWork W3210493102 @default.
- W2981668237 isParatext "false" @default.
- W2981668237 isRetracted "false" @default.
- W2981668237 magId "2981668237" @default.
- W2981668237 workType "article" @default.