Matches in SemOpenAlex for { <https://semopenalex.org/work/W2991032634> ?p ?o ?g. }
- W2991032634 abstract "Balancing exploration and exploitation is a fundamental part of reinforcement learning, yet most state-of-the-art algorithms use a naive exploration protocol like $epsilon$-greedy. This contributes to the problem of high sample complexity, as the algorithm wastes effort by repeatedly visiting parts of the state space that have already been explored. We introduce a novel method based on Bayesian linear regression and latent space embedding to generate an intrinsic reward signal that encourages the learning agent to seek out unexplored parts of the state space. This method is computationally efficient, simple to implement, and can extend any state-of-the-art reinforcement learning algorithm. We evaluate the method on a range of algorithms and challenging control tasks, on both simulated and physical robots, demonstrating how the proposed method can significantly improve sample complexity." @default.
- W2991032634 created "2019-12-05" @default.
- W2991032634 creator A5022111646 @default.
- W2991032634 creator A5033725695 @default.
- W2991032634 creator A5062619542 @default.
- W2991032634 date "2019-11-20" @default.
- W2991032634 modified "2023-09-27" @default.
- W2991032634 title "Bayesian Curiosity for Efficient Exploration in Reinforcement Learning." @default.
- W2991032634 cites W1480053368 @default.
- W2991032634 cites W1506806321 @default.
- W2991032634 cites W1771410628 @default.
- W2991032634 cites W1998498767 @default.
- W2991032634 cites W2039522160 @default.
- W2991032634 cites W2082511574 @default.
- W2991032634 cites W2108114251 @default.
- W2991032634 cites W2121863487 @default.
- W2991032634 cites W2145339207 @default.
- W2991032634 cites W2155027007 @default.
- W2991032634 cites W2170912685 @default.
- W2991032634 cites W2173248099 @default.
- W2991032634 cites W2247242831 @default.
- W2991032634 cites W2561776174 @default.
- W2991032634 cites W2593766708 @default.
- W2991032634 cites W2751973545 @default.
- W2991032634 cites W2898963091 @default.
- W2991032634 cites W2937555108 @default.
- W2991032634 cites W2949608212 @default.
- W2991032634 cites W2962767126 @default.
- W2991032634 cites W2962839548 @default.
- W2991032634 cites W2962996309 @default.
- W2991032634 cites W2963276097 @default.
- W2991032634 cites W2963438456 @default.
- W2991032634 cites W2963639957 @default.
- W2991032634 cites W2963641140 @default.
- W2991032634 cites W2963751259 @default.
- W2991032634 cites W2963797557 @default.
- W2991032634 cites W2964067469 @default.
- W2991032634 cites W2964121744 @default.
- W2991032634 cites W2997289589 @default.
- W2991032634 cites W3031547186 @default.
- W2991032634 cites W652265168 @default.
- W2991032634 hasPublicationYear "2019" @default.
- W2991032634 type Work @default.
- W2991032634 sameAs 2991032634 @default.
- W2991032634 citedByCount "2" @default.
- W2991032634 countsByYear W29910326342020 @default.
- W2991032634 crossrefType "posted-content" @default.
- W2991032634 hasAuthorship W2991032634A5022111646 @default.
- W2991032634 hasAuthorship W2991032634A5033725695 @default.
- W2991032634 hasAuthorship W2991032634A5062619542 @default.
- W2991032634 hasConcept C105795698 @default.
- W2991032634 hasConcept C111472728 @default.
- W2991032634 hasConcept C111919701 @default.
- W2991032634 hasConcept C11413529 @default.
- W2991032634 hasConcept C119857082 @default.
- W2991032634 hasConcept C127413603 @default.
- W2991032634 hasConcept C138885662 @default.
- W2991032634 hasConcept C146978453 @default.
- W2991032634 hasConcept C154945302 @default.
- W2991032634 hasConcept C185592680 @default.
- W2991032634 hasConcept C198531522 @default.
- W2991032634 hasConcept C204323151 @default.
- W2991032634 hasConcept C2778445095 @default.
- W2991032634 hasConcept C2778572836 @default.
- W2991032634 hasConcept C2780586882 @default.
- W2991032634 hasConcept C33923547 @default.
- W2991032634 hasConcept C41008148 @default.
- W2991032634 hasConcept C41608201 @default.
- W2991032634 hasConcept C43617362 @default.
- W2991032634 hasConcept C48103436 @default.
- W2991032634 hasConcept C72434380 @default.
- W2991032634 hasConcept C90509273 @default.
- W2991032634 hasConcept C97541855 @default.
- W2991032634 hasConceptScore W2991032634C105795698 @default.
- W2991032634 hasConceptScore W2991032634C111472728 @default.
- W2991032634 hasConceptScore W2991032634C111919701 @default.
- W2991032634 hasConceptScore W2991032634C11413529 @default.
- W2991032634 hasConceptScore W2991032634C119857082 @default.
- W2991032634 hasConceptScore W2991032634C127413603 @default.
- W2991032634 hasConceptScore W2991032634C138885662 @default.
- W2991032634 hasConceptScore W2991032634C146978453 @default.
- W2991032634 hasConceptScore W2991032634C154945302 @default.
- W2991032634 hasConceptScore W2991032634C185592680 @default.
- W2991032634 hasConceptScore W2991032634C198531522 @default.
- W2991032634 hasConceptScore W2991032634C204323151 @default.
- W2991032634 hasConceptScore W2991032634C2778445095 @default.
- W2991032634 hasConceptScore W2991032634C2778572836 @default.
- W2991032634 hasConceptScore W2991032634C2780586882 @default.
- W2991032634 hasConceptScore W2991032634C33923547 @default.
- W2991032634 hasConceptScore W2991032634C41008148 @default.
- W2991032634 hasConceptScore W2991032634C41608201 @default.
- W2991032634 hasConceptScore W2991032634C43617362 @default.
- W2991032634 hasConceptScore W2991032634C48103436 @default.
- W2991032634 hasConceptScore W2991032634C72434380 @default.
- W2991032634 hasConceptScore W2991032634C90509273 @default.
- W2991032634 hasConceptScore W2991032634C97541855 @default.
- W2991032634 hasLocation W29910326341 @default.
- W2991032634 hasOpenAccess W2991032634 @default.
- W2991032634 hasPrimaryLocation W29910326341 @default.
- W2991032634 hasRelatedWork W2132676037 @default.