Matches in SemOpenAlex for { <https://semopenalex.org/work/W2963276097> ?p ?o ?g. }
Showing items 1 to 97 of
97
with 100 items per page.
- W2963276097 endingPage "1487" @default.
- W2963276097 startingPage "1479" @default.
- W2963276097 abstract "We consider an agent's uncertainty about its environment and the problem of generalizing this uncertainty across states. Specifically, we focus on the problem of exploration in non-tabular reinforcement learning. Drawing inspiration from the intrinsic motivation literature, we use density models to measure uncertainty, and propose a novel algorithm for deriving a pseudo-count from an arbitrary density model. This technique enables us to generalize count-based exploration algorithms to the non-tabular case. We apply our ideas to Atari 2600 games, providing sensible pseudo-counts from raw pixels. We transform these pseudo-counts into exploration bonuses and obtain significantly improved exploration in a number of hard games, including the infamously difficult MONTEZUMA'S REVENGE." @default.
- W2963276097 created "2019-07-30" @default.
- W2963276097 creator A5001087292 @default.
- W2963276097 creator A5016419651 @default.
- W2963276097 creator A5016618355 @default.
- W2963276097 creator A5017449169 @default.
- W2963276097 creator A5073997003 @default.
- W2963276097 creator A5081322018 @default.
- W2963276097 date "2016-12-05" @default.
- W2963276097 modified "2023-09-29" @default.
- W2963276097 title "Unifying count-based exploration and intrinsic motivation" @default.
- W2963276097 cites W1576452626 @default.
- W2963276097 cites W164946830 @default.
- W2963276097 cites W172298727 @default.
- W2963276097 cites W1917036087 @default.
- W2963276097 cites W1988526405 @default.
- W2963276097 cites W2101524054 @default.
- W2963276097 cites W2108147145 @default.
- W2963276097 cites W2124352385 @default.
- W2963276097 cites W2136065708 @default.
- W2963276097 cites W2139612737 @default.
- W2963276097 cites W2145339207 @default.
- W2963276097 cites W2155968351 @default.
- W2963276097 cites W2160589914 @default.
- W2963276097 cites W2181068523 @default.
- W2963276097 cites W2188721763 @default.
- W2963276097 cites W2267126114 @default.
- W2963276097 cites W2281341692 @default.
- W2963276097 cites W2417786368 @default.
- W2963276097 cites W2567415945 @default.
- W2963276097 hasPublicationYear "2016" @default.
- W2963276097 type Work @default.
- W2963276097 sameAs 2963276097 @default.
- W2963276097 citedByCount "384" @default.
- W2963276097 countsByYear W29632760972016 @default.
- W2963276097 countsByYear W29632760972017 @default.
- W2963276097 countsByYear W29632760972018 @default.
- W2963276097 countsByYear W29632760972019 @default.
- W2963276097 countsByYear W29632760972020 @default.
- W2963276097 countsByYear W29632760972021 @default.
- W2963276097 countsByYear W29632760972022 @default.
- W2963276097 countsByYear W29632760972023 @default.
- W2963276097 crossrefType "proceedings-article" @default.
- W2963276097 hasAuthorship W2963276097A5001087292 @default.
- W2963276097 hasAuthorship W2963276097A5016419651 @default.
- W2963276097 hasAuthorship W2963276097A5016618355 @default.
- W2963276097 hasAuthorship W2963276097A5017449169 @default.
- W2963276097 hasAuthorship W2963276097A5073997003 @default.
- W2963276097 hasAuthorship W2963276097A5081322018 @default.
- W2963276097 hasConcept C120665830 @default.
- W2963276097 hasConcept C121332964 @default.
- W2963276097 hasConcept C124101348 @default.
- W2963276097 hasConcept C154945302 @default.
- W2963276097 hasConcept C160633673 @default.
- W2963276097 hasConcept C192209626 @default.
- W2963276097 hasConcept C2780009758 @default.
- W2963276097 hasConcept C41008148 @default.
- W2963276097 hasConcept C97541855 @default.
- W2963276097 hasConceptScore W2963276097C120665830 @default.
- W2963276097 hasConceptScore W2963276097C121332964 @default.
- W2963276097 hasConceptScore W2963276097C124101348 @default.
- W2963276097 hasConceptScore W2963276097C154945302 @default.
- W2963276097 hasConceptScore W2963276097C160633673 @default.
- W2963276097 hasConceptScore W2963276097C192209626 @default.
- W2963276097 hasConceptScore W2963276097C2780009758 @default.
- W2963276097 hasConceptScore W2963276097C41008148 @default.
- W2963276097 hasConceptScore W2963276097C97541855 @default.
- W2963276097 hasLocation W29632760971 @default.
- W2963276097 hasOpenAccess W2963276097 @default.
- W2963276097 hasPrimaryLocation W29632760971 @default.
- W2963276097 hasRelatedWork W172298727 @default.
- W2963276097 hasRelatedWork W1757796397 @default.
- W2963276097 hasRelatedWork W1771410628 @default.
- W2963276097 hasRelatedWork W2121863487 @default.
- W2963276097 hasRelatedWork W2145339207 @default.
- W2963276097 hasRelatedWork W2158782408 @default.
- W2963276097 hasRelatedWork W2173248099 @default.
- W2963276097 hasRelatedWork W2257979135 @default.
- W2963276097 hasRelatedWork W2561776174 @default.
- W2963276097 hasRelatedWork W2736601468 @default.
- W2963276097 hasRelatedWork W2751973545 @default.
- W2963276097 hasRelatedWork W2899205164 @default.
- W2963276097 hasRelatedWork W2963160877 @default.
- W2963276097 hasRelatedWork W2963438456 @default.
- W2963276097 hasRelatedWork W2963639957 @default.
- W2963276097 hasRelatedWork W2963938771 @default.
- W2963276097 hasRelatedWork W2964043796 @default.
- W2963276097 hasRelatedWork W2964121744 @default.
- W2963276097 hasRelatedWork W3103780890 @default.
- W2963276097 hasRelatedWork W779494576 @default.
- W2963276097 hasVolume "29" @default.
- W2963276097 isParatext "false" @default.
- W2963276097 isRetracted "false" @default.
- W2963276097 magId "2963276097" @default.
- W2963276097 workType "article" @default.