Matches in SemOpenAlex for { <https://semopenalex.org/work/W2912757816> ?p ?o ?g. }
- W2912757816 abstract "In this paper we consider the problem of how a reinforcement learning agent that is tasked with solving a sequence of reinforcement learning problems (a sequence of Markov decision processes) can use knowledge acquired early in its lifetime to improve its ability to solve new problems. We argue that previous experience with similar problems can provide an agent with information about how it should explore when facing a new but related problem. We show that the search for an optimal exploration strategy can be formulated as a reinforcement learning problem itself and demonstrate that such strategy can leverage patterns found in the structure of related problems. We conclude with experiments that show the benefits of optimizing an exploration strategy using our proposed approach." @default.
- W2912757816 created "2019-02-21" @default.
- W2912757816 creator A5066332280 @default.
- W2912757816 creator A5083266453 @default.
- W2912757816 date "2019-02-03" @default.
- W2912757816 modified "2023-10-12" @default.
- W2912757816 title "A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning" @default.
- W2912757816 cites W1515851193 @default.
- W2912757816 cites W157259654 @default.
- W2912757816 cites W1586504939 @default.
- W2912757816 cites W2002260889 @default.
- W2912757816 cites W2031727428 @default.
- W2912757816 cites W2050884147 @default.
- W2912757816 cites W2111625828 @default.
- W2912757816 cites W2119717200 @default.
- W2912757816 cites W2143435603 @default.
- W2912757816 cites W2143958939 @default.
- W2912757816 cites W2145339207 @default.
- W2912757816 cites W2154623234 @default.
- W2912757816 cites W2186389117 @default.
- W2912757816 cites W2514775068 @default.
- W2912757816 cites W2544765807 @default.
- W2912757816 cites W2604763608 @default.
- W2912757816 cites W2606568940 @default.
- W2912757816 cites W2614839826 @default.
- W2912757816 cites W2624731731 @default.
- W2912757816 cites W2663108269 @default.
- W2912757816 cites W2736601468 @default.
- W2912757816 cites W2767175863 @default.
- W2912757816 cites W2797028887 @default.
- W2912757816 cites W2949475445 @default.
- W2912757816 cites W2950040888 @default.
- W2912757816 cites W2952366022 @default.
- W2912757816 cites W2963184621 @default.
- W2912757816 cites W2963775850 @default.
- W2912757816 cites W2964054583 @default.
- W2912757816 hasPublicationYear "2019" @default.
- W2912757816 type Work @default.
- W2912757816 sameAs 2912757816 @default.
- W2912757816 citedByCount "0" @default.
- W2912757816 crossrefType "posted-content" @default.
- W2912757816 hasAuthorship W2912757816A5066332280 @default.
- W2912757816 hasAuthorship W2912757816A5083266453 @default.
- W2912757816 hasConcept C105795698 @default.
- W2912757816 hasConcept C106189395 @default.
- W2912757816 hasConcept C108771440 @default.
- W2912757816 hasConcept C119857082 @default.
- W2912757816 hasConcept C127413603 @default.
- W2912757816 hasConcept C153083717 @default.
- W2912757816 hasConcept C154945302 @default.
- W2912757816 hasConcept C15744967 @default.
- W2912757816 hasConcept C159886148 @default.
- W2912757816 hasConcept C19417346 @default.
- W2912757816 hasConcept C2778112365 @default.
- W2912757816 hasConcept C33923547 @default.
- W2912757816 hasConcept C41008148 @default.
- W2912757816 hasConcept C47932503 @default.
- W2912757816 hasConcept C54355233 @default.
- W2912757816 hasConcept C66938386 @default.
- W2912757816 hasConcept C67203356 @default.
- W2912757816 hasConcept C86803240 @default.
- W2912757816 hasConcept C97541855 @default.
- W2912757816 hasConceptScore W2912757816C105795698 @default.
- W2912757816 hasConceptScore W2912757816C106189395 @default.
- W2912757816 hasConceptScore W2912757816C108771440 @default.
- W2912757816 hasConceptScore W2912757816C119857082 @default.
- W2912757816 hasConceptScore W2912757816C127413603 @default.
- W2912757816 hasConceptScore W2912757816C153083717 @default.
- W2912757816 hasConceptScore W2912757816C154945302 @default.
- W2912757816 hasConceptScore W2912757816C15744967 @default.
- W2912757816 hasConceptScore W2912757816C159886148 @default.
- W2912757816 hasConceptScore W2912757816C19417346 @default.
- W2912757816 hasConceptScore W2912757816C2778112365 @default.
- W2912757816 hasConceptScore W2912757816C33923547 @default.
- W2912757816 hasConceptScore W2912757816C41008148 @default.
- W2912757816 hasConceptScore W2912757816C47932503 @default.
- W2912757816 hasConceptScore W2912757816C54355233 @default.
- W2912757816 hasConceptScore W2912757816C66938386 @default.
- W2912757816 hasConceptScore W2912757816C67203356 @default.
- W2912757816 hasConceptScore W2912757816C86803240 @default.
- W2912757816 hasConceptScore W2912757816C97541855 @default.
- W2912757816 hasLocation W29127578161 @default.
- W2912757816 hasOpenAccess W2912757816 @default.
- W2912757816 hasPrimaryLocation W29127578161 @default.
- W2912757816 hasRelatedWork W143164768 @default.
- W2912757816 hasRelatedWork W1576253121 @default.
- W2912757816 hasRelatedWork W1658094677 @default.
- W2912757816 hasRelatedWork W1884070896 @default.
- W2912757816 hasRelatedWork W2032854309 @default.
- W2912757816 hasRelatedWork W2062122188 @default.
- W2912757816 hasRelatedWork W2099945315 @default.
- W2912757816 hasRelatedWork W2140332127 @default.
- W2912757816 hasRelatedWork W2158548602 @default.
- W2912757816 hasRelatedWork W2225207041 @default.
- W2912757816 hasRelatedWork W2539164647 @default.
- W2912757816 hasRelatedWork W2906650924 @default.
- W2912757816 hasRelatedWork W2962845991 @default.
- W2912757816 hasRelatedWork W2963065769 @default.
- W2912757816 hasRelatedWork W2972555645 @default.
- W2912757816 hasRelatedWork W3072315125 @default.