Matches in SemOpenAlex for { <https://semopenalex.org/work/W3215733596> ?p ?o ?g. }
Showing items 1 to 85 of
85
with 100 items per page.
- W3215733596 abstract "Consistency is the theoretical property of a meta learning algorithm that ensures that, under certain assumptions, it can adapt to any task at test time. An open question is whether and how theoretical consistency translates into practice, in comparison to inconsistent algorithms. In this paper, we empirically investigate this question on a set of representative meta-RL algorithms. We find that theoretically consistent algorithms can indeed usually adapt to out-of-distribution (OOD) tasks, while inconsistent ones cannot, although they can still fail in practice for reasons like poor exploration. We further find that theoretically inconsistent algorithms can be made consistent by continuing to update all agent components on the OOD tasks, and adapt as well or better than originally consistent ones. We conclude that theoretical consistency is indeed a desirable property, and inconsistent meta-RL algorithms can easily be made consistent to enjoy the same benefits." @default.
- W3215733596 created "2021-12-06" @default.
- W3215733596 creator A5001815216 @default.
- W3215733596 creator A5020365771 @default.
- W3215733596 creator A5023346681 @default.
- W3215733596 creator A5025141355 @default.
- W3215733596 creator A5056879203 @default.
- W3215733596 date "2021-12-01" @default.
- W3215733596 modified "2023-10-17" @default.
- W3215733596 title "On the Practical Consistency of Meta-Reinforcement Learning Algorithms" @default.
- W3215733596 cites W1959608418 @default.
- W3215733596 cites W2141559023 @default.
- W3215733596 cites W2158782408 @default.
- W3215733596 cites W2550182557 @default.
- W3215733596 cites W2604763608 @default.
- W3215733596 cites W2785397462 @default.
- W3215733596 cites W2808682055 @default.
- W3215733596 cites W2914752403 @default.
- W3215733596 cites W2938321354 @default.
- W3215733596 cites W2963641140 @default.
- W3215733596 cites W2964227899 @default.
- W3215733596 cites W2964296021 @default.
- W3215733596 cites W2995686239 @default.
- W3215733596 cites W3035216917 @default.
- W3215733596 cites W3110161557 @default.
- W3215733596 cites W3137695714 @default.
- W3215733596 doi "https://doi.org/10.48550/arxiv.2112.00478" @default.
- W3215733596 hasPublicationYear "2021" @default.
- W3215733596 type Work @default.
- W3215733596 sameAs 3215733596 @default.
- W3215733596 citedByCount "0" @default.
- W3215733596 crossrefType "posted-content" @default.
- W3215733596 hasAuthorship W3215733596A5001815216 @default.
- W3215733596 hasAuthorship W3215733596A5020365771 @default.
- W3215733596 hasAuthorship W3215733596A5023346681 @default.
- W3215733596 hasAuthorship W3215733596A5025141355 @default.
- W3215733596 hasAuthorship W3215733596A5056879203 @default.
- W3215733596 hasBestOaLocation W32157335961 @default.
- W3215733596 hasConcept C111472728 @default.
- W3215733596 hasConcept C11413529 @default.
- W3215733596 hasConcept C119857082 @default.
- W3215733596 hasConcept C138885662 @default.
- W3215733596 hasConcept C154945302 @default.
- W3215733596 hasConcept C162324750 @default.
- W3215733596 hasConcept C177264268 @default.
- W3215733596 hasConcept C187736073 @default.
- W3215733596 hasConcept C189950617 @default.
- W3215733596 hasConcept C199360897 @default.
- W3215733596 hasConcept C2776436953 @default.
- W3215733596 hasConcept C2780451532 @default.
- W3215733596 hasConcept C2781002164 @default.
- W3215733596 hasConcept C41008148 @default.
- W3215733596 hasConcept C97541855 @default.
- W3215733596 hasConceptScore W3215733596C111472728 @default.
- W3215733596 hasConceptScore W3215733596C11413529 @default.
- W3215733596 hasConceptScore W3215733596C119857082 @default.
- W3215733596 hasConceptScore W3215733596C138885662 @default.
- W3215733596 hasConceptScore W3215733596C154945302 @default.
- W3215733596 hasConceptScore W3215733596C162324750 @default.
- W3215733596 hasConceptScore W3215733596C177264268 @default.
- W3215733596 hasConceptScore W3215733596C187736073 @default.
- W3215733596 hasConceptScore W3215733596C189950617 @default.
- W3215733596 hasConceptScore W3215733596C199360897 @default.
- W3215733596 hasConceptScore W3215733596C2776436953 @default.
- W3215733596 hasConceptScore W3215733596C2780451532 @default.
- W3215733596 hasConceptScore W3215733596C2781002164 @default.
- W3215733596 hasConceptScore W3215733596C41008148 @default.
- W3215733596 hasConceptScore W3215733596C97541855 @default.
- W3215733596 hasLocation W32157335961 @default.
- W3215733596 hasOpenAccess W3215733596 @default.
- W3215733596 hasPrimaryLocation W32157335961 @default.
- W3215733596 hasRelatedWork W3022038857 @default.
- W3215733596 hasRelatedWork W3092824172 @default.
- W3215733596 hasRelatedWork W3105036711 @default.
- W3215733596 hasRelatedWork W3187584516 @default.
- W3215733596 hasRelatedWork W3200361725 @default.
- W3215733596 hasRelatedWork W4225307033 @default.
- W3215733596 hasRelatedWork W4226082087 @default.
- W3215733596 hasRelatedWork W4287647350 @default.
- W3215733596 hasRelatedWork W4319083788 @default.
- W3215733596 hasRelatedWork W4319309271 @default.
- W3215733596 isParatext "false" @default.
- W3215733596 isRetracted "false" @default.
- W3215733596 magId "3215733596" @default.
- W3215733596 workType "article" @default.