Matches in SemOpenAlex for { <https://semopenalex.org/work/W2295618152> ?p ?o ?g. }
- W2295618152 abstract "Sampling and computation budgets are two of the key elements that determine the performance of a reinforcement learning algorithm. In essence, any reinforcement learning agent must sample the environment and perform some computation over the samples to decide its best action. Although very fundamental, the trade-off between sampling and computation is still not well understood. In this paper, we explore this trade-off in an actor-critic perspective. First, we propose a new RL algorithm, Dyna-MLAC, which uses model-based actor-critic updates (MLAC) within the Dyna framework. Then, we numerically indicate that the convergence time of Dyna-MLAC is smaller than pre-existing solutions, and that Dyna-MLAC allows to efficiently trade number of samples and computation time." @default.
- W2295618152 created "2016-06-24" @default.
- W2295618152 creator A5034604991 @default.
- W2295618152 creator A5043160730 @default.
- W2295618152 creator A5072025049 @default.
- W2295618152 date "2015-11-01" @default.
- W2295618152 modified "2023-10-12" @default.
- W2295618152 title "Dyna-MLAC: Trading Computational and Sample Complexities in Actor-Critic Reinforcement Learning" @default.
- W2295618152 cites W107583932 @default.
- W2295618152 cites W1491843047 @default.
- W2295618152 cites W1557517019 @default.
- W2295618152 cites W1626155273 @default.
- W2295618152 cites W1689445748 @default.
- W2295618152 cites W1966086707 @default.
- W2295618152 cites W1979638690 @default.
- W2295618152 cites W2080894081 @default.
- W2295618152 cites W2091565802 @default.
- W2295618152 cites W2100677568 @default.
- W2295618152 cites W2101124704 @default.
- W2295618152 cites W2113921460 @default.
- W2295618152 cites W2114537044 @default.
- W2295618152 cites W2119567691 @default.
- W2295618152 cites W2121863487 @default.
- W2295618152 cites W2131044023 @default.
- W2295618152 cites W2132001804 @default.
- W2295618152 cites W2140135625 @default.
- W2295618152 cites W21891419 @default.
- W2295618152 cites W2408978589 @default.
- W2295618152 cites W2951326042 @default.
- W2295618152 cites W32403112 @default.
- W2295618152 doi "https://doi.org/10.1109/bracis.2015.62" @default.
- W2295618152 hasPublicationYear "2015" @default.
- W2295618152 type Work @default.
- W2295618152 sameAs 2295618152 @default.
- W2295618152 citedByCount "1" @default.
- W2295618152 countsByYear W22956181522020 @default.
- W2295618152 crossrefType "proceedings-article" @default.
- W2295618152 hasAuthorship W2295618152A5034604991 @default.
- W2295618152 hasAuthorship W2295618152A5043160730 @default.
- W2295618152 hasAuthorship W2295618152A5072025049 @default.
- W2295618152 hasConcept C106131492 @default.
- W2295618152 hasConcept C11413529 @default.
- W2295618152 hasConcept C12713177 @default.
- W2295618152 hasConcept C140779682 @default.
- W2295618152 hasConcept C154945302 @default.
- W2295618152 hasConcept C162324750 @default.
- W2295618152 hasConcept C185592680 @default.
- W2295618152 hasConcept C198531522 @default.
- W2295618152 hasConcept C26517878 @default.
- W2295618152 hasConcept C2777303404 @default.
- W2295618152 hasConcept C31972630 @default.
- W2295618152 hasConcept C38652104 @default.
- W2295618152 hasConcept C41008148 @default.
- W2295618152 hasConcept C43617362 @default.
- W2295618152 hasConcept C45374587 @default.
- W2295618152 hasConcept C50522688 @default.
- W2295618152 hasConcept C97541855 @default.
- W2295618152 hasConceptScore W2295618152C106131492 @default.
- W2295618152 hasConceptScore W2295618152C11413529 @default.
- W2295618152 hasConceptScore W2295618152C12713177 @default.
- W2295618152 hasConceptScore W2295618152C140779682 @default.
- W2295618152 hasConceptScore W2295618152C154945302 @default.
- W2295618152 hasConceptScore W2295618152C162324750 @default.
- W2295618152 hasConceptScore W2295618152C185592680 @default.
- W2295618152 hasConceptScore W2295618152C198531522 @default.
- W2295618152 hasConceptScore W2295618152C26517878 @default.
- W2295618152 hasConceptScore W2295618152C2777303404 @default.
- W2295618152 hasConceptScore W2295618152C31972630 @default.
- W2295618152 hasConceptScore W2295618152C38652104 @default.
- W2295618152 hasConceptScore W2295618152C41008148 @default.
- W2295618152 hasConceptScore W2295618152C43617362 @default.
- W2295618152 hasConceptScore W2295618152C45374587 @default.
- W2295618152 hasConceptScore W2295618152C50522688 @default.
- W2295618152 hasConceptScore W2295618152C97541855 @default.
- W2295618152 hasLocation W22956181521 @default.
- W2295618152 hasOpenAccess W2295618152 @default.
- W2295618152 hasPrimaryLocation W22956181521 @default.
- W2295618152 hasRelatedWork W1491843047 @default.
- W2295618152 hasRelatedWork W1506145880 @default.
- W2295618152 hasRelatedWork W16046748 @default.
- W2295618152 hasRelatedWork W1970391951 @default.
- W2295618152 hasRelatedWork W1980620643 @default.
- W2295618152 hasRelatedWork W2005189114 @default.
- W2295618152 hasRelatedWork W2016974233 @default.
- W2295618152 hasRelatedWork W2093823610 @default.
- W2295618152 hasRelatedWork W2114384389 @default.
- W2295618152 hasRelatedWork W2126173398 @default.
- W2295618152 hasRelatedWork W2134760705 @default.
- W2295618152 hasRelatedWork W2141559645 @default.
- W2295618152 hasRelatedWork W2143680741 @default.
- W2295618152 hasRelatedWork W2148227001 @default.
- W2295618152 hasRelatedWork W2168133894 @default.
- W2295618152 hasRelatedWork W2258059573 @default.
- W2295618152 hasRelatedWork W2753511062 @default.
- W2295618152 hasRelatedWork W3006325002 @default.
- W2295618152 hasRelatedWork W5547603 @default.
- W2295618152 hasRelatedWork W3115641986 @default.
- W2295618152 isParatext "false" @default.
- W2295618152 isRetracted "false" @default.
- W2295618152 magId "2295618152" @default.