Matches in SemOpenAlex for { <https://semopenalex.org/work/W3034974382> ?p ?o ?g. }
- W3034974382 abstract "Decision-making agents with planning capabilities have achieved huge success in the challenging domain like Chess, Shogi, and Go. In an effort to generalize the planning ability to the more general tasks where the environment dynamics are not available to the agent, researchers proposed the MuZero algorithm that can learn the dynamical model through the interactions with the environment. In this paper, we provide a way and the necessary theoretical results to extend the MuZero algorithm to more generalized environments with continuous action space. Through numerical results on two relatively low-dimensional MuJoCo environments, we show the proposed algorithm outperforms the soft actor-critic (SAC) algorithm, a state-of-the-art model-free deep reinforcement learning algorithm." @default.
- W3034974382 created "2020-06-19" @default.
- W3034974382 creator A5013598917 @default.
- W3034974382 creator A5017428639 @default.
- W3034974382 creator A5076069530 @default.
- W3034974382 date "2020-06-12" @default.
- W3034974382 modified "2023-10-16" @default.
- W3034974382 title "Continuous Control for Searching and Planning with a Learned Model" @default.
- W3034974382 cites W107054272 @default.
- W3034974382 cites W1491843047 @default.
- W3034974382 cites W1500868819 @default.
- W3034974382 cites W1625390266 @default.
- W3034974382 cites W1771410628 @default.
- W3034974382 cites W1977989560 @default.
- W3034974382 cites W1980035368 @default.
- W3034974382 cites W2001150023 @default.
- W3034974382 cites W2075848246 @default.
- W3034974382 cites W2087617385 @default.
- W3034974382 cites W2120090487 @default.
- W3034974382 cites W2121863487 @default.
- W3034974382 cites W2134461419 @default.
- W3034974382 cites W2135997697 @default.
- W3034974382 cites W2158858912 @default.
- W3034974382 cites W2173248099 @default.
- W3034974382 cites W2186241545 @default.
- W3034974382 cites W2201581102 @default.
- W3034974382 cites W2257979135 @default.
- W3034974382 cites W2290354866 @default.
- W3034974382 cites W2557283755 @default.
- W3034974382 cites W2574227367 @default.
- W3034974382 cites W2736601468 @default.
- W3034974382 cites W2766447205 @default.
- W3034974382 cites W2772709170 @default.
- W3034974382 cites W2781726626 @default.
- W3034974382 cites W2803928381 @default.
- W3034974382 cites W2805560727 @default.
- W3034974382 cites W2890208753 @default.
- W3034974382 cites W2902907165 @default.
- W3034974382 cites W2946901134 @default.
- W3034974382 cites W2949608212 @default.
- W3034974382 cites W2950004691 @default.
- W3034974382 cites W2953708620 @default.
- W3034974382 cites W2963641140 @default.
- W3034974382 cites W2964220198 @default.
- W3034974382 cites W2994714051 @default.
- W3034974382 cites W3118210634 @default.
- W3034974382 cites W2803683443 @default.
- W3034974382 doi "https://doi.org/10.48550/arxiv.2006.07430" @default.
- W3034974382 hasPublicationYear "2020" @default.
- W3034974382 type Work @default.
- W3034974382 sameAs 3034974382 @default.
- W3034974382 citedByCount "1" @default.
- W3034974382 countsByYear W30349743822021 @default.
- W3034974382 crossrefType "posted-content" @default.
- W3034974382 hasAuthorship W3034974382A5013598917 @default.
- W3034974382 hasAuthorship W3034974382A5017428639 @default.
- W3034974382 hasAuthorship W3034974382A5076069530 @default.
- W3034974382 hasBestOaLocation W30349743821 @default.
- W3034974382 hasConcept C105795698 @default.
- W3034974382 hasConcept C111919701 @default.
- W3034974382 hasConcept C11413529 @default.
- W3034974382 hasConcept C121332964 @default.
- W3034974382 hasConcept C126255220 @default.
- W3034974382 hasConcept C134306372 @default.
- W3034974382 hasConcept C154945302 @default.
- W3034974382 hasConcept C2775924081 @default.
- W3034974382 hasConcept C2778572836 @default.
- W3034974382 hasConcept C2780791683 @default.
- W3034974382 hasConcept C33923547 @default.
- W3034974382 hasConcept C36503486 @default.
- W3034974382 hasConcept C41008148 @default.
- W3034974382 hasConcept C48103436 @default.
- W3034974382 hasConcept C62520636 @default.
- W3034974382 hasConcept C72434380 @default.
- W3034974382 hasConcept C97541855 @default.
- W3034974382 hasConceptScore W3034974382C105795698 @default.
- W3034974382 hasConceptScore W3034974382C111919701 @default.
- W3034974382 hasConceptScore W3034974382C11413529 @default.
- W3034974382 hasConceptScore W3034974382C121332964 @default.
- W3034974382 hasConceptScore W3034974382C126255220 @default.
- W3034974382 hasConceptScore W3034974382C134306372 @default.
- W3034974382 hasConceptScore W3034974382C154945302 @default.
- W3034974382 hasConceptScore W3034974382C2775924081 @default.
- W3034974382 hasConceptScore W3034974382C2778572836 @default.
- W3034974382 hasConceptScore W3034974382C2780791683 @default.
- W3034974382 hasConceptScore W3034974382C33923547 @default.
- W3034974382 hasConceptScore W3034974382C36503486 @default.
- W3034974382 hasConceptScore W3034974382C41008148 @default.
- W3034974382 hasConceptScore W3034974382C48103436 @default.
- W3034974382 hasConceptScore W3034974382C62520636 @default.
- W3034974382 hasConceptScore W3034974382C72434380 @default.
- W3034974382 hasConceptScore W3034974382C97541855 @default.
- W3034974382 hasLocation W30349743821 @default.
- W3034974382 hasOpenAccess W3034974382 @default.
- W3034974382 hasPrimaryLocation W30349743821 @default.
- W3034974382 hasRelatedWork W1492014007 @default.
- W3034974382 hasRelatedWork W2094557321 @default.
- W3034974382 hasRelatedWork W2335095937 @default.
- W3034974382 hasRelatedWork W3103643887 @default.
- W3034974382 hasRelatedWork W3170446423 @default.