Matches in SemOpenAlex for { <https://semopenalex.org/work/W2949745502> ?p ?o ?g. }
- W2949745502 abstract "Although many reinforcement learning methods have been proposed for learning the optimal solutions in single-agent continuous-action domains, multiagent coordination domains with continuous actions have received relatively few investigations. In this paper, we propose an independent learner hierarchical method, named Sample Continuous Coordination with recursive Frequency Maximum Q-Value (SCC-rFMQ), which divides the cooperative problem with continuous actions into two layers. The first layer samples a finite set of actions from the continuous action spaces by a re-sampling mechanism with variable exploratory rates, and the second layer evaluates the actions in the sampled action set and updates the policy using a reinforcement learning cooperative method. By constructing cooperative mechanisms at both levels, SCC-rFMQ can handle cooperative problems in continuous action cooperative Markov games effectively. The effectiveness of SCC-rFMQ is experimentally demonstrated on two well-designed games, i.e., a continuous version of the climbing game and a cooperative version of the boat problem. Experimental results show that SCC-rFMQ outperforms other reinforcement learning algorithms." @default.
- W2949745502 created "2019-06-27" @default.
- W2949745502 creator A5001714538 @default.
- W2949745502 creator A5008547992 @default.
- W2949745502 creator A5027220635 @default.
- W2949745502 creator A5047509839 @default.
- W2949745502 creator A5063514061 @default.
- W2949745502 creator A5082662124 @default.
- W2949745502 creator A5083729859 @default.
- W2949745502 creator A5090385327 @default.
- W2949745502 date "2018-09-18" @default.
- W2949745502 modified "2023-09-25" @default.
- W2949745502 title "SCC-rFMQ Learning in Cooperative Markov Games with Continuous Actions" @default.
- W2949745502 cites W1560074431 @default.
- W2949745502 cites W1574638321 @default.
- W2949745502 cites W1757796397 @default.
- W2949745502 cites W2075268401 @default.
- W2949745502 cites W2096145798 @default.
- W2949745502 cites W2097451572 @default.
- W2949745502 cites W2108449787 @default.
- W2949745502 cites W2108892923 @default.
- W2949745502 cites W2109102709 @default.
- W2949745502 cites W2120968583 @default.
- W2949745502 cites W2127412976 @default.
- W2949745502 cites W2145339207 @default.
- W2949745502 cites W2145805610 @default.
- W2949745502 cites W2156737235 @default.
- W2949745502 cites W2165150801 @default.
- W2949745502 cites W2173248099 @default.
- W2949745502 cites W2183243664 @default.
- W2949745502 cites W2398042709 @default.
- W2949745502 cites W2466211196 @default.
- W2949745502 cites W2623431351 @default.
- W2949745502 cites W3102974627 @default.
- W2949745502 cites W620125583 @default.
- W2949745502 cites W755046805 @default.
- W2949745502 hasPublicationYear "2018" @default.
- W2949745502 type Work @default.
- W2949745502 sameAs 2949745502 @default.
- W2949745502 citedByCount "0" @default.
- W2949745502 crossrefType "posted-content" @default.
- W2949745502 hasAuthorship W2949745502A5001714538 @default.
- W2949745502 hasAuthorship W2949745502A5008547992 @default.
- W2949745502 hasAuthorship W2949745502A5027220635 @default.
- W2949745502 hasAuthorship W2949745502A5047509839 @default.
- W2949745502 hasAuthorship W2949745502A5063514061 @default.
- W2949745502 hasAuthorship W2949745502A5082662124 @default.
- W2949745502 hasAuthorship W2949745502A5083729859 @default.
- W2949745502 hasAuthorship W2949745502A5090385327 @default.
- W2949745502 hasConcept C105795698 @default.
- W2949745502 hasConcept C106189395 @default.
- W2949745502 hasConcept C119857082 @default.
- W2949745502 hasConcept C121332964 @default.
- W2949745502 hasConcept C126255220 @default.
- W2949745502 hasConcept C154945302 @default.
- W2949745502 hasConcept C159886148 @default.
- W2949745502 hasConcept C177264268 @default.
- W2949745502 hasConcept C199360897 @default.
- W2949745502 hasConcept C2780791683 @default.
- W2949745502 hasConcept C33923547 @default.
- W2949745502 hasConcept C41008148 @default.
- W2949745502 hasConcept C62520636 @default.
- W2949745502 hasConcept C97541855 @default.
- W2949745502 hasConcept C98763669 @default.
- W2949745502 hasConceptScore W2949745502C105795698 @default.
- W2949745502 hasConceptScore W2949745502C106189395 @default.
- W2949745502 hasConceptScore W2949745502C119857082 @default.
- W2949745502 hasConceptScore W2949745502C121332964 @default.
- W2949745502 hasConceptScore W2949745502C126255220 @default.
- W2949745502 hasConceptScore W2949745502C154945302 @default.
- W2949745502 hasConceptScore W2949745502C159886148 @default.
- W2949745502 hasConceptScore W2949745502C177264268 @default.
- W2949745502 hasConceptScore W2949745502C199360897 @default.
- W2949745502 hasConceptScore W2949745502C2780791683 @default.
- W2949745502 hasConceptScore W2949745502C33923547 @default.
- W2949745502 hasConceptScore W2949745502C41008148 @default.
- W2949745502 hasConceptScore W2949745502C62520636 @default.
- W2949745502 hasConceptScore W2949745502C97541855 @default.
- W2949745502 hasConceptScore W2949745502C98763669 @default.
- W2949745502 hasLocation W29497455021 @default.
- W2949745502 hasOpenAccess W2949745502 @default.
- W2949745502 hasPrimaryLocation W29497455021 @default.
- W2949745502 hasRelatedWork W1218520990 @default.
- W2949745502 hasRelatedWork W1569074635 @default.
- W2949745502 hasRelatedWork W178386168 @default.
- W2949745502 hasRelatedWork W1830271598 @default.
- W2949745502 hasRelatedWork W2011231614 @default.
- W2949745502 hasRelatedWork W2085366587 @default.
- W2949745502 hasRelatedWork W2118318536 @default.
- W2949745502 hasRelatedWork W2338351427 @default.
- W2949745502 hasRelatedWork W2347960316 @default.
- W2949745502 hasRelatedWork W2380374763 @default.
- W2949745502 hasRelatedWork W2389533683 @default.
- W2949745502 hasRelatedWork W2807918663 @default.
- W2949745502 hasRelatedWork W2888996021 @default.
- W2949745502 hasRelatedWork W3001618744 @default.
- W2949745502 hasRelatedWork W3092429416 @default.
- W2949745502 hasRelatedWork W3143815010 @default.
- W2949745502 hasRelatedWork W3175224103 @default.
- W2949745502 hasRelatedWork W3192815666 @default.