Matches in SemOpenAlex for { <https://semopenalex.org/work/W3113382536> ?p ?o ?g. }
Showing items 1 to 89 of
89
with 100 items per page.
- W3113382536 abstract "Cooperation among agents with partial observation is an important task in multi-agent reinforcement learning (MARL), aiming to maximize a common reward. Most existing cooperative MARL approaches focus on building different model frameworks, such as centralized, decentralized, and centralized training with decentralized execution. These methods employ partial observation of agents as input directly, but rarely consider the local relationship between agents. The local relationship can help agents integrate observation information among different agents in a local range, and then adopt a more effective cooperation policy. In this paper, we propose a MARL method based on spatial relationship called hierarchical relation graph soft actor-critic (HRG-SAC). The method first uses a hierarchical relation graph generation module to represent the spatial relationship between agents in local space. Second, it integrates feature information of the relation graph through the graph convolution network (GCN). Finally, the soft actor-critic (SAC) is used to optimize agents' actions in training for compliance control. We conduct experiments on the Food Collector task and compare HRG-SAC with three baseline methods. The results demonstrate that the hierarchical relation graph can significantly improve MARL performance in the cooperative task." @default.
- W3113382536 created "2021-01-05" @default.
- W3113382536 creator A5036388641 @default.
- W3113382536 creator A5046597133 @default.
- W3113382536 creator A5048419868 @default.
- W3113382536 creator A5064842058 @default.
- W3113382536 creator A5067265615 @default.
- W3113382536 creator A5086702173 @default.
- W3113382536 date "2020-11-01" @default.
- W3113382536 modified "2023-09-27" @default.
- W3113382536 title "Cooperative Multi-Agent Reinforcement Learning with Hierarchical Relation Graph under Partial Observability" @default.
- W3113382536 cites W1513468570 @default.
- W3113382536 cites W1641379095 @default.
- W3113382536 cites W1757796397 @default.
- W3113382536 cites W2029534657 @default.
- W3113382536 cites W2104602264 @default.
- W3113382536 cites W2116341502 @default.
- W3113382536 cites W2145339207 @default.
- W3113382536 cites W2257979135 @default.
- W3113382536 cites W2519887557 @default.
- W3113382536 cites W2763208138 @default.
- W3113382536 cites W2786915849 @default.
- W3113382536 cites W2889987506 @default.
- W3113382536 cites W2896451037 @default.
- W3113382536 cites W2897086622 @default.
- W3113382536 cites W2903006607 @default.
- W3113382536 cites W2904246096 @default.
- W3113382536 cites W2962938168 @default.
- W3113382536 cites W2963407617 @default.
- W3113382536 cites W2963858333 @default.
- W3113382536 cites W2964311892 @default.
- W3113382536 cites W2998367975 @default.
- W3113382536 cites W2999614490 @default.
- W3113382536 cites W3005803637 @default.
- W3113382536 cites W3024507639 @default.
- W3113382536 cites W4210257598 @default.
- W3113382536 doi "https://doi.org/10.1109/ictai50040.2020.00011" @default.
- W3113382536 hasPublicationYear "2020" @default.
- W3113382536 type Work @default.
- W3113382536 sameAs 3113382536 @default.
- W3113382536 citedByCount "0" @default.
- W3113382536 crossrefType "proceedings-article" @default.
- W3113382536 hasAuthorship W3113382536A5036388641 @default.
- W3113382536 hasAuthorship W3113382536A5046597133 @default.
- W3113382536 hasAuthorship W3113382536A5048419868 @default.
- W3113382536 hasAuthorship W3113382536A5064842058 @default.
- W3113382536 hasAuthorship W3113382536A5067265615 @default.
- W3113382536 hasAuthorship W3113382536A5086702173 @default.
- W3113382536 hasConcept C120314980 @default.
- W3113382536 hasConcept C124101348 @default.
- W3113382536 hasConcept C132525143 @default.
- W3113382536 hasConcept C154945302 @default.
- W3113382536 hasConcept C25343380 @default.
- W3113382536 hasConcept C28826006 @default.
- W3113382536 hasConcept C33923547 @default.
- W3113382536 hasConcept C36299963 @default.
- W3113382536 hasConcept C41008148 @default.
- W3113382536 hasConcept C80444323 @default.
- W3113382536 hasConcept C97541855 @default.
- W3113382536 hasConceptScore W3113382536C120314980 @default.
- W3113382536 hasConceptScore W3113382536C124101348 @default.
- W3113382536 hasConceptScore W3113382536C132525143 @default.
- W3113382536 hasConceptScore W3113382536C154945302 @default.
- W3113382536 hasConceptScore W3113382536C25343380 @default.
- W3113382536 hasConceptScore W3113382536C28826006 @default.
- W3113382536 hasConceptScore W3113382536C33923547 @default.
- W3113382536 hasConceptScore W3113382536C36299963 @default.
- W3113382536 hasConceptScore W3113382536C41008148 @default.
- W3113382536 hasConceptScore W3113382536C80444323 @default.
- W3113382536 hasConceptScore W3113382536C97541855 @default.
- W3113382536 hasFunder F4320321001 @default.
- W3113382536 hasFunder F4320323970 @default.
- W3113382536 hasLocation W31133825361 @default.
- W3113382536 hasOpenAccess W3113382536 @default.
- W3113382536 hasPrimaryLocation W31133825361 @default.
- W3113382536 hasRelatedWork W11104910 @default.
- W3113382536 hasRelatedWork W13501734 @default.
- W3113382536 hasRelatedWork W3884061 @default.
- W3113382536 hasRelatedWork W4191668 @default.
- W3113382536 hasRelatedWork W6014836 @default.
- W3113382536 hasRelatedWork W6043852 @default.
- W3113382536 hasRelatedWork W6242441 @default.
- W3113382536 hasRelatedWork W7092785 @default.
- W3113382536 hasRelatedWork W7981553 @default.
- W3113382536 hasRelatedWork W8801238 @default.
- W3113382536 isParatext "false" @default.
- W3113382536 isRetracted "false" @default.
- W3113382536 magId "3113382536" @default.
- W3113382536 workType "article" @default.