Matches in SemOpenAlex for { <https://semopenalex.org/work/W4377141572> ?p ?o ?g. }
Showing items 1 to 82 of
82
with 100 items per page.
- W4377141572 endingPage "690" @default.
- W4377141572 startingPage "681" @default.
- W4377141572 abstract "Credit assignment is a crucial issue in multi-agent tasks employing a centralized training and decentralized execution paradigm. While value decomposition has demonstrated strong performance in Q-learning-based approaches and certain Actor-Critic variants, it remains challenging to achieve efficient credit assignment in multi-agent tasks using policy gradient methods due to decomposable value limitations. This paper introduces Predictive Contribution Measurement, an explicit credit assignment method that compares prediction errors among agents and allocates surrogate rewards based on their relevance to global state transitions, with a theoretical guarantee. With multi-agent proximal policy optimization (MAPPO) as a training backend, we propose Predictive Contribution MAPPO (PC-MAPPO). Our experiments demonstrate that PC-MAPPO, with a 10% warm-up phase, outperforms MAPPO, QMIX, and Weighted QMIX on StarCraft multi-agent challenge tasks, particularly in maps requiring heightened cooperation to defeat enemies, such as the map corridor. Employing a pre-trained predictor, PC-MAPPO achieves significantly improved performance on all tested super-hard maps. In parallel training scenarios, PC-MAPPO exhibits superior data efficiency and achieves state-of-the-art performance compared to other methods." @default.
- W4377141572 created "2023-05-21" @default.
- W4377141572 creator A5003725848 @default.
- W4377141572 creator A5078570390 @default.
- W4377141572 date "2023-07-01" @default.
- W4377141572 modified "2023-10-15" @default.
- W4377141572 title "Credit assignment with predictive contribution measurement in multi-agent reinforcement learning" @default.
- W4377141572 cites W1591723504 @default.
- W4377141572 cites W2059859034 @default.
- W4377141572 cites W206679605 @default.
- W4377141572 cites W2617547828 @default.
- W4377141572 cites W2892258706 @default.
- W4377141572 cites W2915117209 @default.
- W4377141572 cites W2997070234 @default.
- W4377141572 cites W3013298095 @default.
- W4377141572 cites W3070092463 @default.
- W4377141572 cites W3104860527 @default.
- W4377141572 cites W4205939747 @default.
- W4377141572 cites W4283727098 @default.
- W4377141572 doi "https://doi.org/10.1016/j.neunet.2023.05.021" @default.
- W4377141572 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/37257392" @default.
- W4377141572 hasPublicationYear "2023" @default.
- W4377141572 type Work @default.
- W4377141572 citedByCount "0" @default.
- W4377141572 crossrefType "journal-article" @default.
- W4377141572 hasAuthorship W4377141572A5003725848 @default.
- W4377141572 hasAuthorship W4377141572A5078570390 @default.
- W4377141572 hasConcept C11413529 @default.
- W4377141572 hasConcept C119857082 @default.
- W4377141572 hasConcept C124681953 @default.
- W4377141572 hasConcept C126322002 @default.
- W4377141572 hasConcept C154945302 @default.
- W4377141572 hasConcept C158154518 @default.
- W4377141572 hasConcept C17744445 @default.
- W4377141572 hasConcept C18903297 @default.
- W4377141572 hasConcept C199539241 @default.
- W4377141572 hasConcept C2776291640 @default.
- W4377141572 hasConcept C3019719930 @default.
- W4377141572 hasConcept C41008148 @default.
- W4377141572 hasConcept C48103436 @default.
- W4377141572 hasConcept C71924100 @default.
- W4377141572 hasConcept C86803240 @default.
- W4377141572 hasConcept C97541855 @default.
- W4377141572 hasConceptScore W4377141572C11413529 @default.
- W4377141572 hasConceptScore W4377141572C119857082 @default.
- W4377141572 hasConceptScore W4377141572C124681953 @default.
- W4377141572 hasConceptScore W4377141572C126322002 @default.
- W4377141572 hasConceptScore W4377141572C154945302 @default.
- W4377141572 hasConceptScore W4377141572C158154518 @default.
- W4377141572 hasConceptScore W4377141572C17744445 @default.
- W4377141572 hasConceptScore W4377141572C18903297 @default.
- W4377141572 hasConceptScore W4377141572C199539241 @default.
- W4377141572 hasConceptScore W4377141572C2776291640 @default.
- W4377141572 hasConceptScore W4377141572C3019719930 @default.
- W4377141572 hasConceptScore W4377141572C41008148 @default.
- W4377141572 hasConceptScore W4377141572C48103436 @default.
- W4377141572 hasConceptScore W4377141572C71924100 @default.
- W4377141572 hasConceptScore W4377141572C86803240 @default.
- W4377141572 hasConceptScore W4377141572C97541855 @default.
- W4377141572 hasFunder F4320321001 @default.
- W4377141572 hasFunder F4320321540 @default.
- W4377141572 hasFunder F4320335777 @default.
- W4377141572 hasLocation W43771415721 @default.
- W4377141572 hasLocation W43771415722 @default.
- W4377141572 hasOpenAccess W4377141572 @default.
- W4377141572 hasPrimaryLocation W43771415721 @default.
- W4377141572 hasRelatedWork W1531601525 @default.
- W4377141572 hasRelatedWork W2031695474 @default.
- W4377141572 hasRelatedWork W2138720691 @default.
- W4377141572 hasRelatedWork W2948807893 @default.
- W4377141572 hasRelatedWork W3173606202 @default.
- W4377141572 hasRelatedWork W4306904969 @default.
- W4377141572 hasRelatedWork W4362501864 @default.
- W4377141572 hasRelatedWork W4380318855 @default.
- W4377141572 hasRelatedWork W2778153218 @default.
- W4377141572 hasRelatedWork W3110381201 @default.
- W4377141572 hasVolume "164" @default.
- W4377141572 isParatext "false" @default.
- W4377141572 isRetracted "false" @default.
- W4377141572 workType "article" @default.