Matches in SemOpenAlex for { <https://semopenalex.org/work/W4306178194> ?p ?o ?g. }
Showing items 1 to 83 of
83
with 100 items per page.
- W4306178194 abstract "We introduce hybrid execution in multi-agent reinforcement learning (MARL), a new paradigm in which agents aim to successfully complete cooperative tasks with arbitrary communication levels at execution time by taking advantage of information-sharing among the agents. Under hybrid execution, the communication level can range from a setting in which no communication is allowed between agents (fully decentralized), to a setting featuring full communication (fully centralized), but the agents do not know beforehand which communication level they will encounter at execution time. To formalize our setting, we define a new class of multi-agent partially observable Markov decision processes (POMDPs) that we name hybrid-POMDPs, which explicitly model a communication process between the agents. We contribute MARO, an approach that makes use of an auto-regressive predictive model, trained in a centralized manner, to estimate missing agents' observations at execution time. We evaluate MARO on standard scenarios and extensions of previous benchmarks tailored to emphasize the negative impact of partial observability in MARL. Experimental results show that our method consistently outperforms relevant baselines, allowing agents to act with faulty communication while successfully exploiting shared information." @default.
- W4306178194 created "2022-10-14" @default.
- W4306178194 creator A5009216751 @default.
- W4306178194 creator A5032046108 @default.
- W4306178194 creator A5056845990 @default.
- W4306178194 creator A5063841377 @default.
- W4306178194 creator A5075446522 @default.
- W4306178194 creator A5077860392 @default.
- W4306178194 creator A5091668153 @default.
- W4306178194 date "2022-10-12" @default.
- W4306178194 modified "2023-09-30" @default.
- W4306178194 title "Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning" @default.
- W4306178194 doi "https://doi.org/10.48550/arxiv.2210.06274" @default.
- W4306178194 hasPublicationYear "2022" @default.
- W4306178194 type Work @default.
- W4306178194 citedByCount "0" @default.
- W4306178194 crossrefType "posted-content" @default.
- W4306178194 hasAuthorship W4306178194A5009216751 @default.
- W4306178194 hasAuthorship W4306178194A5032046108 @default.
- W4306178194 hasAuthorship W4306178194A5056845990 @default.
- W4306178194 hasAuthorship W4306178194A5063841377 @default.
- W4306178194 hasAuthorship W4306178194A5075446522 @default.
- W4306178194 hasAuthorship W4306178194A5077860392 @default.
- W4306178194 hasAuthorship W4306178194A5091668153 @default.
- W4306178194 hasBestOaLocation W43061781941 @default.
- W4306178194 hasConcept C105795698 @default.
- W4306178194 hasConcept C106189395 @default.
- W4306178194 hasConcept C114466953 @default.
- W4306178194 hasConcept C119857082 @default.
- W4306178194 hasConcept C120314980 @default.
- W4306178194 hasConcept C154945302 @default.
- W4306178194 hasConcept C159886148 @default.
- W4306178194 hasConcept C159985019 @default.
- W4306178194 hasConcept C163836022 @default.
- W4306178194 hasConcept C17098449 @default.
- W4306178194 hasConcept C192562407 @default.
- W4306178194 hasConcept C199360897 @default.
- W4306178194 hasConcept C204323151 @default.
- W4306178194 hasConcept C2777212361 @default.
- W4306178194 hasConcept C28826006 @default.
- W4306178194 hasConcept C33923547 @default.
- W4306178194 hasConcept C36299963 @default.
- W4306178194 hasConcept C41008148 @default.
- W4306178194 hasConcept C97541855 @default.
- W4306178194 hasConcept C98045186 @default.
- W4306178194 hasConcept C98763669 @default.
- W4306178194 hasConceptScore W4306178194C105795698 @default.
- W4306178194 hasConceptScore W4306178194C106189395 @default.
- W4306178194 hasConceptScore W4306178194C114466953 @default.
- W4306178194 hasConceptScore W4306178194C119857082 @default.
- W4306178194 hasConceptScore W4306178194C120314980 @default.
- W4306178194 hasConceptScore W4306178194C154945302 @default.
- W4306178194 hasConceptScore W4306178194C159886148 @default.
- W4306178194 hasConceptScore W4306178194C159985019 @default.
- W4306178194 hasConceptScore W4306178194C163836022 @default.
- W4306178194 hasConceptScore W4306178194C17098449 @default.
- W4306178194 hasConceptScore W4306178194C192562407 @default.
- W4306178194 hasConceptScore W4306178194C199360897 @default.
- W4306178194 hasConceptScore W4306178194C204323151 @default.
- W4306178194 hasConceptScore W4306178194C2777212361 @default.
- W4306178194 hasConceptScore W4306178194C28826006 @default.
- W4306178194 hasConceptScore W4306178194C33923547 @default.
- W4306178194 hasConceptScore W4306178194C36299963 @default.
- W4306178194 hasConceptScore W4306178194C41008148 @default.
- W4306178194 hasConceptScore W4306178194C97541855 @default.
- W4306178194 hasConceptScore W4306178194C98045186 @default.
- W4306178194 hasConceptScore W4306178194C98763669 @default.
- W4306178194 hasLocation W43061781941 @default.
- W4306178194 hasOpenAccess W4306178194 @default.
- W4306178194 hasPrimaryLocation W43061781941 @default.
- W4306178194 hasRelatedWork W1561563290 @default.
- W4306178194 hasRelatedWork W1932117986 @default.
- W4306178194 hasRelatedWork W2149126181 @default.
- W4306178194 hasRelatedWork W2802349643 @default.
- W4306178194 hasRelatedWork W2963561234 @default.
- W4306178194 hasRelatedWork W2967060478 @default.
- W4306178194 hasRelatedWork W3128073777 @default.
- W4306178194 hasRelatedWork W3174896399 @default.
- W4306178194 hasRelatedWork W3206095144 @default.
- W4306178194 hasRelatedWork W4283455536 @default.
- W4306178194 isParatext "false" @default.
- W4306178194 isRetracted "false" @default.
- W4306178194 workType "article" @default.