Matches in SemOpenAlex for { <https://semopenalex.org/work/W4287758380> ?p ?o ?g. }
- W4287758380 abstract "Learning to communicate in order to share state information is an active problem in the area of multi-agent reinforcement learning (MARL). The credit assignment problem, the non-stationarity of the communication environment and the creation of influenceable agents are major challenges within this research field which need to be overcome in order to learn a valid communication protocol. This paper introduces the novel multi-agent counterfactual communication learning (MACC) method which adapts counterfactual reasoning in order to overcome the credit assignment problem for communicating agents. Secondly, the non-stationarity of the communication environment while learning the communication Q-function is overcome by creating the communication Q-function using the action policy of the other agents and the Q-function of the action environment. Additionally, a social loss function is introduced in order to create influenceable agents which is required to learn a valid communication protocol. Our experiments show that MACC is able to outperform the state-of-the-art baselines in four different scenarios in the Particle environment." @default.
- W4287758380 created "2022-07-26" @default.
- W4287758380 creator A5006262997 @default.
- W4287758380 creator A5008171683 @default.
- W4287758380 creator A5015248763 @default.
- W4287758380 creator A5041138120 @default.
- W4287758380 creator A5049021976 @default.
- W4287758380 creator A5070558865 @default.
- W4287758380 creator A5074520540 @default.
- W4287758380 creator A5084961388 @default.
- W4287758380 date "2020-06-12" @default.
- W4287758380 modified "2023-10-14" @default.
- W4287758380 title "Learning to Communicate Using Counterfactual Reasoning" @default.
- W4287758380 doi "https://doi.org/10.48550/arxiv.2006.07200" @default.
- W4287758380 hasPublicationYear "2020" @default.
- W4287758380 type Work @default.
- W4287758380 citedByCount "0" @default.
- W4287758380 crossrefType "posted-content" @default.
- W4287758380 hasAuthorship W4287758380A5006262997 @default.
- W4287758380 hasAuthorship W4287758380A5008171683 @default.
- W4287758380 hasAuthorship W4287758380A5015248763 @default.
- W4287758380 hasAuthorship W4287758380A5041138120 @default.
- W4287758380 hasAuthorship W4287758380A5049021976 @default.
- W4287758380 hasAuthorship W4287758380A5070558865 @default.
- W4287758380 hasAuthorship W4287758380A5074520540 @default.
- W4287758380 hasAuthorship W4287758380A5084961388 @default.
- W4287758380 hasBestOaLocation W42877583801 @default.
- W4287758380 hasConcept C10138342 @default.
- W4287758380 hasConcept C107457646 @default.
- W4287758380 hasConcept C108650721 @default.
- W4287758380 hasConcept C11413529 @default.
- W4287758380 hasConcept C121332964 @default.
- W4287758380 hasConcept C12269588 @default.
- W4287758380 hasConcept C14036430 @default.
- W4287758380 hasConcept C142724271 @default.
- W4287758380 hasConcept C154945302 @default.
- W4287758380 hasConcept C15744967 @default.
- W4287758380 hasConcept C158156997 @default.
- W4287758380 hasConcept C162324750 @default.
- W4287758380 hasConcept C182306322 @default.
- W4287758380 hasConcept C202444582 @default.
- W4287758380 hasConcept C204787440 @default.
- W4287758380 hasConcept C2780385302 @default.
- W4287758380 hasConcept C2780791683 @default.
- W4287758380 hasConcept C31258907 @default.
- W4287758380 hasConcept C33923547 @default.
- W4287758380 hasConcept C41008148 @default.
- W4287758380 hasConcept C46312422 @default.
- W4287758380 hasConcept C48103436 @default.
- W4287758380 hasConcept C62520636 @default.
- W4287758380 hasConcept C71924100 @default.
- W4287758380 hasConcept C77805123 @default.
- W4287758380 hasConcept C78458016 @default.
- W4287758380 hasConcept C86803240 @default.
- W4287758380 hasConcept C9652623 @default.
- W4287758380 hasConcept C97541855 @default.
- W4287758380 hasConceptScore W4287758380C10138342 @default.
- W4287758380 hasConceptScore W4287758380C107457646 @default.
- W4287758380 hasConceptScore W4287758380C108650721 @default.
- W4287758380 hasConceptScore W4287758380C11413529 @default.
- W4287758380 hasConceptScore W4287758380C121332964 @default.
- W4287758380 hasConceptScore W4287758380C12269588 @default.
- W4287758380 hasConceptScore W4287758380C14036430 @default.
- W4287758380 hasConceptScore W4287758380C142724271 @default.
- W4287758380 hasConceptScore W4287758380C154945302 @default.
- W4287758380 hasConceptScore W4287758380C15744967 @default.
- W4287758380 hasConceptScore W4287758380C158156997 @default.
- W4287758380 hasConceptScore W4287758380C162324750 @default.
- W4287758380 hasConceptScore W4287758380C182306322 @default.
- W4287758380 hasConceptScore W4287758380C202444582 @default.
- W4287758380 hasConceptScore W4287758380C204787440 @default.
- W4287758380 hasConceptScore W4287758380C2780385302 @default.
- W4287758380 hasConceptScore W4287758380C2780791683 @default.
- W4287758380 hasConceptScore W4287758380C31258907 @default.
- W4287758380 hasConceptScore W4287758380C33923547 @default.
- W4287758380 hasConceptScore W4287758380C41008148 @default.
- W4287758380 hasConceptScore W4287758380C46312422 @default.
- W4287758380 hasConceptScore W4287758380C48103436 @default.
- W4287758380 hasConceptScore W4287758380C62520636 @default.
- W4287758380 hasConceptScore W4287758380C71924100 @default.
- W4287758380 hasConceptScore W4287758380C77805123 @default.
- W4287758380 hasConceptScore W4287758380C78458016 @default.
- W4287758380 hasConceptScore W4287758380C86803240 @default.
- W4287758380 hasConceptScore W4287758380C9652623 @default.
- W4287758380 hasConceptScore W4287758380C97541855 @default.
- W4287758380 hasLocation W42877583801 @default.
- W4287758380 hasOpenAccess W4287758380 @default.
- W4287758380 hasPrimaryLocation W42877583801 @default.
- W4287758380 hasRelatedWork W1500064003 @default.
- W4287758380 hasRelatedWork W1596157658 @default.
- W4287758380 hasRelatedWork W1972320025 @default.
- W4287758380 hasRelatedWork W2348606995 @default.
- W4287758380 hasRelatedWork W2352936227 @default.
- W4287758380 hasRelatedWork W2375403706 @default.
- W4287758380 hasRelatedWork W2389524614 @default.
- W4287758380 hasRelatedWork W2616670909 @default.
- W4287758380 hasRelatedWork W4247441475 @default.
- W4287758380 hasRelatedWork W4308806426 @default.
- W4287758380 isParatext "false" @default.
- W4287758380 isRetracted "false" @default.