Matches in SemOpenAlex for { <https://semopenalex.org/work/W3090204380> ?p ?o ?g. }
- W3090204380 abstract "We consider the problem of learning to communicate using multi-agent reinforcement learning (MARL). A common approach is to learn off-policy, using data sampled from a replay buffer. However, messages received in the past may not accurately reflect the current communication policy of each agent, and this complicates learning. We therefore introduce a 'communication correction' which accounts for the non-stationarity of observed communication induced by multi-agent learning. It works by relabelling the received message to make it likely under the communicator's current policy, and thus be a better reflection of the receiver's current environment. To account for cases in which agents are both senders and receivers, we introduce an ordered relabelling scheme. Our correction is computationally efficient and can be integrated with a range of off-policy algorithms. We find in our experiments that it substantially improves the ability of communicating MARL systems to learn across a variety of cooperative and competitive tasks." @default.
- W3090204380 created "2020-10-08" @default.
- W3090204380 creator A5074964568 @default.
- W3090204380 creator A5075063030 @default.
- W3090204380 date "2020-10-02" @default.
- W3090204380 modified "2023-09-23" @default.
- W3090204380 title "Correcting Experience Replay for Multi-Agent Communication" @default.
- W3090204380 cites W1513468570 @default.
- W3090204380 cites W1542941925 @default.
- W3090204380 cites W1757796397 @default.
- W3090204380 cites W1985093013 @default.
- W3090204380 cites W2008134955 @default.
- W3090204380 cites W2099618002 @default.
- W3090204380 cites W2107544712 @default.
- W3090204380 cites W2121092017 @default.
- W3090204380 cites W2160371091 @default.
- W3090204380 cites W2173248099 @default.
- W3090204380 cites W2201581102 @default.
- W3090204380 cites W2547875792 @default.
- W3090204380 cites W2565313327 @default.
- W3090204380 cites W2594829461 @default.
- W3090204380 cites W2626637010 @default.
- W3090204380 cites W2749807327 @default.
- W3090204380 cites W2756196406 @default.
- W3090204380 cites W2797569913 @default.
- W3090204380 cites W2798511001 @default.
- W3090204380 cites W2803281228 @default.
- W3090204380 cites W2804672169 @default.
- W3090204380 cites W2908064123 @default.
- W3090204380 cites W2914351253 @default.
- W3090204380 cites W2948099544 @default.
- W3090204380 cites W2948342290 @default.
- W3090204380 cites W2949201811 @default.
- W3090204380 cites W2952165242 @default.
- W3090204380 cites W2962686687 @default.
- W3090204380 cites W2962938168 @default.
- W3090204380 cites W2962966033 @default.
- W3090204380 cites W2963000099 @default.
- W3090204380 cites W2963147362 @default.
- W3090204380 cites W2963588154 @default.
- W3090204380 cites W2963681240 @default.
- W3090204380 cites W2963717208 @default.
- W3090204380 cites W2963881016 @default.
- W3090204380 cites W2964001908 @default.
- W3090204380 cites W2964338167 @default.
- W3090204380 cites W2971094937 @default.
- W3090204380 cites W3093287223 @default.
- W3090204380 hasPublicationYear "2020" @default.
- W3090204380 type Work @default.
- W3090204380 sameAs 3090204380 @default.
- W3090204380 citedByCount "1" @default.
- W3090204380 countsByYear W30902043802021 @default.
- W3090204380 crossrefType "posted-content" @default.
- W3090204380 hasAuthorship W3090204380A5074964568 @default.
- W3090204380 hasAuthorship W3090204380A5075063030 @default.
- W3090204380 hasConcept C120314980 @default.
- W3090204380 hasConcept C127413603 @default.
- W3090204380 hasConcept C134306372 @default.
- W3090204380 hasConcept C136197465 @default.
- W3090204380 hasConcept C146978453 @default.
- W3090204380 hasConcept C154945302 @default.
- W3090204380 hasConcept C199360897 @default.
- W3090204380 hasConcept C204323151 @default.
- W3090204380 hasConcept C33923547 @default.
- W3090204380 hasConcept C41008148 @default.
- W3090204380 hasConcept C65682993 @default.
- W3090204380 hasConcept C77618280 @default.
- W3090204380 hasConcept C97541855 @default.
- W3090204380 hasConceptScore W3090204380C120314980 @default.
- W3090204380 hasConceptScore W3090204380C127413603 @default.
- W3090204380 hasConceptScore W3090204380C134306372 @default.
- W3090204380 hasConceptScore W3090204380C136197465 @default.
- W3090204380 hasConceptScore W3090204380C146978453 @default.
- W3090204380 hasConceptScore W3090204380C154945302 @default.
- W3090204380 hasConceptScore W3090204380C199360897 @default.
- W3090204380 hasConceptScore W3090204380C204323151 @default.
- W3090204380 hasConceptScore W3090204380C33923547 @default.
- W3090204380 hasConceptScore W3090204380C41008148 @default.
- W3090204380 hasConceptScore W3090204380C65682993 @default.
- W3090204380 hasConceptScore W3090204380C77618280 @default.
- W3090204380 hasConceptScore W3090204380C97541855 @default.
- W3090204380 hasOpenAccess W3090204380 @default.
- W3090204380 hasRelatedWork W1546568226 @default.
- W3090204380 hasRelatedWork W1595483645 @default.
- W3090204380 hasRelatedWork W2111770102 @default.
- W3090204380 hasRelatedWork W2272929109 @default.
- W3090204380 hasRelatedWork W2900477380 @default.
- W3090204380 hasRelatedWork W2942486667 @default.
- W3090204380 hasRelatedWork W2952348496 @default.
- W3090204380 hasRelatedWork W2970909667 @default.
- W3090204380 hasRelatedWork W2977730262 @default.
- W3090204380 hasRelatedWork W2992642034 @default.
- W3090204380 hasRelatedWork W3006231530 @default.
- W3090204380 hasRelatedWork W3035868808 @default.
- W3090204380 hasRelatedWork W3084888924 @default.
- W3090204380 hasRelatedWork W3127063420 @default.
- W3090204380 hasRelatedWork W3132143885 @default.
- W3090204380 hasRelatedWork W3151079898 @default.
- W3090204380 hasRelatedWork W3174682485 @default.
- W3090204380 hasRelatedWork W3176048614 @default.