Matches in SemOpenAlex for { <https://semopenalex.org/work/W3100019413> ?p ?o ?g. }
- W3100019413 endingPage "1747" @default.
- W3100019413 startingPage "1727" @default.
- W3100019413 abstract "Abstract Deep reinforcement learning algorithms have recently been used to train multiple interacting agents in a centralised manner whilst keeping their execution decentralised. When the agents can only acquire partial observations and are faced with tasks requiring coordination and synchronisation skills, inter-agent communication plays an essential role. In this work, we propose a framework for multi-agent training using deep deterministic policy gradients that enables concurrent, end-to-end learning of an explicit communication protocol through a memory device. During training, the agents learn to perform read and write operations enabling them to infer a shared representation of the world. We empirically demonstrate that concurrent learning of the communication device and individual policies can improve inter-agent coordination and performance in small-scale systems. Our experimental results show that the proposed method achieves superior performance in scenarios with up to six agents. We illustrate how different communication patterns can emerge on six different tasks of increasing complexity. Furthermore, we study the effects of corrupting the communication channel, provide a visualisation of the time-varying memory content as the underlying task is being solved and validate the building blocks of the proposed memory device through ablation studies." @default.
- W3100019413 created "2020-11-23" @default.
- W3100019413 creator A5010581004 @default.
- W3100019413 creator A5059716182 @default.
- W3100019413 date "2020-01-23" @default.
- W3100019413 modified "2023-09-25" @default.
- W3100019413 title "Improving coordination in small-scale multi-agent deep reinforcement learning through memory-driven communication" @default.
- W3100019413 cites W1518858799 @default.
- W3100019413 cites W1541084404 @default.
- W3100019413 cites W1542941925 @default.
- W3100019413 cites W1574700590 @default.
- W3100019413 cites W1579184372 @default.
- W3100019413 cites W1964644919 @default.
- W3100019413 cites W1964883026 @default.
- W3100019413 cites W1977136897 @default.
- W3100019413 cites W1985955147 @default.
- W3100019413 cites W1986268386 @default.
- W3100019413 cites W1997881026 @default.
- W3100019413 cites W1997932767 @default.
- W3100019413 cites W2005636787 @default.
- W3100019413 cites W2011751276 @default.
- W3100019413 cites W2017957151 @default.
- W3100019413 cites W2034184326 @default.
- W3100019413 cites W2034692547 @default.
- W3100019413 cites W2039533720 @default.
- W3100019413 cites W2048519937 @default.
- W3100019413 cites W2059529072 @default.
- W3100019413 cites W2064675550 @default.
- W3100019413 cites W2066866890 @default.
- W3100019413 cites W2076063813 @default.
- W3100019413 cites W2089773578 @default.
- W3100019413 cites W2106701811 @default.
- W3100019413 cites W2107544712 @default.
- W3100019413 cites W2108892923 @default.
- W3100019413 cites W2128453677 @default.
- W3100019413 cites W2139232530 @default.
- W3100019413 cites W2145339207 @default.
- W3100019413 cites W2148831627 @default.
- W3100019413 cites W2151389381 @default.
- W3100019413 cites W2160643434 @default.
- W3100019413 cites W2165321212 @default.
- W3100019413 cites W2173200640 @default.
- W3100019413 cites W2292533394 @default.
- W3100019413 cites W2496794451 @default.
- W3100019413 cites W2586491110 @default.
- W3100019413 cites W2625978909 @default.
- W3100019413 cites W2768629321 @default.
- W3100019413 cites W2778610932 @default.
- W3100019413 cites W2919115771 @default.
- W3100019413 cites W2951276827 @default.
- W3100019413 cites W2963658727 @default.
- W3100019413 doi "https://doi.org/10.1007/s10994-019-05864-5" @default.
- W3100019413 hasPublicationYear "2020" @default.
- W3100019413 type Work @default.
- W3100019413 sameAs 3100019413 @default.
- W3100019413 citedByCount "29" @default.
- W3100019413 countsByYear W31000194132020 @default.
- W3100019413 countsByYear W31000194132021 @default.
- W3100019413 countsByYear W31000194132022 @default.
- W3100019413 countsByYear W31000194132023 @default.
- W3100019413 crossrefType "journal-article" @default.
- W3100019413 hasAuthorship W3100019413A5010581004 @default.
- W3100019413 hasAuthorship W3100019413A5059716182 @default.
- W3100019413 hasBestOaLocation W31000194131 @default.
- W3100019413 hasConcept C107457646 @default.
- W3100019413 hasConcept C120314980 @default.
- W3100019413 hasConcept C121332964 @default.
- W3100019413 hasConcept C12269588 @default.
- W3100019413 hasConcept C127162648 @default.
- W3100019413 hasConcept C142724271 @default.
- W3100019413 hasConcept C154945302 @default.
- W3100019413 hasConcept C162324750 @default.
- W3100019413 hasConcept C17744445 @default.
- W3100019413 hasConcept C187736073 @default.
- W3100019413 hasConcept C199539241 @default.
- W3100019413 hasConcept C204787440 @default.
- W3100019413 hasConcept C2776359362 @default.
- W3100019413 hasConcept C2778755073 @default.
- W3100019413 hasConcept C2780385302 @default.
- W3100019413 hasConcept C2780451532 @default.
- W3100019413 hasConcept C31258907 @default.
- W3100019413 hasConcept C41008148 @default.
- W3100019413 hasConcept C62520636 @default.
- W3100019413 hasConcept C71924100 @default.
- W3100019413 hasConcept C94625758 @default.
- W3100019413 hasConcept C97541855 @default.
- W3100019413 hasConceptScore W3100019413C107457646 @default.
- W3100019413 hasConceptScore W3100019413C120314980 @default.
- W3100019413 hasConceptScore W3100019413C121332964 @default.
- W3100019413 hasConceptScore W3100019413C12269588 @default.
- W3100019413 hasConceptScore W3100019413C127162648 @default.
- W3100019413 hasConceptScore W3100019413C142724271 @default.
- W3100019413 hasConceptScore W3100019413C154945302 @default.
- W3100019413 hasConceptScore W3100019413C162324750 @default.
- W3100019413 hasConceptScore W3100019413C17744445 @default.
- W3100019413 hasConceptScore W3100019413C187736073 @default.
- W3100019413 hasConceptScore W3100019413C199539241 @default.
- W3100019413 hasConceptScore W3100019413C204787440 @default.