Matches in SemOpenAlex for { <https://semopenalex.org/work/W4317526194> ?p ?o ?g. }
Showing items 1 to 67 of
67
with 100 items per page.
- W4317526194 endingPage "61" @default.
- W4317526194 startingPage "50" @default.
- W4317526194 abstract "In multi-agent systems, deep reinforcement learning policy gradient algorithms can converge excessively slowly or even fail to converge if the agent size as well as the state information quickly grows. We consequently present a policy gradient algorithm for generalised centralised training and decentralised execution (CTDE) based on the principle of masking. We transform the global state information of the critic network in the original (MADDPG) algorithm to the state information of local random agents as the input of the critic network. In addition, we have changed the way Polyak updates the target network so that it can dynamically and adaptively update the target network. Under the new framework, our approach considerably decreases the training strain on the critic network while taking into consideration the efficiency of agent sample learning and speeding up the multi-agent discovery of superior strategies. Combining these two improvements, our suggested approaches can be extended to any other CTDE-based multi-agent deep reinforcement learning algorithms, rather than being limited to the MADDPG conventional multi-agent reinforcement learning algorithm. We made the code publicly available at https://github.com/ZVEzhangyu/SMPG-master ." @default.
- W4317526194 created "2023-01-20" @default.
- W4317526194 creator A5013532487 @default.
- W4317526194 creator A5035082721 @default.
- W4317526194 creator A5039085563 @default.
- W4317526194 creator A5050550539 @default.
- W4317526194 creator A5071773009 @default.
- W4317526194 creator A5090751520 @default.
- W4317526194 date "2022-01-01" @default.
- W4317526194 modified "2023-10-01" @default.
- W4317526194 title "SMPG: Adaptive Soft Update for Masked MADDPG" @default.
- W4317526194 cites W2617547828 @default.
- W4317526194 cites W2915117209 @default.
- W4317526194 cites W2982316857 @default.
- W4317526194 cites W4283789768 @default.
- W4317526194 cites W4313156423 @default.
- W4317526194 doi "https://doi.org/10.1007/978-981-19-9297-1_5" @default.
- W4317526194 hasPublicationYear "2022" @default.
- W4317526194 type Work @default.
- W4317526194 citedByCount "0" @default.
- W4317526194 crossrefType "book-chapter" @default.
- W4317526194 hasAuthorship W4317526194A5013532487 @default.
- W4317526194 hasAuthorship W4317526194A5035082721 @default.
- W4317526194 hasAuthorship W4317526194A5039085563 @default.
- W4317526194 hasAuthorship W4317526194A5050550539 @default.
- W4317526194 hasAuthorship W4317526194A5071773009 @default.
- W4317526194 hasAuthorship W4317526194A5090751520 @default.
- W4317526194 hasConcept C11413529 @default.
- W4317526194 hasConcept C142362112 @default.
- W4317526194 hasConcept C153349607 @default.
- W4317526194 hasConcept C154945302 @default.
- W4317526194 hasConcept C177264268 @default.
- W4317526194 hasConcept C199360897 @default.
- W4317526194 hasConcept C2776760102 @default.
- W4317526194 hasConcept C2777402240 @default.
- W4317526194 hasConcept C41008148 @default.
- W4317526194 hasConcept C48103436 @default.
- W4317526194 hasConcept C97541855 @default.
- W4317526194 hasConceptScore W4317526194C11413529 @default.
- W4317526194 hasConceptScore W4317526194C142362112 @default.
- W4317526194 hasConceptScore W4317526194C153349607 @default.
- W4317526194 hasConceptScore W4317526194C154945302 @default.
- W4317526194 hasConceptScore W4317526194C177264268 @default.
- W4317526194 hasConceptScore W4317526194C199360897 @default.
- W4317526194 hasConceptScore W4317526194C2776760102 @default.
- W4317526194 hasConceptScore W4317526194C2777402240 @default.
- W4317526194 hasConceptScore W4317526194C41008148 @default.
- W4317526194 hasConceptScore W4317526194C48103436 @default.
- W4317526194 hasConceptScore W4317526194C97541855 @default.
- W4317526194 hasLocation W43175261941 @default.
- W4317526194 hasOpenAccess W4317526194 @default.
- W4317526194 hasPrimaryLocation W43175261941 @default.
- W4317526194 hasRelatedWork W2373724792 @default.
- W4317526194 hasRelatedWork W260766989 @default.
- W4317526194 hasRelatedWork W2959276766 @default.
- W4317526194 hasRelatedWork W3074294383 @default.
- W4317526194 hasRelatedWork W3111983280 @default.
- W4317526194 hasRelatedWork W3139193008 @default.
- W4317526194 hasRelatedWork W3164468573 @default.
- W4317526194 hasRelatedWork W4200114095 @default.
- W4317526194 hasRelatedWork W4206669594 @default.
- W4317526194 hasRelatedWork W4295941380 @default.
- W4317526194 isParatext "false" @default.
- W4317526194 isRetracted "false" @default.
- W4317526194 workType "book-chapter" @default.