Matches in SemOpenAlex for { <https://semopenalex.org/work/W2952465248> ?p ?o ?g. }
- W2952465248 abstract "Many cooperative multiagent reinforcement learning environments provide agents with a sparse team-based reward, as well as a dense agent-specific reward that incentivizes learning basic skills. Training policies solely on the team-based reward is often difficult due to its sparsity. Furthermore, relying solely on the agent-specific reward is sub-optimal because it usually does not capture the team coordination objective. A common approach is to use reward shaping to construct a proxy reward by combining the individual rewards. However, this requires manual tuning for each environment. We introduce Multiagent Evolutionary Reinforcement Learning (MERL), a split-level training platform that handles the two objectives separately through two optimization processes. An evolutionary algorithm maximizes the sparse team-based objective through neuroevolution on a population of teams. Concurrently, a gradient-based optimizer trains policies to only maximize the dense agent-specific rewards. The gradient-based policies are periodically added to the evolutionary population as a way of information transfer between the two optimization processes. This enables the evolutionary algorithm to use skills learned via the agent-specific rewards toward optimizing the global objective. Results demonstrate that MERL significantly outperforms state-of-the-art methods, such as MADDPG, on a number of difficult coordination benchmarks." @default.
- W2952465248 created "2019-06-27" @default.
- W2952465248 creator A5008832526 @default.
- W2952465248 creator A5013678286 @default.
- W2952465248 creator A5064779744 @default.
- W2952465248 creator A5070857594 @default.
- W2952465248 creator A5084748531 @default.
- W2952465248 date "2019-06-18" @default.
- W2952465248 modified "2023-09-27" @default.
- W2952465248 title "Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination" @default.
- W2952465248 cites W1504352750 @default.
- W2952465248 cites W1542941925 @default.
- W2952465248 cites W157468466 @default.
- W2952465248 cites W1603022299 @default.
- W2952465248 cites W1674110665 @default.
- W2952465248 cites W1777239053 @default.
- W2952465248 cites W1973183176 @default.
- W2952465248 cites W1978970913 @default.
- W2952465248 cites W2031369387 @default.
- W2952465248 cites W2034587361 @default.
- W2952465248 cites W2098877483 @default.
- W2952465248 cites W2121272499 @default.
- W2952465248 cites W2145339207 @default.
- W2952465248 cites W2171658832 @default.
- W2952465248 cites W2173248099 @default.
- W2952465248 cites W2262257077 @default.
- W2952465248 cites W2509924317 @default.
- W2952465248 cites W2530849036 @default.
- W2952465248 cites W2594794854 @default.
- W2952465248 cites W2596367596 @default.
- W2952465248 cites W2598247389 @default.
- W2952465248 cites W2617547828 @default.
- W2952465248 cites W2623431351 @default.
- W2952465248 cites W2749807327 @default.
- W2952465248 cites W2753984105 @default.
- W2952465248 cites W2772709170 @default.
- W2952465248 cites W2785542505 @default.
- W2952465248 cites W2924656332 @default.
- W2952465248 cites W2949201811 @default.
- W2952465248 cites W2949899112 @default.
- W2952465248 cites W2950472486 @default.
- W2952465248 cites W2951799422 @default.
- W2952465248 cites W2952095743 @default.
- W2952465248 cites W2963243556 @default.
- W2952465248 cites W2963689090 @default.
- W2952465248 cites W2963881016 @default.
- W2952465248 cites W2770298516 @default.
- W2952465248 hasPublicationYear "2019" @default.
- W2952465248 type Work @default.
- W2952465248 sameAs 2952465248 @default.
- W2952465248 citedByCount "3" @default.
- W2952465248 countsByYear W29524652482019 @default.
- W2952465248 countsByYear W29524652482020 @default.
- W2952465248 crossrefType "posted-content" @default.
- W2952465248 hasAuthorship W2952465248A5008832526 @default.
- W2952465248 hasAuthorship W2952465248A5013678286 @default.
- W2952465248 hasAuthorship W2952465248A5064779744 @default.
- W2952465248 hasAuthorship W2952465248A5070857594 @default.
- W2952465248 hasAuthorship W2952465248A5084748531 @default.
- W2952465248 hasConcept C118070581 @default.
- W2952465248 hasConcept C119857082 @default.
- W2952465248 hasConcept C144024400 @default.
- W2952465248 hasConcept C149923435 @default.
- W2952465248 hasConcept C154945302 @default.
- W2952465248 hasConcept C159149176 @default.
- W2952465248 hasConcept C2908647359 @default.
- W2952465248 hasConcept C41008148 @default.
- W2952465248 hasConcept C50644808 @default.
- W2952465248 hasConcept C97541855 @default.
- W2952465248 hasConceptScore W2952465248C118070581 @default.
- W2952465248 hasConceptScore W2952465248C119857082 @default.
- W2952465248 hasConceptScore W2952465248C144024400 @default.
- W2952465248 hasConceptScore W2952465248C149923435 @default.
- W2952465248 hasConceptScore W2952465248C154945302 @default.
- W2952465248 hasConceptScore W2952465248C159149176 @default.
- W2952465248 hasConceptScore W2952465248C2908647359 @default.
- W2952465248 hasConceptScore W2952465248C41008148 @default.
- W2952465248 hasConceptScore W2952465248C50644808 @default.
- W2952465248 hasConceptScore W2952465248C97541855 @default.
- W2952465248 hasLocation W29524652481 @default.
- W2952465248 hasOpenAccess W2952465248 @default.
- W2952465248 hasPrimaryLocation W29524652481 @default.
- W2952465248 hasRelatedWork W1969302761 @default.
- W2952465248 hasRelatedWork W2014394432 @default.
- W2952465248 hasRelatedWork W2095564494 @default.
- W2952465248 hasRelatedWork W2130703626 @default.
- W2952465248 hasRelatedWork W2152050309 @default.
- W2952465248 hasRelatedWork W2621111958 @default.
- W2952465248 hasRelatedWork W2796835114 @default.
- W2952465248 hasRelatedWork W2882982657 @default.
- W2952465248 hasRelatedWork W2897200624 @default.
- W2952465248 hasRelatedWork W2970868077 @default.
- W2952465248 hasRelatedWork W2997372031 @default.
- W2952465248 hasRelatedWork W3014001968 @default.
- W2952465248 hasRelatedWork W3037856217 @default.
- W2952465248 hasRelatedWork W3089491416 @default.
- W2952465248 hasRelatedWork W3091147964 @default.
- W2952465248 hasRelatedWork W3175972451 @default.
- W2952465248 hasRelatedWork W3199291429 @default.
- W2952465248 hasRelatedWork W3208111980 @default.