Matches in SemOpenAlex for { <https://semopenalex.org/work/W2911850043> ?p ?o ?g. }
Showing items 1 to 88 of
88
with 100 items per page.
- W2911850043 endingPage "2164" @default.
- W2911850043 startingPage "2162" @default.
- W2911850043 abstract "We propose Strong Emergent Policy (STEP) approximation, a scalable approach to learn strong decentralized policies for cooperative MAS with a distributed variant of policy iteration. For that, we use function approximation to learn from action recommendations of a decentralized multi-agent planning algorithm. STEP combines decentralized multi-agent planning with centralized learning, only requiring a generative model for distributed black box optimization. We experimentally evaluate STEP in two challenging and stochastic domains with large state and joint action spaces and show that STEP is able to learn stronger policies than standard multi-agent reinforcement learning algorithms, when combining multi-agent open-loop planning with centralized function approximation. The learned policies can be reintegrated into the multi-agent planning process to further improve performance." @default.
- W2911850043 created "2019-02-21" @default.
- W2911850043 creator A5024645861 @default.
- W2911850043 creator A5051707356 @default.
- W2911850043 creator A5061867637 @default.
- W2911850043 creator A5062401089 @default.
- W2911850043 creator A5064469861 @default.
- W2911850043 creator A5068564686 @default.
- W2911850043 date "2019-05-08" @default.
- W2911850043 modified "2023-10-18" @default.
- W2911850043 title "Distributed Policy Iteration for Scalable Approximation of Cooperative Multi-Agent Policies" @default.
- W2911850043 hasPublicationYear "2019" @default.
- W2911850043 type Work @default.
- W2911850043 sameAs 2911850043 @default.
- W2911850043 citedByCount "1" @default.
- W2911850043 countsByYear W29118500432021 @default.
- W2911850043 crossrefType "proceedings-article" @default.
- W2911850043 hasAuthorship W2911850043A5024645861 @default.
- W2911850043 hasAuthorship W2911850043A5051707356 @default.
- W2911850043 hasAuthorship W2911850043A5061867637 @default.
- W2911850043 hasAuthorship W2911850043A5062401089 @default.
- W2911850043 hasAuthorship W2911850043A5064469861 @default.
- W2911850043 hasAuthorship W2911850043A5068564686 @default.
- W2911850043 hasConcept C111919701 @default.
- W2911850043 hasConcept C120314980 @default.
- W2911850043 hasConcept C126255220 @default.
- W2911850043 hasConcept C14036430 @default.
- W2911850043 hasConcept C154945302 @default.
- W2911850043 hasConcept C26517878 @default.
- W2911850043 hasConcept C33923547 @default.
- W2911850043 hasConcept C38652104 @default.
- W2911850043 hasConcept C41008148 @default.
- W2911850043 hasConcept C48044578 @default.
- W2911850043 hasConcept C50644808 @default.
- W2911850043 hasConcept C55479107 @default.
- W2911850043 hasConcept C77088390 @default.
- W2911850043 hasConcept C78458016 @default.
- W2911850043 hasConcept C86803240 @default.
- W2911850043 hasConcept C91873725 @default.
- W2911850043 hasConcept C97541855 @default.
- W2911850043 hasConcept C98045186 @default.
- W2911850043 hasConceptScore W2911850043C111919701 @default.
- W2911850043 hasConceptScore W2911850043C120314980 @default.
- W2911850043 hasConceptScore W2911850043C126255220 @default.
- W2911850043 hasConceptScore W2911850043C14036430 @default.
- W2911850043 hasConceptScore W2911850043C154945302 @default.
- W2911850043 hasConceptScore W2911850043C26517878 @default.
- W2911850043 hasConceptScore W2911850043C33923547 @default.
- W2911850043 hasConceptScore W2911850043C38652104 @default.
- W2911850043 hasConceptScore W2911850043C41008148 @default.
- W2911850043 hasConceptScore W2911850043C48044578 @default.
- W2911850043 hasConceptScore W2911850043C50644808 @default.
- W2911850043 hasConceptScore W2911850043C55479107 @default.
- W2911850043 hasConceptScore W2911850043C77088390 @default.
- W2911850043 hasConceptScore W2911850043C78458016 @default.
- W2911850043 hasConceptScore W2911850043C86803240 @default.
- W2911850043 hasConceptScore W2911850043C91873725 @default.
- W2911850043 hasConceptScore W2911850043C97541855 @default.
- W2911850043 hasConceptScore W2911850043C98045186 @default.
- W2911850043 hasLocation W29118500431 @default.
- W2911850043 hasOpenAccess W2911850043 @default.
- W2911850043 hasPrimaryLocation W29118500431 @default.
- W2911850043 hasRelatedWork W1602079996 @default.
- W2911850043 hasRelatedWork W1616569656 @default.
- W2911850043 hasRelatedWork W2120268911 @default.
- W2911850043 hasRelatedWork W2395750143 @default.
- W2911850043 hasRelatedWork W2898993886 @default.
- W2911850043 hasRelatedWork W2902709966 @default.
- W2911850043 hasRelatedWork W2937587379 @default.
- W2911850043 hasRelatedWork W2953318161 @default.
- W2911850043 hasRelatedWork W3006194577 @default.
- W2911850043 hasRelatedWork W3087708082 @default.
- W2911850043 hasRelatedWork W3100801254 @default.
- W2911850043 hasRelatedWork W3121607432 @default.
- W2911850043 hasRelatedWork W3158051401 @default.
- W2911850043 hasRelatedWork W3173294282 @default.
- W2911850043 hasRelatedWork W3174374257 @default.
- W2911850043 hasRelatedWork W3196861772 @default.
- W2911850043 hasRelatedWork W3203898015 @default.
- W2911850043 hasRelatedWork W3205469446 @default.
- W2911850043 hasRelatedWork W3206822911 @default.
- W2911850043 hasRelatedWork W3208331203 @default.
- W2911850043 isParatext "false" @default.
- W2911850043 isRetracted "false" @default.
- W2911850043 magId "2911850043" @default.
- W2911850043 workType "article" @default.