Matches in SemOpenAlex for { <https://semopenalex.org/work/W2891663723> ?p ?o ?g. }
- W2891663723 endingPage "8113" @default.
- W2891663723 startingPage "8102" @default.
- W2891663723 abstract "Scaling decision theoretic planning to large multiagent systems is challenging due to uncertainty and partial observability in the environment. We focus on a multiagent planning model subclass, relevant to urban settings, where agent interactions are dependent on their ``collective influence'' on each other, rather than their identities. Unlike previous work, we address a general setting where system reward is not decomposable among agents. We develop collective actor-critic RL approaches for this setting, and address the problem of multiagent credit assignment, and computing low variance policy gradient estimates that result in faster convergence to high quality solutions. We also develop difference rewards based credit assignment methods for the collective setting. Empirically our new approaches provide significantly better solutions than previous methods in the presence of global rewards on two real world problems modeling taxi fleet optimization and multiagent patrolling, and a synthetic grid navigation domain." @default.
- W2891663723 created "2018-09-27" @default.
- W2891663723 creator A5015466370 @default.
- W2891663723 creator A5051203367 @default.
- W2891663723 creator A5084307504 @default.
- W2891663723 date "2018-01-01" @default.
- W2891663723 modified "2023-09-23" @default.
- W2891663723 title "Credit Assignment For Collective Multiagent RL With Global Rewards" @default.
- W2891663723 cites W1586258527 @default.
- W2891663723 cites W1588304026 @default.
- W2891663723 cites W1641379095 @default.
- W2891663723 cites W169775351 @default.
- W2891663723 cites W1749432972 @default.
- W2891663723 cites W1841698879 @default.
- W2891663723 cites W1852310822 @default.
- W2891663723 cites W2009303086 @default.
- W2891663723 cites W2088956500 @default.
- W2891663723 cites W2098877483 @default.
- W2891663723 cites W2102764452 @default.
- W2891663723 cites W2110906765 @default.
- W2891663723 cites W2116600741 @default.
- W2891663723 cites W2122763142 @default.
- W2891663723 cites W2126673675 @default.
- W2891663723 cites W2134281179 @default.
- W2891663723 cites W2134779831 @default.
- W2891663723 cites W2144817350 @default.
- W2891663723 cites W2155027007 @default.
- W2891663723 cites W2162392127 @default.
- W2891663723 cites W2169560253 @default.
- W2891663723 cites W2189058185 @default.
- W2891663723 cites W2404646363 @default.
- W2891663723 cites W2548322271 @default.
- W2891663723 cites W2565610523 @default.
- W2891663723 cites W2785853704 @default.
- W2891663723 cites W2788249729 @default.
- W2891663723 cites W30453094 @default.
- W2891663723 cites W3093287223 @default.
- W2891663723 cites W6043852 @default.
- W2891663723 hasPublicationYear "2018" @default.
- W2891663723 type Work @default.
- W2891663723 sameAs 2891663723 @default.
- W2891663723 citedByCount "26" @default.
- W2891663723 countsByYear W28916637232018 @default.
- W2891663723 countsByYear W28916637232019 @default.
- W2891663723 countsByYear W28916637232020 @default.
- W2891663723 countsByYear W28916637232021 @default.
- W2891663723 crossrefType "proceedings-article" @default.
- W2891663723 hasAuthorship W2891663723A5015466370 @default.
- W2891663723 hasAuthorship W2891663723A5051203367 @default.
- W2891663723 hasAuthorship W2891663723A5084307504 @default.
- W2891663723 hasConcept C110698143 @default.
- W2891663723 hasConcept C111472728 @default.
- W2891663723 hasConcept C120314980 @default.
- W2891663723 hasConcept C120665830 @default.
- W2891663723 hasConcept C121332964 @default.
- W2891663723 hasConcept C126255220 @default.
- W2891663723 hasConcept C13280743 @default.
- W2891663723 hasConcept C134306372 @default.
- W2891663723 hasConcept C138885662 @default.
- W2891663723 hasConcept C154945302 @default.
- W2891663723 hasConcept C162324750 @default.
- W2891663723 hasConcept C17744445 @default.
- W2891663723 hasConcept C185798385 @default.
- W2891663723 hasConcept C192209626 @default.
- W2891663723 hasConcept C199539241 @default.
- W2891663723 hasConcept C205649164 @default.
- W2891663723 hasConcept C2777303404 @default.
- W2891663723 hasConcept C2779530757 @default.
- W2891663723 hasConcept C28826006 @default.
- W2891663723 hasConcept C33923547 @default.
- W2891663723 hasConcept C36299963 @default.
- W2891663723 hasConcept C36503486 @default.
- W2891663723 hasConcept C41008148 @default.
- W2891663723 hasConcept C41550386 @default.
- W2891663723 hasConcept C42475967 @default.
- W2891663723 hasConcept C50522688 @default.
- W2891663723 hasConceptScore W2891663723C110698143 @default.
- W2891663723 hasConceptScore W2891663723C111472728 @default.
- W2891663723 hasConceptScore W2891663723C120314980 @default.
- W2891663723 hasConceptScore W2891663723C120665830 @default.
- W2891663723 hasConceptScore W2891663723C121332964 @default.
- W2891663723 hasConceptScore W2891663723C126255220 @default.
- W2891663723 hasConceptScore W2891663723C13280743 @default.
- W2891663723 hasConceptScore W2891663723C134306372 @default.
- W2891663723 hasConceptScore W2891663723C138885662 @default.
- W2891663723 hasConceptScore W2891663723C154945302 @default.
- W2891663723 hasConceptScore W2891663723C162324750 @default.
- W2891663723 hasConceptScore W2891663723C17744445 @default.
- W2891663723 hasConceptScore W2891663723C185798385 @default.
- W2891663723 hasConceptScore W2891663723C192209626 @default.
- W2891663723 hasConceptScore W2891663723C199539241 @default.
- W2891663723 hasConceptScore W2891663723C205649164 @default.
- W2891663723 hasConceptScore W2891663723C2777303404 @default.
- W2891663723 hasConceptScore W2891663723C2779530757 @default.
- W2891663723 hasConceptScore W2891663723C28826006 @default.
- W2891663723 hasConceptScore W2891663723C33923547 @default.
- W2891663723 hasConceptScore W2891663723C36299963 @default.
- W2891663723 hasConceptScore W2891663723C36503486 @default.