Matches in SemOpenAlex for { <https://semopenalex.org/work/W2952006856> ?p ?o ?g. }
Showing items 1 to 76 of
76
with 100 items per page.
- W2952006856 abstract "Stochastic games generalize Markov decision processes (MDPs) to a multiagent setting by allowing the state transitions to depend jointly on all player actions, and having rewards determined by multiplayer matrix games at each state. We consider the problem of computing Nash equilibria in stochastic games, the analogue of planning in MDPs. We begin by providing a generalization of finite-horizon value iteration that computes a Nash strategy for each player in generalsum stochastic games. The algorithm takes an arbitrary Nash selection function as input, which allows the translation of local choices between multiple Nash equilibria into the selection of a single global Nash equilibrium. Our main technical result is an algorithm for computing near-Nash equilibria in large or infinite state spaces. This algorithm builds on our finite-horizon value iteration algorithm, and adapts the sparse sampling methods of Kearns, Mansour and Ng (1999) to stochastic games. We conclude by descrbing a counterexample showing that infinite-horizon discounted value iteration, which was shown by shaplely to converge in the zero-sum case (a result we give extend slightly here), does not converge in the general-sum case." @default.
- W2952006856 created "2019-06-27" @default.
- W2952006856 creator A5014637159 @default.
- W2952006856 creator A5029730907 @default.
- W2952006856 creator A5065366930 @default.
- W2952006856 date "2013-01-16" @default.
- W2952006856 modified "2023-09-23" @default.
- W2952006856 title "Fast Planning in Stochastic Games" @default.
- W2952006856 cites W1556274146 @default.
- W2952006856 cites W1848214029 @default.
- W2952006856 cites W1968136409 @default.
- W2952006856 cites W2575731723 @default.
- W2952006856 cites W3023151133 @default.
- W2952006856 hasPublicationYear "2013" @default.
- W2952006856 type Work @default.
- W2952006856 sameAs 2952006856 @default.
- W2952006856 citedByCount "0" @default.
- W2952006856 crossrefType "posted-content" @default.
- W2952006856 hasAuthorship W2952006856A5014637159 @default.
- W2952006856 hasAuthorship W2952006856A5029730907 @default.
- W2952006856 hasAuthorship W2952006856A5065366930 @default.
- W2952006856 hasConcept C105795698 @default.
- W2952006856 hasConcept C106189395 @default.
- W2952006856 hasConcept C118615104 @default.
- W2952006856 hasConcept C126255220 @default.
- W2952006856 hasConcept C134306372 @default.
- W2952006856 hasConcept C144237770 @default.
- W2952006856 hasConcept C14646407 @default.
- W2952006856 hasConcept C159886148 @default.
- W2952006856 hasConcept C162838799 @default.
- W2952006856 hasConcept C177148314 @default.
- W2952006856 hasConcept C32407928 @default.
- W2952006856 hasConcept C33923547 @default.
- W2952006856 hasConcept C41008148 @default.
- W2952006856 hasConcept C46814582 @default.
- W2952006856 hasConceptScore W2952006856C105795698 @default.
- W2952006856 hasConceptScore W2952006856C106189395 @default.
- W2952006856 hasConceptScore W2952006856C118615104 @default.
- W2952006856 hasConceptScore W2952006856C126255220 @default.
- W2952006856 hasConceptScore W2952006856C134306372 @default.
- W2952006856 hasConceptScore W2952006856C144237770 @default.
- W2952006856 hasConceptScore W2952006856C14646407 @default.
- W2952006856 hasConceptScore W2952006856C159886148 @default.
- W2952006856 hasConceptScore W2952006856C162838799 @default.
- W2952006856 hasConceptScore W2952006856C177148314 @default.
- W2952006856 hasConceptScore W2952006856C32407928 @default.
- W2952006856 hasConceptScore W2952006856C33923547 @default.
- W2952006856 hasConceptScore W2952006856C41008148 @default.
- W2952006856 hasConceptScore W2952006856C46814582 @default.
- W2952006856 hasLocation W29520068561 @default.
- W2952006856 hasOpenAccess W2952006856 @default.
- W2952006856 hasPrimaryLocation W29520068561 @default.
- W2952006856 hasRelatedWork W1511309222 @default.
- W2952006856 hasRelatedWork W1533809857 @default.
- W2952006856 hasRelatedWork W1544822727 @default.
- W2952006856 hasRelatedWork W1566755558 @default.
- W2952006856 hasRelatedWork W1878531498 @default.
- W2952006856 hasRelatedWork W1977823909 @default.
- W2952006856 hasRelatedWork W2114120924 @default.
- W2952006856 hasRelatedWork W2151033164 @default.
- W2952006856 hasRelatedWork W2290131603 @default.
- W2952006856 hasRelatedWork W2315288221 @default.
- W2952006856 hasRelatedWork W2463221887 @default.
- W2952006856 hasRelatedWork W2485392636 @default.
- W2952006856 hasRelatedWork W2667760 @default.
- W2952006856 hasRelatedWork W2768727567 @default.
- W2952006856 hasRelatedWork W2783914570 @default.
- W2952006856 hasRelatedWork W2951924730 @default.
- W2952006856 hasRelatedWork W3046553904 @default.
- W2952006856 hasRelatedWork W3121228496 @default.
- W2952006856 hasRelatedWork W3123507951 @default.
- W2952006856 hasRelatedWork W3162477856 @default.
- W2952006856 isParatext "false" @default.
- W2952006856 isRetracted "false" @default.
- W2952006856 magId "2952006856" @default.
- W2952006856 workType "article" @default.