Matches in SemOpenAlex for { <https://semopenalex.org/work/W1640548472> ?p ?o ?g. }
Showing items 1 to 93 of
93
with 100 items per page.
- W1640548472 abstract "Markov Decision Processes (MDPs) have been used to formulate many decision-making problems in science and engineering. The objective is to synthesize the best decision (action selection) policies to maximize expected rewards (minimize costs) in a given stochastic dynamical environment. In many practical scenarios (multi-agent systems, telecommunication, queuing, etc.), the decision-making problem can have state constraints that must be satisfied, which leads to Constrained MDP (CMDP) problems. In the presence of such state constraints, the optimal policies can be very hard to characterize. This paper introduces a new approach for finding non-stationary randomized policies for finite-horizon CMDPs. An efficient algorithm based on Linear Programming (LP) and duality theory is proposed, which gives the convex set of feasible policies and ensures that the expected total reward is above a computable lower-bound. The resulting decision policy is a randomized policy, which is the projection of the unconstrained deterministic MDP policy on this convex set. To the best of our knowledge, this is the first result in state constrained MDPs to give an efficient algorithm for generating finite horizon randomized policies for CMDP with optimality guarantees. A simulation example of a swarm of autonomous agents running MDPs is also presented to demonstrate the proposed CMDP solution algorithm." @default.
- W1640548472 created "2016-06-24" @default.
- W1640548472 creator A5027729106 @default.
- W1640548472 creator A5044772884 @default.
- W1640548472 date "2015-07-06" @default.
- W1640548472 modified "2023-09-27" @default.
- W1640548472 title "Finite-Horizon Markov Decision Processes with State Constraints" @default.
- W1640548472 cites W1515851193 @default.
- W1640548472 cites W1597160459 @default.
- W1640548472 cites W1988737313 @default.
- W1640548472 cites W2004883573 @default.
- W1640548472 cites W2010557618 @default.
- W1640548472 cites W2027614944 @default.
- W1640548472 cites W2040543362 @default.
- W1640548472 cites W2046579994 @default.
- W1640548472 cites W2047360753 @default.
- W1640548472 cites W2073384958 @default.
- W1640548472 cites W2097148013 @default.
- W1640548472 cites W2098542244 @default.
- W1640548472 cites W2109516767 @default.
- W1640548472 cites W2118761999 @default.
- W1640548472 cites W2120107478 @default.
- W1640548472 cites W2123649906 @default.
- W1640548472 cites W2155397423 @default.
- W1640548472 cites W2159089602 @default.
- W1640548472 cites W2160821179 @default.
- W1640548472 cites W2165882932 @default.
- W1640548472 cites W2296319761 @default.
- W1640548472 hasPublicationYear "2015" @default.
- W1640548472 type Work @default.
- W1640548472 sameAs 1640548472 @default.
- W1640548472 citedByCount "1" @default.
- W1640548472 countsByYear W16405484722020 @default.
- W1640548472 crossrefType "posted-content" @default.
- W1640548472 hasAuthorship W1640548472A5027729106 @default.
- W1640548472 hasAuthorship W1640548472A5044772884 @default.
- W1640548472 hasConcept C105795698 @default.
- W1640548472 hasConcept C106189395 @default.
- W1640548472 hasConcept C11413529 @default.
- W1640548472 hasConcept C118615104 @default.
- W1640548472 hasConcept C126255220 @default.
- W1640548472 hasConcept C134306372 @default.
- W1640548472 hasConcept C159886148 @default.
- W1640548472 hasConcept C162392398 @default.
- W1640548472 hasConcept C177264268 @default.
- W1640548472 hasConcept C199360897 @default.
- W1640548472 hasConcept C2778023678 @default.
- W1640548472 hasConcept C33923547 @default.
- W1640548472 hasConcept C41008148 @default.
- W1640548472 hasConcept C41045048 @default.
- W1640548472 hasConcept C48103436 @default.
- W1640548472 hasConceptScore W1640548472C105795698 @default.
- W1640548472 hasConceptScore W1640548472C106189395 @default.
- W1640548472 hasConceptScore W1640548472C11413529 @default.
- W1640548472 hasConceptScore W1640548472C118615104 @default.
- W1640548472 hasConceptScore W1640548472C126255220 @default.
- W1640548472 hasConceptScore W1640548472C134306372 @default.
- W1640548472 hasConceptScore W1640548472C159886148 @default.
- W1640548472 hasConceptScore W1640548472C162392398 @default.
- W1640548472 hasConceptScore W1640548472C177264268 @default.
- W1640548472 hasConceptScore W1640548472C199360897 @default.
- W1640548472 hasConceptScore W1640548472C2778023678 @default.
- W1640548472 hasConceptScore W1640548472C33923547 @default.
- W1640548472 hasConceptScore W1640548472C41008148 @default.
- W1640548472 hasConceptScore W1640548472C41045048 @default.
- W1640548472 hasConceptScore W1640548472C48103436 @default.
- W1640548472 hasLocation W16405484721 @default.
- W1640548472 hasOpenAccess W1640548472 @default.
- W1640548472 hasPrimaryLocation W16405484721 @default.
- W1640548472 hasRelatedWork W153346180 @default.
- W1640548472 hasRelatedWork W1593140824 @default.
- W1640548472 hasRelatedWork W1899613237 @default.
- W1640548472 hasRelatedWork W202853768 @default.
- W1640548472 hasRelatedWork W2043903686 @default.
- W1640548472 hasRelatedWork W2094153443 @default.
- W1640548472 hasRelatedWork W2114958204 @default.
- W1640548472 hasRelatedWork W2188911627 @default.
- W1640548472 hasRelatedWork W2200122131 @default.
- W1640548472 hasRelatedWork W2495570292 @default.
- W1640548472 hasRelatedWork W2570535465 @default.
- W1640548472 hasRelatedWork W2920238156 @default.
- W1640548472 hasRelatedWork W3009922106 @default.
- W1640548472 hasRelatedWork W3092636900 @default.
- W1640548472 hasRelatedWork W3154040352 @default.
- W1640548472 hasRelatedWork W3174740571 @default.
- W1640548472 hasRelatedWork W3196720387 @default.
- W1640548472 hasRelatedWork W3201878770 @default.
- W1640548472 hasRelatedWork W37272079 @default.
- W1640548472 hasRelatedWork W800657014 @default.
- W1640548472 isParatext "false" @default.
- W1640548472 isRetracted "false" @default.
- W1640548472 magId "1640548472" @default.
- W1640548472 workType "article" @default.