Matches in SemOpenAlex for { <https://semopenalex.org/work/W3125031134> ?p ?o ?g. }
- W3125031134 abstract "In many operations management problems, we need to make decisions sequentially to minimize the cost while satisfying certain constraints. One modeling approach to study such problems is constrained Markov decision process (CMDP). When solving the CMDP to derive good operational policies, there are two key challenges: one is the prohibitively large state space and action space; the other is the hard-to-compute transition kernel. In this work, we develop a sampling-based primal-dual algorithm to solve CMDPs. Our approach alternatively applies regularized policy iteration to improve the policy and subgradient ascent to maintain the constraints. Under mild regularity conditions, we show that the algorithm converges at rate $ O(log(T)/sqrt{T})$, where T is the number of iterations. When the CMDP has a weakly coupled structure, our approach can substantially reduce the dimension of the problem through an embedded decomposition. We apply the algorithm to two important applications with weakly coupled structures: multi-product inventory management and multi-class queue scheduling, and show that it generates controls that outperform state-of-art heuristics." @default.
- W3125031134 created "2021-02-01" @default.
- W3125031134 creator A5028323106 @default.
- W3125031134 creator A5063800558 @default.
- W3125031134 creator A5078210646 @default.
- W3125031134 date "2021-01-26" @default.
- W3125031134 modified "2023-09-26" @default.
- W3125031134 title "A Primal-Dual Approach to Constrained Markov Decision Processes" @default.
- W3125031134 cites W1518931405 @default.
- W3125031134 cites W1575592356 @default.
- W3125031134 cites W1985077192 @default.
- W3125031134 cites W1989821343 @default.
- W3125031134 cites W1993986344 @default.
- W3125031134 cites W1999102639 @default.
- W3125031134 cites W2005155776 @default.
- W3125031134 cites W2020123437 @default.
- W3125031134 cites W2044069028 @default.
- W3125031134 cites W2073314543 @default.
- W3125031134 cites W2094153443 @default.
- W3125031134 cites W2104379682 @default.
- W3125031134 cites W2108197659 @default.
- W3125031134 cites W2119567691 @default.
- W3125031134 cites W2121863487 @default.
- W3125031134 cites W2192240806 @default.
- W3125031134 cites W2305373430 @default.
- W3125031134 cites W2510384539 @default.
- W3125031134 cites W2562316362 @default.
- W3125031134 cites W2601832346 @default.
- W3125031134 cites W2612788245 @default.
- W3125031134 cites W2616964725 @default.
- W3125031134 cites W2736601468 @default.
- W3125031134 cites W2741452308 @default.
- W3125031134 cites W2804791273 @default.
- W3125031134 cites W2911357856 @default.
- W3125031134 cites W2913022628 @default.
- W3125031134 cites W2949561945 @default.
- W3125031134 cites W2949608212 @default.
- W3125031134 cites W2952500758 @default.
- W3125031134 cites W2962734844 @default.
- W3125031134 cites W2970749192 @default.
- W3125031134 cites W2971587637 @default.
- W3125031134 cites W2972166034 @default.
- W3125031134 cites W2998619042 @default.
- W3125031134 cites W3001922646 @default.
- W3125031134 cites W3089787240 @default.
- W3125031134 cites W3099615934 @default.
- W3125031134 cites W3117137507 @default.
- W3125031134 cites W3124499345 @default.
- W3125031134 doi "https://doi.org/10.48550/arxiv.2101.10895" @default.
- W3125031134 hasPublicationYear "2021" @default.
- W3125031134 type Work @default.
- W3125031134 sameAs 3125031134 @default.
- W3125031134 citedByCount "3" @default.
- W3125031134 countsByYear W31250311342021 @default.
- W3125031134 countsByYear W31250311342022 @default.
- W3125031134 crossrefType "posted-content" @default.
- W3125031134 hasAuthorship W3125031134A5028323106 @default.
- W3125031134 hasAuthorship W3125031134A5063800558 @default.
- W3125031134 hasAuthorship W3125031134A5078210646 @default.
- W3125031134 hasBestOaLocation W31250311341 @default.
- W3125031134 hasConcept C105795698 @default.
- W3125031134 hasConcept C106189395 @default.
- W3125031134 hasConcept C121332964 @default.
- W3125031134 hasConcept C126255220 @default.
- W3125031134 hasConcept C127705205 @default.
- W3125031134 hasConcept C158968445 @default.
- W3125031134 hasConcept C159886148 @default.
- W3125031134 hasConcept C160403385 @default.
- W3125031134 hasConcept C163258240 @default.
- W3125031134 hasConcept C187633118 @default.
- W3125031134 hasConcept C199360897 @default.
- W3125031134 hasConcept C206729178 @default.
- W3125031134 hasConcept C33923547 @default.
- W3125031134 hasConcept C41008148 @default.
- W3125031134 hasConcept C62520636 @default.
- W3125031134 hasConcept C72434380 @default.
- W3125031134 hasConcept C89227174 @default.
- W3125031134 hasConceptScore W3125031134C105795698 @default.
- W3125031134 hasConceptScore W3125031134C106189395 @default.
- W3125031134 hasConceptScore W3125031134C121332964 @default.
- W3125031134 hasConceptScore W3125031134C126255220 @default.
- W3125031134 hasConceptScore W3125031134C127705205 @default.
- W3125031134 hasConceptScore W3125031134C158968445 @default.
- W3125031134 hasConceptScore W3125031134C159886148 @default.
- W3125031134 hasConceptScore W3125031134C160403385 @default.
- W3125031134 hasConceptScore W3125031134C163258240 @default.
- W3125031134 hasConceptScore W3125031134C187633118 @default.
- W3125031134 hasConceptScore W3125031134C199360897 @default.
- W3125031134 hasConceptScore W3125031134C206729178 @default.
- W3125031134 hasConceptScore W3125031134C33923547 @default.
- W3125031134 hasConceptScore W3125031134C41008148 @default.
- W3125031134 hasConceptScore W3125031134C62520636 @default.
- W3125031134 hasConceptScore W3125031134C72434380 @default.
- W3125031134 hasConceptScore W3125031134C89227174 @default.
- W3125031134 hasLocation W31250311341 @default.
- W3125031134 hasOpenAccess W3125031134 @default.
- W3125031134 hasPrimaryLocation W31250311341 @default.
- W3125031134 hasRelatedWork W127374984 @default.
- W3125031134 hasRelatedWork W1529126195 @default.
- W3125031134 hasRelatedWork W1980006501 @default.