Matches in SemOpenAlex for { <https://semopenalex.org/work/W1988217924> ?p ?o ?g. }
Showing items 1 to 90 of
90
with 100 items per page.
- W1988217924 endingPage "109" @default.
- W1988217924 startingPage "71" @default.
- W1988217924 abstract "In this paper, we introduce the notion of a bounded-parameter Markov decision process (BMDP) as a generalization of the familiar exact MDP. A bounded-parameter MDP is a set of exact MDPs specified by giving upper and lower bounds on transition probabilities and rewards (all the MDPs in the set share the same state and action space). BMDPs form an efficiently solvable special case of the already known class of MDPs with imprecise parameters (MDPIPs). Bounded-parameter MDPs can be used to represent variation or uncertainty concerning the parameters of sequential decision problems in cases where no prior probabilities on the parameter values are available. Bounded-parameter MDPs can also be used in aggregation schemes to represent the variation in the transition probabilities for different base states aggregated together in the same aggregate state. We introduce interval value functions as a natural extension of traditional value functions. An interval value function assigns a closed real interval to each state, representing the assertion that the value of that state falls within that interval. An interval value function can be used to bound the performance of a policy over the set of exact MDPs associated with a given bounded-parameter MDP. We describe an iterative dynamic programming algorithm called interval policy evaluation that computes an interval value function for a given BMDP and specified policy. Interval policy evaluation on a policy π computes the most restrictive interval value function that is sound, i.e., that bounds the value function for π in every exact MDP in the set defined by the bounded-parameter MDP. We define optimistic and pessimistic criteria for optimality, and provide a variant of value iteration (Bellman, 1957) that we call interval value iteration that computes policies for a BMDP that are optimal with respect to these criteria. We show that each algorithm we present converges to the desired values in a polynomial number of iterations given a fixed discount factor." @default.
- W1988217924 created "2016-06-24" @default.
- W1988217924 creator A5007751369 @default.
- W1988217924 creator A5041931115 @default.
- W1988217924 creator A5088683816 @default.
- W1988217924 date "2000-09-01" @default.
- W1988217924 modified "2023-10-17" @default.
- W1988217924 title "Bounded-parameter Markov decision processes" @default.
- W1988217924 cites W2036931518 @default.
- W1988217924 cites W2061504687 @default.
- W1988217924 cites W2069045459 @default.
- W1988217924 cites W2101167844 @default.
- W1988217924 cites W2146443332 @default.
- W1988217924 cites W2149476049 @default.
- W1988217924 cites W2575731723 @default.
- W1988217924 doi "https://doi.org/10.1016/s0004-3702(00)00047-3" @default.
- W1988217924 hasPublicationYear "2000" @default.
- W1988217924 type Work @default.
- W1988217924 sameAs 1988217924 @default.
- W1988217924 citedByCount "257" @default.
- W1988217924 countsByYear W19882179242012 @default.
- W1988217924 countsByYear W19882179242013 @default.
- W1988217924 countsByYear W19882179242014 @default.
- W1988217924 countsByYear W19882179242015 @default.
- W1988217924 countsByYear W19882179242016 @default.
- W1988217924 countsByYear W19882179242017 @default.
- W1988217924 countsByYear W19882179242018 @default.
- W1988217924 countsByYear W19882179242019 @default.
- W1988217924 countsByYear W19882179242020 @default.
- W1988217924 countsByYear W19882179242021 @default.
- W1988217924 countsByYear W19882179242022 @default.
- W1988217924 countsByYear W19882179242023 @default.
- W1988217924 crossrefType "journal-article" @default.
- W1988217924 hasAuthorship W1988217924A5007751369 @default.
- W1988217924 hasAuthorship W1988217924A5041931115 @default.
- W1988217924 hasAuthorship W1988217924A5088683816 @default.
- W1988217924 hasBestOaLocation W19882179241 @default.
- W1988217924 hasConcept C105795698 @default.
- W1988217924 hasConcept C106189395 @default.
- W1988217924 hasConcept C114614502 @default.
- W1988217924 hasConcept C126255220 @default.
- W1988217924 hasConcept C134306372 @default.
- W1988217924 hasConcept C14036430 @default.
- W1988217924 hasConcept C14646407 @default.
- W1988217924 hasConcept C159886148 @default.
- W1988217924 hasConcept C177148314 @default.
- W1988217924 hasConcept C2778067643 @default.
- W1988217924 hasConcept C28826006 @default.
- W1988217924 hasConcept C33923547 @default.
- W1988217924 hasConcept C34388435 @default.
- W1988217924 hasConcept C41008148 @default.
- W1988217924 hasConcept C78458016 @default.
- W1988217924 hasConcept C86803240 @default.
- W1988217924 hasConceptScore W1988217924C105795698 @default.
- W1988217924 hasConceptScore W1988217924C106189395 @default.
- W1988217924 hasConceptScore W1988217924C114614502 @default.
- W1988217924 hasConceptScore W1988217924C126255220 @default.
- W1988217924 hasConceptScore W1988217924C134306372 @default.
- W1988217924 hasConceptScore W1988217924C14036430 @default.
- W1988217924 hasConceptScore W1988217924C14646407 @default.
- W1988217924 hasConceptScore W1988217924C159886148 @default.
- W1988217924 hasConceptScore W1988217924C177148314 @default.
- W1988217924 hasConceptScore W1988217924C2778067643 @default.
- W1988217924 hasConceptScore W1988217924C28826006 @default.
- W1988217924 hasConceptScore W1988217924C33923547 @default.
- W1988217924 hasConceptScore W1988217924C34388435 @default.
- W1988217924 hasConceptScore W1988217924C41008148 @default.
- W1988217924 hasConceptScore W1988217924C78458016 @default.
- W1988217924 hasConceptScore W1988217924C86803240 @default.
- W1988217924 hasIssue "1-2" @default.
- W1988217924 hasLocation W19882179241 @default.
- W1988217924 hasOpenAccess W1988217924 @default.
- W1988217924 hasPrimaryLocation W19882179241 @default.
- W1988217924 hasRelatedWork W1982461629 @default.
- W1988217924 hasRelatedWork W1988217924 @default.
- W1988217924 hasRelatedWork W1990452411 @default.
- W1988217924 hasRelatedWork W2112369737 @default.
- W1988217924 hasRelatedWork W2149476049 @default.
- W1988217924 hasRelatedWork W2155355065 @default.
- W1988217924 hasRelatedWork W2156021013 @default.
- W1988217924 hasRelatedWork W2156992384 @default.
- W1988217924 hasRelatedWork W2172170808 @default.
- W1988217924 hasRelatedWork W4297610284 @default.
- W1988217924 hasVolume "122" @default.
- W1988217924 isParatext "false" @default.
- W1988217924 isRetracted "false" @default.
- W1988217924 magId "1988217924" @default.
- W1988217924 workType "article" @default.