Matches in SemOpenAlex for { <https://semopenalex.org/work/W4362589073> ?p ?o ?g. }
Showing items 1 to 82 of
82
with 100 items per page.
- W4362589073 endingPage "111024" @default.
- W4362589073 startingPage "111024" @default.
- W4362589073 abstract "Structural results impose sufficient conditions on the model parameters of a Markov decision process (MDP) so that the optimal policy is an increasing function of the underlying state. The classical assumptions for MDP structural results require supermodularity of the rewards and transition probabilities. However, supermodularity does not hold in many applications. This paper uses a sufficient condition for interval dominance (called I ) proposed in the micro-economics literature, to obtain structural results for MDPs under more general conditions. We present several MDP examples where supermodularity does not hold, yet I holds, and so the optimal policy is monotone; these include sigmoidal rewards (arising in prospect theory for human decision making), bi-diagonal and perturbed bi-diagonal transition matrices (in optimal allocation problems). We also consider MDPs with TP3 transition matrices and concave value functions. Finally, reinforcement learning algorithms that exploit the differential sparse structure of the optimal monotone policy are discussed." @default.
- W4362589073 created "2023-04-06" @default.
- W4362589073 creator A5068804090 @default.
- W4362589073 date "2023-07-01" @default.
- W4362589073 modified "2023-09-27" @default.
- W4362589073 title "Interval dominance based structural results for Markov decision process" @default.
- W4362589073 cites W2030947209 @default.
- W4362589073 cites W2045665199 @default.
- W4362589073 cites W2066986122 @default.
- W4362589073 cites W2124715093 @default.
- W4362589073 cites W2126640070 @default.
- W4362589073 cites W2612212767 @default.
- W4362589073 cites W3008838497 @default.
- W4362589073 cites W3011865677 @default.
- W4362589073 doi "https://doi.org/10.1016/j.automatica.2023.111024" @default.
- W4362589073 hasPublicationYear "2023" @default.
- W4362589073 type Work @default.
- W4362589073 citedByCount "0" @default.
- W4362589073 crossrefType "journal-article" @default.
- W4362589073 hasAuthorship W4362589073A5068804090 @default.
- W4362589073 hasBestOaLocation W43625890732 @default.
- W4362589073 hasConcept C105795698 @default.
- W4362589073 hasConcept C106189395 @default.
- W4362589073 hasConcept C114614502 @default.
- W4362589073 hasConcept C119857082 @default.
- W4362589073 hasConcept C126255220 @default.
- W4362589073 hasConcept C130367717 @default.
- W4362589073 hasConcept C144237770 @default.
- W4362589073 hasConcept C14646407 @default.
- W4362589073 hasConcept C159886148 @default.
- W4362589073 hasConcept C2524010 @default.
- W4362589073 hasConcept C2778067643 @default.
- W4362589073 hasConcept C2834757 @default.
- W4362589073 hasConcept C28826006 @default.
- W4362589073 hasConcept C33252445 @default.
- W4362589073 hasConcept C33923547 @default.
- W4362589073 hasConcept C41008148 @default.
- W4362589073 hasConcept C50644808 @default.
- W4362589073 hasConcept C81388566 @default.
- W4362589073 hasConcept C98763669 @default.
- W4362589073 hasConceptScore W4362589073C105795698 @default.
- W4362589073 hasConceptScore W4362589073C106189395 @default.
- W4362589073 hasConceptScore W4362589073C114614502 @default.
- W4362589073 hasConceptScore W4362589073C119857082 @default.
- W4362589073 hasConceptScore W4362589073C126255220 @default.
- W4362589073 hasConceptScore W4362589073C130367717 @default.
- W4362589073 hasConceptScore W4362589073C144237770 @default.
- W4362589073 hasConceptScore W4362589073C14646407 @default.
- W4362589073 hasConceptScore W4362589073C159886148 @default.
- W4362589073 hasConceptScore W4362589073C2524010 @default.
- W4362589073 hasConceptScore W4362589073C2778067643 @default.
- W4362589073 hasConceptScore W4362589073C2834757 @default.
- W4362589073 hasConceptScore W4362589073C28826006 @default.
- W4362589073 hasConceptScore W4362589073C33252445 @default.
- W4362589073 hasConceptScore W4362589073C33923547 @default.
- W4362589073 hasConceptScore W4362589073C41008148 @default.
- W4362589073 hasConceptScore W4362589073C50644808 @default.
- W4362589073 hasConceptScore W4362589073C81388566 @default.
- W4362589073 hasConceptScore W4362589073C98763669 @default.
- W4362589073 hasFunder F4320306076 @default.
- W4362589073 hasFunder F4320338279 @default.
- W4362589073 hasFunder F4320338281 @default.
- W4362589073 hasLocation W43625890731 @default.
- W4362589073 hasLocation W43625890732 @default.
- W4362589073 hasOpenAccess W4362589073 @default.
- W4362589073 hasPrimaryLocation W43625890731 @default.
- W4362589073 hasRelatedWork W1966452785 @default.
- W4362589073 hasRelatedWork W2076580159 @default.
- W4362589073 hasRelatedWork W2115182069 @default.
- W4362589073 hasRelatedWork W2117282672 @default.
- W4362589073 hasRelatedWork W2156992384 @default.
- W4362589073 hasRelatedWork W34472511 @default.
- W4362589073 hasRelatedWork W4221152175 @default.
- W4362589073 hasRelatedWork W4297940580 @default.
- W4362589073 hasRelatedWork W4308702637 @default.
- W4362589073 hasRelatedWork W4362589073 @default.
- W4362589073 hasVolume "153" @default.
- W4362589073 isParatext "false" @default.
- W4362589073 isRetracted "false" @default.
- W4362589073 workType "article" @default.