Matches in SemOpenAlex for { <https://semopenalex.org/work/W1594849649> ?p ?o ?g. }
- W1594849649 endingPage "730" @default.
- W1594849649 startingPage "691" @default.
- W1594849649 abstract "Inverse reinforcement learning (IRL) is the problem of recovering the underlying reward function from the behavior of an expert. Most of the existing IRL algorithms assume that the environment is modeled as a Markov decision process (MDP), although it is desirable to handle partially observable settings in order to handle more realistic scenarios. In this paper, we present IRL algorithms for partially observable environments that can be modeled as a partially observable Markov decision process (POMDP). We deal with two cases according to the representation of the given expert's behavior, namely the case in which the expert's policy is explicitly given, and the case in which the expert's trajectories are available instead. The IRL in POMDPs poses a greater challenge than in MDPs since it is not only ill-posed due to the nature of IRL, but also computationally intractable due to the hardness in solving POMDPs. To overcome these obstacles, we present algorithms that exploit some of the classical results from the POMDP literature. Experimental results on several benchmark POMDP domains show that our work is useful for partially observable settings." @default.
- W1594849649 created "2016-06-24" @default.
- W1594849649 creator A5029650176 @default.
- W1594849649 creator A5074569524 @default.
- W1594849649 date "2011-02-01" @default.
- W1594849649 modified "2023-09-23" @default.
- W1594849649 title "Inverse Reinforcement Learning in Partially Observable Environments" @default.
- W1594849649 cites W1484113995 @default.
- W1594849649 cites W1528120147 @default.
- W1594849649 cites W1542709260 @default.
- W1594849649 cites W1564229172 @default.
- W1594849649 cites W158205031 @default.
- W1594849649 cites W1777239053 @default.
- W1594849649 cites W1970916399 @default.
- W1594849649 cites W1981059217 @default.
- W1594849649 cites W1999874108 @default.
- W1594849649 cites W2011971614 @default.
- W1594849649 cites W2028145673 @default.
- W1594849649 cites W2031571562 @default.
- W1594849649 cites W2035346816 @default.
- W1594849649 cites W2061562262 @default.
- W1594849649 cites W2081030963 @default.
- W1594849649 cites W2097826433 @default.
- W1594849649 cites W2098774185 @default.
- W1594849649 cites W2099430963 @default.
- W1594849649 cites W2099873296 @default.
- W1594849649 cites W2101421095 @default.
- W1594849649 cites W2102847492 @default.
- W1594849649 cites W2106887613 @default.
- W1594849649 cites W2110962519 @default.
- W1594849649 cites W2113023245 @default.
- W1594849649 cites W2116442740 @default.
- W1594849649 cites W2117403214 @default.
- W1594849649 cites W2119785746 @default.
- W1594849649 cites W2134802714 @default.
- W1594849649 cites W2142819538 @default.
- W1594849649 cites W2154017217 @default.
- W1594849649 cites W2166692080 @default.
- W1594849649 cites W2167371646 @default.
- W1594849649 cites W2168359464 @default.
- W1594849649 cites W2169498096 @default.
- W1594849649 cites W2287282975 @default.
- W1594849649 cites W2438667436 @default.
- W1594849649 cites W246998018 @default.
- W1594849649 cites W2963889160 @default.
- W1594849649 cites W3214968550 @default.
- W1594849649 cites W1566073559 @default.
- W1594849649 cites W3215946589 @default.
- W1594849649 hasPublicationYear "2011" @default.
- W1594849649 type Work @default.
- W1594849649 sameAs 1594849649 @default.
- W1594849649 citedByCount "56" @default.
- W1594849649 countsByYear W15948496492012 @default.
- W1594849649 countsByYear W15948496492013 @default.
- W1594849649 countsByYear W15948496492014 @default.
- W1594849649 countsByYear W15948496492015 @default.
- W1594849649 countsByYear W15948496492016 @default.
- W1594849649 countsByYear W15948496492017 @default.
- W1594849649 countsByYear W15948496492018 @default.
- W1594849649 countsByYear W15948496492019 @default.
- W1594849649 countsByYear W15948496492020 @default.
- W1594849649 countsByYear W15948496492021 @default.
- W1594849649 countsByYear W15948496492022 @default.
- W1594849649 countsByYear W15948496492023 @default.
- W1594849649 crossrefType "journal-article" @default.
- W1594849649 hasAuthorship W1594849649A5029650176 @default.
- W1594849649 hasAuthorship W1594849649A5074569524 @default.
- W1594849649 hasConcept C105795698 @default.
- W1594849649 hasConcept C106189395 @default.
- W1594849649 hasConcept C111919701 @default.
- W1594849649 hasConcept C119857082 @default.
- W1594849649 hasConcept C121332964 @default.
- W1594849649 hasConcept C126255220 @default.
- W1594849649 hasConcept C13280743 @default.
- W1594849649 hasConcept C154945302 @default.
- W1594849649 hasConcept C159886148 @default.
- W1594849649 hasConcept C163836022 @default.
- W1594849649 hasConcept C165696696 @default.
- W1594849649 hasConcept C17098449 @default.
- W1594849649 hasConcept C17744445 @default.
- W1594849649 hasConcept C185798385 @default.
- W1594849649 hasConcept C199539241 @default.
- W1594849649 hasConcept C205649164 @default.
- W1594849649 hasConcept C2776359362 @default.
- W1594849649 hasConcept C32848918 @default.
- W1594849649 hasConcept C33923547 @default.
- W1594849649 hasConcept C38652104 @default.
- W1594849649 hasConcept C41008148 @default.
- W1594849649 hasConcept C62520636 @default.
- W1594849649 hasConcept C94625758 @default.
- W1594849649 hasConcept C97541855 @default.
- W1594849649 hasConcept C98045186 @default.
- W1594849649 hasConcept C98763669 @default.
- W1594849649 hasConceptScore W1594849649C105795698 @default.
- W1594849649 hasConceptScore W1594849649C106189395 @default.
- W1594849649 hasConceptScore W1594849649C111919701 @default.
- W1594849649 hasConceptScore W1594849649C119857082 @default.
- W1594849649 hasConceptScore W1594849649C121332964 @default.