Matches in SemOpenAlex for { <https://semopenalex.org/work/W1799240622> ?p ?o ?g. }
Showing items 1 to 88 of
88
with 100 items per page.
- W1799240622 abstract "This paper addresses the problem of apprenticeship learning, that is learning control policies from demonstration by an expert. An efficient framework for it is inverse reinforcement learning (IRL). Based on the assumption that the expert maximizes a utility function, IRL aims at learning the underlying reward from example trajectories. Many IRL algorithms assume that the reward function is linearly parameterized and rely on the computation of some associated feature expectations , which is done through Monte Carlo simulation. However, this assumes to have full trajectories for the expert policy as well as at least a generative model for intermediate policies. In this paper, we introduce a temporal difference method, namely LSTD-μ , to compute these feature expectations. This allows extending apprenticeship learning to a batch and off-policy setting." @default.
- W1799240622 created "2016-06-24" @default.
- W1799240622 creator A5000665687 @default.
- W1799240622 creator A5004267040 @default.
- W1799240622 creator A5008793524 @default.
- W1799240622 date "2012-01-01" @default.
- W1799240622 modified "2023-09-25" @default.
- W1799240622 title "Batch, Off-Policy and Model-Free Apprenticeship Learning" @default.
- W1799240622 cites W1507222174 @default.
- W1799240622 cites W1999874108 @default.
- W1799240622 cites W2031571562 @default.
- W1799240622 cites W2102847492 @default.
- W1799240622 cites W2169498096 @default.
- W1799240622 cites W2544683879 @default.
- W1799240622 doi "https://doi.org/10.1007/978-3-642-29946-9_28" @default.
- W1799240622 hasPublicationYear "2012" @default.
- W1799240622 type Work @default.
- W1799240622 sameAs 1799240622 @default.
- W1799240622 citedByCount "18" @default.
- W1799240622 countsByYear W17992406222012 @default.
- W1799240622 countsByYear W17992406222013 @default.
- W1799240622 countsByYear W17992406222015 @default.
- W1799240622 countsByYear W17992406222017 @default.
- W1799240622 countsByYear W17992406222020 @default.
- W1799240622 countsByYear W17992406222021 @default.
- W1799240622 crossrefType "book-chapter" @default.
- W1799240622 hasAuthorship W1799240622A5000665687 @default.
- W1799240622 hasAuthorship W1799240622A5004267040 @default.
- W1799240622 hasAuthorship W1799240622A5008793524 @default.
- W1799240622 hasBestOaLocation W17992406222 @default.
- W1799240622 hasConcept C107806365 @default.
- W1799240622 hasConcept C108583219 @default.
- W1799240622 hasConcept C11413529 @default.
- W1799240622 hasConcept C119857082 @default.
- W1799240622 hasConcept C138885662 @default.
- W1799240622 hasConcept C14036430 @default.
- W1799240622 hasConcept C154945302 @default.
- W1799240622 hasConcept C165464430 @default.
- W1799240622 hasConcept C2776401178 @default.
- W1799240622 hasConcept C2778827112 @default.
- W1799240622 hasConcept C41008148 @default.
- W1799240622 hasConcept C41895202 @default.
- W1799240622 hasConcept C78458016 @default.
- W1799240622 hasConcept C86803240 @default.
- W1799240622 hasConcept C97541855 @default.
- W1799240622 hasConceptScore W1799240622C107806365 @default.
- W1799240622 hasConceptScore W1799240622C108583219 @default.
- W1799240622 hasConceptScore W1799240622C11413529 @default.
- W1799240622 hasConceptScore W1799240622C119857082 @default.
- W1799240622 hasConceptScore W1799240622C138885662 @default.
- W1799240622 hasConceptScore W1799240622C14036430 @default.
- W1799240622 hasConceptScore W1799240622C154945302 @default.
- W1799240622 hasConceptScore W1799240622C165464430 @default.
- W1799240622 hasConceptScore W1799240622C2776401178 @default.
- W1799240622 hasConceptScore W1799240622C2778827112 @default.
- W1799240622 hasConceptScore W1799240622C41008148 @default.
- W1799240622 hasConceptScore W1799240622C41895202 @default.
- W1799240622 hasConceptScore W1799240622C78458016 @default.
- W1799240622 hasConceptScore W1799240622C86803240 @default.
- W1799240622 hasConceptScore W1799240622C97541855 @default.
- W1799240622 hasLocation W17992406221 @default.
- W1799240622 hasLocation W17992406222 @default.
- W1799240622 hasOpenAccess W1799240622 @default.
- W1799240622 hasPrimaryLocation W17992406221 @default.
- W1799240622 hasRelatedWork W1591675293 @default.
- W1799240622 hasRelatedWork W169931978 @default.
- W1799240622 hasRelatedWork W1757796397 @default.
- W1799240622 hasRelatedWork W1999874108 @default.
- W1799240622 hasRelatedWork W2054795804 @default.
- W1799240622 hasRelatedWork W2061562262 @default.
- W1799240622 hasRelatedWork W2098774185 @default.
- W1799240622 hasRelatedWork W2100401322 @default.
- W1799240622 hasRelatedWork W2130005627 @default.
- W1799240622 hasRelatedWork W2133068870 @default.
- W1799240622 hasRelatedWork W2156211713 @default.
- W1799240622 hasRelatedWork W2174803659 @default.
- W1799240622 hasRelatedWork W2396161314 @default.
- W1799240622 hasRelatedWork W2396881363 @default.
- W1799240622 hasRelatedWork W2604382266 @default.
- W1799240622 hasRelatedWork W2963508354 @default.
- W1799240622 hasRelatedWork W2966492803 @default.
- W1799240622 hasRelatedWork W2994977742 @default.
- W1799240622 hasRelatedWork W3038962357 @default.
- W1799240622 hasRelatedWork W64088143 @default.
- W1799240622 isParatext "false" @default.
- W1799240622 isRetracted "false" @default.
- W1799240622 magId "1799240622" @default.
- W1799240622 workType "book-chapter" @default.