Matches in SemOpenAlex for { <https://semopenalex.org/work/W2949673982> ?p ?o ?g. }
- W2949673982 abstract "Augmenting reinforcement learning with imitation learning is often hailed as a method by which to improve upon learning from scratch. However, most existing methods for integrating these two techniques are subject to several strong assumptions---chief among them that information about demonstrator actions is available. In this paper, we investigate the extent to which this assumption is necessary by introducing and evaluating reinforced inverse dynamics modeling (RIDM), a novel paradigm for combining imitation from observation (IfO) and reinforcement learning with no dependence on demonstrator action information. Moreover, RIDM requires only a single demonstration trajectory and is able to operate directly on raw (unaugmented) state features. We find experimentally that RIDM performs favorably compared to a baseline approach for several tasks in simulation as well as for tasks on a real UR5 robot arm. Experiment videos can be found at this https URL." @default.
- W2949673982 created "2019-06-27" @default.
- W2949673982 creator A5001594330 @default.
- W2949673982 creator A5008014974 @default.
- W2949673982 creator A5027320988 @default.
- W2949673982 creator A5060981901 @default.
- W2949673982 creator A5070355024 @default.
- W2949673982 date "2019-06-18" @default.
- W2949673982 modified "2023-10-01" @default.
- W2949673982 title "RIDM: Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration" @default.
- W2949673982 cites W1516054196 @default.
- W2949673982 cites W1951992568 @default.
- W2949673982 cites W1986014385 @default.
- W2949673982 cites W2041242313 @default.
- W2949673982 cites W2051228319 @default.
- W2949673982 cites W2087617385 @default.
- W2949673982 cites W2116157560 @default.
- W2949673982 cites W2121863487 @default.
- W2949673982 cites W2137375617 @default.
- W2949673982 cites W2138537392 @default.
- W2949673982 cites W2148112459 @default.
- W2949673982 cites W2157340968 @default.
- W2949673982 cites W2158782408 @default.
- W2949673982 cites W2294422333 @default.
- W2949673982 cites W2342840547 @default.
- W2949673982 cites W2379502018 @default.
- W2949673982 cites W2415726935 @default.
- W2949673982 cites W2468462628 @default.
- W2949673982 cites W2481567506 @default.
- W2949673982 cites W2591957724 @default.
- W2949673982 cites W2592285981 @default.
- W2949673982 cites W2595845486 @default.
- W2949673982 cites W2596367596 @default.
- W2949673982 cites W2607198029 @default.
- W2949673982 cites W2736601468 @default.
- W2949673982 cites W2754517384 @default.
- W2949673982 cites W2757609746 @default.
- W2949673982 cites W2785962646 @default.
- W2949673982 cites W2788862220 @default.
- W2949673982 cites W2884247313 @default.
- W2949673982 cites W2946501375 @default.
- W2949673982 cites W2949121148 @default.
- W2949673982 cites W2949608212 @default.
- W2949673982 cites W2952165569 @default.
- W2949673982 cites W2962787969 @default.
- W2949673982 cites W2963099939 @default.
- W2949673982 cites W2963277051 @default.
- W2949673982 cites W2963802910 @default.
- W2949673982 cites W2964460729 @default.
- W2949673982 cites W2965273749 @default.
- W2949673982 cites W576027973 @default.
- W2949673982 cites W770013183 @default.
- W2949673982 hasPublicationYear "2019" @default.
- W2949673982 type Work @default.
- W2949673982 sameAs 2949673982 @default.
- W2949673982 citedByCount "10" @default.
- W2949673982 countsByYear W29496739822019 @default.
- W2949673982 countsByYear W29496739822020 @default.
- W2949673982 crossrefType "posted-content" @default.
- W2949673982 hasAuthorship W2949673982A5001594330 @default.
- W2949673982 hasAuthorship W2949673982A5008014974 @default.
- W2949673982 hasAuthorship W2949673982A5027320988 @default.
- W2949673982 hasAuthorship W2949673982A5060981901 @default.
- W2949673982 hasAuthorship W2949673982A5070355024 @default.
- W2949673982 hasConcept C107457646 @default.
- W2949673982 hasConcept C111368507 @default.
- W2949673982 hasConcept C119857082 @default.
- W2949673982 hasConcept C121332964 @default.
- W2949673982 hasConcept C126388530 @default.
- W2949673982 hasConcept C12725497 @default.
- W2949673982 hasConcept C127313418 @default.
- W2949673982 hasConcept C1276947 @default.
- W2949673982 hasConcept C13662910 @default.
- W2949673982 hasConcept C145912823 @default.
- W2949673982 hasConcept C154945302 @default.
- W2949673982 hasConcept C15744967 @default.
- W2949673982 hasConcept C187523126 @default.
- W2949673982 hasConcept C199360897 @default.
- W2949673982 hasConcept C207467116 @default.
- W2949673982 hasConcept C24890656 @default.
- W2949673982 hasConcept C2524010 @default.
- W2949673982 hasConcept C2780791683 @default.
- W2949673982 hasConcept C2781235140 @default.
- W2949673982 hasConcept C33923547 @default.
- W2949673982 hasConcept C39920418 @default.
- W2949673982 hasConcept C41008148 @default.
- W2949673982 hasConcept C62520636 @default.
- W2949673982 hasConcept C74650414 @default.
- W2949673982 hasConcept C77805123 @default.
- W2949673982 hasConcept C90509273 @default.
- W2949673982 hasConcept C97541855 @default.
- W2949673982 hasConceptScore W2949673982C107457646 @default.
- W2949673982 hasConceptScore W2949673982C111368507 @default.
- W2949673982 hasConceptScore W2949673982C119857082 @default.
- W2949673982 hasConceptScore W2949673982C121332964 @default.
- W2949673982 hasConceptScore W2949673982C126388530 @default.
- W2949673982 hasConceptScore W2949673982C12725497 @default.
- W2949673982 hasConceptScore W2949673982C127313418 @default.
- W2949673982 hasConceptScore W2949673982C1276947 @default.
- W2949673982 hasConceptScore W2949673982C13662910 @default.