Matches in SemOpenAlex for { <https://semopenalex.org/work/W3047507071> ?p ?o ?g. }
- W3047507071 abstract "Learning from demonstration is widely used as an efficient way for robots to acquire new skills. However, it typically requires that demonstrations provide full access to the state and action sequences. In contrast, learning from observation offers a way to utilize unlabeled demonstrations (e.g., video) to perform imitation learning. One approach to this is behavioral cloning from observation (BCO). The original implementation of BCO proceeds by first learning an inverse dynamics model and then using that model to estimate action labels, thereby reducing the problem to behavioral cloning. However, existing approaches to BCO require a large number of initial interactions in the first step. Here, we provide a novel theoretical analysis of BCO, introduce a modification BCO*, and show that in the semi-supervised setting, BCO* can concurrently improve both its estimate for the inverse dynamics model and the expert policy. This result allows us to eliminate the dependence on initial interactions and dramatically improve the sample complexity of BCO. We evaluate the effectiveness of our algorithm through experiments on various benchmark domains. The results demonstrate that concurrent training not only improves over the performance of BCO but also results in performance that is competitive with state-of-the-art imitation learning methods such as GAIL and Value-Dice." @default.
- W3047507071 created "2020-08-10" @default.
- W3047507071 creator A5071030696 @default.
- W3047507071 creator A5073906876 @default.
- W3047507071 date "2020-08-03" @default.
- W3047507071 modified "2023-09-27" @default.
- W3047507071 title "Concurrent Training Improves the Performance of Behavioral Cloning from Observation." @default.
- W3047507071 cites W112666333 @default.
- W3047507071 cites W2099471712 @default.
- W3047507071 cites W2108598243 @default.
- W3047507071 cites W2134491302 @default.
- W3047507071 cites W2145339207 @default.
- W3047507071 cites W2174803659 @default.
- W3047507071 cites W2296673577 @default.
- W3047507071 cites W2512051764 @default.
- W3047507071 cites W2575705757 @default.
- W3047507071 cites W2604884452 @default.
- W3047507071 cites W2740067745 @default.
- W3047507071 cites W2781726626 @default.
- W3047507071 cites W2800415562 @default.
- W3047507071 cites W2809090039 @default.
- W3047507071 cites W2946501375 @default.
- W3047507071 cites W2949673982 @default.
- W3047507071 cites W2952854274 @default.
- W3047507071 cites W2962369866 @default.
- W3047507071 cites W2962787969 @default.
- W3047507071 cites W2962957031 @default.
- W3047507071 cites W2963277051 @default.
- W3047507071 cites W2963511511 @default.
- W3047507071 cites W2970003882 @default.
- W3047507071 cites W2978426779 @default.
- W3047507071 cites W2995146921 @default.
- W3047507071 cites W2996086858 @default.
- W3047507071 cites W3028821797 @default.
- W3047507071 hasPublicationYear "2020" @default.
- W3047507071 type Work @default.
- W3047507071 sameAs 3047507071 @default.
- W3047507071 citedByCount "0" @default.
- W3047507071 crossrefType "posted-content" @default.
- W3047507071 hasAuthorship W3047507071A5071030696 @default.
- W3047507071 hasAuthorship W3047507071A5073906876 @default.
- W3047507071 hasConcept C119857082 @default.
- W3047507071 hasConcept C121050878 @default.
- W3047507071 hasConcept C121332964 @default.
- W3047507071 hasConcept C126388530 @default.
- W3047507071 hasConcept C13280743 @default.
- W3047507071 hasConcept C154945302 @default.
- W3047507071 hasConcept C15744967 @default.
- W3047507071 hasConcept C185798385 @default.
- W3047507071 hasConcept C199360897 @default.
- W3047507071 hasConcept C205649164 @default.
- W3047507071 hasConcept C207467116 @default.
- W3047507071 hasConcept C2524010 @default.
- W3047507071 hasConcept C2776502983 @default.
- W3047507071 hasConcept C2780791683 @default.
- W3047507071 hasConcept C33923547 @default.
- W3047507071 hasConcept C41008148 @default.
- W3047507071 hasConcept C62520636 @default.
- W3047507071 hasConcept C77805123 @default.
- W3047507071 hasConceptScore W3047507071C119857082 @default.
- W3047507071 hasConceptScore W3047507071C121050878 @default.
- W3047507071 hasConceptScore W3047507071C121332964 @default.
- W3047507071 hasConceptScore W3047507071C126388530 @default.
- W3047507071 hasConceptScore W3047507071C13280743 @default.
- W3047507071 hasConceptScore W3047507071C154945302 @default.
- W3047507071 hasConceptScore W3047507071C15744967 @default.
- W3047507071 hasConceptScore W3047507071C185798385 @default.
- W3047507071 hasConceptScore W3047507071C199360897 @default.
- W3047507071 hasConceptScore W3047507071C205649164 @default.
- W3047507071 hasConceptScore W3047507071C207467116 @default.
- W3047507071 hasConceptScore W3047507071C2524010 @default.
- W3047507071 hasConceptScore W3047507071C2776502983 @default.
- W3047507071 hasConceptScore W3047507071C2780791683 @default.
- W3047507071 hasConceptScore W3047507071C33923547 @default.
- W3047507071 hasConceptScore W3047507071C41008148 @default.
- W3047507071 hasConceptScore W3047507071C62520636 @default.
- W3047507071 hasConceptScore W3047507071C77805123 @default.
- W3047507071 hasLocation W30475070711 @default.
- W3047507071 hasOpenAccess W3047507071 @default.
- W3047507071 hasPrimaryLocation W30475070711 @default.
- W3047507071 hasRelatedWork W2623259071 @default.
- W3047507071 hasRelatedWork W2626860042 @default.
- W3047507071 hasRelatedWork W2755614540 @default.
- W3047507071 hasRelatedWork W2787457957 @default.
- W3047507071 hasRelatedWork W2950602341 @default.
- W3047507071 hasRelatedWork W2952193948 @default.
- W3047507071 hasRelatedWork W2964177255 @default.
- W3047507071 hasRelatedWork W2977481643 @default.
- W3047507071 hasRelatedWork W3030598573 @default.
- W3047507071 hasRelatedWork W3034394486 @default.
- W3047507071 hasRelatedWork W3045585575 @default.
- W3047507071 hasRelatedWork W3048412490 @default.
- W3047507071 hasRelatedWork W3090121858 @default.
- W3047507071 hasRelatedWork W3093147807 @default.
- W3047507071 hasRelatedWork W3138564770 @default.
- W3047507071 hasRelatedWork W3167305842 @default.
- W3047507071 hasRelatedWork W3176334129 @default.
- W3047507071 hasRelatedWork W3201617237 @default.
- W3047507071 hasRelatedWork W3210984963 @default.
- W3047507071 hasRelatedWork W3214552351 @default.