Matches in SemOpenAlex for { <https://semopenalex.org/work/W3039179219> ?p ?o ?g. }
- W3039179219 abstract "Supervised imitation learning, also known as behavioral cloning, suffers from distribution drift leading to failures during policy execution. One approach to mitigate this issue is to allow an expert to correct the agent's actions during task execution, based on the expert's determination that the agent has reached a `point of no return.' The agent's policy is then retrained using this new corrective data. This approach alone can enable high-performance agents to be learned, but at a substantial cost: the expert must vigilantly observe execution until the policy reaches a specified level of success, and even at that point, there is no guarantee that the policy will always succeed. To address these limitations, we present FIRE (Failure Identification to Reduce Expert Burden in intervention-based learning), a system that can predict when a running policy will fail, halt its execution, and request a correction from the expert. Unlike existing approaches that learn only from expert data, our approach learns from both expert and non-expert data, akin to adversarial learning. We demonstrate experimentally for a series of challenging manipulation tasks that our method is able to recognize state-action pairs that lead to failures. This permits seamless integration into an intervention-based learning system, where we show an order-of-magnitude gain in sample efficiency compared with a state-of-the-art inverse reinforcement learning method and dramatically improved performance over an equivalent amount of data learned with behavioral cloning." @default.
- W3039179219 created "2020-07-10" @default.
- W3039179219 creator A5001210516 @default.
- W3039179219 creator A5036575954 @default.
- W3039179219 creator A5083325657 @default.
- W3039179219 date "2020-07-01" @default.
- W3039179219 modified "2023-09-27" @default.
- W3039179219 title "Fighting Failures with FIRE: Failure Identification to Reduce Expert Burden in Intervention-Based Learning." @default.
- W3039179219 cites W1771410628 @default.
- W3039179219 cites W1980969546 @default.
- W3039179219 cites W2099471712 @default.
- W3039179219 cites W2167224731 @default.
- W3039179219 cites W2295827790 @default.
- W3039179219 cites W2296673577 @default.
- W3039179219 cites W2342840547 @default.
- W3039179219 cites W2396217537 @default.
- W3039179219 cites W2409942531 @default.
- W3039179219 cites W2594640072 @default.
- W3039179219 cites W2604173613 @default.
- W3039179219 cites W2736601468 @default.
- W3039179219 cites W2789008106 @default.
- W3039179219 cites W2806163172 @default.
- W3039179219 cites W2890026535 @default.
- W3039179219 cites W2911383979 @default.
- W3039179219 cites W2949608212 @default.
- W3039179219 cites W2962879692 @default.
- W3039179219 cites W2962957031 @default.
- W3039179219 cites W2963277051 @default.
- W3039179219 cites W2963367680 @default.
- W3039179219 cites W2963411833 @default.
- W3039179219 cites W2963508354 @default.
- W3039179219 cites W2963669336 @default.
- W3039179219 cites W2963923407 @default.
- W3039179219 cites W2964121744 @default.
- W3039179219 cites W2966994213 @default.
- W3039179219 cites W2968806961 @default.
- W3039179219 cites W2976087063 @default.
- W3039179219 cites W3003342008 @default.
- W3039179219 cites W3004815632 @default.
- W3039179219 cites W3037625705 @default.
- W3039179219 cites W3040490156 @default.
- W3039179219 cites W3210984963 @default.
- W3039179219 hasPublicationYear "2020" @default.
- W3039179219 type Work @default.
- W3039179219 sameAs 3039179219 @default.
- W3039179219 citedByCount "1" @default.
- W3039179219 countsByYear W30391792192020 @default.
- W3039179219 crossrefType "posted-content" @default.
- W3039179219 hasAuthorship W3039179219A5001210516 @default.
- W3039179219 hasAuthorship W3039179219A5036575954 @default.
- W3039179219 hasAuthorship W3039179219A5083325657 @default.
- W3039179219 hasConcept C105002631 @default.
- W3039179219 hasConcept C116834253 @default.
- W3039179219 hasConcept C119857082 @default.
- W3039179219 hasConcept C127413603 @default.
- W3039179219 hasConcept C154945302 @default.
- W3039179219 hasConcept C201995342 @default.
- W3039179219 hasConcept C2780451532 @default.
- W3039179219 hasConcept C41008148 @default.
- W3039179219 hasConcept C58328972 @default.
- W3039179219 hasConcept C59822182 @default.
- W3039179219 hasConcept C86803240 @default.
- W3039179219 hasConcept C97541855 @default.
- W3039179219 hasConceptScore W3039179219C105002631 @default.
- W3039179219 hasConceptScore W3039179219C116834253 @default.
- W3039179219 hasConceptScore W3039179219C119857082 @default.
- W3039179219 hasConceptScore W3039179219C127413603 @default.
- W3039179219 hasConceptScore W3039179219C154945302 @default.
- W3039179219 hasConceptScore W3039179219C201995342 @default.
- W3039179219 hasConceptScore W3039179219C2780451532 @default.
- W3039179219 hasConceptScore W3039179219C41008148 @default.
- W3039179219 hasConceptScore W3039179219C58328972 @default.
- W3039179219 hasConceptScore W3039179219C59822182 @default.
- W3039179219 hasConceptScore W3039179219C86803240 @default.
- W3039179219 hasConceptScore W3039179219C97541855 @default.
- W3039179219 hasLocation W30391792191 @default.
- W3039179219 hasOpenAccess W3039179219 @default.
- W3039179219 hasPrimaryLocation W30391792191 @default.
- W3039179219 hasRelatedWork W107245055 @default.
- W3039179219 hasRelatedWork W1419106667 @default.
- W3039179219 hasRelatedWork W1494395068 @default.
- W3039179219 hasRelatedWork W1550546124 @default.
- W3039179219 hasRelatedWork W158504997 @default.
- W3039179219 hasRelatedWork W193415179 @default.
- W3039179219 hasRelatedWork W2556606924 @default.
- W3039179219 hasRelatedWork W2809278843 @default.
- W3039179219 hasRelatedWork W2911048887 @default.
- W3039179219 hasRelatedWork W2949760349 @default.
- W3039179219 hasRelatedWork W2979071952 @default.
- W3039179219 hasRelatedWork W2985613481 @default.
- W3039179219 hasRelatedWork W3001411582 @default.
- W3039179219 hasRelatedWork W3092270557 @default.
- W3039179219 hasRelatedWork W3096386778 @default.
- W3039179219 hasRelatedWork W3105626663 @default.
- W3039179219 hasRelatedWork W3119172160 @default.
- W3039179219 hasRelatedWork W3132737487 @default.
- W3039179219 hasRelatedWork W3154061550 @default.
- W3039179219 hasRelatedWork W3927766 @default.
- W3039179219 isParatext "false" @default.
- W3039179219 isRetracted "false" @default.