Matches in SemOpenAlex for { <https://semopenalex.org/work/W3182474098> ?p ?o ?g. }
Showing items 1 to 91 of
91
with 100 items per page.
- W3182474098 endingPage "329" @default.
- W3182474098 startingPage "313" @default.
- W3182474098 abstract "We address the problem of imitation learning with multi-modal demonstrations. Instead of attempting to learn all modes, we argue that in many tasks it is sufficient to imitate any one of them. We show that the state-of-the-art methods such as GAIL and behavior cloning, due to their choice of loss function, often incorrectly interpolate between such modes. Our key insight is to minimize the right divergence between the learner and the expert state-action distributions, namely the reverse KL divergence or I-projection. We propose a general imitation learning framework for estimating and minimizing any f-Divergence. By plugging in different divergences, we are able to recover existing algorithms such as Behavior Cloning (Kullback-Leibler), GAIL (Jensen Shannon) and DAgger (Total Variation). Empirical results show that our approximate I-projection technique is able to imitate multi-modal behaviors more reliably than GAIL and behavior cloning." @default.
- W3182474098 created "2021-07-19" @default.
- W3182474098 creator A5023240185 @default.
- W3182474098 creator A5026314250 @default.
- W3182474098 creator A5032266950 @default.
- W3182474098 creator A5045998647 @default.
- W3182474098 creator A5057995939 @default.
- W3182474098 creator A5077719529 @default.
- W3182474098 date "2021-01-01" @default.
- W3182474098 modified "2023-10-06" @default.
- W3182474098 title "Imitation Learning as f-Divergence Minimization" @default.
- W3182474098 cites W1567876833 @default.
- W3182474098 cites W1974314970 @default.
- W3182474098 cites W1975463331 @default.
- W3182474098 cites W1980969546 @default.
- W3182474098 cites W1986014385 @default.
- W3182474098 cites W1999874108 @default.
- W3182474098 cites W2055309977 @default.
- W3182474098 cites W2131940723 @default.
- W3182474098 cites W2166944917 @default.
- W3182474098 cites W2169498096 @default.
- W3182474098 cites W2347074400 @default.
- W3182474098 cites W2409942531 @default.
- W3182474098 cites W2500624988 @default.
- W3182474098 cites W2527925052 @default.
- W3182474098 cites W2593841437 @default.
- W3182474098 cites W2794908222 @default.
- W3182474098 cites W2894766094 @default.
- W3182474098 cites W2894978157 @default.
- W3182474098 cites W2962787969 @default.
- W3182474098 cites W2963411833 @default.
- W3182474098 cites W2963576857 @default.
- W3182474098 doi "https://doi.org/10.1007/978-3-030-66723-8_19" @default.
- W3182474098 hasPublicationYear "2021" @default.
- W3182474098 type Work @default.
- W3182474098 sameAs 3182474098 @default.
- W3182474098 citedByCount "16" @default.
- W3182474098 countsByYear W31824740982020 @default.
- W3182474098 countsByYear W31824740982021 @default.
- W3182474098 countsByYear W31824740982022 @default.
- W3182474098 countsByYear W31824740982023 @default.
- W3182474098 crossrefType "book-chapter" @default.
- W3182474098 hasAuthorship W3182474098A5023240185 @default.
- W3182474098 hasAuthorship W3182474098A5026314250 @default.
- W3182474098 hasAuthorship W3182474098A5032266950 @default.
- W3182474098 hasAuthorship W3182474098A5045998647 @default.
- W3182474098 hasAuthorship W3182474098A5057995939 @default.
- W3182474098 hasAuthorship W3182474098A5077719529 @default.
- W3182474098 hasBestOaLocation W31824740982 @default.
- W3182474098 hasConcept C126255220 @default.
- W3182474098 hasConcept C126388530 @default.
- W3182474098 hasConcept C138885662 @default.
- W3182474098 hasConcept C147764199 @default.
- W3182474098 hasConcept C154945302 @default.
- W3182474098 hasConcept C15744967 @default.
- W3182474098 hasConcept C169760540 @default.
- W3182474098 hasConcept C207390915 @default.
- W3182474098 hasConcept C33923547 @default.
- W3182474098 hasConcept C41008148 @default.
- W3182474098 hasConcept C41895202 @default.
- W3182474098 hasConceptScore W3182474098C126255220 @default.
- W3182474098 hasConceptScore W3182474098C126388530 @default.
- W3182474098 hasConceptScore W3182474098C138885662 @default.
- W3182474098 hasConceptScore W3182474098C147764199 @default.
- W3182474098 hasConceptScore W3182474098C154945302 @default.
- W3182474098 hasConceptScore W3182474098C15744967 @default.
- W3182474098 hasConceptScore W3182474098C169760540 @default.
- W3182474098 hasConceptScore W3182474098C207390915 @default.
- W3182474098 hasConceptScore W3182474098C33923547 @default.
- W3182474098 hasConceptScore W3182474098C41008148 @default.
- W3182474098 hasConceptScore W3182474098C41895202 @default.
- W3182474098 hasLocation W31824740981 @default.
- W3182474098 hasLocation W31824740982 @default.
- W3182474098 hasOpenAccess W3182474098 @default.
- W3182474098 hasPrimaryLocation W31824740981 @default.
- W3182474098 hasRelatedWork W187846026 @default.
- W3182474098 hasRelatedWork W2049520302 @default.
- W3182474098 hasRelatedWork W2064048246 @default.
- W3182474098 hasRelatedWork W2087118616 @default.
- W3182474098 hasRelatedWork W2343708061 @default.
- W3182474098 hasRelatedWork W2554771414 @default.
- W3182474098 hasRelatedWork W2997873209 @default.
- W3182474098 hasRelatedWork W3107474891 @default.
- W3182474098 hasRelatedWork W3121675266 @default.
- W3182474098 hasRelatedWork W4287755493 @default.
- W3182474098 isParatext "false" @default.
- W3182474098 isRetracted "false" @default.
- W3182474098 magId "3182474098" @default.
- W3182474098 workType "book-chapter" @default.