Matches in SemOpenAlex for { <https://semopenalex.org/work/W2167117957> ?p ?o ?g. }
- W2167117957 endingPage "117" @default.
- W2167117957 startingPage "83" @default.
- W2167117957 abstract "Many motor skills in humanoid robotics can be learned using parametrized motor primitives as done in imitation learning. However, most interesting motor learning problems are high-dimensional reinforcement learning problems often beyond the reach of current methods. In this paper, we extend previous work on policy learning from the immediate reward case to episodic reinforcement learning. We show that this results in a general, common framework also connected to policy gradient methods and yielding a novel algorithm for policy learning that is particularly well-suited for dynamic motor primitives. The resulting algorithm is an EM-inspired algorithm applicable to complex motor learning tasks. We compare this algorithm to several well-known parametrized policy search methods and show that it outperforms them. We apply it in the context of motor learning and show that it can learn a complex Ball-in-a-Cup task using a real Barrett WAM™ robot arm." @default.
- W2167117957 created "2016-06-24" @default.
- W2167117957 creator A5035229829 @default.
- W2167117957 creator A5071367253 @default.
- W2167117957 date "2014-01-01" @default.
- W2167117957 modified "2023-10-06" @default.
- W2167117957 title "Policy Search for Motor Primitives in Robotics" @default.
- W2167117957 cites W1491843047 @default.
- W2167117957 cites W1515851193 @default.
- W2167117957 cites W1516801383 @default.
- W2167117957 cites W1530191702 @default.
- W2167117957 cites W1534355532 @default.
- W2167117957 cites W1555368087 @default.
- W2167117957 cites W1564755532 @default.
- W2167117957 cites W157454455 @default.
- W2167117957 cites W1575130825 @default.
- W2167117957 cites W1594783240 @default.
- W2167117957 cites W1601974704 @default.
- W2167117957 cites W173907030 @default.
- W2167117957 cites W1801737117 @default.
- W2167117957 cites W1832110895 @default.
- W2167117957 cites W1949804828 @default.
- W2167117957 cites W1949974402 @default.
- W2167117957 cites W1975318316 @default.
- W2167117957 cites W1988071341 @default.
- W2167117957 cites W1988560433 @default.
- W2167117957 cites W2049633694 @default.
- W2167117957 cites W2080039641 @default.
- W2167117957 cites W2095676919 @default.
- W2167117957 cites W2098546858 @default.
- W2167117957 cites W2105038027 @default.
- W2167117957 cites W2108579172 @default.
- W2167117957 cites W2109008048 @default.
- W2167117957 cites W2109169869 @default.
- W2167117957 cites W2110304639 @default.
- W2167117957 cites W2114537044 @default.
- W2167117957 cites W2117853077 @default.
- W2167117957 cites W2119717200 @default.
- W2167117957 cites W2120070743 @default.
- W2167117957 cites W2123327324 @default.
- W2167117957 cites W2123967136 @default.
- W2167117957 cites W2127107099 @default.
- W2167117957 cites W2129515556 @default.
- W2167117957 cites W2135194391 @default.
- W2167117957 cites W2136751544 @default.
- W2167117957 cites W2137104525 @default.
- W2167117957 cites W2137570937 @default.
- W2167117957 cites W2149192758 @default.
- W2167117957 cites W2154328025 @default.
- W2167117957 cites W2155027007 @default.
- W2167117957 cites W2156243072 @default.
- W2167117957 cites W2160162867 @default.
- W2167117957 cites W2161872510 @default.
- W2167117957 cites W2165131254 @default.
- W2167117957 cites W2165421048 @default.
- W2167117957 cites W2168501056 @default.
- W2167117957 cites W2277201334 @default.
- W2167117957 cites W2312442049 @default.
- W2167117957 cites W2408670836 @default.
- W2167117957 cites W2914656440 @default.
- W2167117957 cites W3022423118 @default.
- W2167117957 doi "https://doi.org/10.1007/978-3-319-03194-1_4" @default.
- W2167117957 hasPublicationYear "2014" @default.
- W2167117957 type Work @default.
- W2167117957 sameAs 2167117957 @default.
- W2167117957 citedByCount "133" @default.
- W2167117957 countsByYear W21671179572012 @default.
- W2167117957 countsByYear W21671179572013 @default.
- W2167117957 countsByYear W21671179572014 @default.
- W2167117957 countsByYear W21671179572015 @default.
- W2167117957 countsByYear W21671179572016 @default.
- W2167117957 countsByYear W21671179572017 @default.
- W2167117957 countsByYear W21671179572018 @default.
- W2167117957 countsByYear W21671179572019 @default.
- W2167117957 countsByYear W21671179572020 @default.
- W2167117957 countsByYear W21671179572021 @default.
- W2167117957 countsByYear W21671179572022 @default.
- W2167117957 crossrefType "book-chapter" @default.
- W2167117957 hasAuthorship W2167117957A5035229829 @default.
- W2167117957 hasAuthorship W2167117957A5071367253 @default.
- W2167117957 hasBestOaLocation W21671179572 @default.
- W2167117957 hasConcept C107457646 @default.
- W2167117957 hasConcept C154945302 @default.
- W2167117957 hasConcept C34413123 @default.
- W2167117957 hasConcept C41008148 @default.
- W2167117957 hasConcept C90509273 @default.
- W2167117957 hasConceptScore W2167117957C107457646 @default.
- W2167117957 hasConceptScore W2167117957C154945302 @default.
- W2167117957 hasConceptScore W2167117957C34413123 @default.
- W2167117957 hasConceptScore W2167117957C41008148 @default.
- W2167117957 hasConceptScore W2167117957C90509273 @default.
- W2167117957 hasLocation W21671179571 @default.
- W2167117957 hasLocation W21671179572 @default.
- W2167117957 hasLocation W21671179573 @default.
- W2167117957 hasLocation W21671179574 @default.
- W2167117957 hasOpenAccess W2167117957 @default.
- W2167117957 hasPrimaryLocation W21671179571 @default.
- W2167117957 hasRelatedWork W2116013011 @default.