Matches in SemOpenAlex for { <https://semopenalex.org/work/W3159667783> ?p ?o ?g. }
- W3159667783 endingPage "5887" @default.
- W3159667783 startingPage "5873" @default.
- W3159667783 abstract "We are motivated by the real challenges presented in a human–robot system to develop new designs that are efficient at data level and with performance guarantees, such as stability and optimality at system level. Existing approximate/adaptive dynamic programming (ADP) results that consider system performance theoretically are not readily providing practically useful learning control algorithms for this problem, and reinforcement learning (RL) algorithms that address the issue of data efficiency usually do not have performance guarantees for the controlled system. This study fills these important voids by introducing innovative features to the policy iteration algorithm. We introduce flexible policy iteration (FPI), which can flexibly and organically integrate experience replay and supplemental values from prior experience into the RL controller. We show system-level performances, including convergence of the approximate value function, (sub)optimality of the solution, and stability of the system. We demonstrate the effectiveness of the FPI via realistic simulations of the human–robot system. It is noted that the problem we face in this study may be difficult to address by design methods based on classical control theory as it is nearly impossible to obtain a customized mathematical model of a human–robot system either online or offline. The results we have obtained also indicate the great potential of RL control to solving realistic and challenging problems with high-dimensional control inputs." @default.
- W3159667783 created "2021-05-10" @default.
- W3159667783 creator A5006374930 @default.
- W3159667783 creator A5022922423 @default.
- W3159667783 creator A5036072216 @default.
- W3159667783 creator A5036489015 @default.
- W3159667783 creator A5085653399 @default.
- W3159667783 date "2022-10-01" @default.
- W3159667783 modified "2023-10-17" @default.
- W3159667783 title "Reinforcement Learning Control of Robotic Knee With Human-in-the-Loop by Flexible Policy Iteration" @default.
- W3159667783 cites W158722652 @default.
- W3159667783 cites W1778882367 @default.
- W3159667783 cites W1825869920 @default.
- W3159667783 cites W1854776945 @default.
- W3159667783 cites W1914756871 @default.
- W3159667783 cites W1981932940 @default.
- W3159667783 cites W1982262386 @default.
- W3159667783 cites W2005675298 @default.
- W3159667783 cites W2017239762 @default.
- W3159667783 cites W2017265991 @default.
- W3159667783 cites W2017380219 @default.
- W3159667783 cites W2048687352 @default.
- W3159667783 cites W2063054322 @default.
- W3159667783 cites W2073990179 @default.
- W3159667783 cites W2079247031 @default.
- W3159667783 cites W2085194340 @default.
- W3159667783 cites W2090771768 @default.
- W3159667783 cites W2093831009 @default.
- W3159667783 cites W2099683979 @default.
- W3159667783 cites W2106813881 @default.
- W3159667783 cites W2117608991 @default.
- W3159667783 cites W2120624705 @default.
- W3159667783 cites W2126565096 @default.
- W3159667783 cites W2133639509 @default.
- W3159667783 cites W2139044388 @default.
- W3159667783 cites W2141559645 @default.
- W3159667783 cites W2145339207 @default.
- W3159667783 cites W2165501837 @default.
- W3159667783 cites W2166310857 @default.
- W3159667783 cites W2183137222 @default.
- W3159667783 cites W2188644438 @default.
- W3159667783 cites W2245501338 @default.
- W3159667783 cites W2257979135 @default.
- W3159667783 cites W2314983263 @default.
- W3159667783 cites W2424242071 @default.
- W3159667783 cites W2471234362 @default.
- W3159667783 cites W2490084223 @default.
- W3159667783 cites W2490234001 @default.
- W3159667783 cites W2538885760 @default.
- W3159667783 cites W2572615266 @default.
- W3159667783 cites W2605603065 @default.
- W3159667783 cites W2700163520 @default.
- W3159667783 cites W2766013052 @default.
- W3159667783 cites W2766447205 @default.
- W3159667783 cites W2767307332 @default.
- W3159667783 cites W2775355371 @default.
- W3159667783 cites W2788331089 @default.
- W3159667783 cites W2802164917 @default.
- W3159667783 cites W2884730950 @default.
- W3159667783 cites W2909711564 @default.
- W3159667783 cites W2921163467 @default.
- W3159667783 cites W2964048876 @default.
- W3159667783 cites W2968028487 @default.
- W3159667783 cites W3103456419 @default.
- W3159667783 cites W4211253905 @default.
- W3159667783 cites W4236611057 @default.
- W3159667783 cites W814391925 @default.
- W3159667783 doi "https://doi.org/10.1109/tnnls.2021.3071727" @default.
- W3159667783 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/33956634" @default.
- W3159667783 hasPublicationYear "2022" @default.
- W3159667783 type Work @default.
- W3159667783 sameAs 3159667783 @default.
- W3159667783 citedByCount "16" @default.
- W3159667783 countsByYear W31596677832021 @default.
- W3159667783 countsByYear W31596677832022 @default.
- W3159667783 countsByYear W31596677832023 @default.
- W3159667783 crossrefType "journal-article" @default.
- W3159667783 hasAuthorship W3159667783A5006374930 @default.
- W3159667783 hasAuthorship W3159667783A5022922423 @default.
- W3159667783 hasAuthorship W3159667783A5036072216 @default.
- W3159667783 hasAuthorship W3159667783A5036489015 @default.
- W3159667783 hasAuthorship W3159667783A5085653399 @default.
- W3159667783 hasBestOaLocation W31596677832 @default.
- W3159667783 hasConcept C112972136 @default.
- W3159667783 hasConcept C11413529 @default.
- W3159667783 hasConcept C119599485 @default.
- W3159667783 hasConcept C119857082 @default.
- W3159667783 hasConcept C126255220 @default.
- W3159667783 hasConcept C127413603 @default.
- W3159667783 hasConcept C133731056 @default.
- W3159667783 hasConcept C136764020 @default.
- W3159667783 hasConcept C14036430 @default.
- W3159667783 hasConcept C14646407 @default.
- W3159667783 hasConcept C154945302 @default.
- W3159667783 hasConcept C162324750 @default.
- W3159667783 hasConcept C17500928 @default.
- W3159667783 hasConcept C203479927 @default.
- W3159667783 hasConcept C2775924081 @default.