Matches in SemOpenAlex for { <https://semopenalex.org/work/W4285115083> ?p ?o ?g. }
- W4285115083 endingPage "67604" @default.
- W4285115083 startingPage "67590" @default.
- W4285115083 abstract "Reinforcement learning (RL) has been successfully applied to motion control, without requiring accurate models and selection of control parameters. In this paper, we propose a novel RL algorithm based on proximal policy optimization algorithm with dimension-wise clipping (PPO-DWC) for attitude control of quadrotor. Firstly, dimension-wise clipping technique is introduced to solve the zero-gradient problem of the PPO algorithm, which can quickly converge while maintaining good sampling efficiency, thus improving the control performance. Moreover, following the idea of stability augmentation system (SAS), a feedback controller is designed and integrated into the environment before training the PPO controller to avoid ineffective exploration and improve the system’s convergence. The eventual controller consists of two parts: the first is the result of the actor neural network in the PPO algorithm, and the second is the output of the stability augmentation feedback controller. Both of them directly use an end-to-end style of control commands to map the system state. This control architecture is applied in the attitude control of the quadrotor. The simulation results show that the quadrotor can quickly and accurately track the command and has a small steady-state error after the training by the improved PPO algorithm. Meanwhile, compared with the traditional PID controller and basic PPO algorithm, the proposed PPO-DWC algorithm with stability augmentation framework has better performance in tracking accuracy and robustness." @default.
- W4285115083 created "2022-07-14" @default.
- W4285115083 creator A5037996388 @default.
- W4285115083 creator A5039490048 @default.
- W4285115083 creator A5055245926 @default.
- W4285115083 creator A5067068994 @default.
- W4285115083 date "2022-01-01" @default.
- W4285115083 modified "2023-10-10" @default.
- W4285115083 title "Improved Reinforcement Learning Using Stability Augmentation With Application to Quadrotor Attitude Control" @default.
- W4285115083 cites W1669416386 @default.
- W4285115083 cites W1923344279 @default.
- W4285115083 cites W2145339207 @default.
- W4285115083 cites W2158025740 @default.
- W4285115083 cites W2586878774 @default.
- W4285115083 cites W2754242591 @default.
- W4285115083 cites W2810217655 @default.
- W4285115083 cites W2885871221 @default.
- W4285115083 cites W2890755534 @default.
- W4285115083 cites W2907214660 @default.
- W4285115083 cites W2907877674 @default.
- W4285115083 cites W2962374310 @default.
- W4285115083 cites W2962890638 @default.
- W4285115083 cites W2985707469 @default.
- W4285115083 cites W2992874519 @default.
- W4285115083 cites W3004045632 @default.
- W4285115083 cites W3008634534 @default.
- W4285115083 cites W3039793486 @default.
- W4285115083 cites W3041678225 @default.
- W4285115083 cites W3084522584 @default.
- W4285115083 cites W3089914711 @default.
- W4285115083 cites W3096632704 @default.
- W4285115083 cites W3106462682 @default.
- W4285115083 cites W3110884888 @default.
- W4285115083 cites W3111262864 @default.
- W4285115083 cites W3123524846 @default.
- W4285115083 cites W3133825289 @default.
- W4285115083 cites W3136545240 @default.
- W4285115083 cites W3156138155 @default.
- W4285115083 cites W3157046108 @default.
- W4285115083 cites W3203218610 @default.
- W4285115083 cites W3206369550 @default.
- W4285115083 cites W4200220124 @default.
- W4285115083 doi "https://doi.org/10.1109/access.2022.3185424" @default.
- W4285115083 hasPublicationYear "2022" @default.
- W4285115083 type Work @default.
- W4285115083 citedByCount "4" @default.
- W4285115083 countsByYear W42851150832022 @default.
- W4285115083 countsByYear W42851150832023 @default.
- W4285115083 crossrefType "journal-article" @default.
- W4285115083 hasAuthorship W4285115083A5037996388 @default.
- W4285115083 hasAuthorship W4285115083A5039490048 @default.
- W4285115083 hasAuthorship W4285115083A5055245926 @default.
- W4285115083 hasAuthorship W4285115083A5067068994 @default.
- W4285115083 hasBestOaLocation W42851150831 @default.
- W4285115083 hasConcept C104317684 @default.
- W4285115083 hasConcept C112972136 @default.
- W4285115083 hasConcept C119857082 @default.
- W4285115083 hasConcept C127413603 @default.
- W4285115083 hasConcept C133731056 @default.
- W4285115083 hasConcept C138885662 @default.
- W4285115083 hasConcept C154945302 @default.
- W4285115083 hasConcept C185592680 @default.
- W4285115083 hasConcept C203479927 @default.
- W4285115083 hasConcept C2775924081 @default.
- W4285115083 hasConcept C2776848632 @default.
- W4285115083 hasConcept C41008148 @default.
- W4285115083 hasConcept C41895202 @default.
- W4285115083 hasConcept C47116090 @default.
- W4285115083 hasConcept C47446073 @default.
- W4285115083 hasConcept C536315585 @default.
- W4285115083 hasConcept C55493867 @default.
- W4285115083 hasConcept C63479239 @default.
- W4285115083 hasConcept C6557445 @default.
- W4285115083 hasConcept C86803240 @default.
- W4285115083 hasConcept C97541855 @default.
- W4285115083 hasConceptScore W4285115083C104317684 @default.
- W4285115083 hasConceptScore W4285115083C112972136 @default.
- W4285115083 hasConceptScore W4285115083C119857082 @default.
- W4285115083 hasConceptScore W4285115083C127413603 @default.
- W4285115083 hasConceptScore W4285115083C133731056 @default.
- W4285115083 hasConceptScore W4285115083C138885662 @default.
- W4285115083 hasConceptScore W4285115083C154945302 @default.
- W4285115083 hasConceptScore W4285115083C185592680 @default.
- W4285115083 hasConceptScore W4285115083C203479927 @default.
- W4285115083 hasConceptScore W4285115083C2775924081 @default.
- W4285115083 hasConceptScore W4285115083C2776848632 @default.
- W4285115083 hasConceptScore W4285115083C41008148 @default.
- W4285115083 hasConceptScore W4285115083C41895202 @default.
- W4285115083 hasConceptScore W4285115083C47116090 @default.
- W4285115083 hasConceptScore W4285115083C47446073 @default.
- W4285115083 hasConceptScore W4285115083C536315585 @default.
- W4285115083 hasConceptScore W4285115083C55493867 @default.
- W4285115083 hasConceptScore W4285115083C63479239 @default.
- W4285115083 hasConceptScore W4285115083C6557445 @default.
- W4285115083 hasConceptScore W4285115083C86803240 @default.
- W4285115083 hasConceptScore W4285115083C97541855 @default.
- W4285115083 hasFunder F4320321001 @default.
- W4285115083 hasFunder F4320335440 @default.