Matches in SemOpenAlex for { <https://semopenalex.org/work/W3021997250> ?p ?o ?g. }
- W3021997250 abstract "In order to obviate the requirement of drift dynamics in adaptive dynamic programming (ADP), integral reinforcement learning (IRL) has been proposed as an alternate formulation of Bellman equation.However control coupling dynamics is still needed to obtain closed form expression of optimal control effort. In addition to this, initial stabilizing controller and two sets of neural networks (NN) (known as Actor-Critic) are required to implement IRL scheme. In this paper, a stabilizing term in the critic update law is leveraged to avoid the requirement of an initial stabilizing controller in IRL framework to solve optimal tracking problem with actuator constraints. With such a term, only one NN is needed to generate optimal control policies in IRL framework. This critic network is coupled with an experience replay (ER) enhanced identifier to obviate the necessity of control coupling dynamics in IRL algorithm. The weights of both identifier and critic NNs are simultaneously updated and it is shown that the ER-enhanced identifier is able to handle parametric variations better than without ER enhancement. The most salient feature of the novel update law is its variable learning rate, which scales the pace of learning based on instantaneous Hamilton-Jacobi-Bellman (HJB) error. Variable learning rate in critic NN coupled with ER technique in identifier NN help in achieving tighter residual set for state error and error in NN weights as shown in uniform ultimate boundedness (UUB) stability proof. The simulation results validate the presented identifier-critic NN on a nonlinear system." @default.
- W3021997250 created "2020-05-13" @default.
- W3021997250 creator A5039724691 @default.
- W3021997250 creator A5056304289 @default.
- W3021997250 date "2020-02-26" @default.
- W3021997250 modified "2023-09-27" @default.
- W3021997250 title "Simultaneous Identification and Optimal Tracking Control of Unknown Continuous Time Nonlinear System With Actuator Constraints Using Critic-Only Integral Reinforcement Learning" @default.
- W3021997250 cites W1527736490 @default.
- W3021997250 cites W1595516097 @default.
- W3021997250 cites W1606119439 @default.
- W3021997250 cites W1614417283 @default.
- W3021997250 cites W1968908471 @default.
- W3021997250 cites W1977237536 @default.
- W3021997250 cites W1983523797 @default.
- W3021997250 cites W2010152647 @default.
- W3021997250 cites W2013895638 @default.
- W3021997250 cites W2018160758 @default.
- W3021997250 cites W2024303516 @default.
- W3021997250 cites W2035003264 @default.
- W3021997250 cites W2037751122 @default.
- W3021997250 cites W2052305027 @default.
- W3021997250 cites W2057059845 @default.
- W3021997250 cites W2085194340 @default.
- W3021997250 cites W2093090294 @default.
- W3021997250 cites W2104843094 @default.
- W3021997250 cites W2108286682 @default.
- W3021997250 cites W2132468772 @default.
- W3021997250 cites W2161130209 @default.
- W3021997250 cites W2296530621 @default.
- W3021997250 cites W2489401300 @default.
- W3021997250 cites W2593469813 @default.
- W3021997250 cites W2755725247 @default.
- W3021997250 cites W2944972048 @default.
- W3021997250 cites W2988848600 @default.
- W3021997250 cites W3002463821 @default.
- W3021997250 cites W3003749591 @default.
- W3021997250 cites W3010475083 @default.
- W3021997250 cites W3021332258 @default.
- W3021997250 hasPublicationYear "2020" @default.
- W3021997250 type Work @default.
- W3021997250 sameAs 3021997250 @default.
- W3021997250 citedByCount "0" @default.
- W3021997250 crossrefType "posted-content" @default.
- W3021997250 hasAuthorship W3021997250A5039724691 @default.
- W3021997250 hasAuthorship W3021997250A5056304289 @default.
- W3021997250 hasConcept C11413529 @default.
- W3021997250 hasConcept C121332964 @default.
- W3021997250 hasConcept C126255220 @default.
- W3021997250 hasConcept C154504017 @default.
- W3021997250 hasConcept C154945302 @default.
- W3021997250 hasConcept C158622935 @default.
- W3021997250 hasConcept C183356978 @default.
- W3021997250 hasConcept C196978813 @default.
- W3021997250 hasConcept C199360897 @default.
- W3021997250 hasConcept C203479927 @default.
- W3021997250 hasConcept C2775924081 @default.
- W3021997250 hasConcept C33923547 @default.
- W3021997250 hasConcept C37404715 @default.
- W3021997250 hasConcept C41008148 @default.
- W3021997250 hasConcept C47446073 @default.
- W3021997250 hasConcept C50644808 @default.
- W3021997250 hasConcept C62520636 @default.
- W3021997250 hasConcept C6557445 @default.
- W3021997250 hasConcept C86803240 @default.
- W3021997250 hasConcept C91575142 @default.
- W3021997250 hasConcept C97541855 @default.
- W3021997250 hasConceptScore W3021997250C11413529 @default.
- W3021997250 hasConceptScore W3021997250C121332964 @default.
- W3021997250 hasConceptScore W3021997250C126255220 @default.
- W3021997250 hasConceptScore W3021997250C154504017 @default.
- W3021997250 hasConceptScore W3021997250C154945302 @default.
- W3021997250 hasConceptScore W3021997250C158622935 @default.
- W3021997250 hasConceptScore W3021997250C183356978 @default.
- W3021997250 hasConceptScore W3021997250C196978813 @default.
- W3021997250 hasConceptScore W3021997250C199360897 @default.
- W3021997250 hasConceptScore W3021997250C203479927 @default.
- W3021997250 hasConceptScore W3021997250C2775924081 @default.
- W3021997250 hasConceptScore W3021997250C33923547 @default.
- W3021997250 hasConceptScore W3021997250C37404715 @default.
- W3021997250 hasConceptScore W3021997250C41008148 @default.
- W3021997250 hasConceptScore W3021997250C47446073 @default.
- W3021997250 hasConceptScore W3021997250C50644808 @default.
- W3021997250 hasConceptScore W3021997250C62520636 @default.
- W3021997250 hasConceptScore W3021997250C6557445 @default.
- W3021997250 hasConceptScore W3021997250C86803240 @default.
- W3021997250 hasConceptScore W3021997250C91575142 @default.
- W3021997250 hasConceptScore W3021997250C97541855 @default.
- W3021997250 hasLocation W30219972501 @default.
- W3021997250 hasOpenAccess W3021997250 @default.
- W3021997250 hasPrimaryLocation W30219972501 @default.
- W3021997250 hasRelatedWork W1527736490 @default.
- W3021997250 hasRelatedWork W1599132769 @default.
- W3021997250 hasRelatedWork W1673034656 @default.
- W3021997250 hasRelatedWork W1969959431 @default.
- W3021997250 hasRelatedWork W1977237536 @default.
- W3021997250 hasRelatedWork W1978732706 @default.
- W3021997250 hasRelatedWork W1991119064 @default.
- W3021997250 hasRelatedWork W2083748997 @default.
- W3021997250 hasRelatedWork W2091130426 @default.
- W3021997250 hasRelatedWork W2343875079 @default.