Matches in SemOpenAlex for { <https://semopenalex.org/work/W2002260889> ?p ?o ?g. }
- W2002260889 endingPage "390" @default.
- W2002260889 startingPage "377" @default.
- W2002260889 abstract "In this paper, reinforcement learning state- and output-feedback-based adaptive critic controller designs are proposed by using the online approximators (OLAs) for a general multi-input and multioutput affine unknown nonlinear discretetime systems in the presence of bounded disturbances. The proposed controller design has two entities, an action network that is designed to produce optimal signal and a critic network that evaluates the performance of the action network. The critic estimates the cost-to-go function which is tuned online using recursive equations derived from heuristic dynamic programming. Here, neural networks (NNs) are used both for the action and critic whereas any OLAs, such as radial basis functions, splines, fuzzy logic, etc., can be utilized. For the output-feedback counterpart, an additional NN is designated as the observer to estimate the unavailable system states, and thus, separation principle is not required. The NN weight tuning laws for the controller schemes are also derived while ensuring uniform ultimate boundedness of the closed-loop system using Lyapunov theory. Finally, the effectiveness of the two controllers is tested in simulation on a pendulum balancing system and a two-link robotic arm system." @default.
- W2002260889 created "2016-06-24" @default.
- W2002260889 creator A5062534500 @default.
- W2002260889 creator A5078910343 @default.
- W2002260889 date "2012-04-01" @default.
- W2002260889 modified "2023-10-02" @default.
- W2002260889 title "Reinforcement Learning Controller Design for Affine Nonlinear Discrete-Time Systems using Online Approximators" @default.
- W2002260889 cites W1557517019 @default.
- W2002260889 cites W1969166509 @default.
- W2002260889 cites W1986278072 @default.
- W2002260889 cites W1995622844 @default.
- W2002260889 cites W2005229381 @default.
- W2002260889 cites W2049063292 @default.
- W2002260889 cites W2059147929 @default.
- W2002260889 cites W2077195478 @default.
- W2002260889 cites W2091565802 @default.
- W2002260889 cites W2093831009 @default.
- W2002260889 cites W2104346286 @default.
- W2002260889 cites W2115249980 @default.
- W2002260889 cites W2131398727 @default.
- W2002260889 cites W2136064843 @default.
- W2002260889 cites W2145830976 @default.
- W2002260889 cites W2160561608 @default.
- W2002260889 cites W2165501837 @default.
- W2002260889 cites W3041202696 @default.
- W2002260889 cites W4230466265 @default.
- W2002260889 cites W4298300677 @default.
- W2002260889 doi "https://doi.org/10.1109/tsmcb.2011.2166384" @default.
- W2002260889 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/21947529" @default.
- W2002260889 hasPublicationYear "2012" @default.
- W2002260889 type Work @default.
- W2002260889 sameAs 2002260889 @default.
- W2002260889 citedByCount "151" @default.
- W2002260889 countsByYear W20022608892012 @default.
- W2002260889 countsByYear W20022608892013 @default.
- W2002260889 countsByYear W20022608892014 @default.
- W2002260889 countsByYear W20022608892015 @default.
- W2002260889 countsByYear W20022608892016 @default.
- W2002260889 countsByYear W20022608892017 @default.
- W2002260889 countsByYear W20022608892018 @default.
- W2002260889 countsByYear W20022608892019 @default.
- W2002260889 countsByYear W20022608892020 @default.
- W2002260889 countsByYear W20022608892021 @default.
- W2002260889 countsByYear W20022608892022 @default.
- W2002260889 countsByYear W20022608892023 @default.
- W2002260889 crossrefType "journal-article" @default.
- W2002260889 hasAuthorship W2002260889A5062534500 @default.
- W2002260889 hasAuthorship W2002260889A5078910343 @default.
- W2002260889 hasConcept C121332964 @default.
- W2002260889 hasConcept C126255220 @default.
- W2002260889 hasConcept C127413603 @default.
- W2002260889 hasConcept C133731056 @default.
- W2002260889 hasConcept C154945302 @default.
- W2002260889 hasConcept C158622935 @default.
- W2002260889 hasConcept C192921069 @default.
- W2002260889 hasConcept C203479927 @default.
- W2002260889 hasConcept C2775924081 @default.
- W2002260889 hasConcept C2780704645 @default.
- W2002260889 hasConcept C33923547 @default.
- W2002260889 hasConcept C41008148 @default.
- W2002260889 hasConcept C47446073 @default.
- W2002260889 hasConcept C50644808 @default.
- W2002260889 hasConcept C60640748 @default.
- W2002260889 hasConcept C62520636 @default.
- W2002260889 hasConcept C6557445 @default.
- W2002260889 hasConcept C86803240 @default.
- W2002260889 hasConcept C91575142 @default.
- W2002260889 hasConcept C97541855 @default.
- W2002260889 hasConceptScore W2002260889C121332964 @default.
- W2002260889 hasConceptScore W2002260889C126255220 @default.
- W2002260889 hasConceptScore W2002260889C127413603 @default.
- W2002260889 hasConceptScore W2002260889C133731056 @default.
- W2002260889 hasConceptScore W2002260889C154945302 @default.
- W2002260889 hasConceptScore W2002260889C158622935 @default.
- W2002260889 hasConceptScore W2002260889C192921069 @default.
- W2002260889 hasConceptScore W2002260889C203479927 @default.
- W2002260889 hasConceptScore W2002260889C2775924081 @default.
- W2002260889 hasConceptScore W2002260889C2780704645 @default.
- W2002260889 hasConceptScore W2002260889C33923547 @default.
- W2002260889 hasConceptScore W2002260889C41008148 @default.
- W2002260889 hasConceptScore W2002260889C47446073 @default.
- W2002260889 hasConceptScore W2002260889C50644808 @default.
- W2002260889 hasConceptScore W2002260889C60640748 @default.
- W2002260889 hasConceptScore W2002260889C62520636 @default.
- W2002260889 hasConceptScore W2002260889C6557445 @default.
- W2002260889 hasConceptScore W2002260889C86803240 @default.
- W2002260889 hasConceptScore W2002260889C91575142 @default.
- W2002260889 hasConceptScore W2002260889C97541855 @default.
- W2002260889 hasIssue "2" @default.
- W2002260889 hasLocation W20022608891 @default.
- W2002260889 hasLocation W20022608892 @default.
- W2002260889 hasOpenAccess W2002260889 @default.
- W2002260889 hasPrimaryLocation W20022608891 @default.
- W2002260889 hasRelatedWork W1968045209 @default.
- W2002260889 hasRelatedWork W1991899812 @default.
- W2002260889 hasRelatedWork W2150220609 @default.
- W2002260889 hasRelatedWork W2158581961 @default.
- W2002260889 hasRelatedWork W2365785480 @default.