Matches in SemOpenAlex for { <https://semopenalex.org/work/W2051269406> ?p ?o ?g. }
Showing items 1 to 86 of
86
with 100 items per page.
- W2051269406 endingPage "28" @default.
- W2051269406 startingPage "3" @default.
- W2051269406 abstract "An application of reinforcement learning to a linear-quadratic, differential game is presented. The reinforcement learning system uses a recently developed algorithm, the residual-gradient form of advantage updating. The game is a Markov decision process with continuous time, states, and actions, linear dynamics, and a quadratic cost function. The game consists of two players, a missile and a plane; the missile pursues the plane and the plane evades the missile. Although a missile and plane scenario was the chosen test bed, the reinforcement learning approach presented here is equally applicable to biologically based systems, such as a predator pursuing prey. The reinforcement learning algorithm for optimal control is modified for differential games to find the minimax point rather than the maximum. Simulation results are compared to the analytical solution, demonstrating that the simulated reinforcement learning system converges to the optimal answer. The performance of both the residual-gradient and non-residual-gradient forms of advantage updating and Q-learning are compared, demonstrating that advantage updating converges faster than Q-learning in all simulations. Advantage updating also is demonstrated to converge regardless of the time step duration; Q-learning is unable to converge as the time step duration grows small." @default.
- W2051269406 created "2016-06-24" @default.
- W2051269406 creator A5025439737 @default.
- W2051269406 creator A5038077029 @default.
- W2051269406 creator A5059725138 @default.
- W2051269406 date "1995-09-01" @default.
- W2051269406 modified "2023-10-03" @default.
- W2051269406 title "Reinforcement Learning Applied to a Differential Game" @default.
- W2051269406 cites W1498436455 @default.
- W2051269406 cites W1621423688 @default.
- W2051269406 cites W2037584050 @default.
- W2051269406 cites W2119717200 @default.
- W2051269406 cites W2143867281 @default.
- W2051269406 doi "https://doi.org/10.1177/105971239500400102" @default.
- W2051269406 hasPublicationYear "1995" @default.
- W2051269406 type Work @default.
- W2051269406 sameAs 2051269406 @default.
- W2051269406 citedByCount "37" @default.
- W2051269406 countsByYear W20512694062012 @default.
- W2051269406 countsByYear W20512694062013 @default.
- W2051269406 countsByYear W20512694062014 @default.
- W2051269406 countsByYear W20512694062015 @default.
- W2051269406 countsByYear W20512694062016 @default.
- W2051269406 countsByYear W20512694062018 @default.
- W2051269406 countsByYear W20512694062019 @default.
- W2051269406 countsByYear W20512694062020 @default.
- W2051269406 countsByYear W20512694062021 @default.
- W2051269406 countsByYear W20512694062022 @default.
- W2051269406 countsByYear W20512694062023 @default.
- W2051269406 crossrefType "journal-article" @default.
- W2051269406 hasAuthorship W2051269406A5025439737 @default.
- W2051269406 hasAuthorship W2051269406A5038077029 @default.
- W2051269406 hasAuthorship W2051269406A5059725138 @default.
- W2051269406 hasConcept C105795698 @default.
- W2051269406 hasConcept C106189395 @default.
- W2051269406 hasConcept C11413529 @default.
- W2051269406 hasConcept C126255220 @default.
- W2051269406 hasConcept C127413603 @default.
- W2051269406 hasConcept C146978453 @default.
- W2051269406 hasConcept C149728462 @default.
- W2051269406 hasConcept C154945302 @default.
- W2051269406 hasConcept C155512373 @default.
- W2051269406 hasConcept C159886148 @default.
- W2051269406 hasConcept C188116033 @default.
- W2051269406 hasConcept C2778857364 @default.
- W2051269406 hasConcept C2779006483 @default.
- W2051269406 hasConcept C33923547 @default.
- W2051269406 hasConcept C41008148 @default.
- W2051269406 hasConcept C97541855 @default.
- W2051269406 hasConceptScore W2051269406C105795698 @default.
- W2051269406 hasConceptScore W2051269406C106189395 @default.
- W2051269406 hasConceptScore W2051269406C11413529 @default.
- W2051269406 hasConceptScore W2051269406C126255220 @default.
- W2051269406 hasConceptScore W2051269406C127413603 @default.
- W2051269406 hasConceptScore W2051269406C146978453 @default.
- W2051269406 hasConceptScore W2051269406C149728462 @default.
- W2051269406 hasConceptScore W2051269406C154945302 @default.
- W2051269406 hasConceptScore W2051269406C155512373 @default.
- W2051269406 hasConceptScore W2051269406C159886148 @default.
- W2051269406 hasConceptScore W2051269406C188116033 @default.
- W2051269406 hasConceptScore W2051269406C2778857364 @default.
- W2051269406 hasConceptScore W2051269406C2779006483 @default.
- W2051269406 hasConceptScore W2051269406C33923547 @default.
- W2051269406 hasConceptScore W2051269406C41008148 @default.
- W2051269406 hasConceptScore W2051269406C97541855 @default.
- W2051269406 hasIssue "1" @default.
- W2051269406 hasLocation W20512694061 @default.
- W2051269406 hasOpenAccess W2051269406 @default.
- W2051269406 hasPrimaryLocation W20512694061 @default.
- W2051269406 hasRelatedWork W1511927616 @default.
- W2051269406 hasRelatedWork W1556532828 @default.
- W2051269406 hasRelatedWork W1574991376 @default.
- W2051269406 hasRelatedWork W1968481698 @default.
- W2051269406 hasRelatedWork W2051269406 @default.
- W2051269406 hasRelatedWork W2146763310 @default.
- W2051269406 hasRelatedWork W2182304831 @default.
- W2051269406 hasRelatedWork W2937181779 @default.
- W2051269406 hasRelatedWork W3167472281 @default.
- W2051269406 hasRelatedWork W4382935469 @default.
- W2051269406 hasVolume "4" @default.
- W2051269406 isParatext "false" @default.
- W2051269406 isRetracted "false" @default.
- W2051269406 magId "2051269406" @default.
- W2051269406 workType "article" @default.