Matches in SemOpenAlex for { <https://semopenalex.org/work/W3112009994> ?p ?o ?g. }
Showing items 1 to 94 of
94
with 100 items per page.
- W3112009994 endingPage "223755" @default.
- W3112009994 startingPage "223743" @default.
- W3112009994 abstract "Solving the Nash equilibrium is important for multi-agent game systems, and the speed of reaching Nash equilibrium is critical for the agent to quickly make real-time decisions. A typical scheme is the model-free reinforcement learning algorithm based on policy iteration, which is slow because each iteration will be calculated from the start state to the end state. In this paper, we propose a faster scheme based on value iteration, using Q-function in an online manner to solve the Nash equilibrium of the system. Since the calculation is based on the value from the last iteration, the convergence speed of the proposed scheme is much faster than the policy iteration. The rationality and convergence of this scheme are analyzed and proved theoretically. An actor-critic network structure is used to implement this scheme through simulation. The simulation results show that the convergence speed of our proposed scheme is about 10 times faster than that of the policy iteration algorithm." @default.
- W3112009994 created "2020-12-21" @default.
- W3112009994 creator A5027001173 @default.
- W3112009994 creator A5046339692 @default.
- W3112009994 date "2020-01-01" @default.
- W3112009994 modified "2023-10-05" @default.
- W3112009994 title "An Enhanced Model-Free Reinforcement Learning Algorithm to Solve Nash Equilibrium for Multi-Agent Cooperative Game Systems" @default.
- W3112009994 cites W1520347797 @default.
- W3112009994 cites W1981826826 @default.
- W3112009994 cites W1983523797 @default.
- W3112009994 cites W2026008276 @default.
- W3112009994 cites W2039354440 @default.
- W3112009994 cites W2086977346 @default.
- W3112009994 cites W2099618002 @default.
- W3112009994 cites W2108383324 @default.
- W3112009994 cites W2112197455 @default.
- W3112009994 cites W2114682338 @default.
- W3112009994 cites W2132858840 @default.
- W3112009994 cites W2149938689 @default.
- W3112009994 cites W2320262435 @default.
- W3112009994 cites W2338351427 @default.
- W3112009994 cites W2475651303 @default.
- W3112009994 cites W2484646121 @default.
- W3112009994 cites W2611994503 @default.
- W3112009994 cites W2897822515 @default.
- W3112009994 cites W2908445768 @default.
- W3112009994 cites W2909756869 @default.
- W3112009994 cites W2921869787 @default.
- W3112009994 cites W2980297462 @default.
- W3112009994 cites W2986211651 @default.
- W3112009994 cites W2998579696 @default.
- W3112009994 doi "https://doi.org/10.1109/access.2020.3043806" @default.
- W3112009994 hasPublicationYear "2020" @default.
- W3112009994 type Work @default.
- W3112009994 sameAs 3112009994 @default.
- W3112009994 citedByCount "2" @default.
- W3112009994 countsByYear W31120099942023 @default.
- W3112009994 crossrefType "journal-article" @default.
- W3112009994 hasAuthorship W3112009994A5027001173 @default.
- W3112009994 hasAuthorship W3112009994A5046339692 @default.
- W3112009994 hasBestOaLocation W31120099941 @default.
- W3112009994 hasConcept C11413529 @default.
- W3112009994 hasConcept C126255220 @default.
- W3112009994 hasConcept C134306372 @default.
- W3112009994 hasConcept C144237770 @default.
- W3112009994 hasConcept C14646407 @default.
- W3112009994 hasConcept C154945302 @default.
- W3112009994 hasConcept C162324750 @default.
- W3112009994 hasConcept C177142836 @default.
- W3112009994 hasConcept C2777303404 @default.
- W3112009994 hasConcept C32407928 @default.
- W3112009994 hasConcept C33923547 @default.
- W3112009994 hasConcept C41008148 @default.
- W3112009994 hasConcept C46814582 @default.
- W3112009994 hasConcept C50522688 @default.
- W3112009994 hasConcept C77618280 @default.
- W3112009994 hasConcept C97541855 @default.
- W3112009994 hasConceptScore W3112009994C11413529 @default.
- W3112009994 hasConceptScore W3112009994C126255220 @default.
- W3112009994 hasConceptScore W3112009994C134306372 @default.
- W3112009994 hasConceptScore W3112009994C144237770 @default.
- W3112009994 hasConceptScore W3112009994C14646407 @default.
- W3112009994 hasConceptScore W3112009994C154945302 @default.
- W3112009994 hasConceptScore W3112009994C162324750 @default.
- W3112009994 hasConceptScore W3112009994C177142836 @default.
- W3112009994 hasConceptScore W3112009994C2777303404 @default.
- W3112009994 hasConceptScore W3112009994C32407928 @default.
- W3112009994 hasConceptScore W3112009994C33923547 @default.
- W3112009994 hasConceptScore W3112009994C41008148 @default.
- W3112009994 hasConceptScore W3112009994C46814582 @default.
- W3112009994 hasConceptScore W3112009994C50522688 @default.
- W3112009994 hasConceptScore W3112009994C77618280 @default.
- W3112009994 hasConceptScore W3112009994C97541855 @default.
- W3112009994 hasFunder F4320321001 @default.
- W3112009994 hasLocation W31120099941 @default.
- W3112009994 hasOpenAccess W3112009994 @default.
- W3112009994 hasPrimaryLocation W31120099941 @default.
- W3112009994 hasRelatedWork W1853631319 @default.
- W3112009994 hasRelatedWork W1966224968 @default.
- W3112009994 hasRelatedWork W2142650434 @default.
- W3112009994 hasRelatedWork W2154480527 @default.
- W3112009994 hasRelatedWork W2156232164 @default.
- W3112009994 hasRelatedWork W2787184676 @default.
- W3112009994 hasRelatedWork W2888456894 @default.
- W3112009994 hasRelatedWork W3112009994 @default.
- W3112009994 hasRelatedWork W4214835929 @default.
- W3112009994 hasRelatedWork W4245873547 @default.
- W3112009994 hasVolume "8" @default.
- W3112009994 isParatext "false" @default.
- W3112009994 isRetracted "false" @default.
- W3112009994 magId "3112009994" @default.
- W3112009994 workType "article" @default.