Matches in SemOpenAlex for { <https://semopenalex.org/work/W2999963940> ?p ?o ?g. }
Showing items 1 to 96 of
96
with 100 items per page.
- W2999963940 endingPage "5727" @default.
- W2999963940 startingPage "5717" @default.
- W2999963940 abstract "An imposing task for a reinforcement learning agent in an uncertain environment is to expeditiously learn a policy or a sequence of actions, with which it can achieve the desired goal. In this article, we present an incremental model learning scheme to reconstruct the model of a stochastic environment. In the proposed learning scheme, we introduce a clustering algorithm to assimilate the model information and estimate the probability for each state transition. In addition, utilizing the reconstructed model, we present an experience replay strategy to create virtual interactive experiences by incorporating a balance between exploration and exploitation, which greatly accelerates learning and enables planning. Furthermore, we extend the proposed learning scheme for a multiagent framework to decrease the effort required for exploration and to reduce the learning time in a large environment. In this multiagent framework, we introduce a knowledge-sharing algorithm to share the reconstructed model information among the different agents, as needed, and develop a computationally efficient knowledge fusing mechanism to fuse the knowledge acquired using the agents’ own experience with the knowledge received from its teammates. Finally, the simulation results with comparative analysis are provided to demonstrate the efficacy of the proposed methods in the complex learning tasks." @default.
- W2999963940 created "2020-01-23" @default.
- W2999963940 creator A5001081733 @default.
- W2999963940 creator A5019813462 @default.
- W2999963940 creator A5079314465 @default.
- W2999963940 date "2021-12-01" @default.
- W2999963940 modified "2023-10-15" @default.
- W2999963940 title "Model Learning and Knowledge Sharing for Cooperative Multiagent Systems in Stochastic Environment" @default.
- W2999963940 cites W1977671496 @default.
- W2999963940 cites W2032378315 @default.
- W2999963940 cites W2077671246 @default.
- W2999963940 cites W2095989982 @default.
- W2999963940 cites W2124716489 @default.
- W2999963940 cites W2329769476 @default.
- W2999963940 cites W2338351427 @default.
- W2999963940 cites W2386591939 @default.
- W2999963940 cites W2408978589 @default.
- W2999963940 cites W2779774899 @default.
- W2999963940 doi "https://doi.org/10.1109/tcyb.2019.2958912" @default.
- W2999963940 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/7338261" @default.
- W2999963940 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/31944970" @default.
- W2999963940 hasPublicationYear "2021" @default.
- W2999963940 type Work @default.
- W2999963940 sameAs 2999963940 @default.
- W2999963940 citedByCount "8" @default.
- W2999963940 countsByYear W29999639402020 @default.
- W2999963940 countsByYear W29999639402021 @default.
- W2999963940 countsByYear W29999639402022 @default.
- W2999963940 countsByYear W29999639402023 @default.
- W2999963940 crossrefType "journal-article" @default.
- W2999963940 hasAuthorship W2999963940A5001081733 @default.
- W2999963940 hasAuthorship W2999963940A5019813462 @default.
- W2999963940 hasAuthorship W2999963940A5079314465 @default.
- W2999963940 hasBestOaLocation W29999639402 @default.
- W2999963940 hasConcept C119599485 @default.
- W2999963940 hasConcept C119857082 @default.
- W2999963940 hasConcept C127413603 @default.
- W2999963940 hasConcept C134306372 @default.
- W2999963940 hasConcept C141353440 @default.
- W2999963940 hasConcept C154945302 @default.
- W2999963940 hasConcept C201995342 @default.
- W2999963940 hasConcept C2776604539 @default.
- W2999963940 hasConcept C2778112365 @default.
- W2999963940 hasConcept C2780451532 @default.
- W2999963940 hasConcept C33923547 @default.
- W2999963940 hasConcept C41008148 @default.
- W2999963940 hasConcept C54355233 @default.
- W2999963940 hasConcept C56739046 @default.
- W2999963940 hasConcept C73555534 @default.
- W2999963940 hasConcept C77618280 @default.
- W2999963940 hasConcept C86803240 @default.
- W2999963940 hasConcept C97541855 @default.
- W2999963940 hasConceptScore W2999963940C119599485 @default.
- W2999963940 hasConceptScore W2999963940C119857082 @default.
- W2999963940 hasConceptScore W2999963940C127413603 @default.
- W2999963940 hasConceptScore W2999963940C134306372 @default.
- W2999963940 hasConceptScore W2999963940C141353440 @default.
- W2999963940 hasConceptScore W2999963940C154945302 @default.
- W2999963940 hasConceptScore W2999963940C201995342 @default.
- W2999963940 hasConceptScore W2999963940C2776604539 @default.
- W2999963940 hasConceptScore W2999963940C2778112365 @default.
- W2999963940 hasConceptScore W2999963940C2780451532 @default.
- W2999963940 hasConceptScore W2999963940C33923547 @default.
- W2999963940 hasConceptScore W2999963940C41008148 @default.
- W2999963940 hasConceptScore W2999963940C54355233 @default.
- W2999963940 hasConceptScore W2999963940C56739046 @default.
- W2999963940 hasConceptScore W2999963940C73555534 @default.
- W2999963940 hasConceptScore W2999963940C77618280 @default.
- W2999963940 hasConceptScore W2999963940C86803240 @default.
- W2999963940 hasConceptScore W2999963940C97541855 @default.
- W2999963940 hasFunder F4320306076 @default.
- W2999963940 hasFunder F4320322795 @default.
- W2999963940 hasFunder F4320332161 @default.
- W2999963940 hasIssue "12" @default.
- W2999963940 hasLocation W29999639401 @default.
- W2999963940 hasLocation W29999639402 @default.
- W2999963940 hasLocation W29999639403 @default.
- W2999963940 hasOpenAccess W2999963940 @default.
- W2999963940 hasPrimaryLocation W29999639401 @default.
- W2999963940 hasRelatedWork W1562959674 @default.
- W2999963940 hasRelatedWork W2739678353 @default.
- W2999963940 hasRelatedWork W2923653485 @default.
- W2999963940 hasRelatedWork W2957776456 @default.
- W2999963940 hasRelatedWork W3022038857 @default.
- W2999963940 hasRelatedWork W3198239567 @default.
- W2999963940 hasRelatedWork W4293149836 @default.
- W2999963940 hasRelatedWork W4319083788 @default.
- W2999963940 hasRelatedWork W4319453732 @default.
- W2999963940 hasRelatedWork W4361026739 @default.
- W2999963940 hasVolume "51" @default.
- W2999963940 isParatext "false" @default.
- W2999963940 isRetracted "false" @default.
- W2999963940 magId "2999963940" @default.
- W2999963940 workType "article" @default.