Matches in SemOpenAlex for { <https://semopenalex.org/work/W1982875596> ?p ?o ?g. }
Showing items 1 to 86 of
86
with 100 items per page.
- W1982875596 abstract "Reinforcement learning (RL) is an efficient learning method for Markov decision processes (MDPs); ant colony system (ACS) is an efficient method for solving combinatorial optimization problems. Based on the update policy of reinforcement values in RL and the cooperating method of the indirect media communication in ACS, this paper proposes the Q-ACS multi-agent cooperating learning method for the learning agents to share episodes beneficial to the exploitation of the accumulated knowledge and to utilize the learned reinforcement values efficiently. Further, taking the visited times into account, this paper proposes the T-ACS multi-agent learning method that the learning agents share better policies beneficial to the exploration during agent's learning processes. Meanwhile, in the light of the indirect media communication among heterogeneous multi-agents, this paper presents a heterogeneous multi-agent RL method, the D-ACS. The agents in our methods are given a simply cooperating way exchanging information in the form of reinforcement values updated in the common model of all agents. Owning the advantages of exploring the unknown environment actively and exploiting learned knowledge effectively, the proposed methods are able to solve both MDPs and combinatorial optimization problems effectively. To results of simulations on the hunter game and the traveling salesman problem, this paper discusses the role of the indirect media communication on the multi-agent cooperation learning system and analyzes its efficiency. The results of experiments also demonstrate that our methods perform competitively with representative methods on each domain respectively." @default.
- W1982875596 created "2016-06-24" @default.
- W1982875596 creator A5000098880 @default.
- W1982875596 creator A5072991521 @default.
- W1982875596 date "2006-10-01" @default.
- W1982875596 modified "2023-10-14" @default.
- W1982875596 title "Analysis about Efficiency of Indirect Media Communication on Multi-agent Cooperation Learning" @default.
- W1982875596 cites W1524865490 @default.
- W1982875596 cites W1557517019 @default.
- W1982875596 cites W1641379095 @default.
- W1982875596 cites W1792357815 @default.
- W1982875596 cites W1801849579 @default.
- W1982875596 cites W2002444764 @default.
- W1982875596 cites W2074272451 @default.
- W1982875596 cites W2100677568 @default.
- W1982875596 cites W2103791569 @default.
- W1982875596 cites W2107726111 @default.
- W1982875596 cites W2113836425 @default.
- W1982875596 cites W2117341272 @default.
- W1982875596 cites W2125965138 @default.
- W1982875596 cites W2130411910 @default.
- W1982875596 cites W2140481494 @default.
- W1982875596 cites W2154929945 @default.
- W1982875596 cites W2162125251 @default.
- W1982875596 cites W2169022337 @default.
- W1982875596 cites W2615688110 @default.
- W1982875596 cites W411215626 @default.
- W1982875596 doi "https://doi.org/10.1109/icsmc.2006.384790" @default.
- W1982875596 hasPublicationYear "2006" @default.
- W1982875596 type Work @default.
- W1982875596 sameAs 1982875596 @default.
- W1982875596 citedByCount "0" @default.
- W1982875596 crossrefType "proceedings-article" @default.
- W1982875596 hasAuthorship W1982875596A5000098880 @default.
- W1982875596 hasAuthorship W1982875596A5072991521 @default.
- W1982875596 hasConcept C105795698 @default.
- W1982875596 hasConcept C106189395 @default.
- W1982875596 hasConcept C11413529 @default.
- W1982875596 hasConcept C119857082 @default.
- W1982875596 hasConcept C120314980 @default.
- W1982875596 hasConcept C126255220 @default.
- W1982875596 hasConcept C134306372 @default.
- W1982875596 hasConcept C154945302 @default.
- W1982875596 hasConcept C159886148 @default.
- W1982875596 hasConcept C175859090 @default.
- W1982875596 hasConcept C188116033 @default.
- W1982875596 hasConcept C33923547 @default.
- W1982875596 hasConcept C36503486 @default.
- W1982875596 hasConcept C40128228 @default.
- W1982875596 hasConcept C41008148 @default.
- W1982875596 hasConcept C41550386 @default.
- W1982875596 hasConcept C97541855 @default.
- W1982875596 hasConceptScore W1982875596C105795698 @default.
- W1982875596 hasConceptScore W1982875596C106189395 @default.
- W1982875596 hasConceptScore W1982875596C11413529 @default.
- W1982875596 hasConceptScore W1982875596C119857082 @default.
- W1982875596 hasConceptScore W1982875596C120314980 @default.
- W1982875596 hasConceptScore W1982875596C126255220 @default.
- W1982875596 hasConceptScore W1982875596C134306372 @default.
- W1982875596 hasConceptScore W1982875596C154945302 @default.
- W1982875596 hasConceptScore W1982875596C159886148 @default.
- W1982875596 hasConceptScore W1982875596C175859090 @default.
- W1982875596 hasConceptScore W1982875596C188116033 @default.
- W1982875596 hasConceptScore W1982875596C33923547 @default.
- W1982875596 hasConceptScore W1982875596C36503486 @default.
- W1982875596 hasConceptScore W1982875596C40128228 @default.
- W1982875596 hasConceptScore W1982875596C41008148 @default.
- W1982875596 hasConceptScore W1982875596C41550386 @default.
- W1982875596 hasConceptScore W1982875596C97541855 @default.
- W1982875596 hasLocation W19828755961 @default.
- W1982875596 hasOpenAccess W1982875596 @default.
- W1982875596 hasPrimaryLocation W19828755961 @default.
- W1982875596 hasRelatedWork W1982875596 @default.
- W1982875596 hasRelatedWork W2116875315 @default.
- W1982875596 hasRelatedWork W2146763310 @default.
- W1982875596 hasRelatedWork W2150498751 @default.
- W1982875596 hasRelatedWork W2182304831 @default.
- W1982875596 hasRelatedWork W2808418668 @default.
- W1982875596 hasRelatedWork W2937181779 @default.
- W1982875596 hasRelatedWork W3096874164 @default.
- W1982875596 hasRelatedWork W3167472281 @default.
- W1982875596 hasRelatedWork W4206039910 @default.
- W1982875596 isParatext "false" @default.
- W1982875596 isRetracted "false" @default.
- W1982875596 magId "1982875596" @default.
- W1982875596 workType "article" @default.