Matches in SemOpenAlex for { <https://semopenalex.org/work/W1977031068> ?p ?o ?g. }
Showing items 1 to 87 of
87
with 100 items per page.
- W1977031068 abstract "In many multi-agent learning problems, it is difficult to determine, a priori, the agent reward structure that will lead to good performance. This problem is particularly pronounced in continuous, noisy domains ill-suited to simple table backup schemes commonly used in TD(λ)/Q-learning. In this paper, we present a new reward evaluation method that provides a visualization of the tradeoff between coordination among the agents and the difficulty of the learning problem each agent faces. This method is independent of the learning algorithm and is only a function of the problem domain and the agents' reward structure. We then use this reward property visualization method to determine an effective reward without performing extensive simulations. We test this method in both a static and a dynamic multi-rover learning domain where the agents have continuous state spaces and where their actions are noisy (e.g., the agents' movement decisions are not always carried out properly). Our results show that in the more difficult dynamic domain, the reward efficiency visualization method provides a two order of magnitude speedup in selecting a good reward. Most importantly it allows one to quickly create and verify rewards tailored to the observational limitations of the domain." @default.
- W1977031068 created "2016-06-24" @default.
- W1977031068 creator A5014767382 @default.
- W1977031068 creator A5047563625 @default.
- W1977031068 date "2005-07-25" @default.
- W1977031068 modified "2023-10-17" @default.
- W1977031068 title "Multi-agent reward analysis for learning in noisy domains" @default.
- W1977031068 cites W1918859184 @default.
- W1977031068 cites W1982239148 @default.
- W1977031068 cites W2008437221 @default.
- W1977031068 cites W2023491698 @default.
- W1977031068 cites W2071310709 @default.
- W1977031068 cites W2094304980 @default.
- W1977031068 cites W2134281179 @default.
- W1977031068 cites W2148157928 @default.
- W1977031068 cites W2149901355 @default.
- W1977031068 cites W2161061197 @default.
- W1977031068 cites W2169219648 @default.
- W1977031068 cites W3022436500 @default.
- W1977031068 doi "https://doi.org/10.1145/1082473.1082486" @default.
- W1977031068 hasPublicationYear "2005" @default.
- W1977031068 type Work @default.
- W1977031068 sameAs 1977031068 @default.
- W1977031068 citedByCount "34" @default.
- W1977031068 countsByYear W19770310682012 @default.
- W1977031068 countsByYear W19770310682013 @default.
- W1977031068 countsByYear W19770310682014 @default.
- W1977031068 countsByYear W19770310682015 @default.
- W1977031068 countsByYear W19770310682016 @default.
- W1977031068 countsByYear W19770310682017 @default.
- W1977031068 countsByYear W19770310682019 @default.
- W1977031068 countsByYear W19770310682020 @default.
- W1977031068 countsByYear W19770310682021 @default.
- W1977031068 countsByYear W19770310682022 @default.
- W1977031068 crossrefType "proceedings-article" @default.
- W1977031068 hasAuthorship W1977031068A5014767382 @default.
- W1977031068 hasAuthorship W1977031068A5047563625 @default.
- W1977031068 hasBestOaLocation W19770310682 @default.
- W1977031068 hasConcept C111472728 @default.
- W1977031068 hasConcept C111919701 @default.
- W1977031068 hasConcept C119857082 @default.
- W1977031068 hasConcept C124101348 @default.
- W1977031068 hasConcept C134306372 @default.
- W1977031068 hasConcept C138885662 @default.
- W1977031068 hasConcept C154945302 @default.
- W1977031068 hasConcept C189950617 @default.
- W1977031068 hasConcept C33923547 @default.
- W1977031068 hasConcept C36464697 @default.
- W1977031068 hasConcept C36503486 @default.
- W1977031068 hasConcept C41008148 @default.
- W1977031068 hasConcept C45235069 @default.
- W1977031068 hasConcept C68339613 @default.
- W1977031068 hasConcept C75553542 @default.
- W1977031068 hasConceptScore W1977031068C111472728 @default.
- W1977031068 hasConceptScore W1977031068C111919701 @default.
- W1977031068 hasConceptScore W1977031068C119857082 @default.
- W1977031068 hasConceptScore W1977031068C124101348 @default.
- W1977031068 hasConceptScore W1977031068C134306372 @default.
- W1977031068 hasConceptScore W1977031068C138885662 @default.
- W1977031068 hasConceptScore W1977031068C154945302 @default.
- W1977031068 hasConceptScore W1977031068C189950617 @default.
- W1977031068 hasConceptScore W1977031068C33923547 @default.
- W1977031068 hasConceptScore W1977031068C36464697 @default.
- W1977031068 hasConceptScore W1977031068C36503486 @default.
- W1977031068 hasConceptScore W1977031068C41008148 @default.
- W1977031068 hasConceptScore W1977031068C45235069 @default.
- W1977031068 hasConceptScore W1977031068C68339613 @default.
- W1977031068 hasConceptScore W1977031068C75553542 @default.
- W1977031068 hasLocation W19770310681 @default.
- W1977031068 hasLocation W19770310682 @default.
- W1977031068 hasLocation W19770310683 @default.
- W1977031068 hasOpenAccess W1977031068 @default.
- W1977031068 hasPrimaryLocation W19770310681 @default.
- W1977031068 hasRelatedWork W2013643406 @default.
- W1977031068 hasRelatedWork W2027972911 @default.
- W1977031068 hasRelatedWork W2058965144 @default.
- W1977031068 hasRelatedWork W2111089054 @default.
- W1977031068 hasRelatedWork W2146343568 @default.
- W1977031068 hasRelatedWork W2150291671 @default.
- W1977031068 hasRelatedWork W2157978810 @default.
- W1977031068 hasRelatedWork W2164382479 @default.
- W1977031068 hasRelatedWork W2597809628 @default.
- W1977031068 hasRelatedWork W98480971 @default.
- W1977031068 isParatext "false" @default.
- W1977031068 isRetracted "false" @default.
- W1977031068 magId "1977031068" @default.
- W1977031068 workType "article" @default.