Matches in SemOpenAlex for { <https://semopenalex.org/work/W2096791657> ?p ?o ?g. }
Showing items 1 to 66 of
66
with 100 items per page.
- W2096791657 abstract "We focus on potential capability of a profit sharing method (PS) in non-Markov multi-agent environments. It is shown that PS has some rationality in non-Markov environments and is also effective in multi-agent environments. However, conventional PS uses only a reward to learn suitable rules. On the other hand. ldquopenalty avoiding rational policy making algorithm (PARP)rdquo is based on PS and uses not only a reward but also penalties. PARP is improved to save memories and to cope with uncertainties, which is known as ldquoimproved penalty avoiding rational policy making algorithm (improved PARP).rdquo There is another critical problem we must cope with when we apply PS based methods to real environments; we need a huge amount of state information and most of states take continuous values. One solution for this problem is to approximate the states with a function approximation method, e.g. tile coding. In this paper, first, we extend improved penalty avoiding rational policy making algorithm to tile coding environments. Then, we compare the extended method with conventional methods to show the effectiveness through an application to a keepaway task in a soccer game." @default.
- W2096791657 created "2016-06-24" @default.
- W2096791657 creator A5007893254 @default.
- W2096791657 creator A5062124983 @default.
- W2096791657 creator A5083921393 @default.
- W2096791657 date "2008-08-01" @default.
- W2096791657 modified "2023-09-23" @default.
- W2096791657 title "Extension of Improved Penalty Avoiding Rational Policy Making algorithm to tile coding environment for keepaway tasks" @default.
- W2096791657 cites W2064306130 @default.
- W2096791657 cites W2104641222 @default.
- W2096791657 doi "https://doi.org/10.1109/sice.2008.4654997" @default.
- W2096791657 hasPublicationYear "2008" @default.
- W2096791657 type Work @default.
- W2096791657 sameAs 2096791657 @default.
- W2096791657 citedByCount "2" @default.
- W2096791657 crossrefType "proceedings-article" @default.
- W2096791657 hasAuthorship W2096791657A5007893254 @default.
- W2096791657 hasAuthorship W2096791657A5062124983 @default.
- W2096791657 hasAuthorship W2096791657A5083921393 @default.
- W2096791657 hasConcept C105795698 @default.
- W2096791657 hasConcept C11413529 @default.
- W2096791657 hasConcept C119857082 @default.
- W2096791657 hasConcept C126255220 @default.
- W2096791657 hasConcept C159886148 @default.
- W2096791657 hasConcept C179518139 @default.
- W2096791657 hasConcept C33923547 @default.
- W2096791657 hasConcept C41008148 @default.
- W2096791657 hasConcept C6180225 @default.
- W2096791657 hasConcept C98763669 @default.
- W2096791657 hasConceptScore W2096791657C105795698 @default.
- W2096791657 hasConceptScore W2096791657C11413529 @default.
- W2096791657 hasConceptScore W2096791657C119857082 @default.
- W2096791657 hasConceptScore W2096791657C126255220 @default.
- W2096791657 hasConceptScore W2096791657C159886148 @default.
- W2096791657 hasConceptScore W2096791657C179518139 @default.
- W2096791657 hasConceptScore W2096791657C33923547 @default.
- W2096791657 hasConceptScore W2096791657C41008148 @default.
- W2096791657 hasConceptScore W2096791657C6180225 @default.
- W2096791657 hasConceptScore W2096791657C98763669 @default.
- W2096791657 hasLocation W20967916571 @default.
- W2096791657 hasOpenAccess W2096791657 @default.
- W2096791657 hasPrimaryLocation W20967916571 @default.
- W2096791657 hasRelatedWork W1140243306 @default.
- W2096791657 hasRelatedWork W136736547 @default.
- W2096791657 hasRelatedWork W1555201376 @default.
- W2096791657 hasRelatedWork W1566901001 @default.
- W2096791657 hasRelatedWork W1652467813 @default.
- W2096791657 hasRelatedWork W1984151138 @default.
- W2096791657 hasRelatedWork W2047793033 @default.
- W2096791657 hasRelatedWork W2076061149 @default.
- W2096791657 hasRelatedWork W2077808043 @default.
- W2096791657 hasRelatedWork W2096790256 @default.
- W2096791657 hasRelatedWork W2114391168 @default.
- W2096791657 hasRelatedWork W2132347775 @default.
- W2096791657 hasRelatedWork W2132484823 @default.
- W2096791657 hasRelatedWork W2134014128 @default.
- W2096791657 hasRelatedWork W2139898596 @default.
- W2096791657 hasRelatedWork W2166495063 @default.
- W2096791657 hasRelatedWork W2604883922 @default.
- W2096791657 hasRelatedWork W36041826 @default.
- W2096791657 hasRelatedWork W68363589 @default.
- W2096791657 hasRelatedWork W2107957074 @default.
- W2096791657 isParatext "false" @default.
- W2096791657 isRetracted "false" @default.
- W2096791657 magId "2096791657" @default.
- W2096791657 workType "article" @default.