Matches in SemOpenAlex for { <https://semopenalex.org/work/W1565638106> ?p ?o ?g. }
Showing items 1 to 77 of
77
with 100 items per page.
- W1565638106 abstract "In this paper we state a generalized form of the policy improvement algorithm for reinforcement learning. This new algorithm can be used to ...nd stochastic policies that optimize single-agent behavior for several environments and reinforcement functions simultaneously. We ...rst introduce a geometric interpretation of policy improvement, de...ne a framework to apply one policy to several environments, and propose the notion of balanced policies. Finally we explain the algorithm and present examples." @default.
- W1565638106 created "2016-06-24" @default.
- W1565638106 creator A5024016986 @default.
- W1565638106 creator A5079672297 @default.
- W1565638106 date "2001-01-01" @default.
- W1565638106 modified "2023-09-27" @default.
- W1565638106 title "Policy Improvement for several Environments Extended Version" @default.
- W1565638106 cites W1542941925 @default.
- W1565638106 cites W1597196612 @default.
- W1565638106 cites W1835254890 @default.
- W1565638106 cites W1963547452 @default.
- W1565638106 cites W2107726111 @default.
- W1565638106 cites W2121863487 @default.
- W1565638106 cites W2312609093 @default.
- W1565638106 cites W2330024298 @default.
- W1565638106 hasPublicationYear "2001" @default.
- W1565638106 type Work @default.
- W1565638106 sameAs 1565638106 @default.
- W1565638106 citedByCount "0" @default.
- W1565638106 crossrefType "journal-article" @default.
- W1565638106 hasAuthorship W1565638106A5024016986 @default.
- W1565638106 hasAuthorship W1565638106A5079672297 @default.
- W1565638106 hasConcept C11413529 @default.
- W1565638106 hasConcept C119857082 @default.
- W1565638106 hasConcept C126255220 @default.
- W1565638106 hasConcept C127413603 @default.
- W1565638106 hasConcept C154945302 @default.
- W1565638106 hasConcept C199360897 @default.
- W1565638106 hasConcept C21547014 @default.
- W1565638106 hasConcept C2778915421 @default.
- W1565638106 hasConcept C2779436431 @default.
- W1565638106 hasConcept C33923547 @default.
- W1565638106 hasConcept C41008148 @default.
- W1565638106 hasConcept C48103436 @default.
- W1565638106 hasConcept C527412718 @default.
- W1565638106 hasConcept C97541855 @default.
- W1565638106 hasConceptScore W1565638106C11413529 @default.
- W1565638106 hasConceptScore W1565638106C119857082 @default.
- W1565638106 hasConceptScore W1565638106C126255220 @default.
- W1565638106 hasConceptScore W1565638106C127413603 @default.
- W1565638106 hasConceptScore W1565638106C154945302 @default.
- W1565638106 hasConceptScore W1565638106C199360897 @default.
- W1565638106 hasConceptScore W1565638106C21547014 @default.
- W1565638106 hasConceptScore W1565638106C2778915421 @default.
- W1565638106 hasConceptScore W1565638106C2779436431 @default.
- W1565638106 hasConceptScore W1565638106C33923547 @default.
- W1565638106 hasConceptScore W1565638106C41008148 @default.
- W1565638106 hasConceptScore W1565638106C48103436 @default.
- W1565638106 hasConceptScore W1565638106C527412718 @default.
- W1565638106 hasConceptScore W1565638106C97541855 @default.
- W1565638106 hasLocation W15656381061 @default.
- W1565638106 hasOpenAccess W1565638106 @default.
- W1565638106 hasPrimaryLocation W15656381061 @default.
- W1565638106 hasRelatedWork W106493965 @default.
- W1565638106 hasRelatedWork W115717799 @default.
- W1565638106 hasRelatedWork W1497976081 @default.
- W1565638106 hasRelatedWork W2130711276 @default.
- W1565638106 hasRelatedWork W2285358897 @default.
- W1565638106 hasRelatedWork W2367922714 @default.
- W1565638106 hasRelatedWork W2379603734 @default.
- W1565638106 hasRelatedWork W2386329118 @default.
- W1565638106 hasRelatedWork W2391666574 @default.
- W1565638106 hasRelatedWork W2472956029 @default.
- W1565638106 hasRelatedWork W2733762270 @default.
- W1565638106 hasRelatedWork W2786230833 @default.
- W1565638106 hasRelatedWork W2945986931 @default.
- W1565638106 hasRelatedWork W2978070926 @default.
- W1565638106 hasRelatedWork W3012381177 @default.
- W1565638106 hasRelatedWork W3035367521 @default.
- W1565638106 hasRelatedWork W3208284117 @default.
- W1565638106 hasRelatedWork W32178094 @default.
- W1565638106 hasRelatedWork W8539471 @default.
- W1565638106 hasRelatedWork W2190150267 @default.
- W1565638106 isParatext "false" @default.
- W1565638106 isRetracted "false" @default.
- W1565638106 magId "1565638106" @default.
- W1565638106 workType "article" @default.