Matches in SemOpenAlex for { <https://semopenalex.org/work/W2386496952> ?p ?o ?g. }
Showing items 1 to 61 of
61
with 100 items per page.
- W2386496952 abstract "RL agents solve sequential decision problems by learning optim policies for choosing actions.Thus,at the core of RL is the definition of what it means for a policy to be “optimal”.In this paper,a variety of optimality criteria from the dynamic programming literature are discussed,and their suitability and characteristics for RL is examined through some examples.The necessity of devising RL algorithms for the various criteria has also been analyzed." @default.
- W2386496952 created "2016-06-24" @default.
- W2386496952 creator A5023298418 @default.
- W2386496952 date "2001-01-01" @default.
- W2386496952 modified "2023-09-25" @default.
- W2386496952 title "Research on Optimality Criteria in Reinforcement Learning" @default.
- W2386496952 hasPublicationYear "2001" @default.
- W2386496952 type Work @default.
- W2386496952 sameAs 2386496952 @default.
- W2386496952 citedByCount "0" @default.
- W2386496952 crossrefType "journal-article" @default.
- W2386496952 hasAuthorship W2386496952A5023298418 @default.
- W2386496952 hasConcept C11413529 @default.
- W2386496952 hasConcept C119857082 @default.
- W2386496952 hasConcept C126255220 @default.
- W2386496952 hasConcept C136197465 @default.
- W2386496952 hasConcept C154945302 @default.
- W2386496952 hasConcept C2164484 @default.
- W2386496952 hasConcept C33923547 @default.
- W2386496952 hasConcept C37404715 @default.
- W2386496952 hasConcept C41008148 @default.
- W2386496952 hasConcept C76155785 @default.
- W2386496952 hasConcept C97541855 @default.
- W2386496952 hasConceptScore W2386496952C11413529 @default.
- W2386496952 hasConceptScore W2386496952C119857082 @default.
- W2386496952 hasConceptScore W2386496952C126255220 @default.
- W2386496952 hasConceptScore W2386496952C136197465 @default.
- W2386496952 hasConceptScore W2386496952C154945302 @default.
- W2386496952 hasConceptScore W2386496952C2164484 @default.
- W2386496952 hasConceptScore W2386496952C33923547 @default.
- W2386496952 hasConceptScore W2386496952C37404715 @default.
- W2386496952 hasConceptScore W2386496952C41008148 @default.
- W2386496952 hasConceptScore W2386496952C76155785 @default.
- W2386496952 hasConceptScore W2386496952C97541855 @default.
- W2386496952 hasLocation W23864969521 @default.
- W2386496952 hasOpenAccess W2386496952 @default.
- W2386496952 hasPrimaryLocation W23864969521 @default.
- W2386496952 hasRelatedWork W1218520990 @default.
- W2386496952 hasRelatedWork W1493919947 @default.
- W2386496952 hasRelatedWork W1624779897 @default.
- W2386496952 hasRelatedWork W1663497315 @default.
- W2386496952 hasRelatedWork W1964895604 @default.
- W2386496952 hasRelatedWork W2029482943 @default.
- W2386496952 hasRelatedWork W2038771780 @default.
- W2386496952 hasRelatedWork W2101242010 @default.
- W2386496952 hasRelatedWork W2136719210 @default.
- W2386496952 hasRelatedWork W2147995533 @default.
- W2386496952 hasRelatedWork W2154023516 @default.
- W2386496952 hasRelatedWork W2289410116 @default.
- W2386496952 hasRelatedWork W2352330563 @default.
- W2386496952 hasRelatedWork W275349574 @default.
- W2386496952 hasRelatedWork W2792838926 @default.
- W2386496952 hasRelatedWork W2899658755 @default.
- W2386496952 hasRelatedWork W3133687343 @default.
- W2386496952 hasRelatedWork W3139043417 @default.
- W2386496952 hasRelatedWork W3167264379 @default.
- W2386496952 hasRelatedWork W3170197122 @default.
- W2386496952 isParatext "false" @default.
- W2386496952 isRetracted "false" @default.
- W2386496952 magId "2386496952" @default.
- W2386496952 workType "article" @default.