Matches in SemOpenAlex for { <https://semopenalex.org/work/W2072697031> ?p ?o ?g. }
- W2072697031 endingPage "54" @default.
- W2072697031 startingPage "36" @default.
- W2072697031 abstract "A crucial trade-off is involved in the design process when function approximation is used in reinforcement learning. Ideally the chosen representation should allow representing as close as possible an approximation of the value function. However, the more expressive the representation the more training data is needed because the space of candidate hypotheses is bigger. A less expressive representation has a smaller hypotheses space and a good candidate can be found faster. The core idea of this paper is the use of a mixed resolution function approximation, that is, the use of a less expressive function approximation to provide useful guidance during learning, and the use of a more expressive function approximation to obtain a final result of high quality. A major question is how to combine the two representations. Two approaches are proposed and evaluated empirically.Request access from your librarian to read this article's full text." @default.
- W2072697031 created "2016-06-24" @default.
- W2072697031 creator A5009587907 @default.
- W2072697031 creator A5052315074 @default.
- W2072697031 date "2009-04-01" @default.
- W2072697031 modified "2023-10-16" @default.
- W2072697031 title "Reinforcement Learning with Reward Shaping and Mixed Resolution Function Approximation" @default.
- W2072697031 cites W1497533895 @default.
- W2072697031 cites W1499408472 @default.
- W2072697031 cites W1507087299 @default.
- W2072697031 cites W1515851193 @default.
- W2072697031 cites W1552830313 @default.
- W2072697031 cites W1569296262 @default.
- W2072697031 cites W1706571876 @default.
- W2072697031 cites W1777239053 @default.
- W2072697031 cites W1987187457 @default.
- W2072697031 cites W2103626435 @default.
- W2072697031 cites W2104641222 @default.
- W2072697031 cites W2109102709 @default.
- W2072697031 cites W2109910161 @default.
- W2072697031 cites W2113913482 @default.
- W2072697031 cites W2121517924 @default.
- W2072697031 cites W2124175081 @default.
- W2072697031 cites W2124491905 @default.
- W2072697031 cites W2144366468 @default.
- W2072697031 cites W2146290970 @default.
- W2072697031 cites W2158548602 @default.
- W2072697031 cites W2158969944 @default.
- W2072697031 cites W2305205647 @default.
- W2072697031 doi "https://doi.org/10.4018/jats.2009040103" @default.
- W2072697031 hasPublicationYear "2009" @default.
- W2072697031 type Work @default.
- W2072697031 sameAs 2072697031 @default.
- W2072697031 citedByCount "2" @default.
- W2072697031 countsByYear W20726970312015 @default.
- W2072697031 countsByYear W20726970312017 @default.
- W2072697031 crossrefType "journal-article" @default.
- W2072697031 hasAuthorship W2072697031A5009587907 @default.
- W2072697031 hasAuthorship W2072697031A5052315074 @default.
- W2072697031 hasConcept C111472728 @default.
- W2072697031 hasConcept C111919701 @default.
- W2072697031 hasConcept C119857082 @default.
- W2072697031 hasConcept C126255220 @default.
- W2072697031 hasConcept C138268822 @default.
- W2072697031 hasConcept C138885662 @default.
- W2072697031 hasConcept C14036430 @default.
- W2072697031 hasConcept C14646407 @default.
- W2072697031 hasConcept C154945302 @default.
- W2072697031 hasConcept C17744445 @default.
- W2072697031 hasConcept C199539241 @default.
- W2072697031 hasConcept C2164484 @default.
- W2072697031 hasConcept C2776291640 @default.
- W2072697031 hasConcept C2776359362 @default.
- W2072697031 hasConcept C2778572836 @default.
- W2072697031 hasConcept C2779530757 @default.
- W2072697031 hasConcept C33923547 @default.
- W2072697031 hasConcept C41008148 @default.
- W2072697031 hasConcept C50644808 @default.
- W2072697031 hasConcept C76155785 @default.
- W2072697031 hasConcept C78458016 @default.
- W2072697031 hasConcept C86803240 @default.
- W2072697031 hasConcept C91873725 @default.
- W2072697031 hasConcept C94625758 @default.
- W2072697031 hasConcept C97541855 @default.
- W2072697031 hasConceptScore W2072697031C111472728 @default.
- W2072697031 hasConceptScore W2072697031C111919701 @default.
- W2072697031 hasConceptScore W2072697031C119857082 @default.
- W2072697031 hasConceptScore W2072697031C126255220 @default.
- W2072697031 hasConceptScore W2072697031C138268822 @default.
- W2072697031 hasConceptScore W2072697031C138885662 @default.
- W2072697031 hasConceptScore W2072697031C14036430 @default.
- W2072697031 hasConceptScore W2072697031C14646407 @default.
- W2072697031 hasConceptScore W2072697031C154945302 @default.
- W2072697031 hasConceptScore W2072697031C17744445 @default.
- W2072697031 hasConceptScore W2072697031C199539241 @default.
- W2072697031 hasConceptScore W2072697031C2164484 @default.
- W2072697031 hasConceptScore W2072697031C2776291640 @default.
- W2072697031 hasConceptScore W2072697031C2776359362 @default.
- W2072697031 hasConceptScore W2072697031C2778572836 @default.
- W2072697031 hasConceptScore W2072697031C2779530757 @default.
- W2072697031 hasConceptScore W2072697031C33923547 @default.
- W2072697031 hasConceptScore W2072697031C41008148 @default.
- W2072697031 hasConceptScore W2072697031C50644808 @default.
- W2072697031 hasConceptScore W2072697031C76155785 @default.
- W2072697031 hasConceptScore W2072697031C78458016 @default.
- W2072697031 hasConceptScore W2072697031C86803240 @default.
- W2072697031 hasConceptScore W2072697031C91873725 @default.
- W2072697031 hasConceptScore W2072697031C94625758 @default.
- W2072697031 hasConceptScore W2072697031C97541855 @default.
- W2072697031 hasIssue "2" @default.
- W2072697031 hasLocation W20726970311 @default.
- W2072697031 hasOpenAccess W2072697031 @default.
- W2072697031 hasPrimaryLocation W20726970311 @default.
- W2072697031 hasRelatedWork W1624593201 @default.
- W2072697031 hasRelatedWork W2025663273 @default.
- W2072697031 hasRelatedWork W2072697031 @default.
- W2072697031 hasRelatedWork W2155027007 @default.
- W2072697031 hasRelatedWork W3022038857 @default.