Matches in SemOpenAlex for { <https://semopenalex.org/work/W1840881174> ?p ?o ?g. }
Showing items 1 to 83 of
83
with 100 items per page.
- W1840881174 endingPage "1280" @default.
- W1840881174 startingPage "1277" @default.
- W1840881174 abstract "The original Q-learning method is difficult on achieving sample efficiency such as training a policy to get to a goal with in limited time step. So, the Dyna-Q agent is proposed to speed up the policy learning. However, the Dyna-Q did not specify how to build the model, so the table is used to be the model largely. In this paper, we proposed an adaptive model learning method based on tree structures and combined with Q-Learning to form Tree-Based Dyna-Q agent to enhance the policy learning. When the tree-based model learns an accurate model, a planning method can use the model to produce simulated experiences to accelerate value iterations. Thus, the agent with the proposed method can obtain virtual experiences for updating the policy. The simulation result shows that training time of our method can improve obviously." @default.
- W1840881174 created "2016-06-24" @default.
- W1840881174 creator A5001081733 @default.
- W1840881174 creator A5061189209 @default.
- W1840881174 creator A5064655192 @default.
- W1840881174 date "2012-10-04" @default.
- W1840881174 modified "2023-09-28" @default.
- W1840881174 title "Adaptive model learning method for reinforcement learning" @default.
- W1840881174 cites W1491843047 @default.
- W1840881174 cites W1557517019 @default.
- W1840881174 cites W1783866091 @default.
- W1840881174 cites W1893871610 @default.
- W1840881174 cites W2078196735 @default.
- W1840881174 cites W2166265228 @default.
- W1840881174 hasPublicationYear "2012" @default.
- W1840881174 type Work @default.
- W1840881174 sameAs 1840881174 @default.
- W1840881174 citedByCount "2" @default.
- W1840881174 countsByYear W18408811742013 @default.
- W1840881174 countsByYear W18408811742021 @default.
- W1840881174 crossrefType "proceedings-article" @default.
- W1840881174 hasAuthorship W1840881174A5001081733 @default.
- W1840881174 hasAuthorship W1840881174A5061189209 @default.
- W1840881174 hasAuthorship W1840881174A5064655192 @default.
- W1840881174 hasConcept C113174947 @default.
- W1840881174 hasConcept C11413529 @default.
- W1840881174 hasConcept C119857082 @default.
- W1840881174 hasConcept C124101348 @default.
- W1840881174 hasConcept C125014702 @default.
- W1840881174 hasConcept C134306372 @default.
- W1840881174 hasConcept C154945302 @default.
- W1840881174 hasConcept C163797641 @default.
- W1840881174 hasConcept C188116033 @default.
- W1840881174 hasConcept C197855036 @default.
- W1840881174 hasConcept C2779436431 @default.
- W1840881174 hasConcept C33923547 @default.
- W1840881174 hasConcept C41008148 @default.
- W1840881174 hasConcept C45235069 @default.
- W1840881174 hasConcept C97541855 @default.
- W1840881174 hasConceptScore W1840881174C113174947 @default.
- W1840881174 hasConceptScore W1840881174C11413529 @default.
- W1840881174 hasConceptScore W1840881174C119857082 @default.
- W1840881174 hasConceptScore W1840881174C124101348 @default.
- W1840881174 hasConceptScore W1840881174C125014702 @default.
- W1840881174 hasConceptScore W1840881174C134306372 @default.
- W1840881174 hasConceptScore W1840881174C154945302 @default.
- W1840881174 hasConceptScore W1840881174C163797641 @default.
- W1840881174 hasConceptScore W1840881174C188116033 @default.
- W1840881174 hasConceptScore W1840881174C197855036 @default.
- W1840881174 hasConceptScore W1840881174C2779436431 @default.
- W1840881174 hasConceptScore W1840881174C33923547 @default.
- W1840881174 hasConceptScore W1840881174C41008148 @default.
- W1840881174 hasConceptScore W1840881174C45235069 @default.
- W1840881174 hasConceptScore W1840881174C97541855 @default.
- W1840881174 hasLocation W18408811741 @default.
- W1840881174 hasOpenAccess W1840881174 @default.
- W1840881174 hasPrimaryLocation W18408811741 @default.
- W1840881174 hasRelatedWork W1975737528 @default.
- W1840881174 hasRelatedWork W1982781528 @default.
- W1840881174 hasRelatedWork W1985881603 @default.
- W1840881174 hasRelatedWork W2050020749 @default.
- W1840881174 hasRelatedWork W2099242228 @default.
- W1840881174 hasRelatedWork W2114876262 @default.
- W1840881174 hasRelatedWork W2115393638 @default.
- W1840881174 hasRelatedWork W2120153538 @default.
- W1840881174 hasRelatedWork W2146946892 @default.
- W1840881174 hasRelatedWork W2150498751 @default.
- W1840881174 hasRelatedWork W2152585316 @default.
- W1840881174 hasRelatedWork W2356358603 @default.
- W1840881174 hasRelatedWork W2534140487 @default.
- W1840881174 hasRelatedWork W2599481709 @default.
- W1840881174 hasRelatedWork W2808421695 @default.
- W1840881174 hasRelatedWork W2907306720 @default.
- W1840881174 hasRelatedWork W2963170229 @default.
- W1840881174 hasRelatedWork W3017385169 @default.
- W1840881174 hasRelatedWork W3088331655 @default.
- W1840881174 hasRelatedWork W89604033 @default.
- W1840881174 isParatext "false" @default.
- W1840881174 isRetracted "false" @default.
- W1840881174 magId "1840881174" @default.
- W1840881174 workType "article" @default.