Matches in SemOpenAlex for { <https://semopenalex.org/work/W2367922714> ?p ?o ?g. }
Showing items 1 to 81 of
81
with 100 items per page.
- W2367922714 abstract "The problem of agent learning to act in an unknown world is both challenging and interesting. Reinforcement learning has been successful at finding optimal control policies through trial-and-error interaction with dynamic environment. Its properties of self-improving and online learning make reinforcement learning become one of most important machine learning methods. The goal of this paper was to provide a comprehensive review of reinforcement learning about theory,algorithms and applications. First of all,this paper surveyed the foundation,model of environment of reinforcement learning. Discussed the convergence and generalization of the algorithms in the next. Then deeply discussed two representative selection of these algorithm,including discounted reward and average reward. Finally,provided some applications of reinforcement learning,and pointed out some challenges and problems of reinforcement learning." @default.
- W2367922714 created "2016-06-24" @default.
- W2367922714 creator A5027880057 @default.
- W2367922714 date "2010-01-01" @default.
- W2367922714 modified "2023-09-25" @default.
- W2367922714 title "Reinforcement learning: survey of recent work" @default.
- W2367922714 hasPublicationYear "2010" @default.
- W2367922714 type Work @default.
- W2367922714 sameAs 2367922714 @default.
- W2367922714 citedByCount "3" @default.
- W2367922714 countsByYear W23679227142015 @default.
- W2367922714 countsByYear W23679227142018 @default.
- W2367922714 crossrefType "journal-article" @default.
- W2367922714 hasAuthorship W2367922714A5027880057 @default.
- W2367922714 hasConcept C115903097 @default.
- W2367922714 hasConcept C119857082 @default.
- W2367922714 hasConcept C134306372 @default.
- W2367922714 hasConcept C154945302 @default.
- W2367922714 hasConcept C15744967 @default.
- W2367922714 hasConcept C162324750 @default.
- W2367922714 hasConcept C177148314 @default.
- W2367922714 hasConcept C188888258 @default.
- W2367922714 hasConcept C199190896 @default.
- W2367922714 hasConcept C19966478 @default.
- W2367922714 hasConcept C2777303404 @default.
- W2367922714 hasConcept C33923547 @default.
- W2367922714 hasConcept C41008148 @default.
- W2367922714 hasConcept C47932503 @default.
- W2367922714 hasConcept C50522688 @default.
- W2367922714 hasConcept C67203356 @default.
- W2367922714 hasConcept C77805123 @default.
- W2367922714 hasConcept C77967617 @default.
- W2367922714 hasConcept C90509273 @default.
- W2367922714 hasConcept C97541855 @default.
- W2367922714 hasConceptScore W2367922714C115903097 @default.
- W2367922714 hasConceptScore W2367922714C119857082 @default.
- W2367922714 hasConceptScore W2367922714C134306372 @default.
- W2367922714 hasConceptScore W2367922714C154945302 @default.
- W2367922714 hasConceptScore W2367922714C15744967 @default.
- W2367922714 hasConceptScore W2367922714C162324750 @default.
- W2367922714 hasConceptScore W2367922714C177148314 @default.
- W2367922714 hasConceptScore W2367922714C188888258 @default.
- W2367922714 hasConceptScore W2367922714C199190896 @default.
- W2367922714 hasConceptScore W2367922714C19966478 @default.
- W2367922714 hasConceptScore W2367922714C2777303404 @default.
- W2367922714 hasConceptScore W2367922714C33923547 @default.
- W2367922714 hasConceptScore W2367922714C41008148 @default.
- W2367922714 hasConceptScore W2367922714C47932503 @default.
- W2367922714 hasConceptScore W2367922714C50522688 @default.
- W2367922714 hasConceptScore W2367922714C67203356 @default.
- W2367922714 hasConceptScore W2367922714C77805123 @default.
- W2367922714 hasConceptScore W2367922714C77967617 @default.
- W2367922714 hasConceptScore W2367922714C90509273 @default.
- W2367922714 hasConceptScore W2367922714C97541855 @default.
- W2367922714 hasLocation W23679227141 @default.
- W2367922714 hasOpenAccess W2367922714 @default.
- W2367922714 hasPrimaryLocation W23679227141 @default.
- W2367922714 hasRelatedWork W1497976081 @default.
- W2367922714 hasRelatedWork W1562437701 @default.
- W2367922714 hasRelatedWork W2123602362 @default.
- W2367922714 hasRelatedWork W2130711276 @default.
- W2367922714 hasRelatedWork W213343370 @default.
- W2367922714 hasRelatedWork W2168438882 @default.
- W2367922714 hasRelatedWork W2379603734 @default.
- W2367922714 hasRelatedWork W2386329118 @default.
- W2367922714 hasRelatedWork W2391666574 @default.
- W2367922714 hasRelatedWork W2615565422 @default.
- W2367922714 hasRelatedWork W2786230833 @default.
- W2367922714 hasRelatedWork W2808546214 @default.
- W2367922714 hasRelatedWork W2941892493 @default.
- W2367922714 hasRelatedWork W2950651907 @default.
- W2367922714 hasRelatedWork W2969320674 @default.
- W2367922714 hasRelatedWork W2975874528 @default.
- W2367922714 hasRelatedWork W2978070926 @default.
- W2367922714 hasRelatedWork W3081310128 @default.
- W2367922714 hasRelatedWork W3143922204 @default.
- W2367922714 hasRelatedWork W2190150267 @default.
- W2367922714 isParatext "false" @default.
- W2367922714 isRetracted "false" @default.
- W2367922714 magId "2367922714" @default.
- W2367922714 workType "article" @default.