Matches in SemOpenAlex for { <https://semopenalex.org/work/W4238510270> ?p ?o ?g. }
Showing items 1 to 63 of
63
with 100 items per page.
- W4238510270 abstract "Reinforcement Learning (RL) is one of the model free machine learning algorithms where the agent learns its behaviours from the environment by actually interacting with it. This is better than the offline planner because the agent actually interacts with the environment to learn its behaviours because it is almost impossible to simulate a real world in a computer. By using the reinforcement learning, the agent learns those extra features which can only be learned in an real world environment hence giving it a learning capability like living organisms because in a real world there are certain parameters which cannot be simulated by a computer. Since the reinforcement learning agent gets its feedback from the environment, it allows the agent to automatically determine its behaviours that are considered ideal within a specified context. Reinforcement learning is deemed important in the field of artificial intelligence as it starts to make breakthrough and benchmarks in various industrial applications. Previously we have analysed the pacman game where the pacman agent is a reflex agent, here, we are trying to make the pacman agent more smarter by applying RL techniques, i.e, Q-learning successfully." @default.
- W4238510270 created "2022-05-12" @default.
- W4238510270 creator A5008995616 @default.
- W4238510270 creator A5039295670 @default.
- W4238510270 creator A5041386941 @default.
- W4238510270 date "2019-10-29" @default.
- W4238510270 modified "2023-09-30" @default.
- W4238510270 title "Reinforcement Learning" @default.
- W4238510270 doi "https://doi.org/10.31219/osf.io/dz6sx" @default.
- W4238510270 hasPublicationYear "2019" @default.
- W4238510270 type Work @default.
- W4238510270 citedByCount "0" @default.
- W4238510270 crossrefType "posted-content" @default.
- W4238510270 hasAuthorship W4238510270A5008995616 @default.
- W4238510270 hasAuthorship W4238510270A5039295670 @default.
- W4238510270 hasAuthorship W4238510270A5041386941 @default.
- W4238510270 hasBestOaLocation W42385102701 @default.
- W4238510270 hasConcept C107457646 @default.
- W4238510270 hasConcept C119857082 @default.
- W4238510270 hasConcept C127413603 @default.
- W4238510270 hasConcept C151730666 @default.
- W4238510270 hasConcept C154945302 @default.
- W4238510270 hasConcept C202444582 @default.
- W4238510270 hasConcept C2776999362 @default.
- W4238510270 hasConcept C2779343474 @default.
- W4238510270 hasConcept C33923547 @default.
- W4238510270 hasConcept C41008148 @default.
- W4238510270 hasConcept C66938386 @default.
- W4238510270 hasConcept C67203356 @default.
- W4238510270 hasConcept C86803240 @default.
- W4238510270 hasConcept C9652623 @default.
- W4238510270 hasConcept C97541855 @default.
- W4238510270 hasConceptScore W4238510270C107457646 @default.
- W4238510270 hasConceptScore W4238510270C119857082 @default.
- W4238510270 hasConceptScore W4238510270C127413603 @default.
- W4238510270 hasConceptScore W4238510270C151730666 @default.
- W4238510270 hasConceptScore W4238510270C154945302 @default.
- W4238510270 hasConceptScore W4238510270C202444582 @default.
- W4238510270 hasConceptScore W4238510270C2776999362 @default.
- W4238510270 hasConceptScore W4238510270C2779343474 @default.
- W4238510270 hasConceptScore W4238510270C33923547 @default.
- W4238510270 hasConceptScore W4238510270C41008148 @default.
- W4238510270 hasConceptScore W4238510270C66938386 @default.
- W4238510270 hasConceptScore W4238510270C67203356 @default.
- W4238510270 hasConceptScore W4238510270C86803240 @default.
- W4238510270 hasConceptScore W4238510270C9652623 @default.
- W4238510270 hasConceptScore W4238510270C97541855 @default.
- W4238510270 hasLocation W42385102701 @default.
- W4238510270 hasOpenAccess W4238510270 @default.
- W4238510270 hasPrimaryLocation W42385102701 @default.
- W4238510270 hasRelatedWork W12522828 @default.
- W4238510270 hasRelatedWork W13936347 @default.
- W4238510270 hasRelatedWork W1650390 @default.
- W4238510270 hasRelatedWork W2337330 @default.
- W4238510270 hasRelatedWork W24270 @default.
- W4238510270 hasRelatedWork W4191668 @default.
- W4238510270 hasRelatedWork W4412456 @default.
- W4238510270 hasRelatedWork W4651166 @default.
- W4238510270 hasRelatedWork W5081013 @default.
- W4238510270 hasRelatedWork W929682 @default.
- W4238510270 isParatext "false" @default.
- W4238510270 isRetracted "false" @default.
- W4238510270 workType "article" @default.