Matches in SemOpenAlex for { <https://semopenalex.org/work/W5547603> ?p ?o ?g. }
Showing items 1 to 83 of
83
with 100 items per page.
- W5547603 abstract "The main goal of this thesis was the evaluation and implementation of two types of reinforcement learning algorithms on a computer-simulated control problem. Reinforcement learning is a branch of machine learning which combines principles of dynamic programming and supervised learning for problem solving. For the benchmark system we chose the cart-pole control problem as it is widely used in this field for testing the efficiency of learning algorithms. Out of the reinforcement learning methods we chose two algorithms for temporal difference learning. This type of learning uses methods of dynamic programming and Monte Carlo methods. The first chosen algorithm is Q-learning, the second is an actor-critic algorithm which is called learning by associative search element and adaptive critic element. In the purpose of achieving our goal, we developed a computer application for the experimental testing of the simulation of learning on a benchmark system. Our aim was to make this tool as modular and reusable as possible. We defined a different method of performance evaluation which was used to evaluate both learning algorithms on a wide set of simulation parameters. We also measured the computational performance of both algorithms." @default.
- W5547603 created "2016-06-24" @default.
- W5547603 creator A5022903627 @default.
- W5547603 date "2011-09-20" @default.
- W5547603 modified "2023-09-27" @default.
- W5547603 title "Reinforcement learning on the cart-pole problem" @default.
- W5547603 hasPublicationYear "2011" @default.
- W5547603 type Work @default.
- W5547603 sameAs 5547603 @default.
- W5547603 citedByCount "0" @default.
- W5547603 crossrefType "dissertation" @default.
- W5547603 hasAuthorship W5547603A5022903627 @default.
- W5547603 hasConcept C101468663 @default.
- W5547603 hasConcept C111919701 @default.
- W5547603 hasConcept C115903097 @default.
- W5547603 hasConcept C119857082 @default.
- W5547603 hasConcept C13280743 @default.
- W5547603 hasConcept C154945302 @default.
- W5547603 hasConcept C177264268 @default.
- W5547603 hasConcept C185798385 @default.
- W5547603 hasConcept C188888258 @default.
- W5547603 hasConcept C196340769 @default.
- W5547603 hasConcept C199190896 @default.
- W5547603 hasConcept C199360897 @default.
- W5547603 hasConcept C19966478 @default.
- W5547603 hasConcept C202444582 @default.
- W5547603 hasConcept C205649164 @default.
- W5547603 hasConcept C24138899 @default.
- W5547603 hasConcept C33923547 @default.
- W5547603 hasConcept C41008148 @default.
- W5547603 hasConcept C77967617 @default.
- W5547603 hasConcept C90509273 @default.
- W5547603 hasConcept C9652623 @default.
- W5547603 hasConcept C97541855 @default.
- W5547603 hasConceptScore W5547603C101468663 @default.
- W5547603 hasConceptScore W5547603C111919701 @default.
- W5547603 hasConceptScore W5547603C115903097 @default.
- W5547603 hasConceptScore W5547603C119857082 @default.
- W5547603 hasConceptScore W5547603C13280743 @default.
- W5547603 hasConceptScore W5547603C154945302 @default.
- W5547603 hasConceptScore W5547603C177264268 @default.
- W5547603 hasConceptScore W5547603C185798385 @default.
- W5547603 hasConceptScore W5547603C188888258 @default.
- W5547603 hasConceptScore W5547603C196340769 @default.
- W5547603 hasConceptScore W5547603C199190896 @default.
- W5547603 hasConceptScore W5547603C199360897 @default.
- W5547603 hasConceptScore W5547603C19966478 @default.
- W5547603 hasConceptScore W5547603C202444582 @default.
- W5547603 hasConceptScore W5547603C205649164 @default.
- W5547603 hasConceptScore W5547603C24138899 @default.
- W5547603 hasConceptScore W5547603C33923547 @default.
- W5547603 hasConceptScore W5547603C41008148 @default.
- W5547603 hasConceptScore W5547603C77967617 @default.
- W5547603 hasConceptScore W5547603C90509273 @default.
- W5547603 hasConceptScore W5547603C9652623 @default.
- W5547603 hasConceptScore W5547603C97541855 @default.
- W5547603 hasLocation W55476031 @default.
- W5547603 hasOpenAccess W5547603 @default.
- W5547603 hasPrimaryLocation W55476031 @default.
- W5547603 hasRelatedWork W1506145880 @default.
- W5547603 hasRelatedWork W1522804074 @default.
- W5547603 hasRelatedWork W1843985569 @default.
- W5547603 hasRelatedWork W2005189114 @default.
- W5547603 hasRelatedWork W2027283105 @default.
- W5547603 hasRelatedWork W2059836092 @default.
- W5547603 hasRelatedWork W2082582741 @default.
- W5547603 hasRelatedWork W2127612629 @default.
- W5547603 hasRelatedWork W212906305 @default.
- W5547603 hasRelatedWork W2143680741 @default.
- W5547603 hasRelatedWork W2526845620 @default.
- W5547603 hasRelatedWork W2557694176 @default.
- W5547603 hasRelatedWork W2558505034 @default.
- W5547603 hasRelatedWork W2610686804 @default.
- W5547603 hasRelatedWork W2965867197 @default.
- W5547603 hasRelatedWork W2967129319 @default.
- W5547603 hasRelatedWork W3112076360 @default.
- W5547603 hasRelatedWork W3132794238 @default.
- W5547603 hasRelatedWork W3135573334 @default.
- W5547603 hasRelatedWork W3001566354 @default.
- W5547603 isParatext "false" @default.
- W5547603 isRetracted "false" @default.
- W5547603 magId "5547603" @default.
- W5547603 workType "dissertation" @default.