SemOpenAlex |

SemOpenAlex

Matches in SemOpenAlex for { <https://semopenalex.org/work/W2985871261> ?p ?o ?g. }

Showing items 1 to 71 of 71 with 100 items per page.

W2985871261 abstract "Machine Learning Control is a control paradigm that applies Artificial Intelligence methods to control problems. Within this domain, the field of Reinforcement Learning (RL) is particularly promising, since it provides a framework in which a control policy does not have to be programmed explicitly, but can be learned by an intelligent controller directly from real-world data, allowing to control systems that are either arduous or even impossible to model analytically. However, in spite of such considerable potential, the RL paradigm poses a number of challenges that effectively hinder its applications in the real-world and in industry. It is therefore critical that research in this field is advanced until RL-based controllers can be practically demonstrated to be real-world feasible and reliable. This thesis report presents the attempts made at applying control strategies based on Reinforcement Learning to solve a precise positioning task with a physical experimental setup. The setup at hand is a magnetic manipulator (magman) characterized by a high degree of nonlinearity. The controller uses the spatially continuous magnetic field generated by four actuators to displace a steel ball, constrained to move in one dimension, towards a reference position. Two different implementations of the Q-learning algorithm (Sutton, Barto, et al., 1998) were deployed. In spite of the good results obtained in a simplified simulated environment, both implementations failed on the experimental setup. The negative outcome of these experiments is mainly due to the fact that, since the task at hand is an accurate positioning task, the reward obtained by the learner while interacting with the environment is too sparse for it to be able to learn a stabilizing control policy. Other factors have presumably contributed to the controllers’ failure, such as the circumstance that the agent does not have access to the full system state information and a sub-optimal tuning of the algorithms’ hyper-parameters. Besides model-free RL, the Value Iteration model-based method was successfully applied both in simulations and with the experimental setup. The present findings suggest that, in order to solve the magman task with model-free RL, more sophisticated algorithms need to be deployed, such as for example an agent that can naturally deal with continuous state and action spaces, as the DDPG algorithm (Lillicrap et al., 2015), with exploration carried out in the parameter-space rather than in the control action space (Plappert et al., 2017), in addition to a more optimal exploitation of the information extracted from the environment, for example using Hindsight Experience Replay (Andrychowicz et al., 2017)." @default.
W2985871261 created "2019-11-22" @default.
W2985871261 creator A5032554584 @default.
W2985871261 date "2019-01-01" @default.
W2985871261 modified "2023-09-26" @default.
W2985871261 title "Online Reinforcement Learning Control of an Electromagnetic Manipulator" @default.
W2985871261 hasPublicationYear "2019" @default.
W2985871261 type Work @default.
W2985871261 sameAs 2985871261 @default.
W2985871261 citedByCount "0" @default.
W2985871261 crossrefType "journal-article" @default.
W2985871261 hasAuthorship W2985871261A5032554584 @default.
W2985871261 hasConcept C127413603 @default.
W2985871261 hasConcept C133731056 @default.
W2985871261 hasConcept C154945302 @default.
W2985871261 hasConcept C172707124 @default.
W2985871261 hasConcept C199360897 @default.
W2985871261 hasConcept C201995342 @default.
W2985871261 hasConcept C202444582 @default.
W2985871261 hasConcept C203479927 @default.
W2985871261 hasConcept C26713055 @default.
W2985871261 hasConcept C2780451532 @default.
W2985871261 hasConcept C33923547 @default.
W2985871261 hasConcept C41008148 @default.
W2985871261 hasConcept C6557445 @default.
W2985871261 hasConcept C86803240 @default.
W2985871261 hasConcept C9652623 @default.
W2985871261 hasConcept C97541855 @default.
W2985871261 hasConceptScore W2985871261C127413603 @default.
W2985871261 hasConceptScore W2985871261C133731056 @default.
W2985871261 hasConceptScore W2985871261C154945302 @default.
W2985871261 hasConceptScore W2985871261C172707124 @default.
W2985871261 hasConceptScore W2985871261C199360897 @default.
W2985871261 hasConceptScore W2985871261C201995342 @default.
W2985871261 hasConceptScore W2985871261C202444582 @default.
W2985871261 hasConceptScore W2985871261C203479927 @default.
W2985871261 hasConceptScore W2985871261C26713055 @default.
W2985871261 hasConceptScore W2985871261C2780451532 @default.
W2985871261 hasConceptScore W2985871261C33923547 @default.
W2985871261 hasConceptScore W2985871261C41008148 @default.
W2985871261 hasConceptScore W2985871261C6557445 @default.
W2985871261 hasConceptScore W2985871261C86803240 @default.
W2985871261 hasConceptScore W2985871261C9652623 @default.
W2985871261 hasConceptScore W2985871261C97541855 @default.
W2985871261 hasLocation W29858712611 @default.
W2985871261 hasOpenAccess W2985871261 @default.
W2985871261 hasPrimaryLocation W29858712611 @default.
W2985871261 hasRelatedWork W1522501859 @default.
W2985871261 hasRelatedWork W1561563290 @default.
W2985871261 hasRelatedWork W1814441753 @default.
W2985871261 hasRelatedWork W1838356191 @default.
W2985871261 hasRelatedWork W1863534978 @default.
W2985871261 hasRelatedWork W2076329865 @default.
W2985871261 hasRelatedWork W2100370041 @default.
W2985871261 hasRelatedWork W2149738224 @default.
W2985871261 hasRelatedWork W2403742485 @default.
W2985871261 hasRelatedWork W2910219310 @default.
W2985871261 hasRelatedWork W2917954210 @default.
W2985871261 hasRelatedWork W2919334316 @default.
W2985871261 hasRelatedWork W2925306934 @default.
W2985871261 hasRelatedWork W2952672470 @default.
W2985871261 hasRelatedWork W3007369745 @default.
W2985871261 hasRelatedWork W3109409708 @default.
W2985871261 hasRelatedWork W3129896193 @default.
W2985871261 hasRelatedWork W3199764406 @default.
W2985871261 hasRelatedWork W3203483019 @default.
W2985871261 hasRelatedWork W76760840 @default.
W2985871261 isParatext "false" @default.
W2985871261 isRetracted "false" @default.
W2985871261 magId "2985871261" @default.
W2985871261 workType "article" @default.