Matches in SemOpenAlex for { <https://semopenalex.org/work/W3128013500> ?p ?o ?g. }
- W3128013500 endingPage "1292" @default.
- W3128013500 startingPage "1292" @default.
- W3128013500 abstract "This article surveys reinforcement learning approaches in social robotics. Reinforcement learning is a framework for decision-making problems in which an agent interacts through trial-and-error with its environment to discover an optimal behavior. Since interaction is a key component in both reinforcement learning and social robotics, it can be a well-suited approach for real-world interactions with physically embodied social robots. The scope of the paper is focused particularly on studies that include social physical robots and real-world human-robot interactions with users. We present a thorough analysis of reinforcement learning approaches in social robotics. In addition to a survey, we categorize existent reinforcement learning approaches based on the used method and the design of the reward mechanisms. Moreover, since communication capability is a prominent feature of social robots, we discuss and group the papers based on the communication medium used for reward formulation. Considering the importance of designing the reward function, we also provide a categorization of the papers based on the nature of the reward. This categorization includes three major themes: interactive reinforcement learning, intrinsically motivated methods, and task performance-driven methods. The benefits and challenges of reinforcement learning in social robotics, evaluation methods of the papers regarding whether or not they use subjective and algorithmic measures, a discussion in the view of real-world reinforcement learning challenges and proposed solutions, the points that remain to be explored, including the approaches that have thus far received less attention is also given in the paper. Thus, this paper aims to become a starting point for researchers interested in using and applying reinforcement learning methods in this particular research field." @default.
- W3128013500 created "2021-02-15" @default.
- W3128013500 creator A5025841244 @default.
- W3128013500 creator A5075949977 @default.
- W3128013500 date "2021-02-11" @default.
- W3128013500 modified "2023-10-06" @default.
- W3128013500 title "Reinforcement Learning Approaches in Social Robotics" @default.
- W3128013500 cites W1453801241 @default.
- W3128013500 cites W1966259872 @default.
- W3128013500 cites W1971191882 @default.
- W3128013500 cites W1977655452 @default.
- W3128013500 cites W1978309251 @default.
- W3128013500 cites W1987867889 @default.
- W3128013500 cites W1991204540 @default.
- W3128013500 cites W1997714027 @default.
- W3128013500 cites W2000514530 @default.
- W3128013500 cites W2005250710 @default.
- W3128013500 cites W2017310186 @default.
- W3128013500 cites W2017697483 @default.
- W3128013500 cites W2021641437 @default.
- W3128013500 cites W2044663075 @default.
- W3128013500 cites W2054908983 @default.
- W3128013500 cites W2056653303 @default.
- W3128013500 cites W2064012529 @default.
- W3128013500 cites W2070629246 @default.
- W3128013500 cites W2074056782 @default.
- W3128013500 cites W2077343054 @default.
- W3128013500 cites W2085240212 @default.
- W3128013500 cites W2085533297 @default.
- W3128013500 cites W2093271548 @default.
- W3128013500 cites W2095436958 @default.
- W3128013500 cites W2099019320 @default.
- W3128013500 cites W2101448098 @default.
- W3128013500 cites W2101915445 @default.
- W3128013500 cites W2105938655 @default.
- W3128013500 cites W2121110499 @default.
- W3128013500 cites W2121517924 @default.
- W3128013500 cites W2124267516 @default.
- W3128013500 cites W2130763675 @default.
- W3128013500 cites W2145339207 @default.
- W3128013500 cites W2151717183 @default.
- W3128013500 cites W2153189295 @default.
- W3128013500 cites W2162932021 @default.
- W3128013500 cites W2168108292 @default.
- W3128013500 cites W2170899200 @default.
- W3128013500 cites W2253319075 @default.
- W3128013500 cites W2401691899 @default.
- W3128013500 cites W2570651606 @default.
- W3128013500 cites W2592373391 @default.
- W3128013500 cites W2735071725 @default.
- W3128013500 cites W2769184731 @default.
- W3128013500 cites W2769208536 @default.
- W3128013500 cites W2769428625 @default.
- W3128013500 cites W2775496038 @default.
- W3128013500 cites W2789863437 @default.
- W3128013500 cites W2794187674 @default.
- W3128013500 cites W2799745602 @default.
- W3128013500 cites W2887320834 @default.
- W3128013500 cites W2894609524 @default.
- W3128013500 cites W2897782875 @default.
- W3128013500 cites W2898626038 @default.
- W3128013500 cites W2910273746 @default.
- W3128013500 cites W2912215636 @default.
- W3128013500 cites W2922038110 @default.
- W3128013500 cites W2944766483 @default.
- W3128013500 cites W2945169717 @default.
- W3128013500 cites W2950929549 @default.
- W3128013500 cites W2951001370 @default.
- W3128013500 cites W2963900541 @default.
- W3128013500 cites W3038282378 @default.
- W3128013500 cites W3098163860 @default.
- W3128013500 cites W3103262232 @default.
- W3128013500 cites W2002288980 @default.
- W3128013500 doi "https://doi.org/10.3390/s21041292" @default.
- W3128013500 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/7918897" @default.
- W3128013500 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/33670257" @default.
- W3128013500 hasPublicationYear "2021" @default.
- W3128013500 type Work @default.
- W3128013500 sameAs 3128013500 @default.
- W3128013500 citedByCount "33" @default.
- W3128013500 countsByYear W31280135002021 @default.
- W3128013500 countsByYear W31280135002022 @default.
- W3128013500 countsByYear W31280135002023 @default.
- W3128013500 crossrefType "journal-article" @default.
- W3128013500 hasAuthorship W3128013500A5025841244 @default.
- W3128013500 hasAuthorship W3128013500A5075949977 @default.
- W3128013500 hasBestOaLocation W31280135001 @default.
- W3128013500 hasConcept C107457646 @default.
- W3128013500 hasConcept C119857082 @default.
- W3128013500 hasConcept C154945302 @default.
- W3128013500 hasConcept C188888258 @default.
- W3128013500 hasConcept C19766214 @default.
- W3128013500 hasConcept C19966478 @default.
- W3128013500 hasConcept C34413123 @default.
- W3128013500 hasConcept C41008148 @default.
- W3128013500 hasConcept C90509273 @default.
- W3128013500 hasConcept C94124525 @default.
- W3128013500 hasConcept C97541855 @default.