Matches in SemOpenAlex for { <https://semopenalex.org/work/W4367000266> ?p ?o ?g. }
Showing items 1 to 69 of
69
with 100 items per page.
- W4367000266 abstract "Reinforcement Learning(RL) has achieved tremendous development in recent years, but still faces significant obstacles in addressing complex real-life problems due to the issues of poor system generalization, low sample efficiency as well as safety and interpretability concerns. The core reason underlying such dilemmas can be attributed to the fact that most of the work has focused on the computational aspect of value functions or policies using a representational model to describe atomic components of rewards, states and actions etc, thus neglecting the rich high-level declarative domain knowledge of facts, relations and rules that can be either provided a priori or acquired through reasoning over time. Recently, there has been a rapidly growing interest in the use of Knowledge Representation and Reasoning(KRR) methods, usually using logical languages, to enable more abstract representation and efficient learning in RL. In this survey, we provide a preliminary overview on these endeavors that leverage the strengths of KRR to help solving various problems in RL, and discuss the challenging open problems and possible directions for future work in this area." @default.
- W4367000266 created "2023-04-27" @default.
- W4367000266 creator A5011500179 @default.
- W4367000266 creator A5017811036 @default.
- W4367000266 creator A5027600524 @default.
- W4367000266 creator A5078337292 @default.
- W4367000266 creator A5087138998 @default.
- W4367000266 date "2023-04-24" @default.
- W4367000266 modified "2023-09-25" @default.
- W4367000266 title "Reinforcement Learning with Knowledge Representation and Reasoning: A Brief Survey" @default.
- W4367000266 doi "https://doi.org/10.48550/arxiv.2304.12090" @default.
- W4367000266 hasPublicationYear "2023" @default.
- W4367000266 type Work @default.
- W4367000266 citedByCount "0" @default.
- W4367000266 crossrefType "posted-content" @default.
- W4367000266 hasAuthorship W4367000266A5011500179 @default.
- W4367000266 hasAuthorship W4367000266A5017811036 @default.
- W4367000266 hasAuthorship W4367000266A5027600524 @default.
- W4367000266 hasAuthorship W4367000266A5078337292 @default.
- W4367000266 hasAuthorship W4367000266A5087138998 @default.
- W4367000266 hasBestOaLocation W43670002661 @default.
- W4367000266 hasConcept C111472728 @default.
- W4367000266 hasConcept C138885662 @default.
- W4367000266 hasConcept C153083717 @default.
- W4367000266 hasConcept C154945302 @default.
- W4367000266 hasConcept C161301231 @default.
- W4367000266 hasConcept C162324750 @default.
- W4367000266 hasConcept C177148314 @default.
- W4367000266 hasConcept C17744445 @default.
- W4367000266 hasConcept C199539241 @default.
- W4367000266 hasConcept C2776359362 @default.
- W4367000266 hasConcept C2781067378 @default.
- W4367000266 hasConcept C41008148 @default.
- W4367000266 hasConcept C539667460 @default.
- W4367000266 hasConcept C75553542 @default.
- W4367000266 hasConcept C94625758 @default.
- W4367000266 hasConcept C97541855 @default.
- W4367000266 hasConceptScore W4367000266C111472728 @default.
- W4367000266 hasConceptScore W4367000266C138885662 @default.
- W4367000266 hasConceptScore W4367000266C153083717 @default.
- W4367000266 hasConceptScore W4367000266C154945302 @default.
- W4367000266 hasConceptScore W4367000266C161301231 @default.
- W4367000266 hasConceptScore W4367000266C162324750 @default.
- W4367000266 hasConceptScore W4367000266C177148314 @default.
- W4367000266 hasConceptScore W4367000266C17744445 @default.
- W4367000266 hasConceptScore W4367000266C199539241 @default.
- W4367000266 hasConceptScore W4367000266C2776359362 @default.
- W4367000266 hasConceptScore W4367000266C2781067378 @default.
- W4367000266 hasConceptScore W4367000266C41008148 @default.
- W4367000266 hasConceptScore W4367000266C539667460 @default.
- W4367000266 hasConceptScore W4367000266C75553542 @default.
- W4367000266 hasConceptScore W4367000266C94625758 @default.
- W4367000266 hasConceptScore W4367000266C97541855 @default.
- W4367000266 hasLocation W43670002661 @default.
- W4367000266 hasOpenAccess W4367000266 @default.
- W4367000266 hasPrimaryLocation W43670002661 @default.
- W4367000266 hasRelatedWork W1889934247 @default.
- W4367000266 hasRelatedWork W1965020329 @default.
- W4367000266 hasRelatedWork W2101885143 @default.
- W4367000266 hasRelatedWork W2135290995 @default.
- W4367000266 hasRelatedWork W2766672297 @default.
- W4367000266 hasRelatedWork W2807340089 @default.
- W4367000266 hasRelatedWork W4281729806 @default.
- W4367000266 hasRelatedWork W4289856193 @default.
- W4367000266 hasRelatedWork W4296563273 @default.
- W4367000266 hasRelatedWork W4315778373 @default.
- W4367000266 isParatext "false" @default.
- W4367000266 isRetracted "false" @default.
- W4367000266 workType "article" @default.