Matches in SemOpenAlex for { <https://semopenalex.org/work/W2811111819> ?p ?o ?g. }
- W2811111819 abstract "One of the key challenges in applying reinforcement learning to real-life problems is that the amount of train-and-error required to learn a good policy increases drastically as the task becomes complex. One potential solution to this problem is to combine reinforcement learning with automated symbol planning and utilize prior knowledge on the domain. However, existing methods have limitations in their applicability and expressiveness. In this paper we propose a hierarchical reinforcement learning method based on abductive symbolic planning. The planner can deal with user-defined evaluation functions and is not based on the Herbrand theorem. Therefore it can utilize prior knowledge of the rewards and can work in a domain where the state space is unknown. We demonstrate empirically that our architecture significantly improves learning efficiency with respect to the amount of training examples on the evaluation domain, in which the state space is unknown and there exist multiple goals." @default.
- W2811111819 created "2018-07-10" @default.
- W2811111819 creator A5046874748 @default.
- W2811111819 creator A5064113904 @default.
- W2811111819 creator A5082122978 @default.
- W2811111819 date "2018-06-28" @default.
- W2811111819 modified "2023-09-23" @default.
- W2811111819 title "Hierarchical Reinforcement Learning with Abductive Planning." @default.
- W2811111819 cites W10344003 @default.
- W2811111819 cites W1591434728 @default.
- W2811111819 cites W1977970897 @default.
- W2811111819 cites W2010087056 @default.
- W2811111819 cites W2099587183 @default.
- W2811111819 cites W2109462874 @default.
- W2811111819 cites W2111316871 @default.
- W2811111819 cites W2134439162 @default.
- W2811111819 cites W2158937425 @default.
- W2811111819 cites W2168477347 @default.
- W2811111819 cites W2203521679 @default.
- W2811111819 cites W2222874733 @default.
- W2811111819 cites W2240338485 @default.
- W2811111819 cites W2335959470 @default.
- W2811111819 cites W2405011436 @default.
- W2811111819 cites W2568315666 @default.
- W2811111819 cites W2729184120 @default.
- W2811111819 cites W2753655233 @default.
- W2811111819 cites W90924478 @default.
- W2811111819 cites W2251681624 @default.
- W2811111819 hasPublicationYear "2018" @default.
- W2811111819 type Work @default.
- W2811111819 sameAs 2811111819 @default.
- W2811111819 citedByCount "4" @default.
- W2811111819 countsByYear W28111118192019 @default.
- W2811111819 countsByYear W28111118192020 @default.
- W2811111819 countsByYear W28111118192021 @default.
- W2811111819 crossrefType "posted-content" @default.
- W2811111819 hasAuthorship W2811111819A5046874748 @default.
- W2811111819 hasAuthorship W2811111819A5064113904 @default.
- W2811111819 hasAuthorship W2811111819A5082122978 @default.
- W2811111819 hasConcept C105795698 @default.
- W2811111819 hasConcept C111919701 @default.
- W2811111819 hasConcept C119857082 @default.
- W2811111819 hasConcept C127413603 @default.
- W2811111819 hasConcept C134306372 @default.
- W2811111819 hasConcept C134400042 @default.
- W2811111819 hasConcept C154945302 @default.
- W2811111819 hasConcept C199360897 @default.
- W2811111819 hasConcept C201995342 @default.
- W2811111819 hasConcept C207685749 @default.
- W2811111819 hasConcept C26517878 @default.
- W2811111819 hasConcept C2776999362 @default.
- W2811111819 hasConcept C2778572836 @default.
- W2811111819 hasConcept C2780451532 @default.
- W2811111819 hasConcept C33923547 @default.
- W2811111819 hasConcept C36503486 @default.
- W2811111819 hasConcept C38652104 @default.
- W2811111819 hasConcept C41008148 @default.
- W2811111819 hasConcept C72434380 @default.
- W2811111819 hasConcept C97541855 @default.
- W2811111819 hasConceptScore W2811111819C105795698 @default.
- W2811111819 hasConceptScore W2811111819C111919701 @default.
- W2811111819 hasConceptScore W2811111819C119857082 @default.
- W2811111819 hasConceptScore W2811111819C127413603 @default.
- W2811111819 hasConceptScore W2811111819C134306372 @default.
- W2811111819 hasConceptScore W2811111819C134400042 @default.
- W2811111819 hasConceptScore W2811111819C154945302 @default.
- W2811111819 hasConceptScore W2811111819C199360897 @default.
- W2811111819 hasConceptScore W2811111819C201995342 @default.
- W2811111819 hasConceptScore W2811111819C207685749 @default.
- W2811111819 hasConceptScore W2811111819C26517878 @default.
- W2811111819 hasConceptScore W2811111819C2776999362 @default.
- W2811111819 hasConceptScore W2811111819C2778572836 @default.
- W2811111819 hasConceptScore W2811111819C2780451532 @default.
- W2811111819 hasConceptScore W2811111819C33923547 @default.
- W2811111819 hasConceptScore W2811111819C36503486 @default.
- W2811111819 hasConceptScore W2811111819C38652104 @default.
- W2811111819 hasConceptScore W2811111819C41008148 @default.
- W2811111819 hasConceptScore W2811111819C72434380 @default.
- W2811111819 hasConceptScore W2811111819C97541855 @default.
- W2811111819 hasLocation W28111118191 @default.
- W2811111819 hasOpenAccess W2811111819 @default.
- W2811111819 hasPrimaryLocation W28111118191 @default.
- W2811111819 hasRelatedWork W2025448855 @default.
- W2811111819 hasRelatedWork W2103064945 @default.
- W2811111819 hasRelatedWork W2109910161 @default.
- W2811111819 hasRelatedWork W2158150115 @default.
- W2811111819 hasRelatedWork W2304525083 @default.
- W2811111819 hasRelatedWork W2399554456 @default.
- W2811111819 hasRelatedWork W276460289 @default.
- W2811111819 hasRelatedWork W2902024803 @default.
- W2811111819 hasRelatedWork W2945709907 @default.
- W2811111819 hasRelatedWork W2946824041 @default.
- W2811111819 hasRelatedWork W2980297462 @default.
- W2811111819 hasRelatedWork W298069310 @default.
- W2811111819 hasRelatedWork W2989941899 @default.
- W2811111819 hasRelatedWork W2993149128 @default.
- W2811111819 hasRelatedWork W3005888089 @default.
- W2811111819 hasRelatedWork W3082320588 @default.
- W2811111819 hasRelatedWork W3084024636 @default.
- W2811111819 hasRelatedWork W3201666735 @default.