Matches in SemOpenAlex for { <https://semopenalex.org/work/W3088362626> ?p ?o ?g. }
Showing items 1 to 94 of
94
with 100 items per page.
- W3088362626 abstract "Many reinforcement learning tasks can benefit from explicit planning based on an internal model of the environment. Previously, such planning components have been incorporated through a neural network that partially aligns with the computational graph of value iteration. Such network have so far been focused on restrictive environments (e.g. grid-worlds), and modelled the planning procedure only indirectly. We relax these constraints, proposing a graph neural network (GNN) that executes the value iteration (VI) algorithm, across arbitrary environment models, with direct supervision on the intermediate steps of VI. The results indicate that GNNs are able to model value iteration accurately, recovering favourable metrics and policies across a variety of out-of-distribution tests. This suggests that GNN executors with strong supervision are a viable component within deep reinforcement learning systems." @default.
- W3088362626 created "2020-10-01" @default.
- W3088362626 creator A5013115798 @default.
- W3088362626 creator A5067918843 @default.
- W3088362626 creator A5077231313 @default.
- W3088362626 date "2020-09-26" @default.
- W3088362626 modified "2023-09-27" @default.
- W3088362626 title "Graph neural induction of value iteration." @default.
- W3088362626 cites W2002470750 @default.
- W3088362626 cites W2008620264 @default.
- W3088362626 cites W2008857988 @default.
- W3088362626 cites W2076337359 @default.
- W3088362626 cites W2100677568 @default.
- W3088362626 cites W2145339207 @default.
- W3088362626 cites W2258731934 @default.
- W3088362626 cites W2606780347 @default.
- W3088362626 cites W2738675347 @default.
- W3088362626 cites W2766453196 @default.
- W3088362626 cites W2947795716 @default.
- W3088362626 cites W2963072115 @default.
- W3088362626 cites W2995268324 @default.
- W3088362626 cites W3016124664 @default.
- W3088362626 cites W3017835029 @default.
- W3088362626 cites W3023380900 @default.
- W3088362626 cites W3118210634 @default.
- W3088362626 hasPublicationYear "2020" @default.
- W3088362626 type Work @default.
- W3088362626 sameAs 3088362626 @default.
- W3088362626 citedByCount "5" @default.
- W3088362626 countsByYear W30883626262020 @default.
- W3088362626 countsByYear W30883626262021 @default.
- W3088362626 crossrefType "posted-content" @default.
- W3088362626 hasAuthorship W3088362626A5013115798 @default.
- W3088362626 hasAuthorship W3088362626A5067918843 @default.
- W3088362626 hasAuthorship W3088362626A5077231313 @default.
- W3088362626 hasConcept C119857082 @default.
- W3088362626 hasConcept C121332964 @default.
- W3088362626 hasConcept C126255220 @default.
- W3088362626 hasConcept C132525143 @default.
- W3088362626 hasConcept C136197465 @default.
- W3088362626 hasConcept C154945302 @default.
- W3088362626 hasConcept C168167062 @default.
- W3088362626 hasConcept C187691185 @default.
- W3088362626 hasConcept C2524010 @default.
- W3088362626 hasConcept C2776291640 @default.
- W3088362626 hasConcept C33923547 @default.
- W3088362626 hasConcept C41008148 @default.
- W3088362626 hasConcept C50644808 @default.
- W3088362626 hasConcept C80444323 @default.
- W3088362626 hasConcept C97355855 @default.
- W3088362626 hasConcept C97541855 @default.
- W3088362626 hasConceptScore W3088362626C119857082 @default.
- W3088362626 hasConceptScore W3088362626C121332964 @default.
- W3088362626 hasConceptScore W3088362626C126255220 @default.
- W3088362626 hasConceptScore W3088362626C132525143 @default.
- W3088362626 hasConceptScore W3088362626C136197465 @default.
- W3088362626 hasConceptScore W3088362626C154945302 @default.
- W3088362626 hasConceptScore W3088362626C168167062 @default.
- W3088362626 hasConceptScore W3088362626C187691185 @default.
- W3088362626 hasConceptScore W3088362626C2524010 @default.
- W3088362626 hasConceptScore W3088362626C2776291640 @default.
- W3088362626 hasConceptScore W3088362626C33923547 @default.
- W3088362626 hasConceptScore W3088362626C41008148 @default.
- W3088362626 hasConceptScore W3088362626C50644808 @default.
- W3088362626 hasConceptScore W3088362626C80444323 @default.
- W3088362626 hasConceptScore W3088362626C97355855 @default.
- W3088362626 hasConceptScore W3088362626C97541855 @default.
- W3088362626 hasLocation W30883626261 @default.
- W3088362626 hasOpenAccess W3088362626 @default.
- W3088362626 hasPrimaryLocation W30883626261 @default.
- W3088362626 hasRelatedWork W1533269661 @default.
- W3088362626 hasRelatedWork W1538918057 @default.
- W3088362626 hasRelatedWork W1541730457 @default.
- W3088362626 hasRelatedWork W1552148478 @default.
- W3088362626 hasRelatedWork W1588577808 @default.
- W3088362626 hasRelatedWork W2115951219 @default.
- W3088362626 hasRelatedWork W2160808139 @default.
- W3088362626 hasRelatedWork W2353658043 @default.
- W3088362626 hasRelatedWork W2600009170 @default.
- W3088362626 hasRelatedWork W2899685447 @default.
- W3088362626 hasRelatedWork W2950141689 @default.
- W3088362626 hasRelatedWork W2994477615 @default.
- W3088362626 hasRelatedWork W2996347495 @default.
- W3088362626 hasRelatedWork W3021208093 @default.
- W3088362626 hasRelatedWork W3116030074 @default.
- W3088362626 hasRelatedWork W3135875965 @default.
- W3088362626 hasRelatedWork W3136458726 @default.
- W3088362626 hasRelatedWork W3138738770 @default.
- W3088362626 hasRelatedWork W3184838181 @default.
- W3088362626 hasRelatedWork W2083963846 @default.
- W3088362626 isParatext "false" @default.
- W3088362626 isRetracted "false" @default.
- W3088362626 magId "3088362626" @default.
- W3088362626 workType "article" @default.