Matches in SemOpenAlex for { <https://semopenalex.org/work/W1552327263> ?p ?o ?g. }
Showing items 1 to 81 of
81
with 100 items per page.
- W1552327263 endingPage "118" @default.
- W1552327263 startingPage "109" @default.
- W1552327263 abstract "In this paper we present two substantial extensions of Neural Rewards Regression (NRR) [1]. In order to give a less biased estimator of the Bellman Residual and to facilitate the regression character of NRR, we incorporate an improved, Auxiliared Bellman Residual [2] and provide, to the best of our knowledge, the first Neural Network based implementation of the novel Bellman Residual minimisation technique. Furthermore, we extend NRR to Policy Gradient Neural Rewards Regression (PGNRR), where the strategy is directly encoded by a policy network. PGNRR profits from both the data-efficiency of the Rewards Regression approach and the directness of policy search methods. PGNRR further overcomes a crucial drawback of NRR as it extends the accordant problem class considerably by the applicability of continuous action spaces." @default.
- W1552327263 created "2016-06-24" @default.
- W1552327263 creator A5014856404 @default.
- W1552327263 creator A5035246650 @default.
- W1552327263 creator A5072736085 @default.
- W1552327263 date "2007-01-01" @default.
- W1552327263 modified "2023-10-16" @default.
- W1552327263 title "Improving Optimality of Neural Rewards Regression for Data-Efficient Batch Near-Optimal Policy Identification" @default.
- W1552327263 cites W1543217731 @default.
- W1552327263 cites W1557941966 @default.
- W1552327263 cites W1646707810 @default.
- W1552327263 cites W166862392 @default.
- W1552327263 cites W2006278461 @default.
- W1552327263 cites W2076118331 @default.
- W1552327263 cites W2139418546 @default.
- W1552327263 cites W2154890045 @default.
- W1552327263 cites W4236355931 @default.
- W1552327263 cites W4253020087 @default.
- W1552327263 doi "https://doi.org/10.1007/978-3-540-74690-4_12" @default.
- W1552327263 hasPublicationYear "2007" @default.
- W1552327263 type Work @default.
- W1552327263 sameAs 1552327263 @default.
- W1552327263 citedByCount "11" @default.
- W1552327263 countsByYear W15523272632013 @default.
- W1552327263 countsByYear W15523272632017 @default.
- W1552327263 countsByYear W15523272632018 @default.
- W1552327263 countsByYear W15523272632020 @default.
- W1552327263 countsByYear W15523272632021 @default.
- W1552327263 crossrefType "book-chapter" @default.
- W1552327263 hasAuthorship W1552327263A5014856404 @default.
- W1552327263 hasAuthorship W1552327263A5035246650 @default.
- W1552327263 hasAuthorship W1552327263A5072736085 @default.
- W1552327263 hasConcept C105795698 @default.
- W1552327263 hasConcept C11413529 @default.
- W1552327263 hasConcept C116834253 @default.
- W1552327263 hasConcept C119857082 @default.
- W1552327263 hasConcept C126255220 @default.
- W1552327263 hasConcept C152877465 @default.
- W1552327263 hasConcept C154945302 @default.
- W1552327263 hasConcept C155512373 @default.
- W1552327263 hasConcept C185429906 @default.
- W1552327263 hasConcept C33923547 @default.
- W1552327263 hasConcept C41008148 @default.
- W1552327263 hasConcept C50644808 @default.
- W1552327263 hasConcept C59822182 @default.
- W1552327263 hasConcept C83546350 @default.
- W1552327263 hasConcept C86803240 @default.
- W1552327263 hasConceptScore W1552327263C105795698 @default.
- W1552327263 hasConceptScore W1552327263C11413529 @default.
- W1552327263 hasConceptScore W1552327263C116834253 @default.
- W1552327263 hasConceptScore W1552327263C119857082 @default.
- W1552327263 hasConceptScore W1552327263C126255220 @default.
- W1552327263 hasConceptScore W1552327263C152877465 @default.
- W1552327263 hasConceptScore W1552327263C154945302 @default.
- W1552327263 hasConceptScore W1552327263C155512373 @default.
- W1552327263 hasConceptScore W1552327263C185429906 @default.
- W1552327263 hasConceptScore W1552327263C33923547 @default.
- W1552327263 hasConceptScore W1552327263C41008148 @default.
- W1552327263 hasConceptScore W1552327263C50644808 @default.
- W1552327263 hasConceptScore W1552327263C59822182 @default.
- W1552327263 hasConceptScore W1552327263C83546350 @default.
- W1552327263 hasConceptScore W1552327263C86803240 @default.
- W1552327263 hasLocation W15523272631 @default.
- W1552327263 hasOpenAccess W1552327263 @default.
- W1552327263 hasPrimaryLocation W15523272631 @default.
- W1552327263 hasRelatedWork W1970158984 @default.
- W1552327263 hasRelatedWork W2012241321 @default.
- W1552327263 hasRelatedWork W2069198726 @default.
- W1552327263 hasRelatedWork W2072034916 @default.
- W1552327263 hasRelatedWork W2135164884 @default.
- W1552327263 hasRelatedWork W2358152617 @default.
- W1552327263 hasRelatedWork W2359645249 @default.
- W1552327263 hasRelatedWork W2389674502 @default.
- W1552327263 hasRelatedWork W2807555463 @default.
- W1552327263 hasRelatedWork W3135327272 @default.
- W1552327263 isParatext "false" @default.
- W1552327263 isRetracted "false" @default.
- W1552327263 magId "1552327263" @default.
- W1552327263 workType "book-chapter" @default.