Matches in SemOpenAlex for { <https://semopenalex.org/work/W2994838927> ?p ?o ?g. }
- W2994838927 abstract "Infinite horizon off-policy policy evaluation is a highly challenging task due to the excessively large variance of typical importance sampling (IS) estimators. Recently, Liu et al. (2018) proposed an approach that significantly reduces the variance of infinite-horizon off-policy evaluation by estimating the stationary density ratio, but at the cost of introducing potentially high risks due to the error in density ratio estimation. In this paper, we develop a bias-reduced augmentation of their method, which can take advantage of a learned value function to obtain higher accuracy. Our method is doubly robust in that the bias vanishes when either the density ratio or value function estimation is perfect. In general, when either of them is accurate, the bias can also be reduced. Both theoretical and empirical results show that our method yields significant advantages over previous methods." @default.
- W2994838927 created "2019-12-26" @default.
- W2994838927 creator A5005799900 @default.
- W2994838927 creator A5048079999 @default.
- W2994838927 creator A5054850777 @default.
- W2994838927 creator A5067027007 @default.
- W2994838927 creator A5074605808 @default.
- W2994838927 date "2020-04-30" @default.
- W2994838927 modified "2023-09-24" @default.
- W2994838927 title "Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation" @default.
- W2994838927 cites W1745373831 @default.
- W2994838927 cites W191658262 @default.
- W2994838927 cites W1971712327 @default.
- W2994838927 cites W2069739265 @default.
- W2994838927 cites W2098432798 @default.
- W2994838927 cites W2119567691 @default.
- W2994838927 cites W2122124659 @default.
- W2994838927 cites W2134689794 @default.
- W2994838927 cites W2138909795 @default.
- W2994838927 cites W2312609093 @default.
- W2994838927 cites W2395162158 @default.
- W2994838927 cites W242065599 @default.
- W2994838927 cites W2484199440 @default.
- W2994838927 cites W2562316362 @default.
- W2994838927 cites W2592754049 @default.
- W2994838927 cites W2605323209 @default.
- W2994838927 cites W2806985155 @default.
- W2994838927 cites W2904789544 @default.
- W2994838927 cites W2936304709 @default.
- W2994838927 cites W2945624305 @default.
- W2994838927 cites W2962785510 @default.
- W2994838927 cites W2962802563 @default.
- W2994838927 cites W2963323139 @default.
- W2994838927 cites W2963882293 @default.
- W2994838927 cites W2964055673 @default.
- W2994838927 cites W2964068481 @default.
- W2994838927 cites W2964297722 @default.
- W2994838927 cites W2971026276 @default.
- W2994838927 cites W2972536280 @default.
- W2994838927 cites W2991522342 @default.
- W2994838927 cites W3085521361 @default.
- W2994838927 hasPublicationYear "2020" @default.
- W2994838927 type Work @default.
- W2994838927 sameAs 2994838927 @default.
- W2994838927 citedByCount "20" @default.
- W2994838927 countsByYear W29948389272019 @default.
- W2994838927 countsByYear W29948389272020 @default.
- W2994838927 countsByYear W29948389272021 @default.
- W2994838927 crossrefType "proceedings-article" @default.
- W2994838927 hasAuthorship W2994838927A5005799900 @default.
- W2994838927 hasAuthorship W2994838927A5048079999 @default.
- W2994838927 hasAuthorship W2994838927A5054850777 @default.
- W2994838927 hasAuthorship W2994838927A5067027007 @default.
- W2994838927 hasAuthorship W2994838927A5074605808 @default.
- W2994838927 hasConcept C104317684 @default.
- W2994838927 hasConcept C105795698 @default.
- W2994838927 hasConcept C111335779 @default.
- W2994838927 hasConcept C121955636 @default.
- W2994838927 hasConcept C126255220 @default.
- W2994838927 hasConcept C14036430 @default.
- W2994838927 hasConcept C149782125 @default.
- W2994838927 hasConcept C159176650 @default.
- W2994838927 hasConcept C162324750 @default.
- W2994838927 hasConcept C185429906 @default.
- W2994838927 hasConcept C185592680 @default.
- W2994838927 hasConcept C19499675 @default.
- W2994838927 hasConcept C196083921 @default.
- W2994838927 hasConcept C2524010 @default.
- W2994838927 hasConcept C2776291640 @default.
- W2994838927 hasConcept C33923547 @default.
- W2994838927 hasConcept C41008148 @default.
- W2994838927 hasConcept C55493867 @default.
- W2994838927 hasConcept C62644790 @default.
- W2994838927 hasConcept C63479239 @default.
- W2994838927 hasConcept C78458016 @default.
- W2994838927 hasConcept C86803240 @default.
- W2994838927 hasConceptScore W2994838927C104317684 @default.
- W2994838927 hasConceptScore W2994838927C105795698 @default.
- W2994838927 hasConceptScore W2994838927C111335779 @default.
- W2994838927 hasConceptScore W2994838927C121955636 @default.
- W2994838927 hasConceptScore W2994838927C126255220 @default.
- W2994838927 hasConceptScore W2994838927C14036430 @default.
- W2994838927 hasConceptScore W2994838927C149782125 @default.
- W2994838927 hasConceptScore W2994838927C159176650 @default.
- W2994838927 hasConceptScore W2994838927C162324750 @default.
- W2994838927 hasConceptScore W2994838927C185429906 @default.
- W2994838927 hasConceptScore W2994838927C185592680 @default.
- W2994838927 hasConceptScore W2994838927C19499675 @default.
- W2994838927 hasConceptScore W2994838927C196083921 @default.
- W2994838927 hasConceptScore W2994838927C2524010 @default.
- W2994838927 hasConceptScore W2994838927C2776291640 @default.
- W2994838927 hasConceptScore W2994838927C33923547 @default.
- W2994838927 hasConceptScore W2994838927C41008148 @default.
- W2994838927 hasConceptScore W2994838927C55493867 @default.
- W2994838927 hasConceptScore W2994838927C62644790 @default.
- W2994838927 hasConceptScore W2994838927C63479239 @default.
- W2994838927 hasConceptScore W2994838927C78458016 @default.
- W2994838927 hasConceptScore W2994838927C86803240 @default.
- W2994838927 hasLocation W29948389271 @default.
- W2994838927 hasOpenAccess W2994838927 @default.