Matches in SemOpenAlex for { <https://semopenalex.org/work/W3093848209> ?p ?o ?g. }
- W3093848209 abstract "The goal of off-policy evaluation (OPE) is to evaluate a new policy using historical data obtained via a behavior policy. However, because the contextual bandit algorithm updates the policy based on past observations, the samples are not independent and identically distributed (i.i.d.). This paper tackles this problem by constructing an estimator from a martingale difference sequence (MDS) for the dependent samples. In the data-generating process, we do not assume the convergence of the policy, but the policy uses the same conditional probability of choosing an action during a certain period. Then, we derive an asymptotically normal estimator of the value of an evaluation policy. As another advantage of our method, the batch-based approach simultaneously solves the deficient support problem. Using benchmark and real-world datasets, we experimentally confirm the effectiveness of the proposed method." @default.
- W3093848209 created "2020-10-29" @default.
- W3093848209 creator A5083089288 @default.
- W3093848209 creator A5086434718 @default.
- W3093848209 date "2020-10-23" @default.
- W3093848209 modified "2023-09-27" @default.
- W3093848209 title "Off-Policy Evaluation of Bandit Algorithm from Dependent Samples under Batch Update Policy." @default.
- W3093848209 cites W131280298 @default.
- W3093848209 cites W1487320471 @default.
- W3093848209 cites W1835900096 @default.
- W3093848209 cites W1958090791 @default.
- W3093848209 cites W1973240854 @default.
- W3093848209 cites W1977314434 @default.
- W3093848209 cites W2020160576 @default.
- W3093848209 cites W2039811614 @default.
- W3093848209 cites W2079597004 @default.
- W3093848209 cites W2107822634 @default.
- W3093848209 cites W2112420033 @default.
- W3093848209 cites W2118502261 @default.
- W3093848209 cites W2121863487 @default.
- W3093848209 cites W2123697620 @default.
- W3093848209 cites W2288211471 @default.
- W3093848209 cites W2622003161 @default.
- W3093848209 cites W2796930163 @default.
- W3093848209 cites W2890951405 @default.
- W3093848209 cites W2946442465 @default.
- W3093848209 cites W2962785510 @default.
- W3093848209 cites W2963323139 @default.
- W3093848209 cites W2964297722 @default.
- W3093848209 cites W2970045071 @default.
- W3093848209 cites W2987109054 @default.
- W3093848209 cites W2996233539 @default.
- W3093848209 cites W3004901471 @default.
- W3093848209 cites W3005539805 @default.
- W3093848209 cites W3006948422 @default.
- W3093848209 cites W3034339257 @default.
- W3093848209 cites W3099117208 @default.
- W3093848209 cites W3122193054 @default.
- W3093848209 cites W3146166473 @default.
- W3093848209 cites W67506904 @default.
- W3093848209 hasPublicationYear "2020" @default.
- W3093848209 type Work @default.
- W3093848209 sameAs 3093848209 @default.
- W3093848209 citedByCount "2" @default.
- W3093848209 countsByYear W30938482092020 @default.
- W3093848209 countsByYear W30938482092021 @default.
- W3093848209 crossrefType "posted-content" @default.
- W3093848209 hasAuthorship W3093848209A5083089288 @default.
- W3093848209 hasAuthorship W3093848209A5086434718 @default.
- W3093848209 hasConcept C105795698 @default.
- W3093848209 hasConcept C111919701 @default.
- W3093848209 hasConcept C11413529 @default.
- W3093848209 hasConcept C122123141 @default.
- W3093848209 hasConcept C126255220 @default.
- W3093848209 hasConcept C13280743 @default.
- W3093848209 hasConcept C141513077 @default.
- W3093848209 hasConcept C162324750 @default.
- W3093848209 hasConcept C185429906 @default.
- W3093848209 hasConcept C185798385 @default.
- W3093848209 hasConcept C205649164 @default.
- W3093848209 hasConcept C2777303404 @default.
- W3093848209 hasConcept C28826006 @default.
- W3093848209 hasConcept C33923547 @default.
- W3093848209 hasConcept C41008148 @default.
- W3093848209 hasConcept C48406656 @default.
- W3093848209 hasConcept C50522688 @default.
- W3093848209 hasConcept C98045186 @default.
- W3093848209 hasConceptScore W3093848209C105795698 @default.
- W3093848209 hasConceptScore W3093848209C111919701 @default.
- W3093848209 hasConceptScore W3093848209C11413529 @default.
- W3093848209 hasConceptScore W3093848209C122123141 @default.
- W3093848209 hasConceptScore W3093848209C126255220 @default.
- W3093848209 hasConceptScore W3093848209C13280743 @default.
- W3093848209 hasConceptScore W3093848209C141513077 @default.
- W3093848209 hasConceptScore W3093848209C162324750 @default.
- W3093848209 hasConceptScore W3093848209C185429906 @default.
- W3093848209 hasConceptScore W3093848209C185798385 @default.
- W3093848209 hasConceptScore W3093848209C205649164 @default.
- W3093848209 hasConceptScore W3093848209C2777303404 @default.
- W3093848209 hasConceptScore W3093848209C28826006 @default.
- W3093848209 hasConceptScore W3093848209C33923547 @default.
- W3093848209 hasConceptScore W3093848209C41008148 @default.
- W3093848209 hasConceptScore W3093848209C48406656 @default.
- W3093848209 hasConceptScore W3093848209C50522688 @default.
- W3093848209 hasConceptScore W3093848209C98045186 @default.
- W3093848209 hasLocation W30938482091 @default.
- W3093848209 hasOpenAccess W3093848209 @default.
- W3093848209 hasPrimaryLocation W30938482091 @default.
- W3093848209 hasRelatedWork W191658262 @default.
- W3093848209 hasRelatedWork W2010356296 @default.
- W3093848209 hasRelatedWork W2606656360 @default.
- W3093848209 hasRelatedWork W2756036450 @default.
- W3093848209 hasRelatedWork W2765274790 @default.
- W3093848209 hasRelatedWork W277306631 @default.
- W3093848209 hasRelatedWork W2951886426 @default.
- W3093848209 hasRelatedWork W2952514467 @default.
- W3093848209 hasRelatedWork W2952668684 @default.
- W3093848209 hasRelatedWork W2963057120 @default.
- W3093848209 hasRelatedWork W2996181810 @default.
- W3093848209 hasRelatedWork W3005539805 @default.