Matches in SemOpenAlex for { <https://semopenalex.org/work/W1809653203> ?p ?o ?g. }
- W1809653203 abstract "We study decision making in environments where the reward is only partially observed, but can be modeled as a function of an action and an observed context. This setting, known as contextual bandits, encompasses a wide variety of applications including health-care policy and Internet advertising. A central task is evaluation of a new policy given historic data consisting of contexts, actions and received rewards. The key challenge is that the past data typically does not faithfully represent proportions of actions taken by a new policy. Previous approaches rely either on models of rewards or models of the past policy. The former are plagued by a large bias whereas the latter have a large variance. In this work, we leverage the strength and overcome the weaknesses of the two approaches by applying the doubly robust technique to the problems of policy evaluation and optimization. We prove that this approach yields accurate value estimates when we have either a good (but not necessarily consistent) model of rewards or a good (but not necessarily consistent) model of past policy. Extensive empirical comparison demonstrates that the doubly robust approach uniformly improves over existing techniques, achieving both lower variance in value estimation and better policies. As such, we expect the doubly robust approach to become common practice." @default.
- W1809653203 created "2016-06-24" @default.
- W1809653203 creator A5005003250 @default.
- W1809653203 creator A5054850777 @default.
- W1809653203 creator A5089372170 @default.
- W1809653203 date "2011-03-23" @default.
- W1809653203 modified "2023-10-01" @default.
- W1809653203 title "Doubly Robust Policy Evaluation and Learning" @default.
- W1809653203 cites W1488797257 @default.
- W1809653203 cites W1990966354 @default.
- W1809653203 cites W2001947543 @default.
- W1809653203 cites W2020160576 @default.
- W1809653203 cites W2077902449 @default.
- W1809653203 cites W2121878111 @default.
- W1809653203 cites W2162651021 @default.
- W1809653203 cites W2168639902 @default.
- W1809653203 cites W2293743194 @default.
- W1809653203 cites W2519411794 @default.
- W1809653203 cites W2951403958 @default.
- W1809653203 cites W3120740533 @default.
- W1809653203 hasPublicationYear "2011" @default.
- W1809653203 type Work @default.
- W1809653203 sameAs 1809653203 @default.
- W1809653203 citedByCount "204" @default.
- W1809653203 countsByYear W18096532032012 @default.
- W1809653203 countsByYear W18096532032013 @default.
- W1809653203 countsByYear W18096532032014 @default.
- W1809653203 countsByYear W18096532032015 @default.
- W1809653203 countsByYear W18096532032016 @default.
- W1809653203 countsByYear W18096532032017 @default.
- W1809653203 countsByYear W18096532032018 @default.
- W1809653203 countsByYear W18096532032019 @default.
- W1809653203 countsByYear W18096532032020 @default.
- W1809653203 countsByYear W18096532032021 @default.
- W1809653203 crossrefType "posted-content" @default.
- W1809653203 hasAuthorship W1809653203A5005003250 @default.
- W1809653203 hasAuthorship W1809653203A5054850777 @default.
- W1809653203 hasAuthorship W1809653203A5089372170 @default.
- W1809653203 hasConcept C119857082 @default.
- W1809653203 hasConcept C121332964 @default.
- W1809653203 hasConcept C121955636 @default.
- W1809653203 hasConcept C136197465 @default.
- W1809653203 hasConcept C149782125 @default.
- W1809653203 hasConcept C151730666 @default.
- W1809653203 hasConcept C153083717 @default.
- W1809653203 hasConcept C154945302 @default.
- W1809653203 hasConcept C162324750 @default.
- W1809653203 hasConcept C187736073 @default.
- W1809653203 hasConcept C196083921 @default.
- W1809653203 hasConcept C2776291640 @default.
- W1809653203 hasConcept C2779343474 @default.
- W1809653203 hasConcept C2780451532 @default.
- W1809653203 hasConcept C2780791683 @default.
- W1809653203 hasConcept C41008148 @default.
- W1809653203 hasConcept C62520636 @default.
- W1809653203 hasConcept C86803240 @default.
- W1809653203 hasConcept C97541855 @default.
- W1809653203 hasConceptScore W1809653203C119857082 @default.
- W1809653203 hasConceptScore W1809653203C121332964 @default.
- W1809653203 hasConceptScore W1809653203C121955636 @default.
- W1809653203 hasConceptScore W1809653203C136197465 @default.
- W1809653203 hasConceptScore W1809653203C149782125 @default.
- W1809653203 hasConceptScore W1809653203C151730666 @default.
- W1809653203 hasConceptScore W1809653203C153083717 @default.
- W1809653203 hasConceptScore W1809653203C154945302 @default.
- W1809653203 hasConceptScore W1809653203C162324750 @default.
- W1809653203 hasConceptScore W1809653203C187736073 @default.
- W1809653203 hasConceptScore W1809653203C196083921 @default.
- W1809653203 hasConceptScore W1809653203C2776291640 @default.
- W1809653203 hasConceptScore W1809653203C2779343474 @default.
- W1809653203 hasConceptScore W1809653203C2780451532 @default.
- W1809653203 hasConceptScore W1809653203C2780791683 @default.
- W1809653203 hasConceptScore W1809653203C41008148 @default.
- W1809653203 hasConceptScore W1809653203C62520636 @default.
- W1809653203 hasConceptScore W1809653203C86803240 @default.
- W1809653203 hasConceptScore W1809653203C97541855 @default.
- W1809653203 hasLocation W18096532031 @default.
- W1809653203 hasOpenAccess W1809653203 @default.
- W1809653203 hasPrimaryLocation W18096532031 @default.
- W1809653203 hasRelatedWork W1514587017 @default.
- W1809653203 hasRelatedWork W1835900096 @default.
- W1809653203 hasRelatedWork W2001947543 @default.
- W1809653203 hasRelatedWork W2020160576 @default.
- W1809653203 hasRelatedWork W2039811614 @default.
- W1809653203 hasRelatedWork W2064903582 @default.
- W1809653203 hasRelatedWork W2098258765 @default.
- W1809653203 hasRelatedWork W2112420033 @default.
- W1809653203 hasRelatedWork W2119850747 @default.
- W1809653203 hasRelatedWork W2121863487 @default.
- W1809653203 hasRelatedWork W2122124659 @default.
- W1809653203 hasRelatedWork W2132917208 @default.
- W1809653203 hasRelatedWork W2137370054 @default.
- W1809653203 hasRelatedWork W2138909795 @default.
- W1809653203 hasRelatedWork W2150291618 @default.
- W1809653203 hasRelatedWork W2188353343 @default.
- W1809653203 hasRelatedWork W2962785510 @default.
- W1809653203 hasRelatedWork W2962802563 @default.
- W1809653203 hasRelatedWork W2963323139 @default.
- W1809653203 hasRelatedWork W2964068481 @default.
- W1809653203 isParatext "false" @default.