Matches in SemOpenAlex for { <https://semopenalex.org/work/W3128736802> ?p ?o ?g. }
Showing items 1 to 95 of
95
with 100 items per page.
- W3128736802 abstract "This paper explores multi-armed bandit (MAB) strategies in very short horizon scenarios, i.e., when the bandit strategy is only allowed very few interactions with the environment. This is an understudied setting in the MAB literature with many applications in the context of games, such as player modeling. Specifically, we pursue three different ideas. First, we explore the use of regression oracles, which replace the simple average used in strategies such as epsilon-greedy with linear regression models. Second, we examine different exploration patterns such as forced exploration phases. Finally, we introduce a new variant of the UCB1 strategy called UCBT that has interesting properties and no tunable parameters. We present experimental results in a domain motivated by exergames, where the goal is to maximize a player's daily steps. Our results show that the combination of epsilon-greedy or epsilon-decreasing with regression oracles outperforms all other tested strategies in the short horizon setting." @default.
- W3128736802 created "2021-02-15" @default.
- W3128736802 creator A5017306840 @default.
- W3128736802 creator A5038686010 @default.
- W3128736802 creator A5086741338 @default.
- W3128736802 date "2021-02-10" @default.
- W3128736802 modified "2023-09-28" @default.
- W3128736802 title "Regression Oracles and Exploration Strategies for Short-Horizon Multi-Armed Bandits" @default.
- W3128736802 cites W1500931416 @default.
- W3128736802 cites W1998498767 @default.
- W3128736802 cites W2003274974 @default.
- W3128736802 cites W2009551863 @default.
- W3128736802 cites W2039522160 @default.
- W3128736802 cites W2056921512 @default.
- W3128736802 cites W2126316555 @default.
- W3128736802 cites W2146772569 @default.
- W3128736802 cites W2152074827 @default.
- W3128736802 cites W2168405694 @default.
- W3128736802 cites W2187642949 @default.
- W3128736802 cites W2519411794 @default.
- W3128736802 cites W2724523279 @default.
- W3128736802 cites W2898638584 @default.
- W3128736802 cites W2962821829 @default.
- W3128736802 cites W3008642523 @default.
- W3128736802 cites W3088725440 @default.
- W3128736802 cites W3209034279 @default.
- W3128736802 hasPublicationYear "2021" @default.
- W3128736802 type Work @default.
- W3128736802 sameAs 3128736802 @default.
- W3128736802 citedByCount "0" @default.
- W3128736802 crossrefType "posted-content" @default.
- W3128736802 hasAuthorship W3128736802A5017306840 @default.
- W3128736802 hasAuthorship W3128736802A5038686010 @default.
- W3128736802 hasAuthorship W3128736802A5086741338 @default.
- W3128736802 hasConcept C105795698 @default.
- W3128736802 hasConcept C111472728 @default.
- W3128736802 hasConcept C119857082 @default.
- W3128736802 hasConcept C126255220 @default.
- W3128736802 hasConcept C134306372 @default.
- W3128736802 hasConcept C138885662 @default.
- W3128736802 hasConcept C154945302 @default.
- W3128736802 hasConcept C159176650 @default.
- W3128736802 hasConcept C166957645 @default.
- W3128736802 hasConcept C205649164 @default.
- W3128736802 hasConcept C2524010 @default.
- W3128736802 hasConcept C2779343474 @default.
- W3128736802 hasConcept C2780586882 @default.
- W3128736802 hasConcept C28761237 @default.
- W3128736802 hasConcept C33923547 @default.
- W3128736802 hasConcept C36503486 @default.
- W3128736802 hasConcept C41008148 @default.
- W3128736802 hasConcept C83546350 @default.
- W3128736802 hasConceptScore W3128736802C105795698 @default.
- W3128736802 hasConceptScore W3128736802C111472728 @default.
- W3128736802 hasConceptScore W3128736802C119857082 @default.
- W3128736802 hasConceptScore W3128736802C126255220 @default.
- W3128736802 hasConceptScore W3128736802C134306372 @default.
- W3128736802 hasConceptScore W3128736802C138885662 @default.
- W3128736802 hasConceptScore W3128736802C154945302 @default.
- W3128736802 hasConceptScore W3128736802C159176650 @default.
- W3128736802 hasConceptScore W3128736802C166957645 @default.
- W3128736802 hasConceptScore W3128736802C205649164 @default.
- W3128736802 hasConceptScore W3128736802C2524010 @default.
- W3128736802 hasConceptScore W3128736802C2779343474 @default.
- W3128736802 hasConceptScore W3128736802C2780586882 @default.
- W3128736802 hasConceptScore W3128736802C28761237 @default.
- W3128736802 hasConceptScore W3128736802C33923547 @default.
- W3128736802 hasConceptScore W3128736802C36503486 @default.
- W3128736802 hasConceptScore W3128736802C41008148 @default.
- W3128736802 hasConceptScore W3128736802C83546350 @default.
- W3128736802 hasOpenAccess W3128736802 @default.
- W3128736802 hasRelatedWork W1605676990 @default.
- W3128736802 hasRelatedWork W189728362 @default.
- W3128736802 hasRelatedWork W2052471706 @default.
- W3128736802 hasRelatedWork W2129384778 @default.
- W3128736802 hasRelatedWork W2158807713 @default.
- W3128736802 hasRelatedWork W2298941664 @default.
- W3128736802 hasRelatedWork W2602425879 @default.
- W3128736802 hasRelatedWork W2606637533 @default.
- W3128736802 hasRelatedWork W268671474 @default.
- W3128736802 hasRelatedWork W2791100102 @default.
- W3128736802 hasRelatedWork W2899689262 @default.
- W3128736802 hasRelatedWork W2951862331 @default.
- W3128736802 hasRelatedWork W3012001585 @default.
- W3128736802 hasRelatedWork W3030192008 @default.
- W3128736802 hasRelatedWork W3046246549 @default.
- W3128736802 hasRelatedWork W3094031648 @default.
- W3128736802 hasRelatedWork W3097741917 @default.
- W3128736802 hasRelatedWork W3114769327 @default.
- W3128736802 hasRelatedWork W3121822496 @default.
- W3128736802 hasRelatedWork W3159979150 @default.
- W3128736802 isParatext "false" @default.
- W3128736802 isRetracted "false" @default.
- W3128736802 magId "3128736802" @default.
- W3128736802 workType "article" @default.