Matches in SemOpenAlex for { <https://semopenalex.org/work/W66652580> ?p ?o ?g. }
- W66652580 endingPage "1012" @default.
- W66652580 startingPage "1005" @default.
- W66652580 abstract "Agent evaluation in stochastic domains can be difficult. The commonplace approach of Monte Carlo evaluation can involve a prohibitive number of simulations when the variance of the outcome is high. In such domains, variance reduction techniques are necessary, but these techniques require careful encoding of domain knowledge. This paper introduces baseline as a simple approach to creating low variance estimators for zero-sum multi-agent domains with high outcome variance. The baseline method leverages the self play of any available agent to produce a control variate for variance reduction, subverting any extra complexity inherent with traditional approaches. The baseline method is also applicable in situations where existing techniques either require extensive implementation overhead or simply cannot be applied. Experimental variance reduction results are shown for both cases using the baseline method. Baseline is shown to surpass state-of-the-art techniques in three-player computer poker and is competitive in two-player computer poker games. Baseline also shows variance reduction in human poker and in a mock Ad Auction tournament from the Trading Agent Competition, domains where variance reduction methods are not typically employed." @default.
- W66652580 created "2016-06-24" @default.
- W66652580 creator A5007019092 @default.
- W66652580 creator A5019796688 @default.
- W66652580 creator A5081163135 @default.
- W66652580 date "2013-05-06" @default.
- W66652580 modified "2023-09-24" @default.
- W66652580 title "Baseline: practical control variates for agent evaluation in zero-sum domains" @default.
- W66652580 cites W1542261886 @default.
- W66652580 cites W1602773783 @default.
- W66652580 cites W1625390266 @default.
- W66652580 cites W2014932765 @default.
- W66652580 cites W2107105626 @default.
- W66652580 cites W2127052460 @default.
- W66652580 cites W2139515132 @default.
- W66652580 cites W2145901173 @default.
- W66652580 cites W82448155 @default.
- W66652580 cites W2131600418 @default.
- W66652580 doi "https://doi.org/10.5555/2484920.2485079" @default.
- W66652580 hasPublicationYear "2013" @default.
- W66652580 type Work @default.
- W66652580 sameAs 66652580 @default.
- W66652580 citedByCount "4" @default.
- W66652580 countsByYear W666525802014 @default.
- W66652580 countsByYear W666525802020 @default.
- W66652580 countsByYear W666525802021 @default.
- W66652580 crossrefType "proceedings-article" @default.
- W66652580 hasAuthorship W66652580A5007019092 @default.
- W66652580 hasAuthorship W66652580A5019796688 @default.
- W66652580 hasAuthorship W66652580A5081163135 @default.
- W66652580 hasConcept C105795698 @default.
- W66652580 hasConcept C107673813 @default.
- W66652580 hasConcept C111335779 @default.
- W66652580 hasConcept C111350023 @default.
- W66652580 hasConcept C111368507 @default.
- W66652580 hasConcept C111919701 @default.
- W66652580 hasConcept C11413529 @default.
- W66652580 hasConcept C121683094 @default.
- W66652580 hasConcept C121955636 @default.
- W66652580 hasConcept C126255220 @default.
- W66652580 hasConcept C12725497 @default.
- W66652580 hasConcept C127313418 @default.
- W66652580 hasConcept C13153151 @default.
- W66652580 hasConcept C134306372 @default.
- W66652580 hasConcept C144133560 @default.
- W66652580 hasConcept C154945302 @default.
- W66652580 hasConcept C185429906 @default.
- W66652580 hasConcept C19499675 @default.
- W66652580 hasConcept C196083921 @default.
- W66652580 hasConcept C2524010 @default.
- W66652580 hasConcept C2779960059 @default.
- W66652580 hasConcept C33923547 @default.
- W66652580 hasConcept C36503486 @default.
- W66652580 hasConcept C41008148 @default.
- W66652580 hasConcept C62644790 @default.
- W66652580 hasConceptScore W66652580C105795698 @default.
- W66652580 hasConceptScore W66652580C107673813 @default.
- W66652580 hasConceptScore W66652580C111335779 @default.
- W66652580 hasConceptScore W66652580C111350023 @default.
- W66652580 hasConceptScore W66652580C111368507 @default.
- W66652580 hasConceptScore W66652580C111919701 @default.
- W66652580 hasConceptScore W66652580C11413529 @default.
- W66652580 hasConceptScore W66652580C121683094 @default.
- W66652580 hasConceptScore W66652580C121955636 @default.
- W66652580 hasConceptScore W66652580C126255220 @default.
- W66652580 hasConceptScore W66652580C12725497 @default.
- W66652580 hasConceptScore W66652580C127313418 @default.
- W66652580 hasConceptScore W66652580C13153151 @default.
- W66652580 hasConceptScore W66652580C134306372 @default.
- W66652580 hasConceptScore W66652580C144133560 @default.
- W66652580 hasConceptScore W66652580C154945302 @default.
- W66652580 hasConceptScore W66652580C185429906 @default.
- W66652580 hasConceptScore W66652580C19499675 @default.
- W66652580 hasConceptScore W66652580C196083921 @default.
- W66652580 hasConceptScore W66652580C2524010 @default.
- W66652580 hasConceptScore W66652580C2779960059 @default.
- W66652580 hasConceptScore W66652580C33923547 @default.
- W66652580 hasConceptScore W66652580C36503486 @default.
- W66652580 hasConceptScore W66652580C41008148 @default.
- W66652580 hasConceptScore W66652580C62644790 @default.
- W66652580 hasLocation W666525801 @default.
- W66652580 hasOpenAccess W66652580 @default.
- W66652580 hasPrimaryLocation W666525801 @default.
- W66652580 hasRelatedWork W1511670714 @default.
- W66652580 hasRelatedWork W1966748858 @default.
- W66652580 hasRelatedWork W1966992081 @default.
- W66652580 hasRelatedWork W1976253807 @default.
- W66652580 hasRelatedWork W2021367230 @default.
- W66652580 hasRelatedWork W2116541721 @default.
- W66652580 hasRelatedWork W2116780995 @default.
- W66652580 hasRelatedWork W2181164912 @default.
- W66652580 hasRelatedWork W2517816274 @default.
- W66652580 hasRelatedWork W2535357955 @default.
- W66652580 hasRelatedWork W2559830408 @default.
- W66652580 hasRelatedWork W2618438111 @default.
- W66652580 hasRelatedWork W2778783904 @default.
- W66652580 hasRelatedWork W2989664601 @default.
- W66652580 hasRelatedWork W2994787779 @default.