Matches in SemOpenAlex for { <https://semopenalex.org/work/W3089678889> ?p ?o ?g. }
- W3089678889 endingPage "542" @default.
- W3089678889 startingPage "528" @default.
- W3089678889 abstract "Recently, AlphaZero has achieved landmark results in deep reinforcement learning, by providing a single self-play architecture that learned three different games at super human level. AlphaZero is a large and complicated system with many parameters, and success requires much compute power and fine-tuning. Reproducing results in other games is a challenge, and many researchers are looking for ways to improve results while reducing computational demands. AlphaZero's design is purely based on self-play and makes no use of labeled expert data ordomain specific enhancements; it is designed to learn from scratch. We propose a novel approach to deal with this cold-start problem by employing simple search enhancements at the beginning phase of self-play training, namely Rollout, Rapid Action Value Estimate (RAVE) and dynamically weighted combinations of these with the neural network, and Rolling Horizon Evolutionary Algorithms (RHEA). Our experiments indicate that most of these enhancements improve the performance of their baseline player in three different (small) board games, with especially RAVE based variants playing strongly." @default.
- W3089678889 created "2020-10-08" @default.
- W3089678889 creator A5062774048 @default.
- W3089678889 creator A5085542421 @default.
- W3089678889 creator A5090366405 @default.
- W3089678889 date "2020-01-01" @default.
- W3089678889 modified "2023-10-17" @default.
- W3089678889 title "Warm-Start AlphaZero Self-play Search Enhancements" @default.
- W3089678889 cites W1499747339 @default.
- W3089678889 cites W1587022413 @default.
- W3089678889 cites W1714211023 @default.
- W3089678889 cites W17749007 @default.
- W3089678889 cites W1977989560 @default.
- W3089678889 cites W1997840820 @default.
- W3089678889 cites W1999319334 @default.
- W3089678889 cites W2020135152 @default.
- W3089678889 cites W2076063813 @default.
- W3089678889 cites W2088043394 @default.
- W3089678889 cites W2092667854 @default.
- W3089678889 cites W2126316555 @default.
- W3089678889 cites W2128262600 @default.
- W3089678889 cites W2132994929 @default.
- W3089678889 cites W2133067606 @default.
- W3089678889 cites W2145339207 @default.
- W3089678889 cites W2257979135 @default.
- W3089678889 cites W2316978694 @default.
- W3089678889 cites W2766447205 @default.
- W3089678889 cites W2767922802 @default.
- W3089678889 cites W2787259794 @default.
- W3089678889 cites W2787567977 @default.
- W3089678889 cites W2890208098 @default.
- W3089678889 cites W2902907165 @default.
- W3089678889 cites W2962781071 @default.
- W3089678889 cites W2977978457 @default.
- W3089678889 cites W2982316857 @default.
- W3089678889 cites W3006868496 @default.
- W3089678889 cites W3093602542 @default.
- W3089678889 cites W4243655891 @default.
- W3089678889 cites W4362203700 @default.
- W3089678889 doi "https://doi.org/10.1007/978-3-030-58115-2_37" @default.
- W3089678889 hasPublicationYear "2020" @default.
- W3089678889 type Work @default.
- W3089678889 sameAs 3089678889 @default.
- W3089678889 citedByCount "5" @default.
- W3089678889 countsByYear W30896788892020 @default.
- W3089678889 countsByYear W30896788892021 @default.
- W3089678889 countsByYear W30896788892022 @default.
- W3089678889 countsByYear W30896788892023 @default.
- W3089678889 crossrefType "book-chapter" @default.
- W3089678889 hasAuthorship W3089678889A5062774048 @default.
- W3089678889 hasAuthorship W3089678889A5085542421 @default.
- W3089678889 hasAuthorship W3089678889A5090366405 @default.
- W3089678889 hasBestOaLocation W30896788892 @default.
- W3089678889 hasConcept C111368507 @default.
- W3089678889 hasConcept C111472728 @default.
- W3089678889 hasConcept C111919701 @default.
- W3089678889 hasConcept C119857082 @default.
- W3089678889 hasConcept C121332964 @default.
- W3089678889 hasConcept C12725497 @default.
- W3089678889 hasConcept C127313418 @default.
- W3089678889 hasConcept C138885662 @default.
- W3089678889 hasConcept C154945302 @default.
- W3089678889 hasConcept C2780586882 @default.
- W3089678889 hasConcept C2780791683 @default.
- W3089678889 hasConcept C2781235140 @default.
- W3089678889 hasConcept C41008148 @default.
- W3089678889 hasConcept C50644808 @default.
- W3089678889 hasConcept C62520636 @default.
- W3089678889 hasConcept C97541855 @default.
- W3089678889 hasConceptScore W3089678889C111368507 @default.
- W3089678889 hasConceptScore W3089678889C111472728 @default.
- W3089678889 hasConceptScore W3089678889C111919701 @default.
- W3089678889 hasConceptScore W3089678889C119857082 @default.
- W3089678889 hasConceptScore W3089678889C121332964 @default.
- W3089678889 hasConceptScore W3089678889C12725497 @default.
- W3089678889 hasConceptScore W3089678889C127313418 @default.
- W3089678889 hasConceptScore W3089678889C138885662 @default.
- W3089678889 hasConceptScore W3089678889C154945302 @default.
- W3089678889 hasConceptScore W3089678889C2780586882 @default.
- W3089678889 hasConceptScore W3089678889C2780791683 @default.
- W3089678889 hasConceptScore W3089678889C2781235140 @default.
- W3089678889 hasConceptScore W3089678889C41008148 @default.
- W3089678889 hasConceptScore W3089678889C50644808 @default.
- W3089678889 hasConceptScore W3089678889C62520636 @default.
- W3089678889 hasConceptScore W3089678889C97541855 @default.
- W3089678889 hasLocation W30896788891 @default.
- W3089678889 hasLocation W30896788892 @default.
- W3089678889 hasLocation W30896788893 @default.
- W3089678889 hasOpenAccess W3089678889 @default.
- W3089678889 hasPrimaryLocation W30896788891 @default.
- W3089678889 hasRelatedWork W1562959674 @default.
- W3089678889 hasRelatedWork W2065109233 @default.
- W3089678889 hasRelatedWork W2923653485 @default.
- W3089678889 hasRelatedWork W3022038857 @default.
- W3089678889 hasRelatedWork W3119508709 @default.
- W3089678889 hasRelatedWork W4224998860 @default.
- W3089678889 hasRelatedWork W4281658507 @default.
- W3089678889 hasRelatedWork W4318719223 @default.