Matches in SemOpenAlex for { <https://semopenalex.org/work/W2890735789> ?p ?o ?g. }
Showing items 1 to 85 of
85
with 100 items per page.
- W2890735789 abstract "While deep reinforcement learning (DRL) has led to numerous successes in recent years, reproducing these successes can be extremely challenging. One reproducibility challenge particularly relevant to DRL is nondeterminism in the training process, which can substantially affect the results. Motivated by this challenge, we study the positive impacts of deterministic implementations in eliminating nondeterminism in training. To do so, we consider the particular case of the deep Q-learning algorithm, for which we produce a deterministic implementation by identifying and controlling all sources of nondeterminism in the training process. One by one, we then allow individual sources of nondeterminism to affect our otherwise deterministic implementation, and measure the impact of each source on the variance in performance. We find that individual sources of nondeterminism can substantially impact the performance of agent, illustrating the benefits of deterministic implementations. In addition, we also discuss the important role of deterministic implementations in achieving exact replicability of results." @default.
- W2890735789 created "2018-09-27" @default.
- W2890735789 creator A5001594330 @default.
- W2890735789 creator A5020644058 @default.
- W2890735789 creator A5060981901 @default.
- W2890735789 date "2018-09-15" @default.
- W2890735789 modified "2023-09-27" @default.
- W2890735789 title "Deterministic Implementations for Reproducibility in Deep Reinforcement Learning" @default.
- W2890735789 cites W1658008008 @default.
- W2890735789 cites W1757796397 @default.
- W2890735789 cites W1971700225 @default.
- W2890735789 cites W1992288515 @default.
- W2890735789 cites W2100506021 @default.
- W2890735789 cites W2130126144 @default.
- W2890735789 cites W2145339207 @default.
- W2890735789 cites W2472803348 @default.
- W2890735789 cites W2501465840 @default.
- W2890735789 cites W2726187156 @default.
- W2890735789 cites W2747402019 @default.
- W2890735789 cites W2754517384 @default.
- W2890735789 cites W2781585732 @default.
- W2890735789 cites W2809256243 @default.
- W2890735789 cites W2899771611 @default.
- W2890735789 cites W2963403143 @default.
- W2890735789 hasPublicationYear "2018" @default.
- W2890735789 type Work @default.
- W2890735789 sameAs 2890735789 @default.
- W2890735789 citedByCount "17" @default.
- W2890735789 countsByYear W28907357892018 @default.
- W2890735789 countsByYear W28907357892019 @default.
- W2890735789 countsByYear W28907357892020 @default.
- W2890735789 countsByYear W28907357892021 @default.
- W2890735789 crossrefType "posted-content" @default.
- W2890735789 hasAuthorship W2890735789A5001594330 @default.
- W2890735789 hasAuthorship W2890735789A5020644058 @default.
- W2890735789 hasAuthorship W2890735789A5060981901 @default.
- W2890735789 hasConcept C115903868 @default.
- W2890735789 hasConcept C119857082 @default.
- W2890735789 hasConcept C121955636 @default.
- W2890735789 hasConcept C144133560 @default.
- W2890735789 hasConcept C154945302 @default.
- W2890735789 hasConcept C196083921 @default.
- W2890735789 hasConcept C199360897 @default.
- W2890735789 hasConcept C26713055 @default.
- W2890735789 hasConcept C41008148 @default.
- W2890735789 hasConcept C97541855 @default.
- W2890735789 hasConcept C98045186 @default.
- W2890735789 hasConceptScore W2890735789C115903868 @default.
- W2890735789 hasConceptScore W2890735789C119857082 @default.
- W2890735789 hasConceptScore W2890735789C121955636 @default.
- W2890735789 hasConceptScore W2890735789C144133560 @default.
- W2890735789 hasConceptScore W2890735789C154945302 @default.
- W2890735789 hasConceptScore W2890735789C196083921 @default.
- W2890735789 hasConceptScore W2890735789C199360897 @default.
- W2890735789 hasConceptScore W2890735789C26713055 @default.
- W2890735789 hasConceptScore W2890735789C41008148 @default.
- W2890735789 hasConceptScore W2890735789C97541855 @default.
- W2890735789 hasConceptScore W2890735789C98045186 @default.
- W2890735789 hasLocation W28907357891 @default.
- W2890735789 hasOpenAccess W2890735789 @default.
- W2890735789 hasPrimaryLocation W28907357891 @default.
- W2890735789 hasRelatedWork W1534477342 @default.
- W2890735789 hasRelatedWork W1663973292 @default.
- W2890735789 hasRelatedWork W1821462560 @default.
- W2890735789 hasRelatedWork W2121863487 @default.
- W2890735789 hasRelatedWork W2145339207 @default.
- W2890735789 hasRelatedWork W2155027007 @default.
- W2890735789 hasRelatedWork W2173248099 @default.
- W2890735789 hasRelatedWork W2257979135 @default.
- W2890735789 hasRelatedWork W2557283755 @default.
- W2890735789 hasRelatedWork W2736601468 @default.
- W2890735789 hasRelatedWork W2747402019 @default.
- W2890735789 hasRelatedWork W2766447205 @default.
- W2890735789 hasRelatedWork W2809256243 @default.
- W2890735789 hasRelatedWork W2905342215 @default.
- W2890735789 hasRelatedWork W2949117887 @default.
- W2890735789 hasRelatedWork W2963120839 @default.
- W2890735789 hasRelatedWork W2963403143 @default.
- W2890735789 hasRelatedWork W2963748792 @default.
- W2890735789 hasRelatedWork W3126445456 @default.
- W2890735789 hasRelatedWork W3166713958 @default.
- W2890735789 isParatext "false" @default.
- W2890735789 isRetracted "false" @default.
- W2890735789 magId "2890735789" @default.
- W2890735789 workType "article" @default.