Matches in SemOpenAlex for { <https://semopenalex.org/work/W2913403708> ?p ?o ?g. }
- W2913403708 abstract "Many deep reinforcement learning algorithms contain inductive biases that sculpt the agent's objective and its interface to the environment. These inductive biases can take many forms, including domain knowledge and pretuned hyper-parameters. In general, there is a trade-off between generality and performance when algorithms use such biases. Stronger biases can lead to faster learning, but weaker biases can potentially lead to more general algorithms. This trade-off is important because inductive biases are not free; substantial effort may be required to obtain relevant domain knowledge or to tune hyper-parameters effectively. In this paper, we re-examine several domain-specific components that bias the objective and the environmental interface of common deep reinforcement learning agents. We investigated whether the performance deteriorates when these components are replaced with adaptive solutions from the literature. In our experiments, performance sometimes decreased with the adaptive components, as one might expect when comparing to components crafted for the domain, but sometimes the adaptive components performed better. We investigated the main benefit of having fewer domain-specific components, by comparing the learning performance of the two systems on a different set of continuous control problems, without additional tuning of either system. As hypothesized, the system with adaptive components performed better on many of the new tasks." @default.
- W2913403708 created "2019-02-21" @default.
- W2913403708 creator A5033135596 @default.
- W2913403708 creator A5036908874 @default.
- W2913403708 creator A5054065284 @default.
- W2913403708 creator A5091771290 @default.
- W2913403708 date "2019-07-05" @default.
- W2913403708 modified "2023-09-27" @default.
- W2913403708 title "On Inductive Biases in Deep Reinforcement Learning" @default.
- W2913403708 cites W1494114146 @default.
- W2913403708 cites W2064675550 @default.
- W2913403708 cites W2100677568 @default.
- W2913403708 cites W2119076496 @default.
- W2913403708 cites W2119717200 @default.
- W2913403708 cites W2145339207 @default.
- W2913403708 cites W2155968351 @default.
- W2913403708 cites W2157864803 @default.
- W2913403708 cites W2160371091 @default.
- W2913403708 cites W2166107799 @default.
- W2913403708 cites W2173564293 @default.
- W2913403708 cites W2201467212 @default.
- W2913403708 cites W2257979135 @default.
- W2913403708 cites W2341171179 @default.
- W2913403708 cites W2402302915 @default.
- W2913403708 cites W2480004914 @default.
- W2913403708 cites W2551887912 @default.
- W2913403708 cites W2605102581 @default.
- W2913403708 cites W2785693718 @default.
- W2913403708 cites W2803767077 @default.
- W2913403708 cites W2891076394 @default.
- W2913403708 cites W2963211300 @default.
- W2913403708 cites W2963296584 @default.
- W2913403708 cites W2963403143 @default.
- W2913403708 cites W2963985863 @default.
- W2913403708 cites W2964121744 @default.
- W2913403708 cites W2964227312 @default.
- W2913403708 cites W3037211759 @default.
- W2913403708 hasPublicationYear "2019" @default.
- W2913403708 type Work @default.
- W2913403708 sameAs 2913403708 @default.
- W2913403708 citedByCount "12" @default.
- W2913403708 countsByYear W29134037082019 @default.
- W2913403708 countsByYear W29134037082020 @default.
- W2913403708 countsByYear W29134037082021 @default.
- W2913403708 crossrefType "posted-content" @default.
- W2913403708 hasAuthorship W2913403708A5033135596 @default.
- W2913403708 hasAuthorship W2913403708A5036908874 @default.
- W2913403708 hasAuthorship W2913403708A5054065284 @default.
- W2913403708 hasAuthorship W2913403708A5091771290 @default.
- W2913403708 hasConcept C113843644 @default.
- W2913403708 hasConcept C119857082 @default.
- W2913403708 hasConcept C127413603 @default.
- W2913403708 hasConcept C129307140 @default.
- W2913403708 hasConcept C134306372 @default.
- W2913403708 hasConcept C154945302 @default.
- W2913403708 hasConcept C15744967 @default.
- W2913403708 hasConcept C157915830 @default.
- W2913403708 hasConcept C173608175 @default.
- W2913403708 hasConcept C177264268 @default.
- W2913403708 hasConcept C197352929 @default.
- W2913403708 hasConcept C199360897 @default.
- W2913403708 hasConcept C201995342 @default.
- W2913403708 hasConcept C2780451532 @default.
- W2913403708 hasConcept C2780767217 @default.
- W2913403708 hasConcept C28006648 @default.
- W2913403708 hasConcept C33923547 @default.
- W2913403708 hasConcept C36503486 @default.
- W2913403708 hasConcept C41008148 @default.
- W2913403708 hasConcept C542102704 @default.
- W2913403708 hasConcept C97541855 @default.
- W2913403708 hasConceptScore W2913403708C113843644 @default.
- W2913403708 hasConceptScore W2913403708C119857082 @default.
- W2913403708 hasConceptScore W2913403708C127413603 @default.
- W2913403708 hasConceptScore W2913403708C129307140 @default.
- W2913403708 hasConceptScore W2913403708C134306372 @default.
- W2913403708 hasConceptScore W2913403708C154945302 @default.
- W2913403708 hasConceptScore W2913403708C15744967 @default.
- W2913403708 hasConceptScore W2913403708C157915830 @default.
- W2913403708 hasConceptScore W2913403708C173608175 @default.
- W2913403708 hasConceptScore W2913403708C177264268 @default.
- W2913403708 hasConceptScore W2913403708C197352929 @default.
- W2913403708 hasConceptScore W2913403708C199360897 @default.
- W2913403708 hasConceptScore W2913403708C201995342 @default.
- W2913403708 hasConceptScore W2913403708C2780451532 @default.
- W2913403708 hasConceptScore W2913403708C2780767217 @default.
- W2913403708 hasConceptScore W2913403708C28006648 @default.
- W2913403708 hasConceptScore W2913403708C33923547 @default.
- W2913403708 hasConceptScore W2913403708C36503486 @default.
- W2913403708 hasConceptScore W2913403708C41008148 @default.
- W2913403708 hasConceptScore W2913403708C542102704 @default.
- W2913403708 hasConceptScore W2913403708C97541855 @default.
- W2913403708 hasLocation W29134037081 @default.
- W2913403708 hasOpenAccess W2913403708 @default.
- W2913403708 hasPrimaryLocation W29134037081 @default.
- W2913403708 hasRelatedWork W1598796067 @default.
- W2913403708 hasRelatedWork W2121863487 @default.
- W2913403708 hasRelatedWork W2145339207 @default.
- W2913403708 hasRelatedWork W2158782408 @default.
- W2913403708 hasRelatedWork W2273879499 @default.
- W2913403708 hasRelatedWork W2736601468 @default.