Matches in SemOpenAlex for { <https://semopenalex.org/work/W2105486945> ?p ?o ?g. }
- W2105486945 endingPage "784" @default.
- W2105486945 startingPage "777" @default.
- W2105486945 abstract "A key component of any reinforcement learning algorithm is the underlying representation used by the agent. While reinforcement learning (RL) agents have typically relied on hand-coded state representations, there has been a growing interest in learning this representation. While inputs to an agent are typically fixed (i.e., state variables represent sensors on a robot), it is desirable to automatically determine the optimal relative scaling of such inputs, as well as to diminish the impact of irrelevant features. This work introduces Holler, a novel distance metric learning algorithm, and combines it with an existing instance-based RL algorithm to achieve precisely these goals. The algorithms' success is highlighted via empirical measurements on a set of six tasks within the mountain car domain." @default.
- W2105486945 created "2016-06-24" @default.
- W2105486945 creator A5021516103 @default.
- W2105486945 creator A5049134079 @default.
- W2105486945 creator A5070914351 @default.
- W2105486945 date "2011-05-02" @default.
- W2105486945 modified "2023-09-29" @default.
- W2105486945 title "Metric learning for reinforcement learning agents" @default.
- W2105486945 cites W1495858509 @default.
- W2105486945 cites W1508997545 @default.
- W2105486945 cites W1515851193 @default.
- W2105486945 cites W1531663008 @default.
- W2105486945 cites W1585603966 @default.
- W2105486945 cites W1828381662 @default.
- W2105486945 cites W1988344179 @default.
- W2105486945 cites W2074586650 @default.
- W2105486945 cites W2087400813 @default.
- W2105486945 cites W2104752854 @default.
- W2105486945 cites W2106053110 @default.
- W2105486945 cites W2110292307 @default.
- W2105486945 cites W2113913482 @default.
- W2105486945 cites W2117154949 @default.
- W2105486945 cites W2118979615 @default.
- W2105486945 cites W2130556178 @default.
- W2105486945 cites W2131831090 @default.
- W2105486945 cites W2133120480 @default.
- W2105486945 cites W2134540127 @default.
- W2105486945 cites W2138326839 @default.
- W2105486945 cites W2148825261 @default.
- W2105486945 cites W2157998899 @default.
- W2105486945 cites W2158150115 @default.
- W2105486945 cites W2161795906 @default.
- W2105486945 cites W2169495281 @default.
- W2105486945 doi "https://doi.org/10.5555/2031678.2031728" @default.
- W2105486945 hasPublicationYear "2011" @default.
- W2105486945 type Work @default.
- W2105486945 sameAs 2105486945 @default.
- W2105486945 citedByCount "13" @default.
- W2105486945 countsByYear W21054869452012 @default.
- W2105486945 countsByYear W21054869452013 @default.
- W2105486945 countsByYear W21054869452014 @default.
- W2105486945 countsByYear W21054869452015 @default.
- W2105486945 countsByYear W21054869452016 @default.
- W2105486945 countsByYear W21054869452017 @default.
- W2105486945 countsByYear W21054869452019 @default.
- W2105486945 countsByYear W21054869452021 @default.
- W2105486945 crossrefType "proceedings-article" @default.
- W2105486945 hasAuthorship W2105486945A5021516103 @default.
- W2105486945 hasAuthorship W2105486945A5049134079 @default.
- W2105486945 hasAuthorship W2105486945A5070914351 @default.
- W2105486945 hasConcept C119857082 @default.
- W2105486945 hasConcept C127413603 @default.
- W2105486945 hasConcept C154945302 @default.
- W2105486945 hasConcept C176217482 @default.
- W2105486945 hasConcept C177264268 @default.
- W2105486945 hasConcept C17744445 @default.
- W2105486945 hasConcept C199190896 @default.
- W2105486945 hasConcept C199360897 @default.
- W2105486945 hasConcept C199539241 @default.
- W2105486945 hasConcept C21547014 @default.
- W2105486945 hasConcept C26517878 @default.
- W2105486945 hasConcept C2776359362 @default.
- W2105486945 hasConcept C38652104 @default.
- W2105486945 hasConcept C41008148 @default.
- W2105486945 hasConcept C90509273 @default.
- W2105486945 hasConcept C94625758 @default.
- W2105486945 hasConcept C97541855 @default.
- W2105486945 hasConceptScore W2105486945C119857082 @default.
- W2105486945 hasConceptScore W2105486945C127413603 @default.
- W2105486945 hasConceptScore W2105486945C154945302 @default.
- W2105486945 hasConceptScore W2105486945C176217482 @default.
- W2105486945 hasConceptScore W2105486945C177264268 @default.
- W2105486945 hasConceptScore W2105486945C17744445 @default.
- W2105486945 hasConceptScore W2105486945C199190896 @default.
- W2105486945 hasConceptScore W2105486945C199360897 @default.
- W2105486945 hasConceptScore W2105486945C199539241 @default.
- W2105486945 hasConceptScore W2105486945C21547014 @default.
- W2105486945 hasConceptScore W2105486945C26517878 @default.
- W2105486945 hasConceptScore W2105486945C2776359362 @default.
- W2105486945 hasConceptScore W2105486945C38652104 @default.
- W2105486945 hasConceptScore W2105486945C41008148 @default.
- W2105486945 hasConceptScore W2105486945C90509273 @default.
- W2105486945 hasConceptScore W2105486945C94625758 @default.
- W2105486945 hasConceptScore W2105486945C97541855 @default.
- W2105486945 hasLocation W21054869451 @default.
- W2105486945 hasOpenAccess W2105486945 @default.
- W2105486945 hasPrimaryLocation W21054869451 @default.
- W2105486945 hasRelatedWork W116417374 @default.
- W2105486945 hasRelatedWork W1526807135 @default.
- W2105486945 hasRelatedWork W1986014385 @default.
- W2105486945 hasRelatedWork W2003342825 @default.
- W2105486945 hasRelatedWork W2104727569 @default.
- W2105486945 hasRelatedWork W2106053110 @default.
- W2105486945 hasRelatedWork W2117154949 @default.
- W2105486945 hasRelatedWork W2118979615 @default.
- W2105486945 hasRelatedWork W2154023516 @default.
- W2105486945 hasRelatedWork W2164998010 @default.
- W2105486945 hasRelatedWork W2166265228 @default.