Matches in SemOpenAlex for { <https://semopenalex.org/work/W1775685989> ?p ?o ?g. }
- W1775685989 endingPage "1170" @default.
- W1775685989 startingPage "1169" @default.
- W1775685989 abstract "Traditionally, research in the reinforcement learning (RL) community has been devoted to developing domain-independent algorithms such as SARSA [13], Q-learning [16], prioritized sweeping [8], or LSPI [6], that are designed to work for any given state space and action space. However, the modus operandi in RL research has been for a human expert to re-code each learning environment, including defining the actions and state features, as well as specifying the algorithm to be used. Typically each new RL experiment is run by explicitly calling a new program (even when learning can be biased by previous learning experiences, as in transfer learning [10, 15, 14]). Thus, while standards have developed for describing and testing individual RL algorithms (e.g., RL-Glue [17]), no such standards have developed for the problem of describing complete tasks to a preexisting agent." @default.
- W1775685989 created "2016-06-24" @default.
- W1775685989 creator A5001594330 @default.
- W1775685989 creator A5063398337 @default.
- W1775685989 creator A5074285715 @default.
- W1775685989 date "2009-05-10" @default.
- W1775685989 modified "2023-09-22" @default.
- W1775685989 title "A task specification language for bootstrap learning" @default.
- W1775685989 cites W1488730473 @default.
- W1775685989 cites W1505937442 @default.
- W1775685989 cites W1515851193 @default.
- W1775685989 cites W1747702904 @default.
- W1775685989 cites W2042492924 @default.
- W1775685989 cites W2048226872 @default.
- W1775685989 cites W2070301851 @default.
- W1775685989 cites W2100495367 @default.
- W1775685989 cites W2104641222 @default.
- W1775685989 cites W2109910161 @default.
- W1775685989 cites W2116064496 @default.
- W1775685989 cites W2121863487 @default.
- W1775685989 cites W2126565096 @default.
- W1775685989 cites W2130005627 @default.
- W1775685989 cites W2136922672 @default.
- W1775685989 cites W2140584963 @default.
- W1775685989 cites W2141559645 @default.
- W1775685989 cites W2143435603 @default.
- W1775685989 cites W2148112459 @default.
- W1775685989 cites W2153353285 @default.
- W1775685989 cites W2160371091 @default.
- W1775685989 cites W2963705262 @default.
- W1775685989 cites W3011120880 @default.
- W1775685989 cites W36691172 @default.
- W1775685989 hasPublicationYear "2009" @default.
- W1775685989 type Work @default.
- W1775685989 sameAs 1775685989 @default.
- W1775685989 citedByCount "2" @default.
- W1775685989 countsByYear W17756859892021 @default.
- W1775685989 countsByYear W17756859892022 @default.
- W1775685989 crossrefType "proceedings-article" @default.
- W1775685989 hasAuthorship W1775685989A5001594330 @default.
- W1775685989 hasAuthorship W1775685989A5063398337 @default.
- W1775685989 hasAuthorship W1775685989A5074285715 @default.
- W1775685989 hasConcept C105795698 @default.
- W1775685989 hasConcept C119857082 @default.
- W1775685989 hasConcept C134306372 @default.
- W1775685989 hasConcept C150899416 @default.
- W1775685989 hasConcept C154945302 @default.
- W1775685989 hasConcept C162324750 @default.
- W1775685989 hasConcept C177264268 @default.
- W1775685989 hasConcept C187736073 @default.
- W1775685989 hasConcept C199360897 @default.
- W1775685989 hasConcept C2776760102 @default.
- W1775685989 hasConcept C2780451532 @default.
- W1775685989 hasConcept C33923547 @default.
- W1775685989 hasConcept C36503486 @default.
- W1775685989 hasConcept C41008148 @default.
- W1775685989 hasConcept C48103436 @default.
- W1775685989 hasConcept C72434380 @default.
- W1775685989 hasConcept C97541855 @default.
- W1775685989 hasConceptScore W1775685989C105795698 @default.
- W1775685989 hasConceptScore W1775685989C119857082 @default.
- W1775685989 hasConceptScore W1775685989C134306372 @default.
- W1775685989 hasConceptScore W1775685989C150899416 @default.
- W1775685989 hasConceptScore W1775685989C154945302 @default.
- W1775685989 hasConceptScore W1775685989C162324750 @default.
- W1775685989 hasConceptScore W1775685989C177264268 @default.
- W1775685989 hasConceptScore W1775685989C187736073 @default.
- W1775685989 hasConceptScore W1775685989C199360897 @default.
- W1775685989 hasConceptScore W1775685989C2776760102 @default.
- W1775685989 hasConceptScore W1775685989C2780451532 @default.
- W1775685989 hasConceptScore W1775685989C33923547 @default.
- W1775685989 hasConceptScore W1775685989C36503486 @default.
- W1775685989 hasConceptScore W1775685989C41008148 @default.
- W1775685989 hasConceptScore W1775685989C48103436 @default.
- W1775685989 hasConceptScore W1775685989C72434380 @default.
- W1775685989 hasConceptScore W1775685989C97541855 @default.
- W1775685989 hasLocation W17756859891 @default.
- W1775685989 hasOpenAccess W1775685989 @default.
- W1775685989 hasPrimaryLocation W17756859891 @default.
- W1775685989 hasRelatedWork W1604959332 @default.
- W1775685989 hasRelatedWork W1822705290 @default.
- W1775685989 hasRelatedWork W1846689811 @default.
- W1775685989 hasRelatedWork W2020573190 @default.
- W1775685989 hasRelatedWork W2029130131 @default.
- W1775685989 hasRelatedWork W2132073301 @default.
- W1775685989 hasRelatedWork W2610686804 @default.
- W1775685989 hasRelatedWork W2634239194 @default.
- W1775685989 hasRelatedWork W2736068270 @default.
- W1775685989 hasRelatedWork W2795347172 @default.
- W1775685989 hasRelatedWork W2896183040 @default.
- W1775685989 hasRelatedWork W2964241178 @default.
- W1775685989 hasRelatedWork W3098457197 @default.
- W1775685989 hasRelatedWork W3101283005 @default.
- W1775685989 hasRelatedWork W3109198942 @default.
- W1775685989 hasRelatedWork W3112076360 @default.
- W1775685989 hasRelatedWork W3114060346 @default.
- W1775685989 hasRelatedWork W3175458185 @default.
- W1775685989 hasRelatedWork W3214048558 @default.