Matches in SemOpenAlex for { <https://semopenalex.org/work/W2964227312> ?p ?o ?g. }
- W2964227312 abstract "Temporal abstraction is key to scaling up learning and planning in reinforcement learning. While planning with temporally extended actions is well understood, creating such abstractions autonomously from data has remained challenging.We tackle this problem in the framework of options [Sutton,Precup and Singh, 1999; Precup, 2000]. We derive policy gradient theorems for options and propose a new option-critic architecture capable of learning both the internal policies and the termination conditions of options, in tandem with the policy over options, and without the need to provide any additional rewards or subgoals. Experimental results in both discrete and continuous environments showcase the flexibility and efficiency of the framework." @default.
- W2964227312 created "2019-07-30" @default.
- W2964227312 creator A5025745431 @default.
- W2964227312 creator A5065836447 @default.
- W2964227312 creator A5067918843 @default.
- W2964227312 date "2017-02-13" @default.
- W2964227312 modified "2023-10-16" @default.
- W2964227312 title "The Option-Critic Architecture" @default.
- W2964227312 cites W1536990779 @default.
- W2964227312 cites W1556824961 @default.
- W2964227312 cites W1569296262 @default.
- W2964227312 cites W1585861384 @default.
- W2964227312 cites W1757796397 @default.
- W2964227312 cites W182014611 @default.
- W2964227312 cites W189510620 @default.
- W2964227312 cites W1968768508 @default.
- W2964227312 cites W1980035368 @default.
- W2964227312 cites W1985291828 @default.
- W2964227312 cites W199552065 @default.
- W2964227312 cites W2097828232 @default.
- W2964227312 cites W2100677568 @default.
- W2964227312 cites W2109910161 @default.
- W2964227312 cites W2114537044 @default.
- W2964227312 cites W2119567691 @default.
- W2964227312 cites W2119717200 @default.
- W2964227312 cites W2121517924 @default.
- W2964227312 cites W2130535800 @default.
- W2964227312 cites W2143435603 @default.
- W2964227312 cites W2155027007 @default.
- W2964227312 cites W2165150801 @default.
- W2964227312 cites W2168640731 @default.
- W2964227312 cites W2217025414 @default.
- W2964227312 cites W2394928340 @default.
- W2964227312 cites W2912453235 @default.
- W2964227312 cites W3011120880 @default.
- W2964227312 cites W3139377883 @default.
- W2964227312 cites W59183349 @default.
- W2964227312 doi "https://doi.org/10.1609/aaai.v31i1.10916" @default.
- W2964227312 hasPublicationYear "2017" @default.
- W2964227312 type Work @default.
- W2964227312 sameAs 2964227312 @default.
- W2964227312 citedByCount "352" @default.
- W2964227312 countsByYear W29642273122012 @default.
- W2964227312 countsByYear W29642273122016 @default.
- W2964227312 countsByYear W29642273122017 @default.
- W2964227312 countsByYear W29642273122018 @default.
- W2964227312 countsByYear W29642273122019 @default.
- W2964227312 countsByYear W29642273122020 @default.
- W2964227312 countsByYear W29642273122021 @default.
- W2964227312 countsByYear W29642273122022 @default.
- W2964227312 countsByYear W29642273122023 @default.
- W2964227312 crossrefType "journal-article" @default.
- W2964227312 hasAuthorship W2964227312A5025745431 @default.
- W2964227312 hasAuthorship W2964227312A5065836447 @default.
- W2964227312 hasAuthorship W2964227312A5067918843 @default.
- W2964227312 hasBestOaLocation W29642273121 @default.
- W2964227312 hasConcept C111472728 @default.
- W2964227312 hasConcept C123657996 @default.
- W2964227312 hasConcept C124304363 @default.
- W2964227312 hasConcept C138885662 @default.
- W2964227312 hasConcept C142362112 @default.
- W2964227312 hasConcept C153349607 @default.
- W2964227312 hasConcept C154945302 @default.
- W2964227312 hasConcept C162324750 @default.
- W2964227312 hasConcept C187736073 @default.
- W2964227312 hasConcept C26517878 @default.
- W2964227312 hasConcept C2780598303 @default.
- W2964227312 hasConcept C38652104 @default.
- W2964227312 hasConcept C41008148 @default.
- W2964227312 hasConcept C97541855 @default.
- W2964227312 hasConceptScore W2964227312C111472728 @default.
- W2964227312 hasConceptScore W2964227312C123657996 @default.
- W2964227312 hasConceptScore W2964227312C124304363 @default.
- W2964227312 hasConceptScore W2964227312C138885662 @default.
- W2964227312 hasConceptScore W2964227312C142362112 @default.
- W2964227312 hasConceptScore W2964227312C153349607 @default.
- W2964227312 hasConceptScore W2964227312C154945302 @default.
- W2964227312 hasConceptScore W2964227312C162324750 @default.
- W2964227312 hasConceptScore W2964227312C187736073 @default.
- W2964227312 hasConceptScore W2964227312C26517878 @default.
- W2964227312 hasConceptScore W2964227312C2780598303 @default.
- W2964227312 hasConceptScore W2964227312C38652104 @default.
- W2964227312 hasConceptScore W2964227312C41008148 @default.
- W2964227312 hasConceptScore W2964227312C97541855 @default.
- W2964227312 hasIssue "1" @default.
- W2964227312 hasLocation W29642273121 @default.
- W2964227312 hasLocation W29642273122 @default.
- W2964227312 hasOpenAccess W2964227312 @default.
- W2964227312 hasPrimaryLocation W29642273121 @default.
- W2964227312 hasRelatedWork W2068700777 @default.
- W2964227312 hasRelatedWork W2523728418 @default.
- W2964227312 hasRelatedWork W2923653485 @default.
- W2964227312 hasRelatedWork W2952472710 @default.
- W2964227312 hasRelatedWork W2957776456 @default.
- W2964227312 hasRelatedWork W4206669594 @default.
- W2964227312 hasRelatedWork W4224287422 @default.
- W2964227312 hasRelatedWork W4255994452 @default.
- W2964227312 hasRelatedWork W4300093329 @default.
- W2964227312 hasRelatedWork W4319773215 @default.
- W2964227312 hasVolume "31" @default.