Matches in SemOpenAlex for { <https://semopenalex.org/work/W3199500857> ?p ?o ?g. }
Showing items 1 to 82 of
82
with 100 items per page.
- W3199500857 abstract "Reinforcement learning (RL) is a popular machine learning paradigm for game playing, robotics control, and other sequential decision tasks. However, RL agents often have long learning times with high data requirements because they begin by acting randomly. In order to better learn in complex tasks, we argue that an external teacher can often significantly help the RL agent learn. OpenAI Gym is a common framework for RL research, including a large number of standard environments and agents, making RL research significantly more accessible. This article introduces our new open-source RL framework, the Human Input Parsing Platform for Openai Gym (HIPPO Gym), and the design decisions that went into its creation. The goal of this platform is to facilitate human-RL research, making human-in-the-loop RL more accessible, including learning from demonstrations, learning from feedback, or curriculum learning. In addition, all experiments can be conducted over the internet without any additional software needed on the client’s computer, making experiments at scale significantly easier." @default.
- W3199500857 created "2021-09-27" @default.
- W3199500857 creator A5007545490 @default.
- W3199500857 creator A5017774638 @default.
- W3199500857 creator A5052222351 @default.
- W3199500857 creator A5070914351 @default.
- W3199500857 date "2021-09-19" @default.
- W3199500857 modified "2023-09-23" @default.
- W3199500857 title "Improving reinforcement learning with human assistance: an argument for human subject studies with HIPPO Gym" @default.
- W3199500857 cites W1969685488 @default.
- W3199500857 cites W1986014385 @default.
- W3199500857 cites W2042651776 @default.
- W3199500857 cites W2071611837 @default.
- W3199500857 cites W2074056782 @default.
- W3199500857 cites W2084777029 @default.
- W3199500857 cites W2093271548 @default.
- W3199500857 cites W2103285838 @default.
- W3199500857 cites W2145339207 @default.
- W3199500857 cites W2156869222 @default.
- W3199500857 cites W2157289187 @default.
- W3199500857 cites W2158782408 @default.
- W3199500857 cites W2264897026 @default.
- W3199500857 cites W2740302738 @default.
- W3199500857 cites W2788862220 @default.
- W3199500857 cites W2883246952 @default.
- W3199500857 cites W2916904544 @default.
- W3199500857 cites W2962710194 @default.
- W3199500857 cites W2963390684 @default.
- W3199500857 cites W2964915587 @default.
- W3199500857 cites W2988177603 @default.
- W3199500857 cites W2996868001 @default.
- W3199500857 cites W3082925502 @default.
- W3199500857 cites W3099324303 @default.
- W3199500857 cites W3103078407 @default.
- W3199500857 cites W3103780890 @default.
- W3199500857 cites W3116073702 @default.
- W3199500857 cites W3200372272 @default.
- W3199500857 cites W4240968524 @default.
- W3199500857 cites W4246078117 @default.
- W3199500857 cites W8222043 @default.
- W3199500857 cites W1968668104 @default.
- W3199500857 doi "https://doi.org/10.1007/s00521-021-06375-y" @default.
- W3199500857 hasPublicationYear "2021" @default.
- W3199500857 type Work @default.
- W3199500857 sameAs 3199500857 @default.
- W3199500857 citedByCount "1" @default.
- W3199500857 countsByYear W31995008572023 @default.
- W3199500857 crossrefType "journal-article" @default.
- W3199500857 hasAuthorship W3199500857A5007545490 @default.
- W3199500857 hasAuthorship W3199500857A5017774638 @default.
- W3199500857 hasAuthorship W3199500857A5052222351 @default.
- W3199500857 hasAuthorship W3199500857A5070914351 @default.
- W3199500857 hasBestOaLocation W31995008572 @default.
- W3199500857 hasConcept C107457646 @default.
- W3199500857 hasConcept C154945302 @default.
- W3199500857 hasConcept C41008148 @default.
- W3199500857 hasConcept C97541855 @default.
- W3199500857 hasConceptScore W3199500857C107457646 @default.
- W3199500857 hasConceptScore W3199500857C154945302 @default.
- W3199500857 hasConceptScore W3199500857C41008148 @default.
- W3199500857 hasConceptScore W3199500857C97541855 @default.
- W3199500857 hasFunder F4320309949 @default.
- W3199500857 hasFunder F4320314212 @default.
- W3199500857 hasFunder F4320321487 @default.
- W3199500857 hasLocation W31995008571 @default.
- W3199500857 hasLocation W31995008572 @default.
- W3199500857 hasOpenAccess W3199500857 @default.
- W3199500857 hasPrimaryLocation W31995008571 @default.
- W3199500857 hasRelatedWork W260766989 @default.
- W3199500857 hasRelatedWork W2959276766 @default.
- W3199500857 hasRelatedWork W3005560120 @default.
- W3199500857 hasRelatedWork W3037422413 @default.
- W3199500857 hasRelatedWork W3139193008 @default.
- W3199500857 hasRelatedWork W3173482257 @default.
- W3199500857 hasRelatedWork W3209094908 @default.
- W3199500857 hasRelatedWork W4206669594 @default.
- W3199500857 hasRelatedWork W4210912933 @default.
- W3199500857 hasRelatedWork W4295941380 @default.
- W3199500857 isParatext "false" @default.
- W3199500857 isRetracted "false" @default.
- W3199500857 magId "3199500857" @default.
- W3199500857 workType "article" @default.