Matches in SemOpenAlex for { <https://semopenalex.org/work/W1998556751> ?p ?o ?g. }
Showing items 1 to 77 of
77
with 100 items per page.
- W1998556751 endingPage "894" @default.
- W1998556751 startingPage "875" @default.
- W1998556751 abstract "This article is related to the research effort of constructing an intelligent agent, i.e., a computer system that is able to sense its environment (world), reason utilizing its internal knowledge and execute actions upon the world (act). the specific part of this effor presented in this article is reinforcement learning, i.e., the process of acquiring new knowledge based upon an evaluative feedback, called reinforcement, received by tht agent through interactions with the world. This article has two objectives: (1) to give a compact overview of reinforcement learning, and (2) to show that the evolution of the reinforcement learning paradigm has been driven by the need for more efficient learning through the addition of more structure to the learning agent. Therefore, both main ideas of reinforcement learning are introduced, and structural solutions to reinforcemen learning are reviewed. Several architectural enhancements of the RL paradigm are discussed. These include incorporation of state information in the learning process, architectural solutions to learning with delayed reinforcement, dealing with structurally changing worlds through utilization of multiple models of the world, and focusing attention of the learning agent through active perception. the paper closes with an overview of directions for applications and for future research in this area. © 1993 John Wiley & Sons, Inc." @default.
- W1998556751 created "2016-06-24" @default.
- W1998556751 creator A5003723197 @default.
- W1998556751 creator A5008675858 @default.
- W1998556751 date "1993-01-01" @default.
- W1998556751 modified "2023-10-17" @default.
- W1998556751 title "Reinforcement learning: Architectures and algorithms" @default.
- W1998556751 cites W2064018461 @default.
- W1998556751 cites W2091565802 @default.
- W1998556751 doi "https://doi.org/10.1002/int.4550080805" @default.
- W1998556751 hasPublicationYear "1993" @default.
- W1998556751 type Work @default.
- W1998556751 sameAs 1998556751 @default.
- W1998556751 citedByCount "11" @default.
- W1998556751 countsByYear W19985567512017 @default.
- W1998556751 countsByYear W19985567512019 @default.
- W1998556751 countsByYear W19985567512020 @default.
- W1998556751 countsByYear W19985567512023 @default.
- W1998556751 crossrefType "journal-article" @default.
- W1998556751 hasAuthorship W1998556751A5003723197 @default.
- W1998556751 hasAuthorship W1998556751A5008675858 @default.
- W1998556751 hasBestOaLocation W19985567511 @default.
- W1998556751 hasConcept C111919701 @default.
- W1998556751 hasConcept C11413529 @default.
- W1998556751 hasConcept C123657996 @default.
- W1998556751 hasConcept C127413603 @default.
- W1998556751 hasConcept C142362112 @default.
- W1998556751 hasConcept C153349607 @default.
- W1998556751 hasConcept C154945302 @default.
- W1998556751 hasConcept C15744967 @default.
- W1998556751 hasConcept C169760540 @default.
- W1998556751 hasConcept C26760741 @default.
- W1998556751 hasConcept C41008148 @default.
- W1998556751 hasConcept C47932503 @default.
- W1998556751 hasConcept C48103436 @default.
- W1998556751 hasConcept C66938386 @default.
- W1998556751 hasConcept C67203356 @default.
- W1998556751 hasConcept C97541855 @default.
- W1998556751 hasConcept C98045186 @default.
- W1998556751 hasConceptScore W1998556751C111919701 @default.
- W1998556751 hasConceptScore W1998556751C11413529 @default.
- W1998556751 hasConceptScore W1998556751C123657996 @default.
- W1998556751 hasConceptScore W1998556751C127413603 @default.
- W1998556751 hasConceptScore W1998556751C142362112 @default.
- W1998556751 hasConceptScore W1998556751C153349607 @default.
- W1998556751 hasConceptScore W1998556751C154945302 @default.
- W1998556751 hasConceptScore W1998556751C15744967 @default.
- W1998556751 hasConceptScore W1998556751C169760540 @default.
- W1998556751 hasConceptScore W1998556751C26760741 @default.
- W1998556751 hasConceptScore W1998556751C41008148 @default.
- W1998556751 hasConceptScore W1998556751C47932503 @default.
- W1998556751 hasConceptScore W1998556751C48103436 @default.
- W1998556751 hasConceptScore W1998556751C66938386 @default.
- W1998556751 hasConceptScore W1998556751C67203356 @default.
- W1998556751 hasConceptScore W1998556751C97541855 @default.
- W1998556751 hasConceptScore W1998556751C98045186 @default.
- W1998556751 hasIssue "8" @default.
- W1998556751 hasLocation W19985567511 @default.
- W1998556751 hasOpenAccess W1998556751 @default.
- W1998556751 hasPrimaryLocation W19985567511 @default.
- W1998556751 hasRelatedWork W1882507001 @default.
- W1998556751 hasRelatedWork W1997664188 @default.
- W1998556751 hasRelatedWork W2061783822 @default.
- W1998556751 hasRelatedWork W2381453893 @default.
- W1998556751 hasRelatedWork W2909304650 @default.
- W1998556751 hasRelatedWork W3005560120 @default.
- W1998556751 hasRelatedWork W3009457412 @default.
- W1998556751 hasRelatedWork W3123425514 @default.
- W1998556751 hasRelatedWork W3143922204 @default.
- W1998556751 hasRelatedWork W4225393484 @default.
- W1998556751 hasVolume "8" @default.
- W1998556751 isParatext "false" @default.
- W1998556751 isRetracted "false" @default.
- W1998556751 magId "1998556751" @default.
- W1998556751 workType "article" @default.