Matches in SemOpenAlex for { <https://semopenalex.org/work/W2100096361> ?p ?o ?g. }
- W2100096361 abstract "Partially observable environments pose a major challenge to the application of reinforcement learning algorithms. In such environments, due to the Markov property frequently being violated in the system state representation, situations can occur where an agent has insufficient information to decide on the optimal action. In such cases, it is necessary to determine when information gathering actions should be executed, that is, when the agent needs to reduce uncertainty about the current state before deciding on how to act. One possible solution that has been proposed in past research is to manually code rules for execution of information gathering actions in the policy using heuristic (and likely faulty) knowledge. However, such a solution requires explicit expert knowledge about actions which are information gathering. In this paper a flexible solution is proposed which automatically learns when to execute information gathering actions and furthermore to automatically discover which actions gather information. We present an evaluation in the Robo{C}up Keep{A}way domain that empirically shows the robustness of the proposed approach and its success in learning under varying degrees of partial observability. Hence, it eliminates the need for hand-coded rules, is flexible in different situations and does not require knowledge about information gathering actions." @default.
- W2100096361 created "2016-06-24" @default.
- W2100096361 creator A5009587907 @default.
- W2100096361 creator A5048451922 @default.
- W2100096361 creator A5052315074 @default.
- W2100096361 date "2009-01-01" @default.
- W2100096361 modified "2023-09-27" @default.
- W2100096361 title "Reinforcement Learning in RoboCup KeepAway with Partial Observability" @default.
- W2100096361 cites W1515851193 @default.
- W2100096361 cites W1555477527 @default.
- W2100096361 cites W1586030051 @default.
- W2100096361 cites W1987187457 @default.
- W2100096361 cites W2013614847 @default.
- W2100096361 cites W2041367235 @default.
- W2100096361 cites W2098432798 @default.
- W2100096361 cites W2104641222 @default.
- W2100096361 cites W2119567691 @default.
- W2100096361 cites W2121863487 @default.
- W2100096361 cites W2122410182 @default.
- W2100096361 cites W2145790759 @default.
- W2100096361 cites W2155791599 @default.
- W2100096361 cites W2168359464 @default.
- W2100096361 doi "https://doi.org/10.1109/wi-iat.2009.151" @default.
- W2100096361 hasPublicationYear "2009" @default.
- W2100096361 type Work @default.
- W2100096361 sameAs 2100096361 @default.
- W2100096361 citedByCount "1" @default.
- W2100096361 countsByYear W21000963612013 @default.
- W2100096361 crossrefType "proceedings-article" @default.
- W2100096361 hasAuthorship W2100096361A5009587907 @default.
- W2100096361 hasAuthorship W2100096361A5048451922 @default.
- W2100096361 hasAuthorship W2100096361A5052315074 @default.
- W2100096361 hasConcept C104317684 @default.
- W2100096361 hasConcept C111472728 @default.
- W2100096361 hasConcept C113336015 @default.
- W2100096361 hasConcept C119857082 @default.
- W2100096361 hasConcept C121332964 @default.
- W2100096361 hasConcept C138885662 @default.
- W2100096361 hasConcept C154945302 @default.
- W2100096361 hasConcept C162324750 @default.
- W2100096361 hasConcept C163836022 @default.
- W2100096361 hasConcept C17098449 @default.
- W2100096361 hasConcept C173801870 @default.
- W2100096361 hasConcept C175444787 @default.
- W2100096361 hasConcept C185592680 @default.
- W2100096361 hasConcept C189950617 @default.
- W2100096361 hasConcept C2780791683 @default.
- W2100096361 hasConcept C28826006 @default.
- W2100096361 hasConcept C33923547 @default.
- W2100096361 hasConcept C36299963 @default.
- W2100096361 hasConcept C41008148 @default.
- W2100096361 hasConcept C55493867 @default.
- W2100096361 hasConcept C62520636 @default.
- W2100096361 hasConcept C63479239 @default.
- W2100096361 hasConcept C97541855 @default.
- W2100096361 hasConcept C98763669 @default.
- W2100096361 hasConceptScore W2100096361C104317684 @default.
- W2100096361 hasConceptScore W2100096361C111472728 @default.
- W2100096361 hasConceptScore W2100096361C113336015 @default.
- W2100096361 hasConceptScore W2100096361C119857082 @default.
- W2100096361 hasConceptScore W2100096361C121332964 @default.
- W2100096361 hasConceptScore W2100096361C138885662 @default.
- W2100096361 hasConceptScore W2100096361C154945302 @default.
- W2100096361 hasConceptScore W2100096361C162324750 @default.
- W2100096361 hasConceptScore W2100096361C163836022 @default.
- W2100096361 hasConceptScore W2100096361C17098449 @default.
- W2100096361 hasConceptScore W2100096361C173801870 @default.
- W2100096361 hasConceptScore W2100096361C175444787 @default.
- W2100096361 hasConceptScore W2100096361C185592680 @default.
- W2100096361 hasConceptScore W2100096361C189950617 @default.
- W2100096361 hasConceptScore W2100096361C2780791683 @default.
- W2100096361 hasConceptScore W2100096361C28826006 @default.
- W2100096361 hasConceptScore W2100096361C33923547 @default.
- W2100096361 hasConceptScore W2100096361C36299963 @default.
- W2100096361 hasConceptScore W2100096361C41008148 @default.
- W2100096361 hasConceptScore W2100096361C55493867 @default.
- W2100096361 hasConceptScore W2100096361C62520636 @default.
- W2100096361 hasConceptScore W2100096361C63479239 @default.
- W2100096361 hasConceptScore W2100096361C97541855 @default.
- W2100096361 hasConceptScore W2100096361C98763669 @default.
- W2100096361 hasLocation W21000963611 @default.
- W2100096361 hasOpenAccess W2100096361 @default.
- W2100096361 hasPrimaryLocation W21000963611 @default.
- W2100096361 hasRelatedWork W1925600676 @default.
- W2100096361 hasRelatedWork W1950622696 @default.
- W2100096361 hasRelatedWork W1951274542 @default.
- W2100096361 hasRelatedWork W2024877309 @default.
- W2100096361 hasRelatedWork W2103064945 @default.
- W2100096361 hasRelatedWork W2119972318 @default.
- W2100096361 hasRelatedWork W2128467312 @default.
- W2100096361 hasRelatedWork W2138615822 @default.
- W2100096361 hasRelatedWork W2144059919 @default.
- W2100096361 hasRelatedWork W2153427071 @default.
- W2100096361 hasRelatedWork W2156734147 @default.
- W2100096361 hasRelatedWork W2230791532 @default.
- W2100096361 hasRelatedWork W2272929109 @default.
- W2100096361 hasRelatedWork W2284610481 @default.
- W2100096361 hasRelatedWork W2399554456 @default.
- W2100096361 hasRelatedWork W2403130487 @default.
- W2100096361 hasRelatedWork W2569873438 @default.