Matches in SemOpenAlex for { <https://semopenalex.org/work/W2912681837> ?p ?o ?g. }
- W2912681837 endingPage "632" @default.
- W2912681837 startingPage "621" @default.
- W2912681837 abstract "In this paper, a novel incremental learning algorithm is presented for reinforcement learning (RL) in dynamic environments, where the rewards of state-action pairs may change over time. The proposed incremental RL (IRL) algorithm learns from the dynamic environments without making any assumptions or having any prior knowledge about the ever-changing environment. First, IRL generates a detector-agent to detect the changed part of the environment (drift environment) by executing a virtual RL process. Then, the agent gives priority to the drift environment and its neighbor environment for iteratively updating their state-action value functions using new rewards by dynamic programming. After the prioritized sweeping process, IRL restarts a canonical learning process to obtain a new optimal policy adapting to the new environment. The novelty is that IRL fuses the new information into the existing knowledge system incrementally as well as weakening the conflict between them. The IRL algorithm is compared to two direct approaches and various state-of-the-art transfer learning methods for classical maze navigation problems and an intelligent warehouse with multiple robots. The experimental results verify that IRL can effectively improve the adaptability and efficiency of RL algorithms in dynamic environments." @default.
- W2912681837 created "2019-02-21" @default.
- W2912681837 creator A5000582423 @default.
- W2912681837 creator A5030229878 @default.
- W2912681837 creator A5051368351 @default.
- W2912681837 creator A5054800972 @default.
- W2912681837 creator A5059358901 @default.
- W2912681837 date "2019-04-01" @default.
- W2912681837 modified "2023-10-11" @default.
- W2912681837 title "Incremental Reinforcement Learning With Prioritized Sweeping for Dynamic Environments" @default.
- W2912681837 cites W1557517019 @default.
- W2912681837 cites W1969302761 @default.
- W2912681837 cites W1979930838 @default.
- W2912681837 cites W1985602045 @default.
- W2912681837 cites W1994955764 @default.
- W2912681837 cites W1997641400 @default.
- W2912681837 cites W2012060221 @default.
- W2912681837 cites W2020573190 @default.
- W2912681837 cites W2032277247 @default.
- W2912681837 cites W2056584142 @default.
- W2912681837 cites W2059836092 @default.
- W2912681837 cites W2063640990 @default.
- W2912681837 cites W2092886985 @default.
- W2912681837 cites W2106304233 @default.
- W2912681837 cites W2110422826 @default.
- W2912681837 cites W2117941808 @default.
- W2912681837 cites W2125627762 @default.
- W2912681837 cites W2129427976 @default.
- W2912681837 cites W2132269536 @default.
- W2912681837 cites W2139047213 @default.
- W2912681837 cites W2142394576 @default.
- W2912681837 cites W2145339207 @default.
- W2912681837 cites W2147585405 @default.
- W2912681837 cites W2148511931 @default.
- W2912681837 cites W2156413953 @default.
- W2912681837 cites W2160512933 @default.
- W2912681837 cites W2161308859 @default.
- W2912681837 cites W2199825521 @default.
- W2912681837 cites W2259258048 @default.
- W2912681837 cites W2429251376 @default.
- W2912681837 cites W2430202818 @default.
- W2912681837 cites W2526845620 @default.
- W2912681837 cites W2766447205 @default.
- W2912681837 cites W2808421695 @default.
- W2912681837 cites W3041202696 @default.
- W2912681837 cites W4245108548 @default.
- W2912681837 cites W778742492 @default.
- W2912681837 doi "https://doi.org/10.1109/tmech.2019.2899365" @default.
- W2912681837 hasPublicationYear "2019" @default.
- W2912681837 type Work @default.
- W2912681837 sameAs 2912681837 @default.
- W2912681837 citedByCount "37" @default.
- W2912681837 countsByYear W29126818372018 @default.
- W2912681837 countsByYear W29126818372019 @default.
- W2912681837 countsByYear W29126818372020 @default.
- W2912681837 countsByYear W29126818372021 @default.
- W2912681837 countsByYear W29126818372022 @default.
- W2912681837 countsByYear W29126818372023 @default.
- W2912681837 crossrefType "journal-article" @default.
- W2912681837 hasAuthorship W2912681837A5000582423 @default.
- W2912681837 hasAuthorship W2912681837A5030229878 @default.
- W2912681837 hasAuthorship W2912681837A5051368351 @default.
- W2912681837 hasAuthorship W2912681837A5054800972 @default.
- W2912681837 hasAuthorship W2912681837A5059358901 @default.
- W2912681837 hasConcept C111919701 @default.
- W2912681837 hasConcept C11413529 @default.
- W2912681837 hasConcept C119857082 @default.
- W2912681837 hasConcept C121332964 @default.
- W2912681837 hasConcept C138885662 @default.
- W2912681837 hasConcept C154945302 @default.
- W2912681837 hasConcept C177606310 @default.
- W2912681837 hasConcept C18903297 @default.
- W2912681837 hasConcept C27206212 @default.
- W2912681837 hasConcept C2778738651 @default.
- W2912681837 hasConcept C2778924833 @default.
- W2912681837 hasConcept C2780791683 @default.
- W2912681837 hasConcept C37404715 @default.
- W2912681837 hasConcept C41008148 @default.
- W2912681837 hasConcept C48103436 @default.
- W2912681837 hasConcept C62520636 @default.
- W2912681837 hasConcept C86803240 @default.
- W2912681837 hasConcept C90509273 @default.
- W2912681837 hasConcept C97541855 @default.
- W2912681837 hasConcept C98045186 @default.
- W2912681837 hasConceptScore W2912681837C111919701 @default.
- W2912681837 hasConceptScore W2912681837C11413529 @default.
- W2912681837 hasConceptScore W2912681837C119857082 @default.
- W2912681837 hasConceptScore W2912681837C121332964 @default.
- W2912681837 hasConceptScore W2912681837C138885662 @default.
- W2912681837 hasConceptScore W2912681837C154945302 @default.
- W2912681837 hasConceptScore W2912681837C177606310 @default.
- W2912681837 hasConceptScore W2912681837C18903297 @default.
- W2912681837 hasConceptScore W2912681837C27206212 @default.
- W2912681837 hasConceptScore W2912681837C2778738651 @default.
- W2912681837 hasConceptScore W2912681837C2778924833 @default.
- W2912681837 hasConceptScore W2912681837C2780791683 @default.
- W2912681837 hasConceptScore W2912681837C37404715 @default.
- W2912681837 hasConceptScore W2912681837C41008148 @default.