Matches in SemOpenAlex for { <https://semopenalex.org/work/W2889711700> ?p ?o ?g. }
- W2889711700 endingPage "1205" @default.
- W2889711700 startingPage "1186" @default.
- W2889711700 abstract "This paper presents a safe learning framework that employs an adaptive model learning algorithm together with barrier certificates for systems with possibly nonstationary agent dynamics. To extract the dynamic structure of the model, we use a sparse optimization technique. We use the learned model in combination with control barrier certificates which constrain policies (feedback controllers) in order to maintain safety, which refers to avoiding particular undesirable regions of the state space. Under certain conditions, recovery of safety in the sense of Lyapunov stability after violations of safety due to the nonstationarity is guaranteed. In addition, we reformulate an action-value function approximation to make any kernel-based nonlinear function estimation method applicable to our adaptive learning framework. Lastly, solutions to the barrier-certified policy optimization are guaranteed to be globally optimal, ensuring the greedy policy improvement under mild conditions. The resulting framework is validated via simulations of a quadrotor, which has previously been used under stationarity assumptions in the safe learnings literature, and is then tested on a real robot, the brushbot, whose dynamics is unknown, highly complex and nonstationary." @default.
- W2889711700 created "2018-09-27" @default.
- W2889711700 creator A5012278873 @default.
- W2889711700 creator A5017389000 @default.
- W2889711700 creator A5072586461 @default.
- W2889711700 creator A5091125105 @default.
- W2889711700 date "2019-10-01" @default.
- W2889711700 modified "2023-09-29" @default.
- W2889711700 title "Barrier-Certified Adaptive Reinforcement Learning With Applications to Brushbot Navigation" @default.
- W2889711700 cites W1506085041 @default.
- W2889711700 cites W1565176583 @default.
- W2889711700 cites W1677330902 @default.
- W2889711700 cites W1972149633 @default.
- W2889711700 cites W1977655452 @default.
- W2889711700 cites W1986014385 @default.
- W2889711700 cites W1986280275 @default.
- W2889711700 cites W1993288992 @default.
- W2889711700 cites W1996625075 @default.
- W2889711700 cites W2005778669 @default.
- W2889711700 cites W2014914755 @default.
- W2889711700 cites W2015904350 @default.
- W2889711700 cites W2018980815 @default.
- W2889711700 cites W2031863147 @default.
- W2889711700 cites W2042680115 @default.
- W2889711700 cites W2046513829 @default.
- W2889711700 cites W2051671655 @default.
- W2889711700 cites W2053572490 @default.
- W2889711700 cites W2060248504 @default.
- W2889711700 cites W2098102888 @default.
- W2889711700 cites W2100484286 @default.
- W2889711700 cites W2108995755 @default.
- W2889711700 cites W2118556122 @default.
- W2889711700 cites W2124039037 @default.
- W2889711700 cites W2148024708 @default.
- W2889711700 cites W2156974606 @default.
- W2889711700 cites W2162053828 @default.
- W2889711700 cites W2165726932 @default.
- W2889711700 cites W2344310085 @default.
- W2889711700 cites W2481926318 @default.
- W2889711700 cites W2588802774 @default.
- W2889711700 cites W2620840602 @default.
- W2889711700 cites W2730929966 @default.
- W2889711700 cites W2735010720 @default.
- W2889711700 cites W2910221532 @default.
- W2889711700 cites W2963148914 @default.
- W2889711700 cites W2964138223 @default.
- W2889711700 cites W3098713169 @default.
- W2889711700 cites W3098925401 @default.
- W2889711700 cites W3135791089 @default.
- W2889711700 cites W4214717370 @default.
- W2889711700 cites W4292249799 @default.
- W2889711700 cites W2025321626 @default.
- W2889711700 doi "https://doi.org/10.1109/tro.2019.2920206" @default.
- W2889711700 hasPublicationYear "2019" @default.
- W2889711700 type Work @default.
- W2889711700 sameAs 2889711700 @default.
- W2889711700 citedByCount "43" @default.
- W2889711700 countsByYear W28897117002019 @default.
- W2889711700 countsByYear W28897117002020 @default.
- W2889711700 countsByYear W28897117002021 @default.
- W2889711700 countsByYear W28897117002022 @default.
- W2889711700 countsByYear W28897117002023 @default.
- W2889711700 crossrefType "journal-article" @default.
- W2889711700 hasAuthorship W2889711700A5012278873 @default.
- W2889711700 hasAuthorship W2889711700A5017389000 @default.
- W2889711700 hasAuthorship W2889711700A5072586461 @default.
- W2889711700 hasAuthorship W2889711700A5091125105 @default.
- W2889711700 hasBestOaLocation W28897117001 @default.
- W2889711700 hasConcept C107464732 @default.
- W2889711700 hasConcept C112972136 @default.
- W2889711700 hasConcept C114614502 @default.
- W2889711700 hasConcept C119857082 @default.
- W2889711700 hasConcept C121332964 @default.
- W2889711700 hasConcept C126255220 @default.
- W2889711700 hasConcept C127413603 @default.
- W2889711700 hasConcept C133731056 @default.
- W2889711700 hasConcept C14646407 @default.
- W2889711700 hasConcept C154945302 @default.
- W2889711700 hasConcept C158622935 @default.
- W2889711700 hasConcept C2775924081 @default.
- W2889711700 hasConcept C33923547 @default.
- W2889711700 hasConcept C41008148 @default.
- W2889711700 hasConcept C47446073 @default.
- W2889711700 hasConcept C60640748 @default.
- W2889711700 hasConcept C62520636 @default.
- W2889711700 hasConcept C74193536 @default.
- W2889711700 hasConcept C77405623 @default.
- W2889711700 hasConcept C97541855 @default.
- W2889711700 hasConceptScore W2889711700C107464732 @default.
- W2889711700 hasConceptScore W2889711700C112972136 @default.
- W2889711700 hasConceptScore W2889711700C114614502 @default.
- W2889711700 hasConceptScore W2889711700C119857082 @default.
- W2889711700 hasConceptScore W2889711700C121332964 @default.
- W2889711700 hasConceptScore W2889711700C126255220 @default.
- W2889711700 hasConceptScore W2889711700C127413603 @default.
- W2889711700 hasConceptScore W2889711700C133731056 @default.
- W2889711700 hasConceptScore W2889711700C14646407 @default.
- W2889711700 hasConceptScore W2889711700C154945302 @default.