Matches in SemOpenAlex for { <https://semopenalex.org/work/W4386953140> ?p ?o ?g. }
Showing items 1 to 84 of
84
with 100 items per page.
- W4386953140 abstract "The work solves the problem of valve stem position control using a tabular, off-policy Monte Carlo reinforcement learning. The work proposes a model of control valve, involving the stem friction. The control problem is seen as a finite Markov decision process. The reinforcement learning algorithm is a slight modification of the Sutton–Barto Monte Carlo policy estimation method. Such method learns value functions and optimal policies from experience in the form of learning episodes. The modification involves shortening of learning episodes, for their more economical exploitation. As the original algorithm, the method can be used to learn optimal behavior directly from interaction with the environment, with no model of the environment’s dynamics. It is easy and efficient to focus such a method on a small subset of states. Such method does not bootstrap and therefore it is less harmed by violations of the Markov property. The proposed control method has been verified by a series of computer simulations, where it is compared with a three-state bang-bang controller. The simulations show that the proposed controller performs stable and as desired. The proposed method requires very few tuning hyperparameters, which are easy to tune. The target policy learns in few minutes, when interacting with the valve model. The work provides insights on possibilities and limitations of the Monte Carlo learning method, which is still unsettled." @default.
- W4386953140 created "2023-09-23" @default.
- W4386953140 creator A5003093339 @default.
- W4386953140 date "2023-07-19" @default.
- W4386953140 modified "2023-09-27" @default.
- W4386953140 title "Tabular Monte Carlo Reinforcement Learning for Valve Position Control" @default.
- W4386953140 cites W152727074 @default.
- W4386953140 cites W1992526709 @default.
- W4386953140 cites W2053269063 @default.
- W4386953140 cites W2064815992 @default.
- W4386953140 cites W2066260126 @default.
- W4386953140 cites W2068538749 @default.
- W4386953140 cites W2530839267 @default.
- W4386953140 cites W2613233762 @default.
- W4386953140 cites W2754517384 @default.
- W4386953140 cites W2809540795 @default.
- W4386953140 cites W2897090992 @default.
- W4386953140 cites W3100789280 @default.
- W4386953140 cites W3149836039 @default.
- W4386953140 cites W3194869541 @default.
- W4386953140 cites W4220747123 @default.
- W4386953140 doi "https://doi.org/10.1109/iceccme57830.2023.10253420" @default.
- W4386953140 hasPublicationYear "2023" @default.
- W4386953140 type Work @default.
- W4386953140 citedByCount "0" @default.
- W4386953140 crossrefType "proceedings-article" @default.
- W4386953140 hasAuthorship W4386953140A5003093339 @default.
- W4386953140 hasConcept C10138342 @default.
- W4386953140 hasConcept C105795698 @default.
- W4386953140 hasConcept C106189395 @default.
- W4386953140 hasConcept C126255220 @default.
- W4386953140 hasConcept C154945302 @default.
- W4386953140 hasConcept C159886148 @default.
- W4386953140 hasConcept C162324750 @default.
- W4386953140 hasConcept C188116033 @default.
- W4386953140 hasConcept C19499675 @default.
- W4386953140 hasConcept C198082294 @default.
- W4386953140 hasConcept C203479927 @default.
- W4386953140 hasConcept C2775924081 @default.
- W4386953140 hasConcept C33923547 @default.
- W4386953140 hasConcept C41008148 @default.
- W4386953140 hasConcept C46149586 @default.
- W4386953140 hasConcept C47446073 @default.
- W4386953140 hasConcept C6557445 @default.
- W4386953140 hasConcept C8642999 @default.
- W4386953140 hasConcept C86803240 @default.
- W4386953140 hasConcept C97541855 @default.
- W4386953140 hasConceptScore W4386953140C10138342 @default.
- W4386953140 hasConceptScore W4386953140C105795698 @default.
- W4386953140 hasConceptScore W4386953140C106189395 @default.
- W4386953140 hasConceptScore W4386953140C126255220 @default.
- W4386953140 hasConceptScore W4386953140C154945302 @default.
- W4386953140 hasConceptScore W4386953140C159886148 @default.
- W4386953140 hasConceptScore W4386953140C162324750 @default.
- W4386953140 hasConceptScore W4386953140C188116033 @default.
- W4386953140 hasConceptScore W4386953140C19499675 @default.
- W4386953140 hasConceptScore W4386953140C198082294 @default.
- W4386953140 hasConceptScore W4386953140C203479927 @default.
- W4386953140 hasConceptScore W4386953140C2775924081 @default.
- W4386953140 hasConceptScore W4386953140C33923547 @default.
- W4386953140 hasConceptScore W4386953140C41008148 @default.
- W4386953140 hasConceptScore W4386953140C46149586 @default.
- W4386953140 hasConceptScore W4386953140C47446073 @default.
- W4386953140 hasConceptScore W4386953140C6557445 @default.
- W4386953140 hasConceptScore W4386953140C8642999 @default.
- W4386953140 hasConceptScore W4386953140C86803240 @default.
- W4386953140 hasConceptScore W4386953140C97541855 @default.
- W4386953140 hasFunder F4320335322 @default.
- W4386953140 hasLocation W43869531401 @default.
- W4386953140 hasOpenAccess W4386953140 @default.
- W4386953140 hasPrimaryLocation W43869531401 @default.
- W4386953140 hasRelatedWork W1511927616 @default.
- W4386953140 hasRelatedWork W1556532828 @default.
- W4386953140 hasRelatedWork W2146763310 @default.
- W4386953140 hasRelatedWork W2182304831 @default.
- W4386953140 hasRelatedWork W2937181779 @default.
- W4386953140 hasRelatedWork W3167472281 @default.
- W4386953140 hasRelatedWork W4287719180 @default.
- W4386953140 hasRelatedWork W4382935469 @default.
- W4386953140 hasRelatedWork W4386874825 @default.
- W4386953140 hasRelatedWork W4386953140 @default.
- W4386953140 isParatext "false" @default.
- W4386953140 isRetracted "false" @default.
- W4386953140 workType "article" @default.