Matches in SemOpenAlex for { <https://semopenalex.org/work/W2033976720> ?p ?o ?g. }
- W2033976720 abstract "This thesis considers three complications that arise from applying reinforcement learning to a real-world application. In the process of using reinforcement learning to build an adaptive electronic market-maker, we find the sparsity of data, the partial observability of the domain, and the multiple objectives of the agent to cause serious problems for existing reinforcement learning algorithms. We employ importance sampling (likelihood ratios) to achieve good performance in partially observable Markov decision processes with few data. Our importance sampling estimator requires no knowledge about the environment and places few restrictions on the method of collecting data. It can be used efficiently with reactive controllers, finite-state controllers, or policies with function approximation. We present theoretical analyses of the estimator and incorporate it into a reinforcement learning algorithm. Additionally, this method provides a complete return surface which can be used to balance multiple objectives dynamically. We demonstrate the need for multiple goals in a variety of applications and natural solutions based on our sampling method. The thesis concludes with example results from employing our algorithm to the domain of automated electronic market-making. Thesis Supervisor: Tomaso Poggio Title: Professor of Brain and Cognitive Science" @default.
- W2033976720 created "2016-06-24" @default.
- W2033976720 creator A5001833084 @default.
- W2033976720 creator A5091230938 @default.
- W2033976720 date "2001-01-01" @default.
- W2033976720 modified "2023-10-07" @default.
- W2033976720 title "Importance sampling for reinforcement learning with multiple objectives" @default.
- W2033976720 cites W1514587017 @default.
- W2033976720 cites W1515891729 @default.
- W2033976720 cites W1554663460 @default.
- W2033976720 cites W1557760016 @default.
- W2033976720 cites W1576452626 @default.
- W2033976720 cites W1593706890 @default.
- W2033976720 cites W1596364083 @default.
- W2033976720 cites W1599491685 @default.
- W2033976720 cites W1600046456 @default.
- W2033976720 cites W1601974704 @default.
- W2033976720 cites W1640774615 @default.
- W2033976720 cites W1657542410 @default.
- W2033976720 cites W1748709110 @default.
- W2033976720 cites W1981329484 @default.
- W2033976720 cites W1985808284 @default.
- W2033976720 cites W2000836282 @default.
- W2033976720 cites W2025268472 @default.
- W2033976720 cites W2043881376 @default.
- W2033976720 cites W2057565703 @default.
- W2033976720 cites W2058209938 @default.
- W2033976720 cites W2094509095 @default.
- W2033976720 cites W2107726111 @default.
- W2033976720 cites W2109690032 @default.
- W2033976720 cites W2117428849 @default.
- W2033976720 cites W2119717200 @default.
- W2033976720 cites W2121863487 @default.
- W2033976720 cites W2125074935 @default.
- W2033976720 cites W2125838338 @default.
- W2033976720 cites W2133727275 @default.
- W2033976720 cites W2137466452 @default.
- W2033976720 cites W2147632348 @default.
- W2033976720 cites W2149126181 @default.
- W2033976720 cites W2151726636 @default.
- W2033976720 cites W2155027007 @default.
- W2033976720 cites W2156737235 @default.
- W2033976720 cites W2158145505 @default.
- W2033976720 cites W2160067530 @default.
- W2033976720 cites W2161521419 @default.
- W2033976720 cites W2164056559 @default.
- W2033976720 cites W6242441 @default.
- W2033976720 hasPublicationYear "2001" @default.
- W2033976720 type Work @default.
- W2033976720 sameAs 2033976720 @default.
- W2033976720 citedByCount "38" @default.
- W2033976720 countsByYear W20339767202013 @default.
- W2033976720 countsByYear W20339767202014 @default.
- W2033976720 countsByYear W20339767202015 @default.
- W2033976720 countsByYear W20339767202016 @default.
- W2033976720 countsByYear W20339767202017 @default.
- W2033976720 countsByYear W20339767202018 @default.
- W2033976720 countsByYear W20339767202019 @default.
- W2033976720 countsByYear W20339767202020 @default.
- W2033976720 countsByYear W20339767202021 @default.
- W2033976720 crossrefType "dissertation" @default.
- W2033976720 hasAuthorship W2033976720A5001833084 @default.
- W2033976720 hasAuthorship W2033976720A5091230938 @default.
- W2033976720 hasConcept C105795698 @default.
- W2033976720 hasConcept C106131492 @default.
- W2033976720 hasConcept C106189395 @default.
- W2033976720 hasConcept C119857082 @default.
- W2033976720 hasConcept C126255220 @default.
- W2033976720 hasConcept C134306372 @default.
- W2033976720 hasConcept C140779682 @default.
- W2033976720 hasConcept C154945302 @default.
- W2033976720 hasConcept C159886148 @default.
- W2033976720 hasConcept C163836022 @default.
- W2033976720 hasConcept C17098449 @default.
- W2033976720 hasConcept C17744445 @default.
- W2033976720 hasConcept C185429906 @default.
- W2033976720 hasConcept C199539241 @default.
- W2033976720 hasConcept C2779110517 @default.
- W2033976720 hasConcept C28826006 @default.
- W2033976720 hasConcept C31972630 @default.
- W2033976720 hasConcept C33923547 @default.
- W2033976720 hasConcept C36299963 @default.
- W2033976720 hasConcept C36503486 @default.
- W2033976720 hasConcept C41008148 @default.
- W2033976720 hasConcept C97541855 @default.
- W2033976720 hasConcept C98763669 @default.
- W2033976720 hasConceptScore W2033976720C105795698 @default.
- W2033976720 hasConceptScore W2033976720C106131492 @default.
- W2033976720 hasConceptScore W2033976720C106189395 @default.
- W2033976720 hasConceptScore W2033976720C119857082 @default.
- W2033976720 hasConceptScore W2033976720C126255220 @default.
- W2033976720 hasConceptScore W2033976720C134306372 @default.
- W2033976720 hasConceptScore W2033976720C140779682 @default.
- W2033976720 hasConceptScore W2033976720C154945302 @default.
- W2033976720 hasConceptScore W2033976720C159886148 @default.
- W2033976720 hasConceptScore W2033976720C163836022 @default.
- W2033976720 hasConceptScore W2033976720C17098449 @default.
- W2033976720 hasConceptScore W2033976720C17744445 @default.
- W2033976720 hasConceptScore W2033976720C185429906 @default.
- W2033976720 hasConceptScore W2033976720C199539241 @default.