Matches in SemOpenAlex for { <https://semopenalex.org/work/W4313703382> ?p ?o ?g. }
- W4313703382 endingPage "334" @default.
- W4313703382 startingPage "318" @default.
- W4313703382 abstract "Abstract Dialogue policy learning (DPL) is a key component in a task-oriented dialogue (TOD) system. Its goal is to decide the next action of the dialogue system, given the dialogue state at each turn based on a learned dialogue policy. Reinforcement learning (RL) is widely used to optimize this dialogue policy. In the learning process, the user is regarded as the environment and the system as the agent. In this paper, we present an overview of the recent advances and challenges in dialogue policy from the perspective of RL. More specifically, we identify the problems and summarize corresponding solutions for RL-based dialogue policy learning. In addition, we provide a comprehensive survey of applying RL to DPL by categorizing recent methods into five basic elements in RL. We believe this survey can shed light on future research in DPL." @default.
- W4313703382 created "2023-01-08" @default.
- W4313703382 creator A5008208316 @default.
- W4313703382 creator A5009639089 @default.
- W4313703382 creator A5068169387 @default.
- W4313703382 creator A5077637321 @default.
- W4313703382 date "2023-01-07" @default.
- W4313703382 modified "2023-09-30" @default.
- W4313703382 title "A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning" @default.
- W4313703382 cites W143053999 @default.
- W4313703382 cites W1681299129 @default.
- W4313703382 cites W1975244201 @default.
- W4313703382 cites W1999874108 @default.
- W4313703382 cites W2005814556 @default.
- W4313703382 cites W2021151961 @default.
- W4313703382 cites W2022398498 @default.
- W4313703382 cites W2031571562 @default.
- W4313703382 cites W2062175565 @default.
- W4313703382 cites W2064675550 @default.
- W4313703382 cites W2089327077 @default.
- W4313703382 cites W2106547558 @default.
- W4313703382 cites W2108366050 @default.
- W4313703382 cites W2109038907 @default.
- W4313703382 cites W2109910161 @default.
- W4313703382 cites W2120045257 @default.
- W4313703382 cites W2121517924 @default.
- W4313703382 cites W2125612430 @default.
- W4313703382 cites W2132997613 @default.
- W4313703382 cites W2151814822 @default.
- W4313703382 cites W2152342063 @default.
- W4313703382 cites W2162046402 @default.
- W4313703382 cites W2165698076 @default.
- W4313703382 cites W2171486046 @default.
- W4313703382 cites W2257979135 @default.
- W4313703382 cites W2396863940 @default.
- W4313703382 cites W2603550564 @default.
- W4313703382 cites W2739936944 @default.
- W4313703382 cites W2765111838 @default.
- W4313703382 cites W2798494119 @default.
- W4313703382 cites W2806936550 @default.
- W4313703382 cites W2810840719 @default.
- W4313703382 cites W2889165300 @default.
- W4313703382 cites W2889186204 @default.
- W4313703382 cites W2892176966 @default.
- W4313703382 cites W2900082869 @default.
- W4313703382 cites W2915295540 @default.
- W4313703382 cites W2949476504 @default.
- W4313703382 cites W2951805158 @default.
- W4313703382 cites W2953071719 @default.
- W4313703382 cites W2962776342 @default.
- W4313703382 cites W2962852262 @default.
- W4313703382 cites W2962886331 @default.
- W4313703382 cites W2963043030 @default.
- W4313703382 cites W2963064439 @default.
- W4313703382 cites W2963140401 @default.
- W4313703382 cites W2963246392 @default.
- W4313703382 cites W2963252944 @default.
- W4313703382 cites W2963295373 @default.
- W4313703382 cites W2963306198 @default.
- W4313703382 cites W2963433587 @default.
- W4313703382 cites W2963963856 @default.
- W4313703382 cites W2963993502 @default.
- W4313703382 cites W2964018203 @default.
- W4313703382 cites W2964044380 @default.
- W4313703382 cites W2964080167 @default.
- W4313703382 cites W2964101860 @default.
- W4313703382 cites W2964180249 @default.
- W4313703382 cites W2964227312 @default.
- W4313703382 cites W2970828515 @default.
- W4313703382 cites W2970866659 @default.
- W4313703382 cites W2979372603 @default.
- W4313703382 cites W2979400990 @default.
- W4313703382 cites W2997108628 @default.
- W4313703382 cites W3034782127 @default.
- W4313703382 cites W3034930293 @default.
- W4313703382 cites W3035337525 @default.
- W4313703382 cites W3035597485 @default.
- W4313703382 cites W3088273075 @default.
- W4313703382 cites W3099293669 @default.
- W4313703382 cites W3101282510 @default.
- W4313703382 cites W3104546989 @default.
- W4313703382 cites W3105184920 @default.
- W4313703382 cites W3176706409 @default.
- W4313703382 cites W3214586773 @default.
- W4313703382 cites W4214717370 @default.
- W4313703382 cites W4290742115 @default.
- W4313703382 cites W4312609624 @default.
- W4313703382 cites W3034330559 @default.
- W4313703382 doi "https://doi.org/10.1007/s11633-022-1347-y" @default.
- W4313703382 hasPublicationYear "2023" @default.
- W4313703382 type Work @default.
- W4313703382 citedByCount "0" @default.
- W4313703382 crossrefType "journal-article" @default.
- W4313703382 hasAuthorship W4313703382A5008208316 @default.
- W4313703382 hasAuthorship W4313703382A5009639089 @default.
- W4313703382 hasAuthorship W4313703382A5068169387 @default.
- W4313703382 hasAuthorship W4313703382A5077637321 @default.
- W4313703382 hasBestOaLocation W43137033821 @default.