Matches in SemOpenAlex for { <https://semopenalex.org/work/W4378907438> ?p ?o ?g. }
Showing items 1 to 81 of
81
with 100 items per page.
- W4378907438 endingPage "58538" @default.
- W4378907438 startingPage "58532" @default.
- W4378907438 abstract "Several real-world problems are modeled as multi-objective sequential decision-making problems with multiple competing objectives, and multi-objective reinforcement learning (MORL) has garnered attention as a solution to this problem. One of the challenges in obtaining the desired policy using MORL is that the priorities (hereafter, weights) for each objective must be designed in advance to scalarize the reward vector. Determining weights through trial-and-error burdens system designers, and methods to estimate weights are needed. The existing methods use inverse reinforcement learning (IRL), which is not scalable because it requires reinforcement learning several times until an optimal policy is obtained. This study proposes a weight interval estimation (WInter) method using adversarial IRL (AIRL). AIRL is a scalable framework that reduces the computational complexity of IRL by simultaneously estimating rewards and policies. WInter estimates the weight interval using the expert neighborhoods obtained during AIRL training. We successfully estimated the weight interval through experiments in a benchmark environment for multi-objective sequential decision-making problems in a continuous state space while reducing computational complexity compared to the existing methods." @default.
- W4378907438 created "2023-06-01" @default.
- W4378907438 creator A5025528905 @default.
- W4378907438 creator A5047470756 @default.
- W4378907438 date "2023-01-01" @default.
- W4378907438 modified "2023-09-23" @default.
- W4378907438 title "Objective Weight Interval Estimation Using Adversarial Inverse Reinforcement Learning" @default.
- W4378907438 cites W1999874108 @default.
- W4378907438 cites W2012612381 @default.
- W4378907438 cites W2031571562 @default.
- W4378907438 cites W2102847492 @default.
- W4378907438 cites W2158782408 @default.
- W4378907438 cites W2889973018 @default.
- W4378907438 cites W4285419493 @default.
- W4378907438 cites W4286447866 @default.
- W4378907438 cites W4302774934 @default.
- W4378907438 doi "https://doi.org/10.1109/access.2023.3281593" @default.
- W4378907438 hasPublicationYear "2023" @default.
- W4378907438 type Work @default.
- W4378907438 citedByCount "0" @default.
- W4378907438 crossrefType "journal-article" @default.
- W4378907438 hasAuthorship W4378907438A5025528905 @default.
- W4378907438 hasAuthorship W4378907438A5047470756 @default.
- W4378907438 hasBestOaLocation W43789074381 @default.
- W4378907438 hasConcept C105795698 @default.
- W4378907438 hasConcept C11413529 @default.
- W4378907438 hasConcept C114614502 @default.
- W4378907438 hasConcept C119857082 @default.
- W4378907438 hasConcept C126255220 @default.
- W4378907438 hasConcept C13280743 @default.
- W4378907438 hasConcept C154945302 @default.
- W4378907438 hasConcept C179799912 @default.
- W4378907438 hasConcept C185798385 @default.
- W4378907438 hasConcept C205167067 @default.
- W4378907438 hasConcept C205649164 @default.
- W4378907438 hasConcept C2778067643 @default.
- W4378907438 hasConcept C33923547 @default.
- W4378907438 hasConcept C37736160 @default.
- W4378907438 hasConcept C41008148 @default.
- W4378907438 hasConcept C44249647 @default.
- W4378907438 hasConcept C48044578 @default.
- W4378907438 hasConcept C77088390 @default.
- W4378907438 hasConcept C97541855 @default.
- W4378907438 hasConceptScore W4378907438C105795698 @default.
- W4378907438 hasConceptScore W4378907438C11413529 @default.
- W4378907438 hasConceptScore W4378907438C114614502 @default.
- W4378907438 hasConceptScore W4378907438C119857082 @default.
- W4378907438 hasConceptScore W4378907438C126255220 @default.
- W4378907438 hasConceptScore W4378907438C13280743 @default.
- W4378907438 hasConceptScore W4378907438C154945302 @default.
- W4378907438 hasConceptScore W4378907438C179799912 @default.
- W4378907438 hasConceptScore W4378907438C185798385 @default.
- W4378907438 hasConceptScore W4378907438C205167067 @default.
- W4378907438 hasConceptScore W4378907438C205649164 @default.
- W4378907438 hasConceptScore W4378907438C2778067643 @default.
- W4378907438 hasConceptScore W4378907438C33923547 @default.
- W4378907438 hasConceptScore W4378907438C37736160 @default.
- W4378907438 hasConceptScore W4378907438C41008148 @default.
- W4378907438 hasConceptScore W4378907438C44249647 @default.
- W4378907438 hasConceptScore W4378907438C48044578 @default.
- W4378907438 hasConceptScore W4378907438C77088390 @default.
- W4378907438 hasConceptScore W4378907438C97541855 @default.
- W4378907438 hasLocation W43789074381 @default.
- W4378907438 hasOpenAccess W4378907438 @default.
- W4378907438 hasPrimaryLocation W43789074381 @default.
- W4378907438 hasRelatedWork W112744582 @default.
- W4378907438 hasRelatedWork W1992807924 @default.
- W4378907438 hasRelatedWork W2789601449 @default.
- W4378907438 hasRelatedWork W3022038857 @default.
- W4378907438 hasRelatedWork W3095449511 @default.
- W4378907438 hasRelatedWork W3132110306 @default.
- W4378907438 hasRelatedWork W3171774521 @default.
- W4378907438 hasRelatedWork W4286893825 @default.
- W4378907438 hasRelatedWork W4319083788 @default.
- W4378907438 hasRelatedWork W4379255972 @default.
- W4378907438 hasVolume "11" @default.
- W4378907438 isParatext "false" @default.
- W4378907438 isRetracted "false" @default.
- W4378907438 workType "article" @default.