Matches in SemOpenAlex for { <https://semopenalex.org/work/W4280491053> ?p ?o ?g. }
Showing items 1 to 94 of
94
with 100 items per page.
- W4280491053 endingPage "392" @default.
- W4280491053 startingPage "381" @default.
- W4280491053 abstract "In the world, most of the successes are results of long-term efforts. The reward of success is extremely high, but before that, a long-term investment process is required. People who are “myopic” only value short-term rewards and are unwilling to make early-stage investments, so they hardly get the ultimate success and the corresponding high rewards. Similarly, for a reinforcement learning (RL) model with long-delay rewards, the discount rate determines the strength of agent's “farsightedness”. In order to enable the trained agent to make a chain of correct choices and succeed finally, the feasible region of the discount rate is obtained through mathematical derivation in this paper firstly. It satisfies the “farsightedness” requirement of agent. Afterwards, in order to avoid the complicated problem of solving implicit equations in the process of choosing feasible solutions, a simple method is explored and verified by theoreti cal demonstration and mathematical experiments. Then, a series of RL experiments are designed and implemented to verify the validity of theory. Finally, the model is extended from the finite process to the infinite process. The validity of the extended model is verified by theories and experiments. The whole research not only reveals the significance of the discount rate, but also provides a theoretical basis as well as a practical method for the choice of discount rate in future researches." @default.
- W4280491053 created "2022-05-22" @default.
- W4280491053 creator A5008636323 @default.
- W4280491053 creator A5016595592 @default.
- W4280491053 creator A5040918429 @default.
- W4280491053 date "2022-04-01" @default.
- W4280491053 modified "2023-10-18" @default.
- W4280491053 title "Choice of discount rate in reinforcement learning with long-delay rewards" @default.
- W4280491053 cites W1453801241 @default.
- W4280491053 cites W1992640612 @default.
- W4280491053 cites W2030265113 @default.
- W4280491053 cites W2052919428 @default.
- W4280491053 cites W2110004219 @default.
- W4280491053 cites W2119579995 @default.
- W4280491053 cites W2257979135 @default.
- W4280491053 cites W2580475959 @default.
- W4280491053 cites W2754397685 @default.
- W4280491053 cites W2772526503 @default.
- W4280491053 cites W2810602713 @default.
- W4280491053 cites W2963948533 @default.
- W4280491053 cites W2979097258 @default.
- W4280491053 cites W3024896014 @default.
- W4280491053 cites W3034748593 @default.
- W4280491053 cites W3040879766 @default.
- W4280491053 cites W3042408539 @default.
- W4280491053 cites W3047515465 @default.
- W4280491053 cites W3084860706 @default.
- W4280491053 cites W3088314378 @default.
- W4280491053 cites W3089817617 @default.
- W4280491053 cites W3121153516 @default.
- W4280491053 doi "https://doi.org/10.23919/jsee.2022.000040" @default.
- W4280491053 hasPublicationYear "2022" @default.
- W4280491053 type Work @default.
- W4280491053 citedByCount "0" @default.
- W4280491053 crossrefType "journal-article" @default.
- W4280491053 hasAuthorship W4280491053A5008636323 @default.
- W4280491053 hasAuthorship W4280491053A5016595592 @default.
- W4280491053 hasAuthorship W4280491053A5040918429 @default.
- W4280491053 hasBestOaLocation W42804910531 @default.
- W4280491053 hasConcept C10138342 @default.
- W4280491053 hasConcept C111919701 @default.
- W4280491053 hasConcept C121332964 @default.
- W4280491053 hasConcept C126255220 @default.
- W4280491053 hasConcept C127413603 @default.
- W4280491053 hasConcept C154945302 @default.
- W4280491053 hasConcept C162324750 @default.
- W4280491053 hasConcept C182306322 @default.
- W4280491053 hasConcept C33923547 @default.
- W4280491053 hasConcept C41008148 @default.
- W4280491053 hasConcept C6177178 @default.
- W4280491053 hasConcept C61797465 @default.
- W4280491053 hasConcept C62520636 @default.
- W4280491053 hasConcept C66938386 @default.
- W4280491053 hasConcept C67203356 @default.
- W4280491053 hasConcept C97541855 @default.
- W4280491053 hasConcept C98045186 @default.
- W4280491053 hasConceptScore W4280491053C10138342 @default.
- W4280491053 hasConceptScore W4280491053C111919701 @default.
- W4280491053 hasConceptScore W4280491053C121332964 @default.
- W4280491053 hasConceptScore W4280491053C126255220 @default.
- W4280491053 hasConceptScore W4280491053C127413603 @default.
- W4280491053 hasConceptScore W4280491053C154945302 @default.
- W4280491053 hasConceptScore W4280491053C162324750 @default.
- W4280491053 hasConceptScore W4280491053C182306322 @default.
- W4280491053 hasConceptScore W4280491053C33923547 @default.
- W4280491053 hasConceptScore W4280491053C41008148 @default.
- W4280491053 hasConceptScore W4280491053C6177178 @default.
- W4280491053 hasConceptScore W4280491053C61797465 @default.
- W4280491053 hasConceptScore W4280491053C62520636 @default.
- W4280491053 hasConceptScore W4280491053C66938386 @default.
- W4280491053 hasConceptScore W4280491053C67203356 @default.
- W4280491053 hasConceptScore W4280491053C97541855 @default.
- W4280491053 hasConceptScore W4280491053C98045186 @default.
- W4280491053 hasFunder F4320321001 @default.
- W4280491053 hasIssue "2" @default.
- W4280491053 hasLocation W42804910531 @default.
- W4280491053 hasOpenAccess W4280491053 @default.
- W4280491053 hasPrimaryLocation W42804910531 @default.
- W4280491053 hasRelatedWork W2185410470 @default.
- W4280491053 hasRelatedWork W260766989 @default.
- W4280491053 hasRelatedWork W2909304650 @default.
- W4280491053 hasRelatedWork W2959276766 @default.
- W4280491053 hasRelatedWork W3074294383 @default.
- W4280491053 hasRelatedWork W3111983280 @default.
- W4280491053 hasRelatedWork W3139193008 @default.
- W4280491053 hasRelatedWork W3164468573 @default.
- W4280491053 hasRelatedWork W4206669594 @default.
- W4280491053 hasRelatedWork W4295941380 @default.
- W4280491053 hasVolume "33" @default.
- W4280491053 isParatext "false" @default.
- W4280491053 isRetracted "false" @default.
- W4280491053 workType "article" @default.