Matches in SemOpenAlex for { <https://semopenalex.org/work/W4385433564> ?p ?o ?g. }
Showing items 1 to 95 of
95
with 100 items per page.
- W4385433564 endingPage "173" @default.
- W4385433564 startingPage "156" @default.
- W4385433564 abstract "The online 3D packing problem has received increasing attention in recent years due to its practical value. However, the problem itself possesses some peculiar properties, such as sequential decision-making and the large size of the state space, which have made the use of reinforcement learning with Markov decision processes a popular approach for solving this problem. In this paper, we focus on the problem of high variance in value estimation caused by reward uncertainty in the presence of highly uncertain dynamics. To address this, proposed a solution based on auxiliary tasks and intrinsic rewards for the online 3D bin packing problem, guided by a binary-valued network, to assist the agent in learning the policy within the framework of actor-critic deep reinforcement learning. Specifically, the maintenance of two-valued networks and the utilization of multi-valued network estimates are employed to replace the original value estimates, aiming to provide better guidance for the learning of policy networks. Experimentally, it has been demonstrated that our model can achieve more robust learning and outperform previous works in terms of performance." @default.
- W4385433564 created "2023-08-01" @default.
- W4385433564 creator A5054037584 @default.
- W4385433564 creator A5063051310 @default.
- W4385433564 date "2023-01-01" @default.
- W4385433564 modified "2023-10-16" @default.
- W4385433564 title "Online 3D Packing Problem Based on Bi-Value Guidance" @default.
- W4385433564 cites W1966211620 @default.
- W4385433564 cites W2107726111 @default.
- W4385433564 cites W2145339207 @default.
- W4385433564 cites W2257979135 @default.
- W4385433564 cites W2746553466 @default.
- W4385433564 cites W2897236562 @default.
- W4385433564 cites W2902907165 @default.
- W4385433564 cites W3170112077 @default.
- W4385433564 cites W3175494061 @default.
- W4385433564 cites W3209308771 @default.
- W4385433564 cites W4210542744 @default.
- W4385433564 cites W4280545054 @default.
- W4385433564 doi "https://doi.org/10.4236/jcc.2023.117010" @default.
- W4385433564 hasPublicationYear "2023" @default.
- W4385433564 type Work @default.
- W4385433564 citedByCount "0" @default.
- W4385433564 crossrefType "journal-article" @default.
- W4385433564 hasAuthorship W4385433564A5054037584 @default.
- W4385433564 hasAuthorship W4385433564A5063051310 @default.
- W4385433564 hasBestOaLocation W43854335641 @default.
- W4385433564 hasConcept C105795698 @default.
- W4385433564 hasConcept C106189395 @default.
- W4385433564 hasConcept C111919701 @default.
- W4385433564 hasConcept C119857082 @default.
- W4385433564 hasConcept C120665830 @default.
- W4385433564 hasConcept C121332964 @default.
- W4385433564 hasConcept C121955636 @default.
- W4385433564 hasConcept C126255220 @default.
- W4385433564 hasConcept C136764020 @default.
- W4385433564 hasConcept C144133560 @default.
- W4385433564 hasConcept C154945302 @default.
- W4385433564 hasConcept C159886148 @default.
- W4385433564 hasConcept C162853370 @default.
- W4385433564 hasConcept C192209626 @default.
- W4385433564 hasConcept C196083921 @default.
- W4385433564 hasConcept C2776291640 @default.
- W4385433564 hasConcept C2778572836 @default.
- W4385433564 hasConcept C2986087404 @default.
- W4385433564 hasConcept C33923547 @default.
- W4385433564 hasConcept C41008148 @default.
- W4385433564 hasConcept C4216890 @default.
- W4385433564 hasConcept C89249532 @default.
- W4385433564 hasConcept C97541855 @default.
- W4385433564 hasConcept C98763669 @default.
- W4385433564 hasConceptScore W4385433564C105795698 @default.
- W4385433564 hasConceptScore W4385433564C106189395 @default.
- W4385433564 hasConceptScore W4385433564C111919701 @default.
- W4385433564 hasConceptScore W4385433564C119857082 @default.
- W4385433564 hasConceptScore W4385433564C120665830 @default.
- W4385433564 hasConceptScore W4385433564C121332964 @default.
- W4385433564 hasConceptScore W4385433564C121955636 @default.
- W4385433564 hasConceptScore W4385433564C126255220 @default.
- W4385433564 hasConceptScore W4385433564C136764020 @default.
- W4385433564 hasConceptScore W4385433564C144133560 @default.
- W4385433564 hasConceptScore W4385433564C154945302 @default.
- W4385433564 hasConceptScore W4385433564C159886148 @default.
- W4385433564 hasConceptScore W4385433564C162853370 @default.
- W4385433564 hasConceptScore W4385433564C192209626 @default.
- W4385433564 hasConceptScore W4385433564C196083921 @default.
- W4385433564 hasConceptScore W4385433564C2776291640 @default.
- W4385433564 hasConceptScore W4385433564C2778572836 @default.
- W4385433564 hasConceptScore W4385433564C2986087404 @default.
- W4385433564 hasConceptScore W4385433564C33923547 @default.
- W4385433564 hasConceptScore W4385433564C41008148 @default.
- W4385433564 hasConceptScore W4385433564C4216890 @default.
- W4385433564 hasConceptScore W4385433564C89249532 @default.
- W4385433564 hasConceptScore W4385433564C97541855 @default.
- W4385433564 hasConceptScore W4385433564C98763669 @default.
- W4385433564 hasIssue "07" @default.
- W4385433564 hasLocation W43854335641 @default.
- W4385433564 hasOpenAccess W4385433564 @default.
- W4385433564 hasPrimaryLocation W43854335641 @default.
- W4385433564 hasRelatedWork W1556532828 @default.
- W4385433564 hasRelatedWork W1574991376 @default.
- W4385433564 hasRelatedWork W1985560493 @default.
- W4385433564 hasRelatedWork W1991138660 @default.
- W4385433564 hasRelatedWork W2146763310 @default.
- W4385433564 hasRelatedWork W2566307933 @default.
- W4385433564 hasRelatedWork W2937181779 @default.
- W4385433564 hasRelatedWork W3119377649 @default.
- W4385433564 hasRelatedWork W3198564127 @default.
- W4385433564 hasRelatedWork W4319083788 @default.
- W4385433564 hasVolume "11" @default.
- W4385433564 isParatext "false" @default.
- W4385433564 isRetracted "false" @default.
- W4385433564 workType "article" @default.