Matches in SemOpenAlex for { <https://semopenalex.org/work/W3205656755> ?p ?o ?g. }
- W3205656755 endingPage "184" @default.
- W3205656755 startingPage "171" @default.
- W3205656755 abstract "In this paper, we study the outage probability minimizing problem in a two-hop cooperative relay network. To reduce outage probability, existing studies propose many schemes for relay selection and power allocation, which are usually based on the assumption of exact channel state information (CSI). However, it is difficult to obtain perfect instantaneous CSI in practical situations where channel states change rapidly, and thus traditional methods would not perform well. Considering these factors, we turn to the emerging reinforcement learning (RL) methods for solutions. RL methods do not need any prior knowledge of CSI, but use neural network for approximation and decision after interacting with communication environment. Nevertheless, conventional RL methods, including most deep reinforcement learning (DRL) methods, cannot perform well when the search space is too large. In addition, non-stationarity is a common problem when using hierarchical reinforcement learning (HRL), which is caused by the changing behavior in different hierarchies. Therefore, we first propose a DRL framework with an outage-based reward function, which is then used as a baseline. Then, we further design an HRL framework and training algorithm. By decomposing relay selection and power allocation into two hierarchical optimization objectives, and combining on- policy and off-policy methods in the HRL framework, our method successfully address the sparse reward and non-stationary problem. Simulation results reveal that compared with traditional DRL method, the proposed HRL training algorithm can converge faster and reduce the outage probability by 8% in two-hop relay network with the same outage threshold." @default.
- W3205656755 created "2021-10-25" @default.
- W3205656755 creator A5034541026 @default.
- W3205656755 creator A5037894061 @default.
- W3205656755 creator A5080763885 @default.
- W3205656755 creator A5082580072 @default.
- W3205656755 date "2022-01-01" @default.
- W3205656755 modified "2023-10-15" @default.
- W3205656755 title "Hierarchical Reinforcement Learning for Relay Selection and Power Optimization in Two-Hop Cooperative Relay Network" @default.
- W3205656755 cites W1980310952 @default.
- W3205656755 cites W2033644422 @default.
- W3205656755 cites W2084779837 @default.
- W3205656755 cites W2108427351 @default.
- W3205656755 cites W2120448104 @default.
- W3205656755 cites W2126080547 @default.
- W3205656755 cites W2137809208 @default.
- W3205656755 cites W2145339207 @default.
- W3205656755 cites W2315140361 @default.
- W3205656755 cites W2344445507 @default.
- W3205656755 cites W2565358133 @default.
- W3205656755 cites W2599312801 @default.
- W3205656755 cites W2791110587 @default.
- W3205656755 cites W2791781667 @default.
- W3205656755 cites W2890860720 @default.
- W3205656755 cites W2901561492 @default.
- W3205656755 cites W2904491988 @default.
- W3205656755 cites W2929863751 @default.
- W3205656755 cites W2942184420 @default.
- W3205656755 cites W2947622545 @default.
- W3205656755 cites W2954168748 @default.
- W3205656755 cites W2960729702 @default.
- W3205656755 cites W2963761387 @default.
- W3205656755 cites W2978207438 @default.
- W3205656755 cites W2981559750 @default.
- W3205656755 cites W3000245973 @default.
- W3205656755 cites W3002189898 @default.
- W3205656755 cites W3020522931 @default.
- W3205656755 cites W3041207562 @default.
- W3205656755 cites W3054258700 @default.
- W3205656755 cites W3085955789 @default.
- W3205656755 cites W3100839090 @default.
- W3205656755 cites W3124919228 @default.
- W3205656755 cites W3124943657 @default.
- W3205656755 doi "https://doi.org/10.1109/tcomm.2021.3119689" @default.
- W3205656755 hasPublicationYear "2022" @default.
- W3205656755 type Work @default.
- W3205656755 sameAs 3205656755 @default.
- W3205656755 citedByCount "7" @default.
- W3205656755 countsByYear W32056567552022 @default.
- W3205656755 countsByYear W32056567552023 @default.
- W3205656755 crossrefType "journal-article" @default.
- W3205656755 hasAuthorship W3205656755A5034541026 @default.
- W3205656755 hasAuthorship W3205656755A5037894061 @default.
- W3205656755 hasAuthorship W3205656755A5080763885 @default.
- W3205656755 hasAuthorship W3205656755A5082580072 @default.
- W3205656755 hasBestOaLocation W32056567552 @default.
- W3205656755 hasConcept C11413529 @default.
- W3205656755 hasConcept C121332964 @default.
- W3205656755 hasConcept C126255220 @default.
- W3205656755 hasConcept C127162648 @default.
- W3205656755 hasConcept C137836250 @default.
- W3205656755 hasConcept C148063708 @default.
- W3205656755 hasConcept C154945302 @default.
- W3205656755 hasConcept C163258240 @default.
- W3205656755 hasConcept C25906391 @default.
- W3205656755 hasConcept C2778156585 @default.
- W3205656755 hasConcept C31258907 @default.
- W3205656755 hasConcept C33923547 @default.
- W3205656755 hasConcept C41008148 @default.
- W3205656755 hasConcept C50644808 @default.
- W3205656755 hasConcept C555944384 @default.
- W3205656755 hasConcept C62520636 @default.
- W3205656755 hasConcept C76155785 @default.
- W3205656755 hasConcept C81917197 @default.
- W3205656755 hasConcept C97541855 @default.
- W3205656755 hasConceptScore W3205656755C11413529 @default.
- W3205656755 hasConceptScore W3205656755C121332964 @default.
- W3205656755 hasConceptScore W3205656755C126255220 @default.
- W3205656755 hasConceptScore W3205656755C127162648 @default.
- W3205656755 hasConceptScore W3205656755C137836250 @default.
- W3205656755 hasConceptScore W3205656755C148063708 @default.
- W3205656755 hasConceptScore W3205656755C154945302 @default.
- W3205656755 hasConceptScore W3205656755C163258240 @default.
- W3205656755 hasConceptScore W3205656755C25906391 @default.
- W3205656755 hasConceptScore W3205656755C2778156585 @default.
- W3205656755 hasConceptScore W3205656755C31258907 @default.
- W3205656755 hasConceptScore W3205656755C33923547 @default.
- W3205656755 hasConceptScore W3205656755C41008148 @default.
- W3205656755 hasConceptScore W3205656755C50644808 @default.
- W3205656755 hasConceptScore W3205656755C555944384 @default.
- W3205656755 hasConceptScore W3205656755C62520636 @default.
- W3205656755 hasConceptScore W3205656755C76155785 @default.
- W3205656755 hasConceptScore W3205656755C81917197 @default.
- W3205656755 hasConceptScore W3205656755C97541855 @default.
- W3205656755 hasFunder F4320321001 @default.
- W3205656755 hasFunder F4320321885 @default.
- W3205656755 hasIssue "1" @default.
- W3205656755 hasLocation W32056567551 @default.