TY - JOUR
T1 - Antenna selection in energy harvesting relaying networks using Q-learning algorithm
AU - Ouyang, Daliang
AU - Zhao, Rui
AU - Li, Yuanjian
AU - Guo, Rongxin
AU - Wang, Yi
N1 - Publisher Copyright:
© 2013 China Institute of Communications.
PY - 2021/4
Y1 - 2021/4
N2 - In this paper, a novel opportunistic scheduling (OS) scheme with antenna selection (AS) for the energy harvesting (EH) cooperative communication system where the relay can harvest energy from the source transmission is proposed. In this considered scheme, we take into both traditional mathematical analysis and reinforcement learning (RL) scenarios with the power splitting (PS) factor constraint. For the case of traditional mathematical analysis of a fixed-PS factor, we derive an exact closed-form expressions for the ergodic capacity and outage probability in general signal-to-noise ratio (SNR) regime. Then, we combine the optimal PS factor with performance metrics to achieve the optimal transmission performance. Subsequently, based on the optimized PS factor, a RL technique called as Q-learning (QL) algorithm is proposed to derive the optimal antenna selection strategy. To highlight the performance advantage of the proposed QL with training the received SNR at the destination, we also examine the scenario of QL scheme with training channel between the relay and the destination. The results illustrate that, the optimized scheme is always superior to the fixed-PS factor scheme. In addition, a better system parameter setting with QL significantly outperforms the traditional mathematical analysis scheme.
AB - In this paper, a novel opportunistic scheduling (OS) scheme with antenna selection (AS) for the energy harvesting (EH) cooperative communication system where the relay can harvest energy from the source transmission is proposed. In this considered scheme, we take into both traditional mathematical analysis and reinforcement learning (RL) scenarios with the power splitting (PS) factor constraint. For the case of traditional mathematical analysis of a fixed-PS factor, we derive an exact closed-form expressions for the ergodic capacity and outage probability in general signal-to-noise ratio (SNR) regime. Then, we combine the optimal PS factor with performance metrics to achieve the optimal transmission performance. Subsequently, based on the optimized PS factor, a RL technique called as Q-learning (QL) algorithm is proposed to derive the optimal antenna selection strategy. To highlight the performance advantage of the proposed QL with training the received SNR at the destination, we also examine the scenario of QL scheme with training channel between the relay and the destination. The results illustrate that, the optimized scheme is always superior to the fixed-PS factor scheme. In addition, a better system parameter setting with QL significantly outperforms the traditional mathematical analysis scheme.
KW - antenna selection
KW - ergodic capacity
KW - optimal PS factor
KW - outage probability
KW - Q-learning
UR - http://www.scopus.com/inward/record.url?scp=85105142620&partnerID=8YFLogxK
U2 - 10.23919/JCC.2021.04.005
DO - 10.23919/JCC.2021.04.005
M3 - Article
AN - SCOPUS:85105142620
SN - 1673-5447
VL - 18
SP - 64
EP - 75
JO - China Communications
JF - China Communications
IS - 4
M1 - 9416921
ER -