TY - JOUR
T1 - Deep Reinforcement Learning Assisted Hybrid Five-Variable Modulation Scheme for DAB Converters to Reduce RMS Current and Expand ZVS Operation
AU - Feng, Zhichen
AU - Wen, Huiqing
AU - Han, Xu
AU - Bu, Qinglei
AU - Zhu, Yinxiao
AU - Wang, Guangyu
N1 - Publisher Copyright:
© 1986-2012 IEEE.
PY - 2024/7/1
Y1 - 2024/7/1
N2 - In order to enhance the conversion efficiency of dual-active-bridge converters, a hybrid five-variable (HFV) modulation strategy with the aid of deep reinforcement learning technique is proposed in this article. Specifically, deep deterministic policy gradient (DDPG) algorithm is employed to train the agent with the purpose of the lower root mean square (RMS) current and the zero voltage switching (ZVS) operation of all power switches under various conditions. Moreover, due to the complexity of HFV modulation with five control DoF, frequency-domain analysis is used to directly derive the operating expressions of the converter. Based on these, the trained DDPG agent can output the optimal values of control variables, which can lower the RMS current and achieve the widest ZVS operation with reduced computation burden. A thorough comparison among related works in terms of the RMS current and ZVS range was presented. Finally, the effectiveness of the proposed method is verified by experiments on a built prototype.
AB - In order to enhance the conversion efficiency of dual-active-bridge converters, a hybrid five-variable (HFV) modulation strategy with the aid of deep reinforcement learning technique is proposed in this article. Specifically, deep deterministic policy gradient (DDPG) algorithm is employed to train the agent with the purpose of the lower root mean square (RMS) current and the zero voltage switching (ZVS) operation of all power switches under various conditions. Moreover, due to the complexity of HFV modulation with five control DoF, frequency-domain analysis is used to directly derive the operating expressions of the converter. Based on these, the trained DDPG agent can output the optimal values of control variables, which can lower the RMS current and achieve the widest ZVS operation with reduced computation burden. A thorough comparison among related works in terms of the RMS current and ZVS range was presented. Finally, the effectiveness of the proposed method is verified by experiments on a built prototype.
KW - Deep deterministic policy gradient (DDPG)
KW - deep reinforcement learning (DRL)
KW - dual-active-bridge (DAB)
KW - root mean square (RMS) current
KW - zero voltage switching (ZVS)
UR - http://www.scopus.com/inward/record.url?scp=85191333591&partnerID=8YFLogxK
U2 - 10.1109/TPEL.2024.3392759
DO - 10.1109/TPEL.2024.3392759
M3 - Article
AN - SCOPUS:85191333591
SN - 0885-8993
VL - 39
SP - 8114
EP - 8128
JO - IEEE Transactions on Power Electronics
JF - IEEE Transactions on Power Electronics
IS - 7
ER -