TY - JOUR
T1 - A Deep Reinforcement Learning Framework for 3L-NPC-DAB Converters With Multiple-Degree-of-Freedom Phase-Shift Control
AU - Feng, Zhichen
AU - Wen, Huiqing
AU - Han, Xu
AU - Wang, Guangyu
AU - Zhu, Yinxiao
AU - Rodrigues, Jose
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2025
Y1 - 2025
N2 - The three-level neutral-point-clamped dual-active-bridge (3L-NPC-DAB) converter offers several advantages over the conventional two-level DAB converter, including enhanced control flexibility and the ability to operate with lower voltage-rated power switches. However, with the increase in the number of power devices and multi-objective control requirements, traditional phase-shift control cannot meet the requirements. In order to further improve the conversion efficiency, this study introduces a deep reinforcement learning (DRL) optimization scheme based on the five control degrees of freedom (5-DoF) technique. The proposed scheme utilizes the deep deterministic policy gradient (DDPG) algorithm to minimize power losses and determine optimal control solutions. Through training, the DDPG agent acts as a predictor, enabling effective control decisions for maximizing conversion efficiency across various operating conditions. The effectiveness of the proposed method is validated through experimental results obtained from a laboratory proto-type.
AB - The three-level neutral-point-clamped dual-active-bridge (3L-NPC-DAB) converter offers several advantages over the conventional two-level DAB converter, including enhanced control flexibility and the ability to operate with lower voltage-rated power switches. However, with the increase in the number of power devices and multi-objective control requirements, traditional phase-shift control cannot meet the requirements. In order to further improve the conversion efficiency, this study introduces a deep reinforcement learning (DRL) optimization scheme based on the five control degrees of freedom (5-DoF) technique. The proposed scheme utilizes the deep deterministic policy gradient (DDPG) algorithm to minimize power losses and determine optimal control solutions. Through training, the DDPG agent acts as a predictor, enabling effective control decisions for maximizing conversion efficiency across various operating conditions. The effectiveness of the proposed method is validated through experimental results obtained from a laboratory proto-type.
KW - Deep Reinforcement Learning (DRL)
KW - Dual-Active-Bridge (DAB)
KW - efficiency optimization control
KW - Three-Level
UR - http://www.scopus.com/inward/record.url?scp=85218950585&partnerID=8YFLogxK
U2 - 10.1109/TTE.2025.3545834
DO - 10.1109/TTE.2025.3545834
M3 - Article
AN - SCOPUS:85218950585
SN - 2332-7782
JO - IEEE Transactions on Transportation Electrification
JF - IEEE Transactions on Transportation Electrification
ER -