TY - JOUR
T1 - Optimizing Portfolios with Surrender Variable Annuities: A Deep Reinforcement Learning Approach
AU - Huang, Huifang
AU - Jin, Zhuo
AU - Wu, Fuke
AU - Yang, Hailiang
N1 - Publisher Copyright:
© 2025 The Author(s)
PY - 2026/1
Y1 - 2026/1
N2 - This paper investigates a portfolio optimization problem for an investor on asset allocation among risk-free asset, risky asset, and surrender variable annuity contracts featuring guaranteed minimum death benefit and guaranteed minimum maturity benefit subject to mortality and surrender risk. The investor's objective is to maximize the expected utility of the bequest at death or the expected utility of assets at contract maturity. On each trading day before the investor's death, the investor can adjust the allocation between risk-free and risky assets, invest in a new surrender variable annuity product. Especially, the policyholder may exercise partial or full surrender options for any existing variable annuity contract. This dynamic adjustment creates a high-dimensional state and action space, making traditional optimization methods inadequate. To address this, we utilize the Lee-Carter model to analyze Australian demographic data, predict mortality risk, simulate surrender risk based on market changes, and estimate the fair pricing of variable annuity contracts in the portfolio. Subsequently, we introduce a deep reinforcement learning algorithm within a simulated trading environment that independently models the dynamic behavior of various assets and underlying indices. The algorithm utilizes neural networks to analyze high-dimensional state variables and leverages the interactive capabilities of the agent to flexibly adapt to asset fluctuations, dynamically optimizing investment allocation. Additionally, we prove the global convergence of the algorithm under standard assumptions and validate its effectiveness in managing the complexities of high-dimensional portfolios, particularly in capturing mortality, surrender, and financial risks. Numerical experiments further demonstrate the stability and robustness of the algorithm, showcasing its advantages in complex insurance and financial scenarios.
AB - This paper investigates a portfolio optimization problem for an investor on asset allocation among risk-free asset, risky asset, and surrender variable annuity contracts featuring guaranteed minimum death benefit and guaranteed minimum maturity benefit subject to mortality and surrender risk. The investor's objective is to maximize the expected utility of the bequest at death or the expected utility of assets at contract maturity. On each trading day before the investor's death, the investor can adjust the allocation between risk-free and risky assets, invest in a new surrender variable annuity product. Especially, the policyholder may exercise partial or full surrender options for any existing variable annuity contract. This dynamic adjustment creates a high-dimensional state and action space, making traditional optimization methods inadequate. To address this, we utilize the Lee-Carter model to analyze Australian demographic data, predict mortality risk, simulate surrender risk based on market changes, and estimate the fair pricing of variable annuity contracts in the portfolio. Subsequently, we introduce a deep reinforcement learning algorithm within a simulated trading environment that independently models the dynamic behavior of various assets and underlying indices. The algorithm utilizes neural networks to analyze high-dimensional state variables and leverages the interactive capabilities of the agent to flexibly adapt to asset fluctuations, dynamically optimizing investment allocation. Additionally, we prove the global convergence of the algorithm under standard assumptions and validate its effectiveness in managing the complexities of high-dimensional portfolios, particularly in capturing mortality, surrender, and financial risks. Numerical experiments further demonstrate the stability and robustness of the algorithm, showcasing its advantages in complex insurance and financial scenarios.
KW - Deep reinforcement learning
KW - Guaranteed minimum death benefit
KW - Guaranteed minimum maturity benefit
KW - Optimal portfolio
KW - Surrender
KW - Variable annuity
UR - https://www.scopus.com/pages/publications/105022594689
U2 - 10.1016/j.insmatheco.2025.103177
DO - 10.1016/j.insmatheco.2025.103177
M3 - Article
SN - 0167-6687
VL - 126
JO - Insurance: Mathematics and Economics
JF - Insurance: Mathematics and Economics
M1 - 103177
ER -