TY - JOUR
T1 - A Multimodal Deep Reinforcement Learning Framework for Dynamic Pricing Optimisation in E-Commerce
AU - Liu, Yuchen
AU - Cui, Tianxiang
AU - Li, Gangmin
AU - Payne, Terry R.
AU - Yue, Yong
AU - Man, Ka Lok
N1 - Publisher Copyright:
© 2026 The Authors.
PY - 2026
Y1 - 2026
N2 - With the rapid expansion of e-commerce platforms, there is an increasing demand for pricing strategies that jointly optimise sales performance and profitability under highly competitive markets. To address this challenge, we propose a unified deep reinforcement learning (DRL) framework built upon ItaNet (Image, Text, and Attributes Fusion Network) for dynamic pricing in Amazon’s marketplace. By integrating multimodal product context into the DRL architecture, the framework enables pricing agents to reason over heterogeneous product characteristics and demand signals, leading to more adaptive pricing decisions. Extensive simulations across multiple DRL algorithms are conducted and compared against a range of strong baseline strategies. The results show that our approach achieves an average cumulative profit margin growth exceeding 16.7% aggregated across products and time steps, while incurring only a modest 0.3%–2% reduction in sales volume. This indicates a net improvement in total gross profit rather than gains driven by aggressive price undercutting. Overall, the findings demonstrate that ItaNet-based DRL policies deliver a more balanced and economically sustainable pricing strategy than traditional sales-driven approaches. The constructed dataset spans over 40 feature dimensions across 27 Amazon product categories, incorporating textual, visual, and numerical signals. Compared with commonly used review-centric datasets, this multimodal dataset better reflects real-world pricing environments and provides a strong foundation for future research on dynamic pricing and multimodal recommender systems.
AB - With the rapid expansion of e-commerce platforms, there is an increasing demand for pricing strategies that jointly optimise sales performance and profitability under highly competitive markets. To address this challenge, we propose a unified deep reinforcement learning (DRL) framework built upon ItaNet (Image, Text, and Attributes Fusion Network) for dynamic pricing in Amazon’s marketplace. By integrating multimodal product context into the DRL architecture, the framework enables pricing agents to reason over heterogeneous product characteristics and demand signals, leading to more adaptive pricing decisions. Extensive simulations across multiple DRL algorithms are conducted and compared against a range of strong baseline strategies. The results show that our approach achieves an average cumulative profit margin growth exceeding 16.7% aggregated across products and time steps, while incurring only a modest 0.3%–2% reduction in sales volume. This indicates a net improvement in total gross profit rather than gains driven by aggressive price undercutting. Overall, the findings demonstrate that ItaNet-based DRL policies deliver a more balanced and economically sustainable pricing strategy than traditional sales-driven approaches. The constructed dataset spans over 40 feature dimensions across 27 Amazon product categories, incorporating textual, visual, and numerical signals. Compared with commonly used review-centric datasets, this multimodal dataset better reflects real-world pricing environments and provides a strong foundation for future research on dynamic pricing and multimodal recommender systems.
KW - deep reinforcement learning
KW - dynamic pricing
KW - E-commerce pricing
KW - multimodal model
UR - https://www.scopus.com/pages/publications/105034772123
U2 - 10.1109/ACCESS.2026.3680140
DO - 10.1109/ACCESS.2026.3680140
M3 - Article
AN - SCOPUS:105034772123
SN - 2169-3536
VL - 14
SP - 57358
EP - 57386
JO - IEEE Access
JF - IEEE Access
ER -