Evaluating and Selecting Deep Reinforcement Learning Models for OptimalDynamic Pricing: A Systematic Comparison of PPO, DDPG, and SAC

Yuchen Liu; Ka Lok Man; Gangmin Li; Terry R. Payne; Yong Yue

doi:10.1145/3640824.3640871

Evaluating and Selecting Deep Reinforcement Learning Models for OptimalDynamic Pricing: A Systematic Comparison of PPO, DDPG, and SAC

Yuchen Liu, Ka Lok Man^*, Gangmin Li, Terry R. Payne, Yong Yue

^*Corresponding author for this work

Department of Computing

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

7 Citations (Scopus)

Abstract

Given the plethora of available solutions, choosing an appropriate Deep Reinforcement Learning (DRL) model for dynamic pricing poses a significant challenge for practitioners. While many DRL solutions claim superior performance, there lacks a standardized framework for their evaluation. Addressing this gap, we introduce a novel framework and a set of metrics to select and assess DRL models systematically. To validate the utility of our framework, we critically compared three representative DRL models, emphasizing their performance in dynamic pricing tasks. Further ensuring the robustness of our assessment, we benchmarked these models against a well-established human agent policy. The DRL model that emerged as the most effective was rigorously tested on an Amazon dataset, demonstrating a notable performance boost of 5.64%. Our findings underscore the value of our proposed metrics and framework in guiding practitioners towards the most suitable DRL solution for dynamic pricing.

Original language	English
Title of host publication	Proceedings - 2024 8th International Conference on Control Engineering and Artificial Intelligence, CCEAI 2024
Editors	Wenqiang Zhang, Yong Yue, Marek Ogiela
Publisher	Association for Computing Machinery
Pages	215-219
Number of pages	5
ISBN (Electronic)	9798400707971
DOIs	https://doi.org/10.1145/3640824.3640871
Publication status	Published - 26 Jan 2024
Event	8th International Conference on Control Engineering and Artificial Intelligence, CCEAI 2024 - Shanghai, China Duration: 26 Jan 2024 → 28 Jan 2024

Publication series

Name	ACM International Conference Proceeding Series

Conference

Conference	8th International Conference on Control Engineering and Artificial Intelligence, CCEAI 2024
Country/Territory	China
City	Shanghai
Period	26/01/24 → 28/01/24

Keywords

DDPG (Deep Deterministic Policy Gradient)
Deep Reinforcement Learning (DRL)
Dynamic Pricing
E-commerce
Inventory Management
Markov Decision Process
Model Evaluation
PPO (Proximal Policy Optimization)
Price Elasticity of Demand
SAC (Soft Actor-Critic)

Access to Document

10.1145/3640824.3640871

Cite this

Liu, Y., Man, K. L., Li, G., Payne, T. R., & Yue, Y. (2024). Evaluating and Selecting Deep Reinforcement Learning Models for OptimalDynamic Pricing: A Systematic Comparison of PPO, DDPG, and SAC. In W. Zhang, Y. Yue, & M. Ogiela (Eds.), Proceedings - 2024 8th International Conference on Control Engineering and Artificial Intelligence, CCEAI 2024 (pp. 215-219). (ACM International Conference Proceeding Series). Association for Computing Machinery. https://doi.org/10.1145/3640824.3640871

Liu, Yuchen ; Man, Ka Lok ; Li, Gangmin et al. / Evaluating and Selecting Deep Reinforcement Learning Models for OptimalDynamic Pricing : A Systematic Comparison of PPO, DDPG, and SAC. Proceedings - 2024 8th International Conference on Control Engineering and Artificial Intelligence, CCEAI 2024. editor / Wenqiang Zhang ; Yong Yue ; Marek Ogiela. Association for Computing Machinery, 2024. pp. 215-219 (ACM International Conference Proceeding Series).

@inproceedings{e6673f02a2164500a553116c80aaf339,

title = "Evaluating and Selecting Deep Reinforcement Learning Models for OptimalDynamic Pricing: A Systematic Comparison of PPO, DDPG, and SAC",

abstract = "Given the plethora of available solutions, choosing an appropriate Deep Reinforcement Learning (DRL) model for dynamic pricing poses a significant challenge for practitioners. While many DRL solutions claim superior performance, there lacks a standardized framework for their evaluation. Addressing this gap, we introduce a novel framework and a set of metrics to select and assess DRL models systematically. To validate the utility of our framework, we critically compared three representative DRL models, emphasizing their performance in dynamic pricing tasks. Further ensuring the robustness of our assessment, we benchmarked these models against a well-established human agent policy. The DRL model that emerged as the most effective was rigorously tested on an Amazon dataset, demonstrating a notable performance boost of 5.64%. Our findings underscore the value of our proposed metrics and framework in guiding practitioners towards the most suitable DRL solution for dynamic pricing.",

keywords = "DDPG (Deep Deterministic Policy Gradient), Deep Reinforcement Learning (DRL), Dynamic Pricing, E-commerce, Inventory Management, Markov Decision Process, Model Evaluation, PPO (Proximal Policy Optimization), Price Elasticity of Demand, SAC (Soft Actor-Critic)",

author = "Yuchen Liu and Man, {Ka Lok} and Gangmin Li and Payne, {Terry R.} and Yong Yue",

note = "Publisher Copyright: {\textcopyright} 2024 ACM.; 8th International Conference on Control Engineering and Artificial Intelligence, CCEAI 2024 ; Conference date: 26-01-2024 Through 28-01-2024",

year = "2024",

month = jan,

day = "26",

doi = "10.1145/3640824.3640871",

language = "English",

series = "ACM International Conference Proceeding Series",

publisher = "Association for Computing Machinery",

pages = "215--219",

editor = "Wenqiang Zhang and Yong Yue and Marek Ogiela",

booktitle = "Proceedings - 2024 8th International Conference on Control Engineering and Artificial Intelligence, CCEAI 2024",

}

Liu, Y, Man, KL , Li, G, Payne, TR & Yue, Y 2024, Evaluating and Selecting Deep Reinforcement Learning Models for OptimalDynamic Pricing: A Systematic Comparison of PPO, DDPG, and SAC. in W Zhang, Y Yue & M Ogiela (eds), Proceedings - 2024 8th International Conference on Control Engineering and Artificial Intelligence, CCEAI 2024. ACM International Conference Proceeding Series, Association for Computing Machinery, pp. 215-219, 8th International Conference on Control Engineering and Artificial Intelligence, CCEAI 2024, Shanghai, China, 26/01/24. https://doi.org/10.1145/3640824.3640871

Evaluating and Selecting Deep Reinforcement Learning Models for OptimalDynamic Pricing: A Systematic Comparison of PPO, DDPG, and SAC. / Liu, Yuchen; Man, Ka Lok ; Li, Gangmin et al.
Proceedings - 2024 8th International Conference on Control Engineering and Artificial Intelligence, CCEAI 2024. ed. / Wenqiang Zhang; Yong Yue; Marek Ogiela. Association for Computing Machinery, 2024. p. 215-219 (ACM International Conference Proceeding Series).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Evaluating and Selecting Deep Reinforcement Learning Models for OptimalDynamic Pricing

T2 - 8th International Conference on Control Engineering and Artificial Intelligence, CCEAI 2024

AU - Liu, Yuchen

AU - Man, Ka Lok

AU - Li, Gangmin

AU - Payne, Terry R.

AU - Yue, Yong

PY - 2024/1/26

Y1 - 2024/1/26

N2 - Given the plethora of available solutions, choosing an appropriate Deep Reinforcement Learning (DRL) model for dynamic pricing poses a significant challenge for practitioners. While many DRL solutions claim superior performance, there lacks a standardized framework for their evaluation. Addressing this gap, we introduce a novel framework and a set of metrics to select and assess DRL models systematically. To validate the utility of our framework, we critically compared three representative DRL models, emphasizing their performance in dynamic pricing tasks. Further ensuring the robustness of our assessment, we benchmarked these models against a well-established human agent policy. The DRL model that emerged as the most effective was rigorously tested on an Amazon dataset, demonstrating a notable performance boost of 5.64%. Our findings underscore the value of our proposed metrics and framework in guiding practitioners towards the most suitable DRL solution for dynamic pricing.

AB - Given the plethora of available solutions, choosing an appropriate Deep Reinforcement Learning (DRL) model for dynamic pricing poses a significant challenge for practitioners. While many DRL solutions claim superior performance, there lacks a standardized framework for their evaluation. Addressing this gap, we introduce a novel framework and a set of metrics to select and assess DRL models systematically. To validate the utility of our framework, we critically compared three representative DRL models, emphasizing their performance in dynamic pricing tasks. Further ensuring the robustness of our assessment, we benchmarked these models against a well-established human agent policy. The DRL model that emerged as the most effective was rigorously tested on an Amazon dataset, demonstrating a notable performance boost of 5.64%. Our findings underscore the value of our proposed metrics and framework in guiding practitioners towards the most suitable DRL solution for dynamic pricing.

KW - DDPG (Deep Deterministic Policy Gradient)

KW - Deep Reinforcement Learning (DRL)

KW - Dynamic Pricing

KW - E-commerce

KW - Inventory Management

KW - Markov Decision Process

KW - Model Evaluation

KW - PPO (Proximal Policy Optimization)

KW - Price Elasticity of Demand

KW - SAC (Soft Actor-Critic)

UR - http://www.scopus.com/inward/record.url?scp=85188251454&partnerID=8YFLogxK

U2 - 10.1145/3640824.3640871

DO - 10.1145/3640824.3640871

M3 - Conference Proceeding

AN - SCOPUS:85188251454

T3 - ACM International Conference Proceeding Series

SP - 215

EP - 219

BT - Proceedings - 2024 8th International Conference on Control Engineering and Artificial Intelligence, CCEAI 2024

A2 - Zhang, Wenqiang

A2 - Yue, Yong

A2 - Ogiela, Marek

PB - Association for Computing Machinery

Y2 - 26 January 2024 through 28 January 2024

ER -

Liu Y, Man KL , Li G, Payne TR, Yue Y. Evaluating and Selecting Deep Reinforcement Learning Models for OptimalDynamic Pricing: A Systematic Comparison of PPO, DDPG, and SAC. In Zhang W, Yue Y, Ogiela M, editors, Proceedings - 2024 8th International Conference on Control Engineering and Artificial Intelligence, CCEAI 2024. Association for Computing Machinery. 2024. p. 215-219. (ACM International Conference Proceeding Series). doi: 10.1145/3640824.3640871

Evaluating and Selecting Deep Reinforcement Learning Models for OptimalDynamic Pricing: A Systematic Comparison of PPO, DDPG, and SAC

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this