TY - JOUR
T1 - Optimizing E-Commerce with Ensemble Learning and Iterative Clustering for Superior Product Selection
AU - Liu, Yuchen
AU - Wang, Meng
AU - Li, Gangmin
AU - Payne, Terry R.
AU - Yue, Yong
AU - Man, Ka Lok
N1 - Publisher Copyright:
© 2024 Korean Society for Internet Information. All rights reserved.
PY - 2024/10/31
Y1 - 2024/10/31
N2 - With the continuous growth of e-commerce sales, a robust product selection model is essential to maintain competitiveness and meet consumer demand. Current research primarily focuses on single models for sales prediction and lacks an integrated approach to sales forecasting and product selection. This paper proposes a comprehensive framework (VN-CPC) that combines sales forecasting with product selection to address these issues. We integrate a series of classical machine learning models, including Tree Models (XGBoost, LightGBM, CatBoost), Support Vector Machine (SVM), Bayesian Ridge, and Artificial Neural Networks (ANN), using a voting mechanism to determine the optimal weighting scheme. Our method demonstrates a lower Root Mean Square Error (RMSE) on collected Amazon data than individual models and other ensemble models. Furthermore, we employ a three-tiered clustering model: Initial Clustering, Refinement Clustering, and Final Clustering, based on our predictive model to refine product selection to specific categories. This integrated forecasting and selection framework can be more effectively applied in the dynamic e-commerce environment. It provides a robust tool for businesses to optimize their product offerings and stay ahead in a competitive market.
AB - With the continuous growth of e-commerce sales, a robust product selection model is essential to maintain competitiveness and meet consumer demand. Current research primarily focuses on single models for sales prediction and lacks an integrated approach to sales forecasting and product selection. This paper proposes a comprehensive framework (VN-CPC) that combines sales forecasting with product selection to address these issues. We integrate a series of classical machine learning models, including Tree Models (XGBoost, LightGBM, CatBoost), Support Vector Machine (SVM), Bayesian Ridge, and Artificial Neural Networks (ANN), using a voting mechanism to determine the optimal weighting scheme. Our method demonstrates a lower Root Mean Square Error (RMSE) on collected Amazon data than individual models and other ensemble models. Furthermore, we employ a three-tiered clustering model: Initial Clustering, Refinement Clustering, and Final Clustering, based on our predictive model to refine product selection to specific categories. This integrated forecasting and selection framework can be more effectively applied in the dynamic e-commerce environment. It provides a robust tool for businesses to optimize their product offerings and stay ahead in a competitive market.
KW - E-commerce Sales Forecasting
KW - Ensemble Machine Learning Models
KW - Iterative Clustering Analysis
KW - Product Attribute Extraction
KW - Product Selection Strategy
UR - http://www.scopus.com/inward/record.url?scp=85208717696&partnerID=8YFLogxK
U2 - 10.3837/tiis.2024.10.001
DO - 10.3837/tiis.2024.10.001
M3 - Article
AN - SCOPUS:85208717696
SN - 1976-7277
VL - 18
SP - 2818
EP - 2839
JO - KSII Transactions on Internet and Information Systems
JF - KSII Transactions on Internet and Information Systems
IS - 10
ER -