A Q-learning-based approach for deploying dynamic service function chains

Jian Sun; Guanhua Huang; Gang Sun; Hongfang Yu; Arun Kumar Sangaiah; Victor Chang

doi:10.3390/sym10110646

A Q-learning-based approach for deploying dynamic service function chains

Jian Sun, Guanhua Huang, Gang Sun^*, Hongfang Yu, Arun Kumar Sangaiah, Victor Chang

^*Corresponding author for this work

International Business School Suzhou

Research output: Contribution to journal › Article › peer-review

33 Citations (Scopus)

Abstract

As the size and service requirements of today's networks gradually increase, large numbers of proprietary devices are deployed, which leads to network complexity, information security crises and makes network service and service provider management increasingly difficult. Network function virtualization (NFV) technology is one solution to this problem. NFV separates network functions from hardware and deploys them as software on a common server. NFV can be used to improve service flexibility and isolate the services provided for each user, thus guaranteeing the security of user data. Therefore, the use of NFV technology includes many problems worth studying. For example, when there is a free choice of network path, one problem is how to choose a service function chain (SFC) that both meets the requirements and offers the service provider maximum profit. Most existing solutions are heuristic algorithms with high time efficiency, or integer linear programming (ILP) algorithms with high accuracy. It's necessary to design an algorithm that symmetrically considers both time efficiency and accuracy. In this paper, we propose the Q-learning Framework Hybrid Module algorithm (QLFHM), which includes reinforcement learning to solve this SFC deployment problem in dynamic networks. The reinforcement learning module in QLFHM is responsible for the output of alternative paths, while the load balancing module in QLFHM is responsible for picking the optimal solution from them. The results of a comparison simulation experiment on a dynamic network topology show that the proposed algorithm can output the approximate optimal solution in a relatively short time while also considering the network load balance. Thus, it achieves the goal of maximizing the benefit to the service provider.

Original language	English
Article number	646
Journal	Symmetry
Volume	10
Issue number	11
DOIs	https://doi.org/10.3390/sym10110646
Publication status	Published - 16 Nov 2018

Keywords

Load balancing
Network function virtualization
Reinforcement learning
Security
Service function chain

Access to Document

10.3390/sym10110646

Cite this

@article{fe357d8a0c9b4e5f9df691d0317ead2a,

title = "A Q-learning-based approach for deploying dynamic service function chains",

abstract = "As the size and service requirements of today's networks gradually increase, large numbers of proprietary devices are deployed, which leads to network complexity, information security crises and makes network service and service provider management increasingly difficult. Network function virtualization (NFV) technology is one solution to this problem. NFV separates network functions from hardware and deploys them as software on a common server. NFV can be used to improve service flexibility and isolate the services provided for each user, thus guaranteeing the security of user data. Therefore, the use of NFV technology includes many problems worth studying. For example, when there is a free choice of network path, one problem is how to choose a service function chain (SFC) that both meets the requirements and offers the service provider maximum profit. Most existing solutions are heuristic algorithms with high time efficiency, or integer linear programming (ILP) algorithms with high accuracy. It's necessary to design an algorithm that symmetrically considers both time efficiency and accuracy. In this paper, we propose the Q-learning Framework Hybrid Module algorithm (QLFHM), which includes reinforcement learning to solve this SFC deployment problem in dynamic networks. The reinforcement learning module in QLFHM is responsible for the output of alternative paths, while the load balancing module in QLFHM is responsible for picking the optimal solution from them. The results of a comparison simulation experiment on a dynamic network topology show that the proposed algorithm can output the approximate optimal solution in a relatively short time while also considering the network load balance. Thus, it achieves the goal of maximizing the benefit to the service provider.",

keywords = "Load balancing, Network function virtualization, Reinforcement learning, Security, Service function chain",

author = "Jian Sun and Guanhua Huang and Gang Sun and Hongfang Yu and Sangaiah, {Arun Kumar} and Victor Chang",

note = "Funding Information: Funding: This research was funded by National Natural Science Foundation of China grant number [61571098], Fundamental Research Funds for the Central Universities grant number [ZYGX2016J217], and the 111 Project grant number [B14039]. Publisher Copyright: {\textcopyright} 2018 by the authors.",

year = "2018",

month = nov,

day = "16",

doi = "10.3390/sym10110646",

language = "English",

volume = "10",

journal = "Symmetry",

issn = "2073-8994",

number = "11",

}

TY - JOUR

T1 - A Q-learning-based approach for deploying dynamic service function chains

AU - Sun, Jian

AU - Huang, Guanhua

AU - Sun, Gang

AU - Yu, Hongfang

AU - Sangaiah, Arun Kumar

AU - Chang, Victor

N1 - Funding Information: Funding: This research was funded by National Natural Science Foundation of China grant number [61571098], Fundamental Research Funds for the Central Universities grant number [ZYGX2016J217], and the 111 Project grant number [B14039]. Publisher Copyright: © 2018 by the authors.

PY - 2018/11/16

Y1 - 2018/11/16

N2 - As the size and service requirements of today's networks gradually increase, large numbers of proprietary devices are deployed, which leads to network complexity, information security crises and makes network service and service provider management increasingly difficult. Network function virtualization (NFV) technology is one solution to this problem. NFV separates network functions from hardware and deploys them as software on a common server. NFV can be used to improve service flexibility and isolate the services provided for each user, thus guaranteeing the security of user data. Therefore, the use of NFV technology includes many problems worth studying. For example, when there is a free choice of network path, one problem is how to choose a service function chain (SFC) that both meets the requirements and offers the service provider maximum profit. Most existing solutions are heuristic algorithms with high time efficiency, or integer linear programming (ILP) algorithms with high accuracy. It's necessary to design an algorithm that symmetrically considers both time efficiency and accuracy. In this paper, we propose the Q-learning Framework Hybrid Module algorithm (QLFHM), which includes reinforcement learning to solve this SFC deployment problem in dynamic networks. The reinforcement learning module in QLFHM is responsible for the output of alternative paths, while the load balancing module in QLFHM is responsible for picking the optimal solution from them. The results of a comparison simulation experiment on a dynamic network topology show that the proposed algorithm can output the approximate optimal solution in a relatively short time while also considering the network load balance. Thus, it achieves the goal of maximizing the benefit to the service provider.

AB - As the size and service requirements of today's networks gradually increase, large numbers of proprietary devices are deployed, which leads to network complexity, information security crises and makes network service and service provider management increasingly difficult. Network function virtualization (NFV) technology is one solution to this problem. NFV separates network functions from hardware and deploys them as software on a common server. NFV can be used to improve service flexibility and isolate the services provided for each user, thus guaranteeing the security of user data. Therefore, the use of NFV technology includes many problems worth studying. For example, when there is a free choice of network path, one problem is how to choose a service function chain (SFC) that both meets the requirements and offers the service provider maximum profit. Most existing solutions are heuristic algorithms with high time efficiency, or integer linear programming (ILP) algorithms with high accuracy. It's necessary to design an algorithm that symmetrically considers both time efficiency and accuracy. In this paper, we propose the Q-learning Framework Hybrid Module algorithm (QLFHM), which includes reinforcement learning to solve this SFC deployment problem in dynamic networks. The reinforcement learning module in QLFHM is responsible for the output of alternative paths, while the load balancing module in QLFHM is responsible for picking the optimal solution from them. The results of a comparison simulation experiment on a dynamic network topology show that the proposed algorithm can output the approximate optimal solution in a relatively short time while also considering the network load balance. Thus, it achieves the goal of maximizing the benefit to the service provider.

KW - Load balancing

KW - Network function virtualization

KW - Reinforcement learning

KW - Security

KW - Service function chain

UR - http://www.scopus.com/inward/record.url?scp=85057859140&partnerID=8YFLogxK

U2 - 10.3390/sym10110646

DO - 10.3390/sym10110646

M3 - Article

AN - SCOPUS:85057859140

SN - 2073-8994

VL - 10

JO - Symmetry

JF - Symmetry

IS - 11

M1 - 646

ER -

A Q-learning-based approach for deploying dynamic service function chains

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this