Bayesian adversarial multi-node bandit for optimal smart grid protection against cyber attacks

Jianyu Xu; Bin Liu; Huadong Mo; Daoyi Dong

doi:10.1016/j.automatica.2021.109551

Bayesian adversarial multi-node bandit for optimal smart grid protection against cyber attacks

Jianyu Xu, Bin Liu, Huadong Mo^*, Daoyi Dong

^*Corresponding author for this work

Department of Intelligent Operations and Marketing

Research output: Contribution to journal › Article › peer-review

25 Citations (Scopus)

Abstract

The cyber security of smart grids has become one of key problems in developing reliable modern power and energy systems. This paper introduces a non-stationary adversarial cost with a variation constraint for smart grids and enables us to investigate the problem of optimal smart grid protection against cyber attacks in a relatively practical scenario. In particular, a Bayesian multi-node bandit (MNB) model with adversarial costs is constructed and a new regret function is defined for this model. An algorithm called Thompson–Hedge algorithm is presented to solve the problem and the superior performance of the proposed algorithm is proven in terms of the convergence rate of the regret function. The applicability of the algorithm to real smart grid scenarios is verified and the performance of the algorithm is also demonstrated by numerical examples.

Original language	English
Article number	109551
Journal	Automatica
Volume	128
DOIs	https://doi.org/10.1016/j.automatica.2021.109551
Publication status	Published - Jun 2021

Keywords

Bayesian updating
Cyber attack
Multi-node bandit
Reinforcement learning
Smart grid

Access to Document

10.1016/j.automatica.2021.109551

Cite this

@article{823f521e06634db78fc0ae1182a9367f,

title = "Bayesian adversarial multi-node bandit for optimal smart grid protection against cyber attacks",

abstract = "The cyber security of smart grids has become one of key problems in developing reliable modern power and energy systems. This paper introduces a non-stationary adversarial cost with a variation constraint for smart grids and enables us to investigate the problem of optimal smart grid protection against cyber attacks in a relatively practical scenario. In particular, a Bayesian multi-node bandit (MNB) model with adversarial costs is constructed and a new regret function is defined for this model. An algorithm called Thompson–Hedge algorithm is presented to solve the problem and the superior performance of the proposed algorithm is proven in terms of the convergence rate of the regret function. The applicability of the algorithm to real smart grid scenarios is verified and the performance of the algorithm is also demonstrated by numerical examples.",

keywords = "Bayesian updating, Cyber attack, Multi-node bandit, Reinforcement learning, Smart grid",

author = "Jianyu Xu and Bin Liu and Huadong Mo and Daoyi Dong",

note = "Publisher Copyright: {\textcopyright} 2021 Elsevier Ltd",

year = "2021",

month = jun,

doi = "10.1016/j.automatica.2021.109551",

language = "English",

volume = "128",

journal = "Automatica",

issn = "0005-1098",

}

TY - JOUR

T1 - Bayesian adversarial multi-node bandit for optimal smart grid protection against cyber attacks

AU - Xu, Jianyu

AU - Liu, Bin

AU - Mo, Huadong

AU - Dong, Daoyi

PY - 2021/6

Y1 - 2021/6

N2 - The cyber security of smart grids has become one of key problems in developing reliable modern power and energy systems. This paper introduces a non-stationary adversarial cost with a variation constraint for smart grids and enables us to investigate the problem of optimal smart grid protection against cyber attacks in a relatively practical scenario. In particular, a Bayesian multi-node bandit (MNB) model with adversarial costs is constructed and a new regret function is defined for this model. An algorithm called Thompson–Hedge algorithm is presented to solve the problem and the superior performance of the proposed algorithm is proven in terms of the convergence rate of the regret function. The applicability of the algorithm to real smart grid scenarios is verified and the performance of the algorithm is also demonstrated by numerical examples.

AB - The cyber security of smart grids has become one of key problems in developing reliable modern power and energy systems. This paper introduces a non-stationary adversarial cost with a variation constraint for smart grids and enables us to investigate the problem of optimal smart grid protection against cyber attacks in a relatively practical scenario. In particular, a Bayesian multi-node bandit (MNB) model with adversarial costs is constructed and a new regret function is defined for this model. An algorithm called Thompson–Hedge algorithm is presented to solve the problem and the superior performance of the proposed algorithm is proven in terms of the convergence rate of the regret function. The applicability of the algorithm to real smart grid scenarios is verified and the performance of the algorithm is also demonstrated by numerical examples.

KW - Bayesian updating

KW - Cyber attack

KW - Multi-node bandit

KW - Reinforcement learning

KW - Smart grid

UR - http://www.scopus.com/inward/record.url?scp=85101998352&partnerID=8YFLogxK

U2 - 10.1016/j.automatica.2021.109551

DO - 10.1016/j.automatica.2021.109551

M3 - Article

AN - SCOPUS:85101998352

SN - 0005-1098

VL - 128

JO - Automatica

JF - Automatica

M1 - 109551

ER -

Bayesian adversarial multi-node bandit for optimal smart grid protection against cyber attacks

Abstract

Keywords

Access to Document

Other files and links

Cite this