Abnormal event detection from videos using a two-stream recurrent variational autoencoder

Shiyang Yan; Jeremy S. Smith; Wenjin Lu; Bailing Zhang

doi:10.1109/TCDS.2018.2883368

Abnormal event detection from videos using a two-stream recurrent variational autoencoder

Shiyang Yan^*, Jeremy S. Smith, Wenjin Lu, Bailing Zhang

^*Corresponding author for this work

Department of Intelligent Science

Research output: Contribution to journal › Article › peer-review

91 Citations (Scopus)

Abstract

With the massive deployment of distributed video surveillance systems, the automatic detection of abnormal events in video streams has become an urgent need. An abnormal event can be considered as a deviation from the regular scene; however, the distribution of normal and abnormal events is severely imbalanced, since the abnormal events do not frequently occur. To make use of a large number of video surveillance videos of regular scenes, we propose a semi-supervised learning scheme, which only uses the data that contains the ordinary scenes. The proposed model has a two-stream structure that is composed of the appearance and motion streams. For each stream, a recurrent variational autoencoder can model the probabilistic distribution of the normal data in a semi-supervised learning scheme. The appearance and motion features from the two streams can provide complementary information to describe this probabilistic distribution. Comprehensive experiments validate the effectiveness of our proposed scheme on several public benchmark data sets which include the Avenue, the Ped1, the Ped2, the Subway-entry, and the Subway-exit.

Original language	English
Article number	8543857
Pages (from-to)	30-42
Number of pages	13
Journal	IEEE Transactions on Cognitive and Developmental Systems
Volume	12
Issue number	1
DOIs	https://doi.org/10.1109/TCDS.2018.2883368
Publication status	Published - Mar 2020

Keywords

Abnormal event detection
convolutional long-short term memory (LSTM)
reconstruction error probability
two-stream fusion
variational autoencoder (VAE)

Access to Document

10.1109/TCDS.2018.2883368

Cite this

@article{4639245b4c7c420da2930d9d3121ed27,

title = "Abnormal event detection from videos using a two-stream recurrent variational autoencoder",

abstract = "With the massive deployment of distributed video surveillance systems, the automatic detection of abnormal events in video streams has become an urgent need. An abnormal event can be considered as a deviation from the regular scene; however, the distribution of normal and abnormal events is severely imbalanced, since the abnormal events do not frequently occur. To make use of a large number of video surveillance videos of regular scenes, we propose a semi-supervised learning scheme, which only uses the data that contains the ordinary scenes. The proposed model has a two-stream structure that is composed of the appearance and motion streams. For each stream, a recurrent variational autoencoder can model the probabilistic distribution of the normal data in a semi-supervised learning scheme. The appearance and motion features from the two streams can provide complementary information to describe this probabilistic distribution. Comprehensive experiments validate the effectiveness of our proposed scheme on several public benchmark data sets which include the Avenue, the Ped1, the Ped2, the Subway-entry, and the Subway-exit.",

keywords = "Abnormal event detection, convolutional long-short term memory (LSTM), reconstruction error probability, two-stream fusion, variational autoencoder (VAE)",

author = "Shiyang Yan and Smith, {Jeremy S.} and Wenjin Lu and Bailing Zhang",

note = "Publisher Copyright: {\textcopyright} 2016 IEEE.",

year = "2020",

month = mar,

doi = "10.1109/TCDS.2018.2883368",

language = "English",

volume = "12",

pages = "30--42",

journal = "IEEE Transactions on Cognitive and Developmental Systems",

issn = "2379-8920",

number = "1",

}

TY - JOUR

T1 - Abnormal event detection from videos using a two-stream recurrent variational autoencoder

AU - Yan, Shiyang

AU - Smith, Jeremy S.

AU - Lu, Wenjin

AU - Zhang, Bailing

PY - 2020/3

Y1 - 2020/3

N2 - With the massive deployment of distributed video surveillance systems, the automatic detection of abnormal events in video streams has become an urgent need. An abnormal event can be considered as a deviation from the regular scene; however, the distribution of normal and abnormal events is severely imbalanced, since the abnormal events do not frequently occur. To make use of a large number of video surveillance videos of regular scenes, we propose a semi-supervised learning scheme, which only uses the data that contains the ordinary scenes. The proposed model has a two-stream structure that is composed of the appearance and motion streams. For each stream, a recurrent variational autoencoder can model the probabilistic distribution of the normal data in a semi-supervised learning scheme. The appearance and motion features from the two streams can provide complementary information to describe this probabilistic distribution. Comprehensive experiments validate the effectiveness of our proposed scheme on several public benchmark data sets which include the Avenue, the Ped1, the Ped2, the Subway-entry, and the Subway-exit.

AB - With the massive deployment of distributed video surveillance systems, the automatic detection of abnormal events in video streams has become an urgent need. An abnormal event can be considered as a deviation from the regular scene; however, the distribution of normal and abnormal events is severely imbalanced, since the abnormal events do not frequently occur. To make use of a large number of video surveillance videos of regular scenes, we propose a semi-supervised learning scheme, which only uses the data that contains the ordinary scenes. The proposed model has a two-stream structure that is composed of the appearance and motion streams. For each stream, a recurrent variational autoencoder can model the probabilistic distribution of the normal data in a semi-supervised learning scheme. The appearance and motion features from the two streams can provide complementary information to describe this probabilistic distribution. Comprehensive experiments validate the effectiveness of our proposed scheme on several public benchmark data sets which include the Avenue, the Ped1, the Ped2, the Subway-entry, and the Subway-exit.

KW - Abnormal event detection

KW - convolutional long-short term memory (LSTM)

KW - reconstruction error probability

KW - two-stream fusion

KW - variational autoencoder (VAE)

UR - http://www.scopus.com/inward/record.url?scp=85057363406&partnerID=8YFLogxK

U2 - 10.1109/TCDS.2018.2883368

DO - 10.1109/TCDS.2018.2883368

M3 - Article

AN - SCOPUS:85057363406

SN - 2379-8920

VL - 12

SP - 30

EP - 42

JO - IEEE Transactions on Cognitive and Developmental Systems

JF - IEEE Transactions on Cognitive and Developmental Systems

IS - 1

M1 - 8543857

ER -

Abnormal event detection from videos using a two-stream recurrent variational autoencoder

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this