TY - JOUR
T1 - ICC
T2 - Intra-Cluster Contraction for Pedestrian Anomaly Detection under One-Class Classification Setting
AU - Wang, Xiaolei
AU - Wang, Xiaoyang
AU - Bai, Huihui
AU - Lim, Eng Gee
AU - Xiao, Jimin
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2026/3
Y1 - 2026/3
N2 - Video anomaly detection (VAD) focuses on identifying unusual or unexpected events within video sequences. However, fully supervised learning approaches are ill-suited for this task because of the scarcity of anomaly examples. To tackle this challenge, one-class classification (OCC) has emerged as an effective solution for VAD. Under the OCC framework, the training phase requires only normal samples to build a detection model. Therefore, the core of VAD in the OCC setting lies in effectively modeling the normality of pedestrian behaviors during training. Recent studies often rely on appearance features for accurate feature modeling. Nevertheless, an over reliance on appearance-only features makes the detection system insensitive to pedestrian movement anomalies. In this study, we combine general scene-appearance features with a simple velocity representation derived from optical flow to achieve high-performance VAD. We posit that velocity information, when combined with a well-designed distribution modeling, can effectively describe pedestrian activities. Given the diversity of normal pedestrian motion, we employ a multi-modal probabilistic model to learn the distribution of velocity normality during training. Directly fitting on the original velocity features, however, leads to mixed clusters and dispersed cluster boundaries. This impedes the separation between normal and abnormal samples. To address this, we propose an Intra-Cluster Contraction (ICC) module. The ICC maps the extracted features to the neighborhoods of their local high-density centers, forcing more compact local clusters. This significantly enhances the distinction between normal and abnormal samples. For deep scene-appearance features, we also apply the ICC to adjust the feature distribution and design a distance-based density estimator to detect appearance anomalies. Supported by both appearance and motion information, our method demonstrates high adaptability to complex and variable scenarios. It achieves competitive results on three benchmark datasets.
AB - Video anomaly detection (VAD) focuses on identifying unusual or unexpected events within video sequences. However, fully supervised learning approaches are ill-suited for this task because of the scarcity of anomaly examples. To tackle this challenge, one-class classification (OCC) has emerged as an effective solution for VAD. Under the OCC framework, the training phase requires only normal samples to build a detection model. Therefore, the core of VAD in the OCC setting lies in effectively modeling the normality of pedestrian behaviors during training. Recent studies often rely on appearance features for accurate feature modeling. Nevertheless, an over reliance on appearance-only features makes the detection system insensitive to pedestrian movement anomalies. In this study, we combine general scene-appearance features with a simple velocity representation derived from optical flow to achieve high-performance VAD. We posit that velocity information, when combined with a well-designed distribution modeling, can effectively describe pedestrian activities. Given the diversity of normal pedestrian motion, we employ a multi-modal probabilistic model to learn the distribution of velocity normality during training. Directly fitting on the original velocity features, however, leads to mixed clusters and dispersed cluster boundaries. This impedes the separation between normal and abnormal samples. To address this, we propose an Intra-Cluster Contraction (ICC) module. The ICC maps the extracted features to the neighborhoods of their local high-density centers, forcing more compact local clusters. This significantly enhances the distinction between normal and abnormal samples. For deep scene-appearance features, we also apply the ICC to adjust the feature distribution and design a distance-based density estimator to detect appearance anomalies. Supported by both appearance and motion information, our method demonstrates high adaptability to complex and variable scenarios. It achieves competitive results on three benchmark datasets.
KW - Cluster Contraction
KW - Multi-modal Model
KW - One-Class Classification
KW - Optical Flow
KW - Video Anomaly Detection
UR - https://www.scopus.com/pages/publications/105014026977
U2 - 10.1109/TAI.2025.3601079
DO - 10.1109/TAI.2025.3601079
M3 - Article
AN - SCOPUS:105014026977
SN - 2691-4581
JO - IEEE Transactions on Artificial Intelligence
JF - IEEE Transactions on Artificial Intelligence
ER -