A Hybrid Video Anomaly Detection Framework via Memory-Augmented Flow Reconstruction and Flow-Guided Frame Prediction | IEEE Conference Publication | IEEE Xplore

A Hybrid Video Anomaly Detection Framework via Memory-Augmented Flow Reconstruction and Flow-Guided Frame Prediction


Abstract:

In this paper, we propose HF2-VAD, a Hybrid framework that integrates Flow reconstruction and Frame prediction seamlessly to handle Video Anomaly Detection. Firstly, we d...Show More

Abstract:

In this paper, we propose HF2-VAD, a Hybrid framework that integrates Flow reconstruction and Frame prediction seamlessly to handle Video Anomaly Detection. Firstly, we design the network of ML-MemAE-SC (Multi-Level Memory modules in an Autoencoder with Skip Connections) to memorize normal patterns for optical flow reconstruction so that abnormal events can be sensitively identified with larger flow reconstruction errors. More importantly, conditioned on the reconstructed flows, we then employ a Conditional Variational Autoencoder (CVAE), which captures the high correlation between video frame and optical flow, to predict the next frame given several previous frames. By CVAE, the quality of flow reconstruction essentially influences that of frame prediction. Therefore, poorly reconstructed optical flows of abnormal events further deteriorate the quality of the final predicted future frame, making the anomalies more detectable. Experimental results demonstrate the effectiveness of the proposed method. Code is available at https://github.com/LiUzHiAn/hf2vad.
Date of Conference: 10-17 October 2021
Date Added to IEEE Xplore: 28 February 2022
ISBN Information:

ISSN Information:

Conference Location: Montreal, QC, Canada

Funding Agency:

References is not available for this document.

1. Introduction

Video Anomaly Detection (VAD) refers to the identification of events that do not conform to expected behaviors [3] in a video, with one example shown in Figure 1. This is an open and very challenging task as abnormal events usually much less happen than normal ones and the forms of abnormal events are unbounded in practical applications [25]. Obviously, it is impossible to collect all kinds of abnormal data in advance. Therefore, a typical solution to video anomaly detection is to train an unsupervised learning model on normal data, and those events or activities that are recognized by the trained model as outliers are then deemed as anomalies.

Select All
1.
Amit Adam, Ehud Rivlin, Ilan Shimshoni and Daviv Reinitz, "Robust real-time unusual event detection using multiple fixed-location monitors", IEEE transactions on pattern analysis and machine intelligence, vol. 30, no. 3, pp. 555-560, 2008.
2.
Yannick Benezeth, P-M Jodoin, Venkatesh Saligrama and Christophe Rosenberger, "Abnormal events detection based on spatio-temporal co-occurences", 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 2458-2465, 2009.
3.
Varun Chandola, Arindam Banerjee and Vipin Kumar, "Anomaly detection: A survey", ACM computing surveys, vol. 41, no. 3, pp. 1-58, 2009.
4.
Bin Ding, Chengjiang Long, Ling Zhang and Chunxia Xiao, "Argan: Attentive recurrent generative adversarial network for shadow detection and removal", Proceedings of the IEEE International Conference on Computer Vision, 2019.
5.
Xinzhi Dong, Chengjiang Long, Wenju Xu and Chunxia Xiao, "Dual graph convolutional networks with transformer and curriculum learning for image captioning", Proceedings of the ACM International Conference on Multimedia, 2021.
6.
Patrick Esser, Ekaterina Sutter and Bjorn Ommer, "A varia-¨ tional u-net for conditional appearance and shape generation", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8857-8866, 2018.
7.
Yaxiang Fan, Gongjian Wen, Deren Li, Shaohua Qiu, Martin D Levine and Fei Xiao, "Video anomaly detection and localization via gaussian mixture fully convolutional variational autoencoder", Computer Vision and Image Understanding, pp. 102920, 2020.
8.
Dong Gong, Lingqiao Liu, Vuong Le, Budhaditya Saha, Moussa Reda Mansour, Svetha Venkatesh, et al., "Memorizing normality to detect anomaly: Memory-augmented deep autoencoder for unsupervised anomaly detection", Proceedings of the IEEE International Conference on Computer Vision, pp. 1705-1714, 2019.
9.
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, et al., "Generative adversarial nets", Advances in neural information processing systems, pp. 27, 2014.
10.
Alex Graves, Greg Wayne, Malcolm Reynolds, Tim Harley, Ivo Danihelka, Agnieszka Grabska-Barwińska, Sergio Gómez Colmenarejo, Edward Grefenstette, Tiago Ramalho, John Agapiou et al., "Hybrid computing using a neural network with dynamic external memory", Nature, vol. 538, no. 7626, pp. 471-476, 2016.
11.
Mahmudul Hasan, Jonghyun Choi, Jan Neumann, Amit K Roy-Chowdhury and Larry S Davis, "Learning temporal regularity in video sequences", Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 733-742, 2016.
12.
Tao Hu, Chengjiang Long and Chunxia Xiao, Crd-cgan: Category-consistent and relativistic constraints for diverse text-to-image generation, 2021.
13.
Tao Hu, Chengjiang Long and Chunxia Xiao, "A novel visual representation on text using diverse conditional gan for visual recognition", IEEE Transactions on Image Processing, vol. 30, pp. 3499-3512, 2021.
14.
Gang Hua, Chengjiang Long, Ming Yang and Yan Gao, "Collaborative active learning of a kernel machine ensemble for recognition", IEEE International Conference on Computer Vision, 2013.
15.
Gang Hua, Chengjiang Long, Ming Yang and Yan Gao, "Collaborative active visual recognition from crowds: A distributed ensemble approach", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 40, no. 3, pp. 582-594, 2018.
16.
Eddy Ilg, Nikolaus Mayer, Tonmoy Saikia, Margret Keuper, Alexey Dosovitskiy and Thomas Brox, "Flownet 2.0: Evolution of optical flow estimation with deep networks", Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2462-2470, 2017.
17.
Sergey Ioffe and Christian Szegedy, Batch normalization: Accelerating deep network training by reducing internal covariate shift, 2015.
18.
Ashraful Islam, Chengjiang Long, Arslan Basharat and Anthony Hoogs, "Doa-gan: Dual-order attentive generative adversarial network for image copy-move forgery detection and localization", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2020.
19.
Ashraful Islam, Chengjiang Long and Richard Radke, "A hybrid attention mechanism for weakly-supervised temporal action localization", AAAI Conference on Artificial Intelligence, 2021.
20.
Jaechul Kim and Kristen Grauman, "Observe locally infer globally: a space-time mrf for detecting abnormal activities with incremental updates", 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 2921-2928, 2009.
21.
Diederik P Kingma and Jimmy Ba, Adam: A method for stochastic optimization, 2014.
22.
Diederik P Kingma and Max Welling, Auto-encoding variational bayes, 2013.
23.
Yann LeCun, Corinna Cortes and CJ Burges, "Mnist handwritten digit database", pp. 2, 2010, [online] Available: http://yann.lecun.com/exdb/mnist.
24.
Bo Li, Sam Leroux and Pieter Simoens, Decoupled appearance and motion learning for efficient anomaly detection in surveillance video, 2020.
25.
Wen Liu, Weixin Luo, Dongze Lian and Shenghua Gao, "Future frame prediction for anomaly detection–a new baseline", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6536-6545, 2018.
26.
Chengjiang Long and Gang Hua, "Multi-class multiannotator active learning with robust gaussian process for visual recognition", Proceedings of the IEEE International Conference on Computer Vision, 2015.
27.
Chengjiang Long and Gang Hua, "Correlational gaussian processes for cross-domain visual recognition", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017.
28.
Chengjiang Long, Gang Hua and Ashish Kapoor, "A joint gaussian process model for active visual recognition with expertise estimation in crowdsourcing", International Journal of Computer Vision, vol. 116, no. 2, pp. 136-160, 2016.
29.
Cewu Lu, Jianping Shi and Jiaya Jia, "Abnormal event detection at 150 fps in matlab", Proceedings of the IEEE international conference on computer vision, pp. 2720-2727, 2013.
30.
Yiwei Lu, K Mahesh Kumar, Seyed shahabeddin Nabavi and Yang Wang, "Future frame prediction using convolutional vrnn for anomaly detection", 2019 16th IEEE International Conference on Advanced Video and Signal Based Surveillance, pp. 1-8, 2019.
Contact IEEE to Subscribe

References

References is not available for this document.