Conferences >2020 25th International Confe...

Motion U-Net: Multi-cue Encoder-Decoder Network for Motion Segmentation

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Detection of moving objects is a critical component of many computer vision tasks. Recently, deep learning architectures have been developed for supervised learning based...Show More

Metadata

Abstract:

Detection of moving objects is a critical component of many computer vision tasks. Recently, deep learning architectures have been developed for supervised learning based moving object change detection. Some top performing architectures, like FgSegNet are single frame spatial appearance cue-based detection and tend to overfit to the training videos. We propose a novel compact multi-cue autoencoder deep architecture, Motion U-Net (MU-Net) for robust moving object detection that generalizes much better than FgSegNet and requires nearly 30 times fewer weight parameters. Motion and change cues are estimated using a multi-modal background subtraction module combined with flux tensor motion estimation. MU-Net was trained and evaluated on the CDnet-2014 change detection challenge video sequences and had an overall F-measure of 0.9369. We used the unseen SBI-2015 video dataset to assess generalization capacity where MU-Net had an F-measure of 0.7625 while FgSegNet_v2 was 0.3519, less than half the MU-Net accuracy. The source code of the Motion U-Net is available at https://github.com/CIVA-Lab/Motion-U-Net.

Published in: 2020 25th International Conference on Pattern Recognition (ICPR)

Date of Conference: 10-15 January 2021

Date Added to IEEE Xplore: 05 May 2021

ISBN Information:

Print on Demand(PoD) ISSN: 1051-4651

DOI: 10.1109/ICPR48806.2021.9413211

Conference Location: Milan, Italy

Funding Agency:

Contents

I. Introduction

Perception of the world requires the interpretation of motion across different time scales. Detecting moving objects in video streams is a crucial task that is fundamental to many computer vision applications from video surveillance and monitoring to self-driving cars. The moving object detection process provides focus of attention for follow-up processes such as tracking, classification, recognition, behavior analysis etc. This is a challenging task because of background clutter, many distractors and changing imaging conditions such as camera motion, degraded environmental conditions, weather, dynamic background, illumination changes, shadows or camouflage effects. Many approaches and pipelines have been proposed to perform moving object detection and to address the aforementioned challenges [1]–[3]. Earlier approaches typically consisted of hand-crafted solutions with little adaptation to challenging scenarios and often relied on complex procedures to address specific challenges. In recent years, advances in deep learning combined with increased availability of training data and affordable high-end computing resources such as GPUs, have led to impressive results in various computer vision tasks. Transfer learning combined with many state-of-the-art CNN models (i.e. VGG-16, ResNet-18 etc.) trained on large benchmark datasets allowed development of feature extraction modules for various tasks with only minor modifications and limited training requirements. Autoencoders are a popular deep learning architecture for segmentation tasks, where the features extracted in the encoder module, using a series of convolution and pooling layers, are upsampled by the decoder module to recover the original spatial resolution of the input image. Fig. 1:

Sample change detection results from cdnet-2014 including original image (row 1), ground truth mask (row 2), proposed MU-net2 mask (row 3), for three sample videos highway (baseline) fr=1100 (col l), canoe (dynamicbg) fr=960 (col 2), peopleinshade (shadow) fr=1100 (col 3).

References is not available for this document.

Motion U-Net: Multi-cue Encoder-Decoder Network for Motion Segmentation

Abstract:

Metadata

Abstract:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Motion U-Net: Multi-cue Encoder-Decoder Network for Motion Segmentation

Alerts

Abstract:

Metadata

Abstract:

Funding Agency:

I. Introduction

Authors

Figures

References

Citations

Keywords

Metrics

References