Loading [MathJax]/extensions/MathMenu.js
Argus++: Robust Real-time Activity Detection for Unconstrained Video Streams with Overlapping Cube Proposals | IEEE Conference Publication | IEEE Xplore

Argus++: Robust Real-time Activity Detection for Unconstrained Video Streams with Overlapping Cube Proposals


Abstract:

Activity detection is one of the attractive computer vision tasks to exploit the video streams captured by widely installed cameras. Although achieving impressive perform...Show More

Abstract:

Activity detection is one of the attractive computer vision tasks to exploit the video streams captured by widely installed cameras. Although achieving impressive performance, conventional activity detection algorithms are usually designed under certain constraints, such as using trimmed and/or object-centered video clips as inputs. Therefore, they failed to deal with the multi-scale multi-instance cases in real-world unconstrained video streams, which are untrimmed and have large field-of-views. Real-time requirements for streaming analysis also mark brute force expansion of them unfeasible. To overcome these issues, we propose Argus++, a robust real-time activity detection system for analyzing unconstrained video streams. The design of Argus++ introduces overlapping spatio-temporal cubes as an intermediate concept of activity proposals to ensure coverage and completeness of activity detection through over-sampling. The overall system is optimized for real-time processing on standalone consumer-level hardware. Extensive experiments on different surveillance and driving scenarios demonstrated its superior performance in a series of activity detection benchmarks, including CVPR ActivityNet ActEV 2021, NIST ActEV SDL UF/KF, TRECVID ActEV 2020/2021, and ICCV ROAD 2021.
Date of Conference: 04-08 January 2022
Date Added to IEEE Xplore: 15 February 2022
ISBN Information:

ISSN Information:

Conference Location: Waikoloa, HI, USA

Funding Agency:


1. Introduction

Nowadays, activity detection has drawn a fast-growing attention in both industry and research fields. Activity detection in extended videos [4], [15] is widely applied for public safety in indoor and outdoor scenarios. Activity detection on streaming videos captured by in-vehicle cameras is applied for vision-based autonomous driving. The development of these applications brings several challenges. First, most of these systems take unconstrained videos as input, which are recorded in large field-of-views where multi-object and multi-activity occur simultaneously and continuously over time. Second, the unconstrained videos in real world are in multiple scenarios and under multiple conditions, e.g. in dynamically changed road environments from day to night in autonomous driving [21]. Third, efficient algorithms are demanded for real-time processing and responding of streaming video.

Contact IEEE to Subscribe

References

References is not available for this document.