Journals & Magazines >IEEE Internet of Things Journal >Volume: 7 Issue: 5

Cost-Effective Video Summarization Using Deep CNN With Hierarchical Weighted Fusion for IoT Surveillance Networks

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Video summarization (VS) has attracted intense attention recently due to its enormous applications in various computer vision domains, such as video retrieval, indexing, ...Show More

Metadata

Abstract:

Video summarization (VS) has attracted intense attention recently due to its enormous applications in various computer vision domains, such as video retrieval, indexing, and browsing. Traditional VS researches mostly target at the effectiveness of the VS algorithms by introducing the high quality of features and clusters for selecting representative visual elements. Due to the increased density of vision sensors network, there is a tradeoff between the processing time of the VS methods with reasonable and representative quality of the generated summaries. It is a challenging task to generate a video summary of significant importance while fulfilling the needs of Internet of Things (IoT) surveillance networks with constrained resources. This article addresses this problem by proposing a new computationally effective solution through designing a deep CNN framework with hierarchical weighted fusion for the summarization of surveillance videos captured in IoT settings. The first stage of our framework designs discriminative rich features extracted from deep CNNs for shot segmentation. Then, we employ image memorability predicted from a fine-tuned CNN model in the framework, along with aesthetic and entropy features to maintain the interestingness and diversity of the summary. Third, a hierarchical weighted fusion mechanism is proposed to produce an aggregated score for the effective computation of the extracted features. Finally, an attention curve is constituted using the aggregated score for deciding outstanding keyframes for the final video summary. Experiments are conducted using benchmark data sets for validating the importance and effectiveness of our framework, which outperforms the other state-of-the-art schemes.

Published in: IEEE Internet of Things Journal ( Volume: 7, Issue: 5, May 2020)

Page(s): 4455 - 4463

Date of Publication: 01 November 2019

ISSN Information:

DOI: 10.1109/JIOT.2019.2950469

Contents

I. Introduction

Current smart cities are equipped with numerous optical sensors or cameras for data sensing, event detection and recognition, and autonomous event reporting, supporting a large number of application domains of healthcare, security, recommender systems, and surveillance. The giant volume of data collected by such a dense network of vision sensors create significant difficulties in analyzing and processing video data for identifying the events of interests. Thus, video summarization (VS) that automatically extracts a brief, yet informative, summary of these videos has attracted intense attention recently in many applications. The current literature contains research methods in the VS domain based on the supervised and unsupervised learning, statistical features, objects detection, and action and activity recognition, which are briefly summarized in the subsequent paragraphs.

References is not available for this document.

Cost-Effective Video Summarization Using Deep CNN With Hierarchical Weighted Fusion for IoT Surveillance Networks

Abstract:

Metadata

Abstract:

ISSN Information:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Cost-Effective Video Summarization Using Deep CNN With Hierarchical Weighted Fusion for IoT Surveillance Networks

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

I. Introduction

Authors

Figures

References

Citations

Keywords

Metrics

References