Loading [MathJax]/extensions/MathMenu.js
A Weakly-Supervised Cross-Domain Query Framework for Video Camouflage Object Detection | IEEE Journals & Magazine | IEEE Xplore

A Weakly-Supervised Cross-Domain Query Framework for Video Camouflage Object Detection


Abstract:

VCOD (Video Camouflage Object Detection) is a crucial security technology that identifies camouflaged objects in videos, bolstering security measures across diverse appli...Show More

Abstract:

VCOD (Video Camouflage Object Detection) is a crucial security technology that identifies camouflaged objects in videos, bolstering security measures across diverse applications. On one hand, appearance-based VCOD methods face challenges because camouflaged appearances cause objects to blend into their surroundings, and current VCOD methods typically utilize optical flow to represent motion information. However, over-reliance on accurate estimation renders the model overly fragile. On the other hand, there is a shortage of effectively annotated camouflaged video datasets, coupled with the time-consuming and labor-intensive annotation process, severely constraining the development of this field. To address this, we propose a novel weakly-supervised framework for VCOD based on cross-domain querying of preceding and succeeding frames. Specifically, we propose a time-efficient and labor-saving manual annotation approach based on large visual models to rapidly generate pseudo-labels. Furthermore, we design a network based on Spatio-Temporal Memory (STM) that performs cross-modal feature querying with the current frame against preceding and succeeding frames to acquire useful information, thereby enhancing the focus on temporal information. Extensive experiments conducted on two common VCOD datasets have proven the effectiveness of our method, achieving state-of-the-art performance on the challenging camouflaged video data.
Page(s): 1506 - 1518
Date of Publication: 30 September 2024

ISSN Information:

Funding Agency:

No metrics found for this document.

I. Introduction

Video camouflaged Object Detection (VCOD) represents an advanced technology within the field of computer vision, aimed at identifying objects in video sequences that are highly integrated with their background environment. This technology plays a crucial role across various domains, particularly in applications such as security surveillance [1], agricultural pest detection [2], [3], military applications [4], medical diagnostics [5], [6], and wildlife conservation [7], showcasing its broad application potential.

Usage
Select a Year
2025

View as

Total usage sinceSep 2024:229
010203040506070JanFebMarAprMayJunJulAugSepOctNovDec134964000000000
Year Total:126
Data is updated monthly. Usage includes PDF downloads and HTML views.
Contact IEEE to Subscribe

References

References is not available for this document.