Conferences >2020 IEEE Winter Conference o...

UnOVOST: Unsupervised Offline Video Object Segmentation and Tracking

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

We address Unsupervised Video Object Segmentation (UVOS), the task of automatically generating accurate pixel masks for salient objects in a video sequence and of trackin...Show More

Metadata

Abstract:

We address Unsupervised Video Object Segmentation (UVOS), the task of automatically generating accurate pixel masks for salient objects in a video sequence and of tracking these objects consistently through time, without any input about which objects should be tracked. Towards solving this task, we present UnOVOST (Unsupervised Offline Video Object Segmentation and Tracking) as a simple and generic algorithm which is able to track and segment a large variety of objects. This algorithm builds up tracks in a number stages, first grouping segments into short tracklets that are spatio-temporally consistent, before merging these tracklets into long-term consistent object tracks based on their visual similarity. In order to achieve this we introduce a novel tracklet-based Forest Path Cutting data association algorithm which builds up a decision forest of track hypotheses before cutting this forest into paths that form long-term consistent object tracks. When evaluating our approach on the DAVIS 2017 Unsupervised dataset we obtain state-of-the-art performance with a mean ℱ score of 67.9% on the val, 58% on the test-dev and 56.4% on the test-challenge benchmarks, obtaining first place in the DAVIS 2019 Unsupervised Video Object Segmentation Challenge. UnOVOST even performs competitively with many semi-supervised video object segmentation algorithms even though it is not given any input as to which objects should be tracked and segmented.

Published in: 2020 IEEE Winter Conference on Applications of Computer Vision (WACV)

Date of Conference: 01-05 March 2020

Date Added to IEEE Xplore: 14 May 2020

ISBN Information:

ISSN Information:

DOI: 10.1109/WACV45572.2020.9093285

Conference Location: Snowmass, CO, USA

Contents

1. Introduction

Video Object Segmentation (VOS) aims at automatically generating accurate pixel masks for objects in each frame of a video, then associating those proposed object pixel masks in the successive frames to obtain temporally consistent tracks. VOS has mostly been tackled in a semi-supervised fashion [19], [32], [30], where the object masks of the objects to be tracked in the first-frame are given, and only those objects need to be tracked and segmented throughout the rest of the video.

References is not available for this document.

UnOVOST: Unsupervised Offline Video Object Segmentation and Tracking

Abstract:

Metadata

Abstract:

ISSN Information:

1. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

UnOVOST: Unsupervised Offline Video Object Segmentation and Tracking

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

1. Introduction

References