Journals & Magazines >IEEE Transactions on Cybernet... >Volume: 50 Issue: 9

SG-One: Similarity Guidance Network for One-Shot Semantic Segmentation

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

One-shot image semantic segmentation poses a challenging task of recognizing the object regions from unseen categories with only one annotated example as supervision. In ...Show More

Metadata

Abstract:

One-shot image semantic segmentation poses a challenging task of recognizing the object regions from unseen categories with only one annotated example as supervision. In this article, we propose a simple yet effective similarity guidance network to tackle the one-shot (SG-One) segmentation problem. We aim at predicting the segmentation mask of a query image with the reference to one densely labeled support image of the same category. To obtain the robust representative feature of the support image, we first adopt a masked average pooling strategy for producing the guidance features by only taking the pixels belonging to the support image into account. We then leverage the cosine similarity to build the relationship between the guidance features and features of pixels from the query image. In this way, the possibilities embedded in the produced similarity maps can be adopted to guide the process of segmenting objects. Furthermore, our SG-One is a unified framework that can efficiently process both support and query images within one network and be learned in an end-to-end manner. We conduct extensive experiments on Pascal VOC 2012. In particular, our SG-One achieves the mIoU score of 46.3%, surpassing the baseline methods.

Published in: IEEE Transactions on Cybernetics ( Volume: 50, Issue: 9, September 2020)

Page(s): 3855 - 3865

Date of Publication: 04 June 2020

ISSN Information:

PubMed ID: 32497014

DOI: 10.1109/TCYB.2020.2992433

Funding Agency:

Contents

I. Introduction

Object semantic segmentation (OSS) aims at predicting the class label of each pixel. Deep neural networks have achieved tremendous success on the OSS tasks, such as U-net [1], FCN [2], and Mask R-CNN [3]. However, these algorithms trained with full annotations require many investments to expensive labeling tasks. To reduce the budget, a promising alternative approach is to apply weak annotations for learning a decent network of segmentation. For example, previous works have implemented image-level labels [4]–[6]; scribbles [7]–[9]; bounding boxes [10], [11]; and points [12]–[14] as cheaper supervision information whereas the main disadvantage of these weakly supervised methods is the lack of the ability for generalizing the learned models to unseen classes. For instance, if a network is trained to segment dogs using thousands of images containing various breeds of dogs, it will not be able to segment bikes without retraining the network using many images containing bikes.

References is not available for this document.

MIT Libraries

MIT Libraries

SG-One: Similarity Guidance Network for One-Shot Semantic Segmentation

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

SG-One: Similarity Guidance Network for One-Shot Semantic Segmentation

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References