Conferences >2019 IEEE/CVF Conference on C...

Grid R-CNN

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This paper proposes a novel object detection framework named Grid R-CNN, which adopts a grid guided localization mechanism for accurate object detection. Different from t...Show More

Metadata

Abstract:

This paper proposes a novel object detection framework named Grid R-CNN, which adopts a grid guided localization mechanism for accurate object detection. Different from the traditional regression based methods, the Grid R-CNN captures the spatial information explicitly and enjoys the position sensitive property of fully convolutional architecture. Instead of using only two independent points, we design a multi-point supervision formulation to encode more clues in order to reduce the impact of inaccurate prediction of specific points. To take the full advantage of the correlation of points in a grid, we propose a two-stage information fusion strategy to fuse feature maps of neighbor grid points. The grid guided localization approach is easy to be extended to different state-of-the-art detection frameworks. Grid R-CNN leads to high quality object localization, and experiments demonstrate that it achieves a 4.1% AP gain at IoU=0.8 and a 10.0% AP gain at IoU=0.9 on COCO benchmark compared to Faster R-CNN with Res50 backbone and FPN architecture.

Published in: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Date of Conference: 15-20 June 2019

Date Added to IEEE Xplore: 09 January 2020

ISBN Information:

ISSN Information:

DOI: 10.1109/CVPR.2019.00754

Conference Location: Long Beach, CA, USA

Contents

1. Introduction

Object detection task can be decomposed into object classification and localization. In recent years, many deep convolutional neural networks (CNN) based detection frameworks are proposed and achieve state-of-the-art results [9], [8], [25], [17], [11], [1]. Although these methods improve the detection performance in many different aspects, their bounding box localization modules are similar. Typical bounding box localization module is a regression branch, which is designed as several fully connected layers and takes in high-level feature maps to predict the offset of the candidate box (proposal or predefined anchor).

References is not available for this document.

MIT Libraries

MIT Libraries

Grid R-CNN

Abstract:

Metadata

Abstract:

ISSN Information:

1. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

Grid R-CNN

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

1. Introduction

References