Conferences >2022 26th International Confe...

Multi-Grained Interpre table Network for Image Recognition

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Given a classification problem with a large number of classes, humans often compare features at different granularities from coarse to fine to gradually recognize an obje...Show More

Metadata

Abstract:

Given a classification problem with a large number of classes, humans often compare features at different granularities from coarse to fine to gradually recognize an object. However, current deep models are generally trained to directly make the final prediction, focusing on improving the ability of the network to extract features without considering the interpretability of the model. In this paper, we propose a multi-grained interpretable network to imitate the reasoning process of humans. The proposed network is equipped with techniques to assign images with multi-grained labels, so as to train a tree-structured classifier that learns features at different levels of granularity. The proposed method can hierarchically classify objects in images at different granularities, while providing a decision pathway with multi-grained explanations for practitioners. Experimental results demonstrate that our method achieves competitive prediction accuracy on CUB-200-2011 and Stanford Cars datasets, and simultaneously produces high-quality explanations of its decisions. Moreover, our method shows higher robustness of the learned features to adversarial examples generated by the FGSM and PGD attacks.

Published in: 2022 26th International Conference on Pattern Recognition (ICPR)

Date of Conference: 21-25 August 2022

Date Added to IEEE Xplore: 29 November 2022

ISBN Information:

ISSN Information:

DOI: 10.1109/ICPR56361.2022.9956087

Conference Location: Montreal, QC, Canada

Funding Agency:

Contents

I. Introduction

Despite their great success, the decision-making process of current deep models lacks interpretability, which hinders their applicability to high-stake problems in areas such as healthcare and finance. To provide explanations for deep models, many explainability methods [1], [2], [3] have been proposed to decipher the predictions made by deep models. However, these post-hoc methods are unable to provide sufficient details for explaining the complicated decision pathway of the black-box model. Instead of explaining a black-box model, many research works [4], [5], [6] aim to construct a self-explainable model. Chen et al. [6] proposed a transparent model by replacing the conventional extractive reasoning process with a case-based reasoning process, which compares the similarity between the input features and learned visual feature vectors called "prototypes" to make predictions. Due to the transparency of the case-based reasoning architecture, the prototypes are also extended to other problems including hierarchical classification and zero-shot classification [7], [8]. However, humans tend to hierarchically compare features of different granularities to recognize objects [9], [10] as shown in the top of Figure 1, which most of the current deep models fail to imitate. Current deep learning models often make their predictions for all the classes in a single go, as shown in the bottom of Figure 1. Predicting classes in one layer without distinction impedes the model to extract distinctive features and hinders humans from understanding the decision-making process of models.

References is not available for this document.

MIT Libraries

MIT Libraries

Multi-Grained Interpre table Network for Image Recognition

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

Multi-Grained Interpre table Network for Image Recognition

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

I. Introduction

References