1. Introduction
Humans can effectively utilize prior knowledge to easily learn new concepts given just a few examples. Fewshot learning [11], [20], [15] aims to acquire some transferable knowledge like humans, where a classifier is able to generalize to new classes when given only one or a few labeled examples of each class, i. e., one-or few-shot. In this paper, we focus on the ability of learning how to compare, namely metric-based methods. Metric-based method-s [2], [11], [22], [23], [25] often consist of a feature extractor and a metric module. Given an unlabeled query image and a few labeled sample images, the feature extractor first generates embeddings for all input images, and then the metric module measures distances between the query embedding and sample embeddings to give a recognition result.
Two situations where the comparison ability of RN will be limited. The top row shows the two compared images, and the bottom row shows their extracted features, where blue areas represent the response of corresponding semantic objects. (a) The convolutional kernel fails to involve the two objects. (b) The convolutional kernel fails to involve the same fine-grained features.