I. Introduction
As an active microwave detection system, a synthetic aperture radar (SAR) plays an indispensable role in various application fields on the account of its unique operating characteristics of all-weather, all-day, and high resolution. Automatic target recognition (ATR) is a basic application in the SAR community. Yet, it appears to be one of the most challenging tasks for SAR image interpretation [1]. After decades of unremitting exploration and efforts, scholars have made proud breakthroughs in SAR target recognition [2], [3], [4]. Broadly speaking, these significant achievements can be grouped into three main streams, i.e., template-based [5], model-based [6], and learning-based one [7], [8], [9]. Among them, the template-based method infers the class attribute of the target through calculating the similarity between the template of each category and the sample to be recognized. The model-based method is dedicated to constructing a sophisticated physical model for each category of the target, which can effectively characterize the properties of the target. Different from the first two categories of methods, a learning-based method no longer extracts the features of the target in the image domain, but mines the discriminative information of the target in the predefined transformation domain, whereas, no matter in which domain, the existing methods rely severely upon handcrafted features, thereby often encountering challenges in complex SAR application scenarios.