I. Introduction
Mental health problems, including autism spectrum disorder (ASD) and attention-deficit hyperactivity disorder (ADHD), etc., have become a global concern and troubled over 17% of the adult population [1]. In the past decades, computer-aided diagnosis (CAD) approaches have been developed to analyze the high-resolution medical images, such as functional magnetic resonance imaging (fMRI) [2], for addressing the escalating psychiatrist shortage. fMRI can perform a noninvasive, safe measure of brain activity by detecting tiny changes in blood flow. With the developments of graphics processing units and data-driven intelligence, deep learning models are reported to achieve outstanding performance for CAD [3]. The successful application of deep learning models usually relies on a large number of labeled training samples [4].