I. Introduction
Major depressive disorder (MDD) [1], also known as depression, is a complex and severe mental illness that has gained widespread attention [2]. Patients suffering from depression may experience persistent low mood, diminished interest, sleep disturbances, and appetite dysregulation, significantly impacting their quality of life and potentially leading to suicidal ideation [3]. However, with the use of medication and psychotherapy, depressive symptoms can be alleviated and improved [4], [5]. Therefore, early assessment of the severity of depression is crucial in mitigating the adverse effects of these symptoms. In clinical practice, the assessment of depression typically relies on experienced doctors conducting structured [6] or semistructured interviews and utilizing standardized self-rating scales for evaluating the severity of depression, e.g., PHQ-8 [7]. Nevertheless, this approach is subjective, labor-intensive, and time-consuming, especially considering the persistent increase in the number of individuals affected by depression. The task of depression assessment can be regarded as a specific information processing task. In recent years, due to the successful results achieved by deep learning in information processing [8], [9], [10], automatic depression assessment based on deep learning has emerged as a new possibility, offering an objective, convenient, and efficient auxiliary diagnostic approach by analyzing depression cues in multimodal data. Currently, significant research on depression assessment based on deep learning has been proposed [11], [12], [13], [14], providing an up-and-coming promising prospect for achieving automatic depression analysis.