Loading [MathJax]/extensions/MathMenu.js
Interpretation of Depression Detection Models via Feature Selection Methods | IEEE Journals & Magazine | IEEE Xplore

Interpretation of Depression Detection Models via Feature Selection Methods


Abstract:

Given the prevalence of depression worldwide and its major impact on society, several studies employed artificial intelligence modelling to automatically detect and asses...Show More

Abstract:

Given the prevalence of depression worldwide and its major impact on society, several studies employed artificial intelligence modelling to automatically detect and assess depression. However, interpretation of these models and cues are rarely discussed in detail in the AI community, but have received increased attention lately. In this article, we aim to analyse the commonly selected features using a proposed framework of several feature selection methods and their effect on the classification results, which will provide an interpretation of the depression detection model. The developed framework aggregates and selects the most promising features for modelling depression detection from 38 feature selection algorithms of different categories. Using three real-world depression datasets, 902 behavioural cues were extracted from speech behaviour, speech prosody, eye movement and head pose. To verify the generalisability of the proposed framework, we applied the entire process to depression datasets individually and when combined. The results from the proposed framework showed that speech behaviour features (e.g. pauses) are the most distinctive features of the depression detection model. From the speech prosody modality, the strongest feature groups were F0, HNR, formants, and MFCC, while for the eye activity modality they were left-right eye movement and gaze direction, and for the head modality it was yaw head movement. Modelling depression detection using the selected features (even though there are only 9 features) outperformed using all features in all the individual and combined datasets. Our feature selection framework did not only provide an interpretation of the model, but was also able to produce a higher accuracy of depression detection with a small number of features in varied datasets. This could help to reduce the processing time needed to extract features and creating the model.
Published in: IEEE Transactions on Affective Computing ( Volume: 14, Issue: 1, 01 Jan.-March 2023)
Page(s): 133 - 152
Date of Publication: 10 November 2020

ISSN Information:

PubMed ID: 36938342

Funding Agency:

References is not available for this document.

1 Introduction

According to the World Health Organisation (WHO), major depressive disorders are an increasing global issue that leads to devastating consequences [1]. A person living with depression suffers enormously and functions poorly in everyday life tasks. Depression is a major contributor to the overall global burden of disease and is the leading cause of disability worldwide. Depression is strongly linked with non-communicable disorders, such as diabetes and heart disease, increased risk of substance use disorders, and at its worst, it can lead to suicide. Even though treatments for depression are effective, only 10 percent of depressed patients receive such treatments, where one of the barriers to effective care is inaccurate diagnoses. Misdiagnosed and untreated depression does not only affect the sufferer at a personal level, but also affects the employer and the government at an economic level.

Select All
1.
T. Vos et al., "Global regional and national incidence prevalence and years lived with disability for 310 diseases and injuries 1990–2015: A systematic analysis for the global burden of disease study 2015", Lancet, vol. 388, no. 10053, pp. 1545-1602, 2016.
2.
A. Pampouchidou et al., "Automatic assessment of depression based on visual cues: A systematic review", IEEE Trans. Affective Comput., vol. 10, no. 4, pp. 445-470, 2019.
3.
N. Cummins, S. Scherer, J. Krajewski, S. Schnieder, J. Epps and T. F. Quatieri, "A review of depression and suicide risk assessment using speech analysis", Speech Commun., vol. 71, pp. 10-49, 2015.
4.
Y. Zhu, Y. Shang, Z. Shao and G. Guo, "Automated depression diagnosis based on deep networks to encode facial appearance and dynamics", IEEE Trans. Affective Comput., vol. 9, no. 4, pp. 578-584, 2018.
5.
Y. Suhara, Y. Xu and A. S. Pentland, "DeepMood: Forecasting depressed mood based on self-reported histories via recurrent neural networks", Proc. 26th Int. Conf. World Wide Web, pp. 715-724, 2017.
6.
T. Alhanai, M. Ghassemi and J. Glass, "Detecting depression with audio/text sequence modeling of interviews", Proc. Annu. Conf. Int. Speech Commun. Assoc., pp. 1716-1720, 2018.
7.
M. A. H. M. Sadan, A. Pampouchidou and F. Meriaudeau, "Deep learning techniques for depression assessment", Proc. Int. Conf. Intell. Adv. Syst., pp. 1-5, 2018.
8.
L. He, D. Jiang and H. Sahli, "Automatic depression analysis using dynamic facial appearance descriptor and Dirichlet process fisher encoding", IEEE Trans. Multimedia, vol. 21, no. 6, pp. 1476-1486, Jun. 2019.
9.
L. Wen, X. Li, G. Guo and Y. Zhu, "Automated depression diagnosis based on facial dynamic analysis and sparse coding", IEEE Trans. Inf. Forensics Security, vol. 10, no. 7, pp. 1432-1441, Jul. 2015.
10.
A. Mendiratta et al., Automatic Detection of Depressive States from Speech, Cham, Switzerland:Springer, pp. 301-314, 2018.
11.
L. He and C. Cao, "Automated depression analysis using convolutional neural networks from speech", J. Biomed. Informat., vol. 83, pp. 103-111, 2018.
12.
S. Song, L. Shen and M. Valstar, "Human behaviour-based automatic depression analysis using hand-crafted statistics and deep learned spectral features", Proc. 13th IEEE Int. Conf. Autom. Face Gesture Recognit., pp. 158-165, 2018.
13.
A. Kacem, Z. Hammal, M. Daoudi and J. Cohn, "Detecting depression severity by interpretable representations of motion dynamics", Proc. 13th IEEE Int. Conf. Autom. Face Gesture Recognit., pp. 739-745, 2018.
14.
K. Yu, X. Wu, W. Ding and J. Pei, "Towards scalable and accurate online feature selection for big data", Proc. IEEE Int. Conf. Data Mining, pp. 660-669, 2014.
15.
K. Yu, X. Wu, W. Ding and J. Pei, "Scalable and accurate online feature selection for big data", ACM Trans. Knowl. Discov. Data, vol. 11, no. 2, pp. 1-16, Dec. 2016.
16.
X. Wu, K. Yu, H. Wang and W. Ding, "Online streaming feature selection", Proc. 27th Int. Conf. Mach. Learn., pp. 1159-1166, 2010.
17.
X. Wu, K. Yu, W. Ding, H. Wang and X. Zhu, "Online feature selection with streaming features", IEEE Trans. Pattern Anal. Mach. Intell., vol. 35, no. 5, pp. 1178-1192, May 2013.
18.
J. Zhou, D. Foster, R. Stine and L. Ungar, "Streamwise feature selection", J. Mach. Learn. Res., vol. 7, pp. 1861-1885, 2006.
19.
I. Tsamardinos, L. E. Brown and C. F. Aliferis, "The max-min hill-climbing Bayesian network structure learning algorithm", Mach. Learn., vol. 65, no. 1, pp. 31-78, Oct. 2006.
20.
C. F. Aliferis, A. Statnikov, I. Tsamardinos, S. Mani and X. D. Koutsoukos, "Local causal and Markov blanket induction for causal discovery and feature selection for classification part I: Algorithms and empirical evaluation", J. Mach. Learn. Res., vol. 11, pp. 171-234, Mar. 2010.
21.
R. Diaz-Uriarte and S. Alvarez de Andres, "Variable selection from random forests: Application to gene expression data", 2015.
22.
H. Deng and G. Runger, "Feature selection via regularized trees", Proc. Int. Joint Conf. Neural Netw., pp. 1-8, 2012.
23.
H. Deng and G. Runger, "Gene selection with guided regularized random forest", Pattern Recognit., vol. 46, no. 12, pp. 3483-3489, 2013.
24.
B. Gregorutti, B. Michel and P. Saint-Pierre, "Grouped variable importance with random forests and application to multiple functional data analysis", Comput. Statist. Data Anal., vol. 90, pp. 15-35, 2015.
25.
Z. Xu, G. Huang, K. Q. Weinberger and A. X. Zheng, "Gradient boosted feature selection", Proc. 20th ACM SIGKDD Int. Conf. Knowl. Discov. Data Mining, pp. 522-531, 2014.
26.
G. Roffo, S. Melzi and M. Cristani, "Infinite feature selection", Proc. IEEE Int. Conf. Comput. Vis., pp. 4202-4210, 2015.
27.
G. Roffo, S. Melzi, U. Castellani and A. Vinciarelli, "Infinite latent feature selection: A probabilistic latent graph-based ranking approach", Proc. IEEE Int. Conf. Comput. Vis., pp. 1407-1415, 2017.
28.
G. Roffo and S. Melzi, "Features selection via eigenvector centrality", Proc. New Front. Mining Complex Patterns, 2016.
29.
Q. Shen and A. Chouchoulas, "A modular approach to generating fuzzy rules with reduced attributes for the monitoring of complex systems", Eng. Appl. Artif. Intell., vol. 13, no. 3, pp. 263-278, 2000.
30.
A. Janusz and D. Ślezak, "Random probes in computation and assessment of approximate reducts", Proc. Rough Sets Intell. Syst. Paradigms, pp. 53-64, 2014.
Contact IEEE to Subscribe

References

References is not available for this document.