Predicting mental disorder from noisyquestionnaires: an anomaly detection approach based on keywordextraction and machine learning techniques

The early warning of mental disorders is of great importance for the psychological well-being of college students. The accuracy of conventional scaling methods on questionnaires is generally low in predicting mental disorders, as the questionnaires contain much noise, and the processing on the questionnaires is rudimentary. To address this problem, we propose a novel anomaly detection framework on questionnaires, which represents each questionnaire as a document, and applies keyword extraction and machine learning techniques to detect abnormal questionnaires. We also propose a new keyword statistic for the calculation of option significance and three interpretable machine learning models for the calculation of question significance. Experiments demonstrate the effectiveness of our proposed methods.

[1]  R. Likert “Technique for the Measurement of Attitudes, A” , 2022, The SAGE Encyclopedia of Research Design.

[2]  Amir Hussain,et al.  A novel approach to stance detection in social media tweets by fusing ranked lists and sentiments , 2021, Inf. Fusion.

[3]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[4]  Norman Blaikie,et al.  Analyzing Quantitative Data , 2012 .

[5]  S. Jamieson Likert scales: how to (ab)use them , 2004, Medical education.

[6]  A. Beck,et al.  An inventory for measuring depression. , 1961, Archives of general psychiatry.

[7]  Wayne Usher,et al.  Predicting Australia’s university students’ mental health status , 2019, Health promotion international.

[8]  P. Frazier,et al.  Understanding stress as an impediment to academic performance , 2018, Journal of American college health : J of ACH.

[9]  Vladimir Vapnik,et al.  Support-vector networks , 2004, Machine Learning.

[10]  C. Bhat,et al.  Predicting the mental health of college students with psychological capital , 2018, Journal of mental health.

[11]  Jameson K. Hirsch,et al.  Depression, Loneliness, and Suicide Risk among Latino College Students: A Test of a Psychosocial Interaction Model. , 2018, Social work.

[12]  D. Russell,et al.  The revised UCLA Loneliness Scale: concurrent and discriminant validity evidence. , 1980, Journal of personality and social psychology.

[13]  O. Morera,et al.  Does body dissatisfaction predict mental health outcomes in a sample of predominantly Hispanic college students , 2009 .

[14]  Ali Daud,et al.  Finding rising stars through hot topics detection , 2021, Future Gener. Comput. Syst..

[15]  Susan K. Johnson,et al.  Physical and Psychological Health Predict Adherence to an Online Mindfulness Program for College Students , 2020 .

[16]  G. Norman Likert scales, levels of measurement and the “laws” of statistics , 2010, Advances in health sciences education : theory and practice.

[17]  Wenjun Cao,et al.  The psychological impact of the COVID-19 epidemic on college students in China , 2020, Psychiatry Research.

[18]  Jesus J. Caban,et al.  A Neural Network Based Model for Predicting Psychological Conditions , 2015, BIH.

[19]  R. Bonner,et al.  Toward a predictive model of suicidal ideation and behavior: some preliminary data in college students. , 1987, Suicide & life-threatening behavior.

[20]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[21]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[22]  F. Prevatt,et al.  Variables predicting the mental health status of Chinese college students. , 2008, Asian Journal of Psychiatry.

[23]  Clement T. Yu,et al.  A theory of term importance in automatic text analysis , 1974, J. Am. Soc. Inf. Sci..

[24]  R Beiter,et al.  The prevalence and correlates of depression, anxiety, and stress in a sample of college students. , 2015, Journal of affective disorders.

[25]  Fabrice Alizon,et al.  Keyword extraction: Issues and methods , 2019, Natural Language Engineering.

[26]  S. Finch,et al.  Prevalence and socio-demographic correlates of psychological distress among students at an Australian university , 2016 .

[27]  A. Rich,et al.  Causes of Depression in College Students: A Cross-Lagged Panel Correlational Analysis , 1987 .

[28]  Peter D. Turney Learning Algorithms for Keyphrase Extraction , 2000, Information Retrieval.