Few-shot learning of Parkinson's disease speech data with optimal convolution sparse kernel transfer learning

Abstract The classification of Parkinson’s disease speech data is useful and popular. However, the existing public Parkinson’s disease(PD) speech datasets are characterized by small sample sizes, and the possible reason is that labeled speech data from PD patients are scarce. To solve the few-shot problem, a PD classification algorithm based on sparse kernel transfer learning combined with simultaneous sample and feature selection is proposed in this paper. Sparse kernel transfer learning is used to extract the effective structural information of PD speech features from public datasets as source domain data, and the fast alternating direction method of multipliers (ADMM) iteration is improved to enhance the information extraction performance. First, features are extracted from a public speech dataset to construct a feature dataset as the source domain. Then, the PD target domain, including the training and test datasets, is encoded by convolution sparse coding, which can extract more in-depth information. Next, simultaneous optimization is implemented. To further improve the classification performance, a convolution kernel optimization mechanism is designed. In the experimental section, two representative PD speech datasets are used for verification; the first dataset is a frequently used public dataset, and the second dataset is constructed by the authors. Over ten relevant algorithms are compared with the proposed method. The results show that the proposed algorithm achieves obvious improvements in terms of classification accuracy. The study also found that the improvements are considerable when compared with nontransfer learning approaches, demonstrating that the proposed transfer learning approach is more effective and has an acceptable time cost.

[1]  Xiao Huang,et al.  Empirical Bayes Transfer Learning for Uncertainty Characterization in Predicting Parkinson’s Disease Severity , 2018, IISE Transactions on Healthcare Systems Engineering.

[2]  Xiaoming Yuan,et al.  A proximal point algorithm revisit on the alternating direction method of multipliers , 2013 .

[3]  Hang Chang,et al.  Unsupervised Transfer Learning via Multi-Scale Convolutional Sparse Coding for Biomedical Applications , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Najim Dehak,et al.  LSTM Siamese Network for Parkinson’s Disease Detection from Speech , 2019, 2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[5]  Jianhua Gu,et al.  A New Hybrid Intelligent Framework for Predicting Parkinson’s Disease , 2017, IEEE Access.

[6]  Joachim Denzler,et al.  Deep Learning on Small Datasets without Pre-Training using Cosine Loss , 2019, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[7]  Yongming Li,et al.  [Research on Diagnosis Algorithm of Parkinson’s Disease Based on Speech Sample Multi-edit and Random Forest]. , 2016, Sheng wu yi xue gong cheng xue za zhi = Journal of biomedical engineering = Shengwu yixue gongchengxue zazhi.

[8]  Jean Schoentgen,et al.  On the harmonic-to-noise ratio as an acoustic cue of vocal timbre of Parkinson speakers , 2017, Biomed. Signal Process. Control..

[9]  Haewon Byeon,et al.  Characteristics of Hypokinetic Dysarthria Patients' Speech based on Sustained Vowel Phonation and Connected Speech , 2016 .

[10]  Vishal M. Patel,et al.  Convolutional Sparse and Low-Rank Coding-Based Image Decomposition , 2017, IEEE Transactions on Image Processing.

[11]  Graham W. Taylor,et al.  Deconvolutional networks , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[12]  Jirí Mekyska,et al.  Prosodic analysis of neutral, stress-modified and rhymed speech in patients with Parkinson's disease , 2016, Comput. Methods Programs Biomed..

[13]  Amir Shmuel,et al.  Performance of machine learning methods in diagnosing Parkinson’s disease based on dysphonia measures , 2017, Biomedical Engineering Letters.

[14]  Fikret S. Gürgen,et al.  Collection and Analysis of a Parkinson Speech Dataset With Multiple Types of Sound Recordings , 2013, IEEE Journal of Biomedical and Health Informatics.

[15]  Yun Li,et al.  Stable dysphonia measures selection for Parkinson speech rehabilitation via diversity regularized ensemble , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[16]  J. Benge,et al.  Brief Report: Knowledge of, Interest in, and Willingness to Try Behavioral Interventions in Individuals With Parkinson's Disease. , 2018, Advances in mind-body medicine.

[17]  Filip Sroubek,et al.  Fast convolutional sparse coding using matrix inversion lemma , 2016, Digit. Signal Process..

[18]  Y. Zhang Can a Smartphone Diagnose Parkinson Disease? A Deep Neural Network Method and Telediagnosis System Implementation , 2017, Parkinson's disease.

[19]  Alexandre Mendes,et al.  Evolutionary Wavelet Neural Network ensembles for breast cancer and Parkinson’s disease prediction , 2018, PloS one.

[20]  Juan Ignacio Godino-Llorente,et al.  Towards the identification of Idiopathic Parkinson’s Disease from the speech. New articulatory kinetic biomarkers , 2017, PloS one.

[21]  Anders P. Eriksson,et al.  Fast Convolutional Sparse Coding , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Ashkan Sami,et al.  A Multiple-Classifier Framework for Parkinson's Disease Detection Based on Various Vocal Tests , 2016, International journal of telemedicine and applications.

[23]  Marcos Faúndez-Zanuy,et al.  Perceptual Features as Markers of Parkinson's Disease: The Issue of Clinical Interpretability , 2022, Recent Advances in Nonlinear Speech Processing.

[24]  B. Bloem,et al.  Being in control of Parkinson’s disease: A qualitative study of community-dwelling patients’ coping with changes in care , 2018, The European journal of general practice.

[25]  Habib Benali,et al.  Automatic detection of early stages of Parkinson's disease through acoustic voice analysis with mel-frequency cepstral coefficients , 2017, 2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP).

[26]  Yongming Li,et al.  Classification of Parkinson’s disease utilizing multi-edit nearest-neighbor and ensemble learning algorithms with speech samples , 2016, BioMedical Engineering OnLine.

[27]  Fethullah Karabiber,et al.  A Machine Learning System for the Diagnosis of Parkinson’s Disease from Speech Signals and Its Application to Multiple Speech Signal Types , 2016, Arabian Journal for Science and Engineering.

[28]  Ping Wang,et al.  Simultaneous learning of speech feature and segment for classification of Parkinson disease , 2017, 2017 IEEE 19th International Conference on e-Health Networking, Applications and Services (Healthcom).

[29]  Shivajirao M. Jadhav,et al.  Feature Ensemble Learning Based on Sparse Autoencoders for Diagnosis of Parkinson’s Disease , 2018, Advances in Intelligent Systems and Computing.

[30]  K. Forrest,et al.  The Impact of Lexical Characteristics and Noise on Intelligibility of Parkinsonian Speech. , 2018, Journal of speech, language, and hearing research : JSLHR.

[31]  Jakub Nalepa,et al.  Selecting training sets for support vector machines: a review , 2018, Artificial Intelligence Review.

[32]  Guandong Xu,et al.  Refining Parkinson’s neurological disorder identification through deep transfer learning , 2019, Neural Computing and Applications.

[33]  Ahmed Hammouch,et al.  Analysis of multiple types of voice recordings in cepstral domain using MFCC for discriminating between patients with Parkinson’s disease and healthy people , 2016, International Journal of Speech Technology.

[34]  Yongming Li,et al.  [Combining speech sample and feature bilateral selection algorithm for classification of Parkinson's disease]. , 2018, Sheng wu yi xue gong cheng xue za zhi = Journal of biomedical engineering = Shengwu yixue gongchengxue zazhi.

[35]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[36]  Vijay Parsa,et al.  Assessment of Amplified Parkinsonian Speech Quality Using Deep Learning , 2018, 2018 IEEE Canadian Conference on Electrical & Computer Engineering (CCECE).

[37]  J. Paralič,et al.  Parkinson's disease patients classification based on the speech signals , 2017, 2017 IEEE 15th International Symposium on Applied Machine Intelligence and Informatics (SAMI).

[38]  Qionghai Dai,et al.  Convolutional Sparse Coding for RGB+NIR Imaging , 2018, IEEE Transactions on Image Processing.

[39]  Brendt Wohlberg,et al.  Efficient Algorithms for Convolutional Sparse Representations , 2016, IEEE Transactions on Image Processing.

[40]  Mahmut Kaya,et al.  Classification of Parkinson speech data by metric learning , 2017, 2017 International Artificial Intelligence and Data Processing Symposium (IDAP).

[41]  Ahmed Hammouch,et al.  Using Human Factor Cepstral Coefficient on Multiple Types of Voice Recordings for Detecting Patients with Parkinson's Disease , 2017 .

[42]  Sai-Ho Ling,et al.  An Efficient Diagnosis System for Parkinson's Disease Using Deep Belief Network , 2017 .

[43]  Maria Grazia Busà,et al.  Speech Rhythm in Parkinson's Disease: A Study on Italian , 2016, INTERSPEECH.

[44]  Jesús Francisco Vargas-Bonilla,et al.  Parkinson's Disease Progression Assessment from Speech Using GMM-UBM , 2016, INTERSPEECH.