Automatic speech recognition using interlaced derivative pattern for cloud based healthcare system

Cloud computing brings several advantages such as flexibility, scalability, and ubiquity in terms of data acquisition, data storage, and data transmission. This can help remote healthcare among other applications in a great deal. This paper proposes a cloud based framework for speech enabling healthcare. In the proposed framework, the patients or any healthy person seeking for some medical assistance can send his/her request by speech commands. The commands are managed and processed in the cloud server. Any doctor with proper authentication can receive the request. By analyzing the request, the doctor can assist the patient or the person. This paper also proposes a new feature extraction technique, namely, interlaced derivative pattern (IDP), to automatic speech recognition (ASR) system to be deployed into the cloud server. The IDP exploits the relative Mel-filter bank coefficients along different neighborhood directions from the speech signal. Experimental results show that the proposed IDP-based ASR system performs reasonably well even when the speech is transmitted via smart phones.

[1]  Kyung-Yong Chung,et al.  Target speech feature extraction using non-parametric correlation coefficient , 2013, Cluster Computing.

[2]  Inderveer Chana,et al.  Cloud based intelligent system for delivering health care as a service , 2014, Comput. Methods Programs Biomed..

[3]  Satoshi Nakamura,et al.  AURORA-2J: An Evaluation Framework for Japanese Noisy Speech Recognition , 2005, IEICE Trans. Inf. Syst..

[4]  J. M. Gilbert,et al.  Development of a (silent) speech recognition system for patients following laryngectomy. , 2008, Medical engineering & physics.

[5]  Young-Sik Jeong,et al.  Proxy based seamless connection management method in mobile cloud computing , 2013, Cluster Computing.

[6]  M. Shamim Hossain,et al.  Cloud-Assisted Speech and Face Recognition Framework for Health Monitoring , 2015, Mobile Networks and Applications.

[7]  Ronald W. Schafer,et al.  Introduction to Digital Speech Processing , 2007, Found. Trends Signal Process..

[8]  Muhammad Ghulam,et al.  Pathological voice detection and binary classification using MPEG-7 audio features , 2014, Biomed. Signal Process. Control..

[9]  P. Mell,et al.  The NIST Definition of Cloud Computing , 2011 .

[10]  A. Bercovitz,et al.  An overview of home health aides: United States, 2007. , 2011, National health statistics reports.

[11]  Muhammad Al-Qurishi,et al.  Evaluating the impact of a cloud-based serious game on obese people , 2014, Comput. Hum. Behav..

[12]  Eduardo Lleida,et al.  Voice Pathology Detection on the Saarbrücken Voice Database with Calibration and Fusion of Scores Using MultiFocal Toolkit , 2012, IberSPEECH.

[13]  Ghulam Muhammad,et al.  Formant analysis in dysphonic patients and automatic Arabic digit speech recognition , 2011, Biomedical engineering online.

[14]  M. Shamim Hossain,et al.  Spectro-temporal directional derivative based automatic speech recognition for a serious game scenario , 2014, Multimedia Tools and Applications.

[15]  Matti Pietikäinen,et al.  IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, TPAMI-2008-09-0620 1 WLD: A Robust Local Image Descriptor , 2022 .

[16]  Gerrit Tamm,et al.  Risks and Crises for Healthcare Providers: The Impact of Cloud Computing , 2014, TheScientificWorldJournal.

[17]  Ameneh Shobeirinejad,et al.  Gender Classification Using Interlaced Derivative Patterns , 2010, 2010 20th International Conference on Pattern Recognition.

[18]  Jo Lumsden,et al.  Speech recognition use in healthcare applications , 2008, MoMM.

[19]  Matti Pietikäinen,et al.  Face Description with Local Binary Patterns: Application to Face Recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Kyung-Yong Chung,et al.  Mobile healthcare application with EMR interoperability for diabetes patients , 2013, Cluster Computing.

[21]  Alex Mihailidis,et al.  Development of an automated speech recognition interface for personal emergency response systems , 2009, Journal of NeuroEngineering and Rehabilitation.

[22]  M. Shamim Hossain,et al.  Cloud-Based Collaborative Media Service Framework for HealthCare , 2014, Int. J. Distributed Sens. Networks.

[23]  Ghulam Muhammad,et al.  Multidirectional regression (MDR)-based features for automatic voice disorder detection. , 2012, Journal of voice : official journal of the Voice Foundation.

[24]  A. El Saddik,et al.  Scalability Measurement of a Proxy based Personalized Multimedia Repurposing System , 2006, 2006 IEEE Instrumentation and Measurement Technology Conference Proceedings.