Automatic speech recognition using interlaced derivative pattern for cloud based healthcare system

Cloud computing brings several advantages such as flexibility, scalability, and ubiquity in terms of data acquisition, data storage, and data transmission. This can help remote healthcare among other applications in a great deal. This paper proposes a cloud based framework for speech enabling healthcare. In the proposed framework, the patients or any healthy person seeking for some medical assistance can send his/her request by speech commands. The commands are managed and processed in the cloud server. Any doctor with proper authentication can receive the request. By analyzing the request, the doctor can assist the patient or the person. This paper also proposes a new feature extraction technique, namely, interlaced derivative pattern (IDP), to automatic speech recognition (ASR) system to be deployed into the cloud server. The IDP exploits the relative Mel-filter bank coefficients along different neighborhood directions from the speech signal. Experimental results show that the proposed IDP-based ASR system performs reasonably well even when the speech is transmitted via smart phones.

[1]  Kyung-Yong Chung,et al.  Target speech feature extraction using non-parametric correlation coefficient , 2013, Cluster Computing.

[2]  Inderveer Chana,et al.  Cloud based intelligent system for delivering health care as a service , 2014, Comput. Methods Programs Biomed..

[3]  Satoshi Nakamura,et al.  AURORA-2J: An Evaluation Framework for Japanese Noisy Speech Recognition , 2005, IEICE Trans. Inf. Syst..

[4]  Matti Pietikäinen,et al.  IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, TPAMI-2008-09-0620 1 WLD: A Robust Local Image Descriptor , 2022 .

[5]  Muhammad Al-Qurishi,et al.  Evaluating the impact of a cloud-based serious game on obese people , 2014, Comput. Hum. Behav..

[6]  M. Shamim Hossain,et al.  Scalability Measurement of a Proxy based Personalized Multimedia Repurposing System , 2006, IMTC 2006.

[7]  Muhammad Ghulam,et al.  Pathological voice detection and binary classification using MPEG-7 audio features , 2014, Biomed. Signal Process. Control..

[8]  A. Bercovitz,et al.  An overview of home health aides: United States, 2007. , 2011, National health statistics reports.

[9]  Matti Pietikäinen,et al.  Face Description with Local Binary Patterns: Application to Face Recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Eduardo Lleida,et al.  Voice Pathology Detection on the Saarbrücken Voice Database with Calibration and Fusion of Scores Using MultiFocal Toolkit , 2012, IberSPEECH.

[11]  Gerrit Tamm,et al.  Risks and Crises for Healthcare Providers: The Impact of Cloud Computing , 2014, TheScientificWorldJournal.

[12]  M. Shamim Hossain,et al.  A biologically inspired multimedia content repurposing system in heterogeneous environments , 2008, Multimedia Systems.

[13]  Ronald W. Schafer,et al.  Introduction to Digital Speech Processing , 2007, Found. Trends Signal Process..

[14]  J. M. Gilbert,et al.  Development of a (silent) speech recognition system for patients following laryngectomy. , 2008, Medical engineering & physics.

[15]  Ameneh Shobeirinejad,et al.  Gender Classification Using Interlaced Derivative Patterns , 2010, 2010 20th International Conference on Pattern Recognition.

[16]  Young-Sik Jeong,et al.  Proxy based seamless connection management method in mobile cloud computing , 2013, Cluster Computing.

[17]  M. Shamim Hossain,et al.  Spectro-temporal directional derivative based automatic speech recognition for a serious game scenario , 2014, Multimedia Tools and Applications.

[18]  Ghulam Muhammad,et al.  Formant analysis in dysphonic patients and automatic Arabic digit speech recognition , 2011, Biomedical engineering online.

[19]  M. Shamim Hossain,et al.  Cloud-Assisted Speech and Face Recognition Framework for Health Monitoring , 2015, Mobile Networks and Applications.

[20]  Kyung-Yong Chung,et al.  Mobile healthcare application with EMR interoperability for diabetes patients , 2013, Cluster Computing.

[21]  Alex Mihailidis,et al.  Development of an automated speech recognition interface for personal emergency response systems , 2009, Journal of NeuroEngineering and Rehabilitation.

[22]  M. Shamim Hossain,et al.  Cloud-Based Collaborative Media Service Framework for HealthCare , 2014, Int. J. Distributed Sens. Networks.

[23]  P. Mell,et al.  The NIST Definition of Cloud Computing , 2011 .

[24]  Jo Lumsden,et al.  Speech recognition use in healthcare applications , 2008, MoMM.

[25]  Ghulam Muhammad,et al.  Multidirectional regression (MDR)-based features for automatic voice disorder detection. , 2012, Journal of voice : official journal of the Voice Foundation.