Healthcare Big Data Voice Pathology Assessment Framework

The fast-growing healthcare big data plays an important role in healthcare service providing. Healthcare big data comprise data from different structured, semi-structured, and unstructured sources. These data sources vary in terms of heterogeneity, volume, variety, velocity, and value that traditional frameworks, algorithms, tools, and techniques are not fully capable of handling. Therefore, a framework is required that facilitates collection, extraction, storage, classification, processing, and modeling of this vast heterogeneous volume of data. This paper proposes a healthcare big data framework using voice pathology assessment (VPA) as a case study. In the proposed VPA system, two robust features, MPEG-7 low-level audio and the interlaced derivative pattern, are used for processing the voice or speech signals. The machine learning algorithms in the form of a support vector machine, an extreme learning machine, and a Gaussian mixture model are used as the classifier. In the experiments, the proposed VPA system shows its efficiency in terms of accuracy and time requirement.

[1]  Jake Luo,et al.  Big Data Application in Biomedical Research and Health Care: A Literature Review , 2016, Biomedical informatics insights.

[2]  Yunhao Liu,et al.  Big Data: A Survey , 2014, Mob. Networks Appl..

[3]  Cécile Paris,et al.  We Feel: Mapping Emotion on Twitter , 2015, IEEE Journal of Biomedical and Health Informatics.

[4]  Pedro Gómez Vilda,et al.  Dimensionality Reduction of a Pathological Voice Quality Assessment System Based on Gaussian Mixture Models and Short-Term Cepstral Parameters , 2006, IEEE Transactions on Biomedical Engineering.

[5]  Jiasong Mu,et al.  Throat polyp detection based on compressed big data of voice with support vector machine algorithm , 2014, EURASIP Journal on Advances in Signal Processing.

[6]  Ghulam Muhammad,et al.  An Investigation of Multidimensional Voice Program Parameters in Three Different Databases for Voice Pathology Detection and Classification. , 2017, Journal of voice : official journal of the Voice Foundation.

[7]  M. Shamim Hossain,et al.  Big Data-Driven Service Composition Using Parallel Clustered Particle Swarm Optimization in Mobile Environment , 2016, IEEE Transactions on Services Computing.

[8]  Ghulam Muhammad,et al.  Environment Recognition Using Selected MPEG-7 Audio Features and Mel-Frequency Cepstral Coefficients , 2010, 2010 Fifth International Conference on Digital Telecommunications.

[9]  Emad A. Mohammed,et al.  Applications of the MapReduce programming framework to clinical big data analysis: current landscape and future trends , 2014, BioData Mining.

[10]  Rajiv Ranjan,et al.  Trustworthy Processing of Healthcare Big Data in Hybrid Clouds , 2015, IEEE Cloud Computing.

[11]  Muhammad Ghulam,et al.  Voice pathology detection using interlaced derivative pattern on glottal source excitation , 2017, Biomed. Signal Process. Control..

[12]  Muhammad Ghulam Automatic speech recognition using interlaced derivative pattern for cloud based healthcare system , 2015, Cluster Computing.

[13]  Alistair A. Young,et al.  Big Heart Data: Advancing Health Informatics Through Data Sharing in Cardiovascular Imaging , 2015, IEEE Journal of Biomedical and Health Informatics.

[14]  Muhammad Ghulam,et al.  Pathological voice detection and binary classification using MPEG-7 audio features , 2014, Biomed. Signal Process. Control..

[15]  M. Shamim Hossain,et al.  Cloud-assisted Industrial Internet of Things (IIoT) - Enabled framework for health monitoring , 2016, Comput. Networks.

[16]  Carmen C. Y. Poon,et al.  Big Data for Health , 2015, IEEE Journal of Biomedical and Health Informatics.

[17]  Min Chen,et al.  Smart Clothing: Connecting Human with Clouds and Big Data for Sustainable Health Monitoring , 2016, Mobile Networks and Applications.

[18]  Peter J. Hunter,et al.  Big Data, Big Knowledge: Big Data for Personalized Healthcare , 2015, IEEE Journal of Biomedical and Health Informatics.

[19]  D. Jamieson,et al.  Identification of pathological voices using glottal noise measures. , 2000, Journal of speech, language, and hearing research : JSLHR.

[20]  E. A. Mary Anita,et al.  A Survey of Big Data Analytics in Healthcare and Government , 2015 .

[21]  Houbing Song,et al.  Mobile Cloud Computing Model and Big Data Analysis for Healthcare Applications , 2016, IEEE Access.

[22]  Min Chen,et al.  Disease Prediction by Machine Learning Over Big Data From Healthcare Communities , 2017, IEEE Access.

[23]  Ameneh Shobeirinejad,et al.  Gender Classification Using Interlaced Derivative Patterns , 2010, 2010 20th International Conference on Pattern Recognition.

[24]  Aileni Raluca Maria,et al.  Cloud computing for big data from biomedical sensors monitoring, storage and analyze , 2015, 2015 Conference Grid, Cloud & High Performance Computing in Science (ROLCG).

[25]  Ghulam Muhammad Automatic speech recognition using interlaced derivative pattern for cloud based healthcare system , 2015 .

[26]  Hongming Zhou,et al.  Extreme Learning Machine for Regression and Multiclass Classification , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[27]  Altug Akay,et al.  Mining Social Media Big Data for Health , 2015 .

[28]  M. Shamim Hossain,et al.  Simultaneously aided diagnosis model for outpatient departments via healthcare big data analytics , 2018, Multimedia Tools and Applications.