Voice activity detection using optimal window overlapping especially over health-care infrastructure

Purpose This paper aims to introduce recently an unregulated unsupervised algorithm focused on voice activity detection by data clustering maximum margin, i.e. support vector machine. The algorithm for clustering K-mean used to solve speech behaviour detection issues was later applied, the application, therefore, did not permit the identification of voice detection. This is critical in demands for speech recognition. Design/methodology/approach Here, the authors find a voice activity detection detector based on a report provided by a K-mean algorithm that permits sliding window detection of voice and noise. However, first, it needs an initial detection pause. The machine initialized by the algorithm will work on health-care infrastructure and provides a platform for health-care professionals to detect the clear voice of patients. Findings Timely usage discussion on many histories of NOISEX-92 var reveals the average non-speech and the average signal-to-noise ratios hit concentrations which are higher than modern voice activity detection. Originality/value Research work is original.

[1]  A. L. Sangal,et al.  Tunicate Swarm Algorithm: A new bio-inspired based metaheuristic paradigm for global optimization , 2020, Eng. Appl. Artif. Intell..

[2]  Vijay Kumar,et al.  Seagull optimization algorithm: Theory and its applications for large-scale industrial engineering problems , 2019, Knowl. Based Syst..

[3]  Zhu Shaotong,et al.  A Clean-Slate ID/Locator Split Architecture for Future Network , 2016 .

[4]  Dale Schuurmans,et al.  Maximum Margin Clustering , 2004, NIPS.

[5]  Vijay Kumar,et al.  Multi-objective spotted hyena optimizer: A Multi-objective optimization algorithm for engineering problems , 2018, Knowl. Based Syst..

[6]  Mohammad Shabaz,et al.  Shabaz–Urvashi Link Prediction (SULP): A Novel Approach to Predict Future Friends in a Social Network , 2020, Journal of Creative Communications.

[7]  Rafik A. Goubran,et al.  Robust voice activity detection using higher-order statistics in the LPC residual domain , 2001, IEEE Trans. Speech Audio Process..

[8]  Herman J. M. Steeneken,et al.  Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems , 1993, Speech Commun..

[9]  Gaurav Dhiman,et al.  A survey on impact of data analytics techniques in E-commerce , 2021 .

[10]  Gaurav Dhiman,et al.  A novel content-based image retrieval approach for classification using GLCM features and texture fused LBP variants , 2020, Neural Computing and Applications.

[11]  Wonyong Sung,et al.  A statistical model-based voice activity detection , 1999, IEEE Signal Processing Letters.

[12]  Amandeep Kaur,et al.  STOA: A bio-inspired based optimization algorithm for industrial engineering problems , 2019, Eng. Appl. Artif. Intell..

[13]  Mohammad Shabaz,et al.  Predicting future diseases based on existing health status using link prediction , 2021 .

[14]  Shrikanth S. Narayanan,et al.  Robust Voice Activity Detection Using Long-Term Signal Variability , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[15]  Vijay Kumar,et al.  Emperor penguin optimizer: A bio-inspired algorithm for engineering problems , 2018, Knowl. Based Syst..

[16]  Amandeep Kaur,et al.  Spotted Hyena Optimizer for Solving Engineering Design Problems , 2017, 2017 International Conference on Machine Learning and Data Science (MLDS).

[17]  Pritpal Singh,et al.  A hybrid fuzzy time series forecasting model based on granular computing and bio-inspired optimization approaches , 2018, J. Comput. Sci..

[18]  Ji Wu,et al.  Maximum Margin Clustering Based Statistical VAD With Multiple Observation Compound Feature , 2011, IEEE Signal Processing Letters.

[19]  Jianwu Dang,et al.  Voice Activity Detection Based on an Unsupervised Learning Framework , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[20]  Ivor W. Tsang,et al.  Tighter and Convex Maximum Margin Clustering , 2009, AISTATS.

[21]  Mohammad Shabaz,et al.  Metaheuristic and Machine Learning-Based Smart Engine for Renting and Sharing of Agriculture Equipment , 2021 .

[22]  Gaurav Dhiman,et al.  Spotted hyena optimizer: A novel bio-inspired based metaheuristic technique for engineering applications , 2017, Adv. Eng. Softw..

[23]  Josep M. Guerrero,et al.  A Spring Search Algorithm Applied to Engineering Optimization Problems , 2020, Applied Sciences.

[24]  Javier Ramírez,et al.  Statistical voice activity detection using a multiple observation likelihood ratio test , 2005, IEEE Signal Processing Letters.

[25]  Deepti Singh,et al.  Voice activity detection , 2007, CROS.