Gender Recognition by Voice using an Improved Self-Labeled Algorithm

Speech recognition has various applications including human to machine interaction, sorting of telephone calls by gender categorization, video categorization with tagging and so on. Currently, machine learning is a popular trend which has been widely utilized in various fields and applications, exploiting the recent development in digital technologies and the advantage of storage capabilities from electronic media. Recently, research focuses on the combination of ensemble learning techniques with the semi-supervised learning framework aiming to build more accurate classifiers. In this paper, we focus on gender recognition by voice utilizing a new ensemble semi-supervised self-labeled algorithm. Our preliminary numerical experiments demonstrate the classification efficiency of the proposed algorithm in terms of accuracy, leading to the development of stable and robust predictive models.

[1]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[2]  Zhi-Hua Zhou,et al.  SETRED: Self-training with Editing , 2005, PAKDD.

[3]  Ninad Bhatt,et al.  Classification Techniques for Speech Recognition: A Review , 2015 .

[4]  Zhi-Hua Zhou,et al.  Tri-training: exploiting unlabeled data using three classifiers , 2005, IEEE Transactions on Knowledge and Data Engineering.

[5]  Sherali Zeadally,et al.  Handling big data: research challenges and future directions , 2016, The Journal of Supercomputing.

[6]  Daniel S. Kermany,et al.  Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning , 2018, Cell.

[7]  Jindrich Matousek,et al.  Experiment with GMM-Based Artefact Localization in Czech Synthetic Speech , 2015, TSD.

[8]  Tassos A. Mikropoulos,et al.  An Ensemble-Based Semi-Supervised Approach for Predicting Students’ Performance , 2018 .

[9]  Gaurav Aggarwal,et al.  Speech Feature Extraction for Gender Recognition , 2016 .

[10]  Panayiotis E. Pintelas,et al.  On Ensemble SSL Algorithms for Credit Scoring Problem , 2018, Informatics.

[11]  J. L. Hodges,et al.  Rank Methods for Combination of Independent Experiments in Analysis of Variance , 1962 .

[13]  Ghazaala Yasmin,et al.  Discrimination of male and female voice using occurrence pattern of spectral flux , 2017, 2017 International Conference on Intelligent Computing, Instrumentation and Control Technologies (ICICICT).

[14]  Yan Zhou,et al.  Democratic co-learning , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[15]  Luiz Moutinho,et al.  The effect of voice emotion response on brand recall by gender , 2017 .

[16]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[17]  Igor Bisio,et al.  SPECTRA: A SPEech proCessing plaTform as smaRtphone Application , 2015, 2015 IEEE International Conference on Communications (ICC).

[18]  Zhi-Hua Zhou,et al.  Improve Computer-Aided Diagnosis With Machine Learning Techniques Using Undiagnosed Samples , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[19]  Ali Osman Cibikdiken,et al.  Voice Gender Recognition Using Deep Learning , 2016 .

[20]  Seyyed Mohammad Reza Hashemi,et al.  A Review of Some Semi-Supervised Learning Methods , 2016 .

[21]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[22]  Natália D. Aredes,et al.  Radiology Data from The Cancer Genome Atlas Stomach Adenocarcinoma [TCGA-STAD] collection , 2016 .

[23]  Jiří Přibil,et al.  GMM-based speaker age and gender classification in Czech and Slovak , 2017 .

[24]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[25]  Louis ten Bosch,et al.  Speaker normalization for automatic speech recognition — An on-line approach , 1998, 9th European Signal Processing Conference (EUSIPCO 1998).

[26]  H. Finner On a Monotonicity Problem in Step-Down Multiple Test Procedures , 1993 .

[27]  Constantinos Kolias,et al.  RuleMR: Classification rule discovery with MapReduce , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[28]  Panayiotis E. Pintelas,et al.  An Ensemble SSL Algorithm for Efficient Chest X-Ray Image Classification , 2018, J. Imaging.

[29]  David Yarowsky,et al.  Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[30]  Francisco Herrera,et al.  Self-labeled techniques for semi-supervised learning: taxonomy, software and empirical study , 2015, Knowledge and Information Systems.

[31]  Oludayo O. Olugbara,et al.  Gender Voice Recognition Using Random Forest Recursive Feature Elimination with Gradient Boosting Machines , 2018, 2018 International Conference on Advances in Big Data, Computing and Data Communication Systems (icABCD).

[32]  Andreas Holzinger,et al.  Introduction to MAchine Learning & Knowledge Extraction (MAKE) , 2017, Mach. Learn. Knowl. Extr..

[33]  Friedhelm Schwenker,et al.  Combining Committee-Based Semi-Supervised Learning and Active Learning , 2010, Journal of Computer Science and Technology.

[34]  Massimo Ferri,et al.  Why Topology for Machine Learning and Knowledge Extraction? , 2018, Mach. Learn. Knowl. Extr..

[35]  Liming Chen,et al.  A general audio classifier based on human perception motivated model , 2007, Multimedia Tools and Applications.

[36]  Piotr Dziurzanski,et al.  An analysis of the influence of acoustical adverse conditions on speaker gender identification , 2014, XXII Annual Pacific Voice Conference (PVC).

[37]  Eduardo R. Hruschka,et al.  A Survey and Comparative Study of Tweet Sentiment Analysis via Semi-Supervised Learning , 2016, ACM Comput. Surv..

[38]  Elisabeth André,et al.  Improving Automatic Emotion Recognition from Speech via Gender Differentiaion , 2006, LREC.

[39]  Jindrich Matousek,et al.  GMM-based speaker gender and age classification after voice conversion , 2016, 2016 First International Workshop on Sensing, Processing and Learning for Intelligent Machines (SPLINE).

[40]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..