Age and Gender Recognition from Speech Using Deep Neural Networks

[1]  Giuseppe Riccardi,et al.  How may I help you? , 1997, Speech Commun..

[2]  Yoonjung Kang,et al.  The effect of speech rate on age estimation in conversational speech , 2020 .

[3]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[4]  Hugo Van hamme,et al.  Age Estimation from Telephone Speech using i-vectors , 2012, INTERSPEECH.

[5]  Héctor A. Sánchez-Hevia,et al.  Convolutional-recurrent Neural Network for Age and Gender Prediction from Speech , 2019, 2019 Signal Processing Symposium (SPSympo).

[6]  Sayak Paul,et al.  A review of deep learning with special emphasis on architectures, applications and recent trends , 2020, Knowl. Based Syst..

[7]  Jinwon Lee,et al.  A Fully Convolutional Neural Network for Speech Enhancement , 2016, INTERSPEECH.

[8]  John H. L. Hansen,et al.  Improved Gender Independent Speaker Recognition Using Convolutional Neural Network Based Bottleneck Features , 2017, INTERSPEECH.

[9]  Najim Dehak,et al.  Age Estimation in Short Speech Utterances Based on LSTM Recurrent Neural Networks , 2018, IEEE Access.

[10]  Yong Xu,et al.  Large-Scale Weakly Supervised Audio Classification Using Gated Convolutional Neural Network , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[11]  Ngoc Thang Vu,et al.  Attentive Convolutional Neural Network Based Speech Emotion Recognition: A Study on the Impact of Input Features, Signal Length, and Acted Speech , 2017, INTERSPEECH.

[12]  Tuomas Virtanen,et al.  Convolutional recurrent neural networks for bird audio detection , 2017, 2017 25th European Signal Processing Conference (EUSIPCO).

[13]  Chitralekha Bhat,et al.  Deploying usable speech enabled IVR systems for mass use , 2013, 2013 International Conference on Human Computer Interactions (ICHCI).

[14]  Sung Wook Baik,et al.  Speech Emotion Recognition from Spectrograms with Deep Convolutional Neural Network , 2017, 2017 International Conference on Platform Technology and Service (PlatCon).

[15]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[16]  Dimitris Pappas,et al.  Anger detection in call center dialogues , 2015, 2015 6th IEEE International Conference on Cognitive Infocommunications (CogInfoCom).

[17]  W. Pitts,et al.  How we know universals; the perception of auditory and visual forms. , 1947, The Bulletin of mathematical biophysics.

[18]  DeLiang Wang,et al.  TCNN: Temporal Convolutional Neural Network for Real-time Speech Enhancement in the Time Domain , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[19]  Manuel Rosa-Zurera,et al.  Precision Maximization in Anger Detection in Interactive Voice Response Systems , 2018 .

[20]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[21]  Keikichi Hirose,et al.  Automatic estimation of one's age with his/her speech based upon acoustic modeling techniques of speakers , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[22]  Deepu Vijayasenan,et al.  A Deep Neural Network Based End to End Model for Joint Height and Age Estimation from Short Duration Speech , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[23]  Xindong Wu,et al.  Object Detection With Deep Learning: A Review , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[24]  Florian Metze,et al.  Comparison of Four Approaches to Age and Gender Recognition for Telephone Applications , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[25]  Vladlen Koltun,et al.  An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling , 2018, ArXiv.

[26]  Hiroshi M Sasaki,et al.  Defining fundamental steps in the assembly of the Drosophila RNAi enzyme complex , 2015, Nature.