Non-negative matrix factorization algorithms for blind source sepertion in speech recognition

The performance of the Speech recognition degrades in the presence of the multiple sources/speakers or unwanted signals such as noise. To separate the source from the other signals called as Blind Source Separation many algorithms are proposed in the literature such as Independent Component Analysis (ICA), Principle Component Analysis (PCA), Non-Negative matrix Factorization (NMF). In this paper we provide the theoretical study of the different algorithms for NMF factorization such as Least Square Error (LSE) divergence, Kullback-Leibler (KL) divergence, Itakura-saito (IS) divergence, Non-negative hidden Markov model(N-HMM), Bayesian NMF, NMF with Automatic Relevance Determinant and Complex NMF applicable for the 2-dimensional data matrix. The performance evaluation of the supervised learning and un-supervised learning is evaluated.

[1]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[2]  Jérôme Idier,et al.  Algorithms for Nonnegative Matrix Factorization with the β-Divergence , 2010, Neural Computation.

[3]  Shivashankar,et al.  Improvement of speed in data collection rate in tree based wireless sensor network , 2016, 2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT).

[4]  Bhiksha Raj,et al.  Non-negative Hidden Markov Modeling of Audio with Application to Source Separation , 2010, LVA/ICA.

[5]  Shivashankar,et al.  Opportunistic routing technique for minimized energy consumption for relay node selection in wireless sensor networks , 2016, 2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT).

[6]  Roberto Togneri,et al.  Time-Frequency Masking: Linking Blind Source Separation and Robust Speech Recognition , 2008 .

[7]  Hirokazu Kameoka,et al.  Formulations and algorithms for multichannel complex NMF , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8]  Shivashankar,et al.  An efficient routing algorithm based on ant colony optimisation for VANETs , 2016, 2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT).

[9]  Vincent Y. F. Tan,et al.  Automatic Relevance Determination in Nonnegative Matrix Factorization with the /spl beta/-Divergence , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Ole Winther,et al.  Bayesian Non-negative Matrix Factorization , 2009, ICA.

[11]  Jean-Francois Cardoso,et al.  Blind signal separation: statistical principles , 1998, Proc. IEEE.

[12]  Te-Won Lee,et al.  A Maximum Likelihood Approach to Single-channel Source Separation , 2003, J. Mach. Learn. Res..

[13]  C. Févotte,et al.  Automatic Relevance Determination in Nonnegative Matrix Factorization with the-Divergence , 2011 .

[14]  Abdul Rafay Khatri,et al.  ATPG method with a hybrid compaction technique for combinational digital systems , 2016, 2016 SAI Computing Conference (SAI).

[15]  P. Rajendra Prasad,et al.  Zone based hierarchical energy efficient clustering scheme for WSN , 2016, 2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT).

[16]  Francis Bach,et al.  Online algorithms for nonnegative matrix factorization with the Itakura-Saito divergence , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[17]  Hirokazu Kameoka,et al.  Underdetermined BSS with multichannel complex NMF assuming W-disjoint orthogonality of source , 2011, TENCON 2011 - 2011 IEEE Region 10 Conference.

[18]  S. S. Kumar,et al.  Non-negative matrix based optimization scheme for blind source separation in automatic speech recognition system , 2016, 2016 International Conference on Communication and Electronics Systems (ICCES).

[19]  Hirokazu Kameoka,et al.  Complex NMF: A new sparse representation for acoustic signals , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[20]  Emmanuel Vincent,et al.  Enforcing Harmonicity and Smoothness in Bayesian Non-Negative Matrix Factorization Applied to Polyphonic Music Transcription , 2010, IEEE Transactions on Audio, Speech, and Language Processing.