Signal compression and enhancement using a new orthogonal-polynomial-based discrete transform

Discrete orthogonal functions are important tools in digital signal processing. These functions received considerable attention in the last few decades. This study proposes a new set of orthogonal functions called discrete Krawtchouk–Tchebichef transform (DKTT). Two traditional orthogonal polynomials, namely, Krawtchouk and Tchebichef, are combined to form DKTT. The theoretical and mathematical frameworks of the proposed transform are provided. DKTT was tested using speech and image signals from a well-known database under clean and noisy environments. DKTT was applied in a speech enhancement algorithm to evaluate the efficient removal of noise from speech signal. The performance of DKTT was compared with that of standard transforms. Different types of distance (similarity index) and objective measures in terms of image quality, speech quality, and speech intelligibility assessments were used for comparison. Experimental tests show that DKTT exhibited remarkable achievements and excellent results in signal compression and speech enhancement. Therefore, DKTT can be considered as a new set of orthogonal functions for futuristic applications of signal processing.

[1]  R. Mukundan,et al.  Some computational aspects of discrete orthonormal moments , 2004, IEEE Transactions on Image Processing.

[2]  Abd. Rahman Ramli,et al.  Image edge detection operators based on orthogonal polynomials , 2017 .

[3]  Michael Unser,et al.  Wavelets, filterbanks, and the Karhunen-Loève transform , 1998, 9th European Signal Processing Conference (EUSIPCO 1998).

[4]  Victor H. Moll,et al.  Irresistible Integrals: Symbolics, Analysis and Experiments in the Evaluation of Integrals , 2004 .

[5]  Majid Ahmadi,et al.  Pattern recognition with moment invariants: A comparative study and new results , 1991, Pattern Recognit..

[6]  Syed Abdul Rahman Al-Haddad,et al.  Low-Distortion MMSE Speech Enhancement Estimator Based on Laplacian Prior , 2017, IEEE Access.

[7]  Yi Hu,et al.  Subjective comparison and evaluation of speech enhancement algorithms , 2007, Speech Commun..

[8]  Roland T. Chin,et al.  On Image Analysis by the Methods of Moments , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Jan Flusser,et al.  Pattern recognition by affine moment invariants , 1993, Pattern Recognit..

[10]  Raveendran Paramesran,et al.  New orthogonal polynomials for speech signal and image processing , 2012, IET Signal Process..

[11]  Atul Puri,et al.  Video coding using the MPEG-2 compression standard , 1993, Other Conferences.

[12]  Steven W. Smith,et al.  The Scientist and Engineer's Guide to Digital Signal Processing , 1997 .

[13]  N. Ahmed,et al.  Discrete Cosine Transform , 1996 .

[14]  Wissam A. Jassim,et al.  Speech quality assessment using 2D neurogram orthogonal moments , 2016, Speech Commun..

[15]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[16]  Kim-Han Thung,et al.  Content-based image quality metric using similarity measure of moment vectors , 2012, Pattern Recognit..

[17]  Huijun Ding,et al.  Speech enhancement in transform domain. , 2011 .

[18]  Soo Ngee Koh,et al.  Noisy speech enhancement using discrete cosine transform , 1998, Speech Commun..

[19]  Yi Hu,et al.  Evaluation of Objective Quality Measures for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[20]  Raveendran Paramesran,et al.  Image analysis by Krawtchouk moments , 2003, IEEE Trans. Image Process..

[21]  Raveendran Paramesran,et al.  Enhancing noisy speech signals using orthogonal moments , 2014, IET Signal Process..

[22]  Hafiz Imtiaz,et al.  A DCT-based noisy speech enhancement method using Teager energy operator , 2013, 2013 5th International Conference on Knowledge and Smart Technology (KST).

[23]  Sim Heng Ong,et al.  Image Analysis by Tchebichef Moments , 2001, IEEE Trans. Image Process..

[24]  Ronald E. Crochiere,et al.  A weighted overlap-add method of short-time Fourier analysis/Synthesis , 1980 .

[25]  Wenyi Wang,et al.  Hiding depth information in compressed 2D image/video using reversible watermarking , 2015, Multimedia Tools and Applications.

[26]  Mrinal Nandi,et al.  On commutativity of Discrete Fourier Transform , 2015, Inf. Process. Lett..

[27]  H. Shu,et al.  General form for obtaining discrete orthogonal moments , 2010 .

[28]  Soo Ngee Koh,et al.  Low distortion speech enhancement , 2000 .

[29]  Md. Kamrul Hasan,et al.  MMSE estimator for speech enhancement considering the constructive and destructive interference of noise , 2010 .

[30]  Wen-Hsiang Tsai,et al.  Moment-preserving edge detection and its application to image data compression , 1993 .

[31]  Ching-Ta Lu Reduction of musical residual noise for speech enhancement using masking properties and optimal smoothing , 2007, Pattern Recognit. Lett..

[32]  Athanassios N. Skodras,et al.  A fast picture compression technique , 1994 .

[33]  Zhang Chunhong,et al.  Digital Watermarking Based on Discrete Cosine Transformation , 2010, 2010 International Forum on Information Technology and Applications.

[34]  Miikka Vilermo,et al.  Energy Compaction Property of the MDCT in Comparison with Other Transforms , 2000 .

[35]  Raveendran Paramesran,et al.  Face Recognition Using Discrete Tchebichef-Krawtchouk Transform , 2012, 2012 IEEE International Symposium on Multimedia.

[36]  Syed Abdul Rahman Al-Haddad,et al.  On Computational Aspects of Tchebichef Polynomials for Higher Polynomial Order , 2017, IEEE Access.

[37]  James M Kates,et al.  Coherence and the speech intelligibility index. , 2004, The Journal of the Acoustical Society of America.

[38]  Raveendran Paramesran,et al.  Image Analysis Using Hahn Moments , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  David Zhang,et al.  FSIM: A Feature Similarity Index for Image Quality Assessment , 2011, IEEE Transactions on Image Processing.

[40]  Andries P. Hekstra,et al.  Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).