Low-Complexity Hardware Design for Fast Solving LSPs With Coordinated Polynomial Solution

This paper presents a low-complexity algorithm and the corresponding hardware based on the coordinated polynomial solutions for solving line spectrum pairs (LSPs). To improve the computation of LSPs, the enhanced Tschirnhaus transform (ETT) is proposed to accelerate the coordinated polynomial solution. The proposed ETT can replace fractional multiplication with addition and shift operations, so unnecessary operations are avoided. To further simplify the hardware of the ETT, three designs are presented: the preprocessing block (PPB), the iterative root-finding block (IRFB), and the closed-form solution block (CFSB). The PPB provides a design with less gate counts that can effectively transform LPCs into general-form polynomials. Such polynomials can be further decomposed into roots using the proposed IRFB based on the Birge-Vieta method. A pipeline-recursive framework is implemented in the IRFB to save calculations. To improve hardware utilization, this paper also analyzes the coefficients relationship of the ETT by introducing the data dependency graph to design the proposed functional blocks in CFSB. The experimental results show that the proposed hardware achieves a 40-fold improvement in throughput and reduces 1.16% of gate counts at the hardware synthesis level; the chip area is 1.29 mm2. The precision analysis indicates the average log spectral distance is 0.310. Moreover, the ETT in the proposed hardware only requires 29.9% of multiplication compared with the original one. Such results reveal that the proposed work is superior to the baseline work, thereby demonstrating the effectiveness of the proposed design.

[1]  Carlo Magi,et al.  Properties of line spectrum pair polynomials: a review , 2006 .

[2]  Chung-Hsien Wu,et al.  A novel two-level method for the computation of the LSP frequencies using a decimation-in-degree algorithm , 1997, IEEE Trans. Speech Audio Process..

[3]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[4]  Shi-Huang Chen,et al.  The Use of k-Means Algorithm to Compute the Line Spectrum Pair Frequencies with Tschirnhaus Transform , 2010, 2010 Sixth International Conference on Intelligent Information Hiding and Multimedia Signal Processing.

[5]  Biing-Hwang Juang,et al.  Optimal quantization of LSP parameters , 1993, IEEE Trans. Speech Audio Process..

[6]  M. J. Narasimha,et al.  Speech over VoIP Networks: Advanced Signal Processing and System Implementation , 2012, IEEE Circuits and Systems Magazine.

[7]  Wai C. Chu,et al.  Speech Coding Algorithms , 2003 .

[8]  Douglas O'Shaughnessy,et al.  A novel feature extractor employing regularized MVDR spectrum estimator and subband spectrum enhancement technique , 2013, 2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA).

[9]  Jhing-Fa Wang,et al.  Chip design of portable speech memopad suitable for persons with visual disabilities , 2002, IEEE Trans. Speech Audio Process..

[10]  A. Mabrouk,et al.  A low-complexity, hardware architecture for a parametric, real-time, LSF speech decoder , 2011, 2011 IEEE 15th International Symposium on Consumer Electronics (ISCE).

[11]  Cao Wei,et al.  A high-performance reconfigurable VLSI architecture for vbsme in H.264 , 2008, IEEE Transactions on Consumer Electronics.

[12]  Sid-Ahmed Selouani,et al.  A noise-robust front-end for distributed speech recognition in mobile communications , 2007, Int. J. Speech Technol..

[13]  Kai Wang,et al.  Parameter Generation Considering LSP Ordering Property for HMM-Based Speech Synthesis , 2012, IEEE Signal Processing Letters.

[14]  Richard G. Lyons Multiplier-Free Divide, Square Root, and Log Algorithms , 2012 .

[15]  Ravi P. Ramachandran,et al.  VLSI architecture for the efficient computation of line spectral frequencies , 2003, Proceedings of the 2003 International Symposium on Circuits and Systems, 2003. ISCAS '03..

[16]  Shi-Huang Chen,et al.  The computation of line spectrum pair frequencies using Tschirnhaus transform , 2009, 2009 IEEE International Symposium on Circuits and Systems.

[17]  Allen Gersho,et al.  Hybrid coding: combined harmonic and waveform coding of speech at 4 kb/s , 2001, IEEE Trans. Speech Audio Process..

[18]  A. Gray,et al.  Distance measures for speech processing , 1976 .

[19]  Begnaud Francis Hildebrand,et al.  Introduction to numerical analysis: 2nd edition , 1987 .

[20]  Joseph Rothweiler On polynomial reduction in the computation of LSP frequencies , 1999, IEEE Trans. Speech Audio Process..

[21]  R.P. Ramachandran,et al.  ASIC design for the efficient computation of line spectral frequencies using Chebyshev polynomial series , 2007, 2007 50th Midwest Symposium on Circuits and Systems.

[22]  M. Omair Ahmad,et al.  Fast computation of the discrete Fourier transform of real data , 1997, IEEE Trans. Signal Process..

[23]  Hiromasa Fujihara,et al.  A musical mood trajectory estimation method using lyrics and acoustic features , 2011, MIRUM '11.

[24]  Goutam Saha,et al.  On the use of perceptual Line Spectral pairs Frequencies and higher-order residual moments for Speaker Identification , 2010, Int. J. Biom..

[25]  Feng Li,et al.  Multiplier-Free Divide, Square Root, and Log Algorithms [DSP Tips and Tricks] , 2011, IEEE Signal Processing Magazine.

[26]  S. S. Yedlapalli Transforming real linear prediction coefficients to line spectral representations with a real FFT , 2005, IEEE Transactions on Speech and Audio Processing.

[27]  Gil-Jin Jang,et al.  Efficient spectrum estimation of noise using line spectral pairs for robust speech recognition , 2011 .

[28]  Shi-Huang Chen,et al.  The computation of line spectrum pair using modified complex-free Ferrari formula , 2005, 2005 International Symposium on Intelligent Signal Processing and Communication Systems.

[29]  Kai Wang,et al.  Speech parameter generation considering LSP ordering property for HMM-based speech synthesis , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[30]  Turaj Zakizadeh Shabestary,et al.  LSP quantization by a union of locally trained codebooks , 2005, IEEE Transactions on Speech and Audio Processing.

[31]  Peter Kabal,et al.  The computation of line spectral frequencies using Chebyshev polynomials , 1986, IEEE Trans. Acoust. Speech Signal Process..

[32]  R.P. Ramachandran,et al.  ASIC implementation of efficient line spectral frequency computation for speech coding applications , 2003, 2003 46th Midwest Symposium on Circuits and Systems.

[33]  F. Itakura Line spectrum representation of linear predictor coefficients of speech signals , 1975 .

[34]  Biing-Hwang Juang,et al.  Line spectrum pair (LSP) and speech data compression , 1984, ICASSP.