Throughput Scaling Of Convolution For Error-Tolerant Multimedia Applications

Convolution and cross-correlation are the basis of filtering and pattern or template matching in multimedia signal processing. We propose two throughput scaling options for any one-dimensional convolution kernel in programmable processors by adjusting the imprecision (distortion) of computation. Our approach is based on scalar quantization, followed by two forms of tight packing in floating-point (one of which is proposed in this paper) that allow for concurrent calculation of multiple results. We illustrate how our approach can operate as an optional pre- and post-processing layer for off-the-shelf optimized convolution routines. This is useful for multimedia applications that are tolerant to processing imprecision and for cases where the input signals are inherently noisy (error tolerant multimedia applications). Indicative experimental results with a digital music matching system and an MPEG-7 audio descriptor system demonstrate that the proposed approach offers up to 175% increase in processing throughput against optimized (full-precision) convolution with virtually no effect in the accuracy of the results. Based on marginal statistics of the input data, it is also shown how the throughput and distortion can be adjusted per input block of samples under constraints on the signal-to-noise ratio against the full-precision convolution.

[1]  Victor W. Lee,et al.  Image Processing on Multicore x86 Architectures , 2010, IEEE Signal Processing Magazine.

[2]  Susanto Rahardja,et al.  A statistics study of the MDCT coefficient distribution for audio , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[3]  Rudy Lauwereins,et al.  Robust stereo matching with fast Normalized Cross-Correlation over shape-adaptive regions , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[4]  Alexander Kadyrov,et al.  The "Invaders' Algorithm: Range of Values Modulation for Accelerated Correlation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Bayya Yegnanarayana,et al.  Determining Number of Speakers From Multispeaker Speech Signals Using Excitation Source Information , 2007, IEEE Signal Processing Letters.

[6]  Octavio Nieto-Taladriz,et al.  Fast and accurate computation of the roundoff noise of linear time-invariant systems , 2008, IET Circuits Devices Syst..

[7]  Julien Pinquier,et al.  Distinguishing Monophonies From Polyphonies Using Weibull Bivariate Distributions , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Neri Merhav,et al.  Approximate convolution using DCT coefficient multipliers , 1998, IEEE Trans. Circuits Syst. Video Technol..

[9]  Stewart Taylor,et al.  Optimizing Applications for Multi-Core Processors, Using the Intel® Integrated Performance Primitives, Second Edition , 2007 .

[10]  Anantha P. Chandrakasan,et al.  Low-power digital filtering using approximate processing , 1996 .

[11]  Franz Franchetti,et al.  Discrete fourier transform on multicore , 2009, IEEE Signal Processing Magazine.

[12]  Luigi di Stefano,et al.  Fast template matching using bounded partial correlation , 2003, Machine Vision and Applications.

[13]  Daniel P. W. Ellis,et al.  Cross-correlation of beat-synchronous representations for music similarity , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14]  Stewart Taylor Optimizing Applications for Multi-Core Processors , 2007 .

[15]  Stephen A. Dyer,et al.  Digital signal processing , 2018, 8th International Multitopic Conference, 2004. Proceedings of INMIC 2004..

[16]  Yiannis Andreopoulos,et al.  Linear Image Processing Operations With Operational Tight Packing , 2010, IEEE Signal Processing Letters.