Fast color image processing using quantized color instruction set

The paper describes the Quantized Color Pack eXtension (QCPX) instruction set that can accelerate color-image processing applications. QCPX employs heterogeneous-subword-parallel instructions, which utilize the microprocessor's 16-bit wide datapaths to process a packed, quantized 16-bit color data type in YCbCr (Y: luminance, Cr and Cb: chrominance) format in parallel. Unlike typical multimedia instruction set extensions (e.g. MMX, SSE, ALTIVEC), QCPX obtains performance and code density improvements through implicit support for color pixel processing rather than depending solely upon generic subword parallelism. Five time-critical color image processing algorithms are coded with and without QCPX to fully measure its impact upon programming techniques. Simulation results indicate that the applications using QCPX achieve significant speedups in execution time over non-QCPX. In addition, QCPX results in higher system utilization (in excess of 94% in all cases) due to significant reduction of conditional instructions.

[1]  Marc Tremblay,et al.  VIS speeds new media processing , 1996, IEEE Micro.

[2]  Pradeep K. Dubey,et al.  How Multimedia Workloads Will Change Processor Design , 1997, Computer.

[3]  Ruby B. Lee Subword parallelism with MAX-2 , 1996, IEEE Micro.

[4]  Vladimir M. Pentkovski,et al.  Implementing Streaming SIMD Extensions on the Pentium III Processor , 2000, IEEE Micro.

[5]  Uri C. Weiser,et al.  Intel MMX for multimedia PCs , 1997, Commun. ACM.

[6]  J. Astola,et al.  Vector median filters , 1990, Proc. IEEE.

[7]  Soojung Ryu,et al.  Impulse noise removal on an embedded, low memory SIMD processor , 2002, 2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628).

[8]  Carlo Tomasi,et al.  Color edge detection with the compass operator , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[9]  Jennifer L. H. Webb Postprocessing to reduce blocking artifacts for low bit-rate video coding using chrominance information , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[10]  Filippo Sorbello,et al.  Real-time vector quantization-based image compression on the SIMPil low memory SIMD architecture , 1997, 1997 IEEE International Performance, Computing and Communications Conference.

[11]  Gaurav Sharma,et al.  Color imaging for multimedia , 1998, Proc. IEEE.

[12]  Lizy Kurian John,et al.  Exploiting SIMD parallelism in DSP and multimedia algorithms using the AltiVec technology , 1999, ICS '99.

[13]  April S. Brown,et al.  SIMPil: an OE integrated SIMD architecture for focal plane processing applications , 1996, Proceedings of Massively Parallel Processing Using Optical Interconnections.