Multi-media extensions in super-pipelined micro-architectures. A new case for SIMD processing?

General purpose microprocessors have long been considered a computing platform unsuited to image processing and vision tasks. The so-called Von-Neumann paradigm and the associated memory bottleneck have motivated the research into various forms of parallel processing and of special processors for vision. The SIMD approach, adopted in massively parallel processors, has been introduced in a minimal format in the multimedia extensions to instruction set architectures of standard microprocessors. This paper examines the characteristics of SIMD processing that have been mapped into these extensions.

[1]  Miodrag Potkonjak,et al.  MediaBench: a tool for evaluating and synthesizing multimedia and communications systems , 1997, Proceedings of 30th Annual International Symposium on Microarchitecture.

[2]  Ahmad Zandi,et al.  Benchmarking multimedia performance , 1998, Electronic Imaging.

[3]  Marco Ferretti,et al.  Multimedia Extensions and Sub-word Parallelism in Image Processing: Preliminary Results , 1999, Euro-Par.

[4]  Uri C. Weiser,et al.  MMX technology extension to the Intel architecture , 1996, IEEE Micro.

[5]  Guy E. Blelloch,et al.  Collection-oriented languages , 1991 .

[6]  I-Chen Wu,et al.  The design and performance analysis for the multimedia function unit of the NSC-98 CPU , 1997, Proceedings of ICICS, 1997 International Conference on Information, Communications and Signal Processing. Theme: Trends in Information Systems Engineering and Wireless Multimedia Communications (Cat..

[7]  Huy Nguyen,et al.  AltiVec/sup TM/: bringing vector technology to the PowerPC/sup TM/ processor family , 1999, 1999 IEEE International Performance, Computing and Communications Conference (Cat. No.99CH36305).

[8]  Patrick Garda,et al.  Evaluating microprocessor multimedia extensions for the real-time simulation of RBF networks , 1999, Proceedings of the Seventh International Conference on Microelectronics for Neural, Fuzzy and Bio-Inspired Systems.

[9]  Wei Ding,et al.  VIS-based native video processing on UltraSPARC , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[10]  Ruby B. Lee,et al.  Challenges to Combining General-Purpose and Multimedia Processors , 1997, Computer.

[11]  Pradeep K. Dubey,et al.  Some fast speech processing algorithms using AltiVec technology , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[12]  Wonyong Sung,et al.  Pentium-MMX-based implementation of a digital copier , 1998, 1998 IEEE Workshop on Signal Processing Systems. SIPS 98. Design and Implementation (Cat. No.98TH8374).

[13]  Ruby B. Lee Accelerating multimedia with enhanced microprocessors , 1995, IEEE Micro.

[14]  Ramesh Radhakrishnan,et al.  Evaluating MMX technology using DSP and multimedia applications , 1998, Proceedings. 31st Annual ACM/IEEE International Symposium on Microarchitecture.

[15]  R. Khanna,et al.  An X86 microprocessor with multimedia extensions , 1997, 1997 IEEE International Solids-State Circuits Conference. Digest of Technical Papers.

[16]  Irek Defée,et al.  Broadband MPEG-2 client with network configuration capability , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[17]  Ruby B. Lee Subword parallelism with MAX-2 , 1996, IEEE Micro.

[18]  Andy D. Pimentel,et al.  Combined hardware/software solution for stream prefetching in multimedia applications , 1998, Electronic Imaging.

[19]  Marc Tremblay,et al.  VIS speeds new media processing , 1996, IEEE Micro.

[20]  Sethuraman Panchanathan,et al.  Design trends in multimedia hardware architectures , 1998, Electronic Imaging.

[21]  W. Daniel Hillis,et al.  Data parallel algorithms , 1986, CACM.

[22]  Derek Pao,et al.  Integrated partition integer execution unit for multimedia and conventional applications , 1998, 1998 IEEE International Conference on Electronics, Circuits and Systems. Surfing the Waves of Science and Technology (Cat. No.98EX196).

[23]  Masato Nagamatsu,et al.  A microprocessor with a 128-bit CPU, ten floating-point MAC's, four floating-point dividers, and an MPEG-2 decoder , 1999, IEEE J. Solid State Circuits.

[24]  Kenneth E. Iverson,et al.  A programming language , 1899, AIEE-IRE '62 (Spring).

[25]  Guy E. Blelloch,et al.  Scan primitives for vector computers , 1990, Proceedings SUPERCOMPUTING '90.

[26]  W. Chen,et al.  Native signal processing on the Ultrasparc in the Ptolemy environment , 1996, Conference Record of The Thirtieth Asilomar Conference on Signals, Systems and Computers.

[27]  Ruby B. Lee Multimedia extensions for general-purpose processors , 1997, 1997 IEEE Workshop on Signal Processing Systems. SiPS 97 Design and Implementation formerly VLSI Signal Processing.

[28]  Shreekant S. Thakkar,et al.  Internet Streaming SIMD Extensions , 1999, Computer.

[29]  David A. Carlson,et al.  Multimedia extensions for a 550-MHz RISC microprocessor , 1997 .

[30]  Mateo Valero,et al.  Exploiting a new level of DLP in multimedia applications , 1999, MICRO-32. Proceedings of the 32nd Annual ACM/IEEE International Symposium on Microarchitecture.

[31]  Ja-Ling Wu,et al.  MMX-based DCT and MC algorithms for real-time pure software MPEG decoding , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.