Reducing Data-Memory Footprint of Multimedia Applications by Delay Redistribution

It is now common for multimedia applications to be partitioned and mapped onto multiple processing elements of a system-on-chip architecture. An important design constraint in such architectures is that the FIFO buffers connecting the processing elements (in a pipelined fashion) should not overflow and the playout buffer should never underflow. To meet these constraints, an usual design practice is to increase the initial playout delay after which the output device starts reading from the playout buffer. Although implementing this technique is straightforward and involves only the the computation of an appropriate playout delay, it suffers from the downside of a large playout buffer being required. In this paper, instead of associating the playout delay solely with the output device, we propose to redistribute this delay among all the processing elements running the various tasks of the multimedia application. We show that this delay redistribution technique can signficantly reduce (up to 70%) the total on-chip memory required.

[1]  Erik Brockmeyer,et al.  Data and memory optimization techniques for embedded systems , 2001, TODE.

[2]  Ahmed Amine Jerraya,et al.  Buffer memory optimization for video codec application modeled in Simulink , 2006, 2006 43rd ACM/IEEE Design Automation Conference.

[3]  Hoeseok Yang,et al.  Buffer Minimization In RTL Synthesis From Coarse-grained Dataflow Specification , 2006 .

[4]  Dong-Ik Ko,et al.  Modeling and optimization of buffering trade-offs for hardware implementation of image processing applications , 2005, IEEE Workshop on Signal Processing Systems Design and Implementation, 2005..

[5]  Om Prakash Gangwal,et al.  Eclipse: heterogeneous multiprocessor architecture for flexible media processing , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[6]  Praveen K. Murthy,et al.  Buffer merging—a powerful technique for reducing memory requirements of synchronous dataflow specifications , 2004, TODE.

[7]  Henning Schulzrinne,et al.  Adaptive playout mechanisms for packetized audio applications in wide-area networks , 1994, Proceedings of INFOCOM '94 Conference on Computer Communications.

[8]  Sander Stuijk,et al.  Exploring trade-offs in buffer requirements and throughput constraints for synchronous dataflow graphs , 2006, 2006 43rd ACM/IEEE Design Automation Conference.

[9]  Xiaolin Wu,et al.  Buffer size reduction through buffer sharing for streaming applications , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).