A Fast Implementation for Recurrent DSP Scheduling Using Final Matrix

The scheduling theory of Heemstra de Groot et al. is supplemented by extending the Final Matrix’s usefulness beyond finding iteration bounds, critical loops and subcritical loops of recursive data flow graphs (RDFGs) to scheduling. DFG is a special case of Petri nets (PN). Hence we apply the cycle time theory of PN to the scheduling of DFG. Contributions include: (1) development of explicit formulas for slack time, scheduling ranges and update, and static rate-optimal time scheduling based on entries of the final matrix; (2) development of a fastest processor assignment algorithm based on the rate-optimal static scheduling without unfolding while considering abnormal cases in which iteration bounds are fractional or smaller than some node execution times; (3) discovery of a new anomaly in addition to the above two cases; (4) development of a user-friendly Final-matrix based Integration Tool (FIT) to view critical and subcritical loops, iteration bounds, scheduling ranges, and level and processor assignment diagrams based on a single tool “final matrix” rather than other tools; (5) elimination of redundant steps such as the construction of inequality graphs; (6) development of a proof showing that the ALAP and ASAP fixed-time schedulings satisfy the firing rule; and (7) thousand-fold faster (linear time complexity) processing compared to others and use of fewer processors in the case of large DFGs.

[1]  S. M. Heemstra de Groot,et al.  A polynomial time algorithm for the computation of the iteration-period bound in recursive data flow graphs , 1992 .

[2]  Keshab K. Parhi,et al.  Dedicated DSP architecture synthesis using the MARS design system , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[3]  Sabih H. Gerez,et al.  Range-chart-guided iterative data-flow graph scheduling , 1992 .

[4]  Walter H. Kohler,et al.  A Preliminary Evaluation of the Critical Path Method for Scheduling Tasks on Multiprocessor Systems , 1975, IEEE Transactions on Computers.

[5]  Liang-Gee Chen,et al.  Rate-optimal DSP synthesis by pipeline and minimum unfolding , 1994, IEEE Trans. Very Large Scale Integr. Syst..

[6]  M. Biot,et al.  QUARTERLY OF APPLIED MATHEMATICS , 1972 .

[7]  C. V. Ramamoorthy,et al.  Extensions on performance evaluation techniques for concurrent systems , 1988, Proceedings COMPSAC 88: The Twelfth Annual International Computer Software & Applications Conference.

[8]  Richard Bellman,et al.  ON A ROUTING PROBLEM , 1958 .

[9]  C. V. Ramamoorthy,et al.  Performance Evaluation of Asynchronous Concurrent Systems Using Petri Nets , 1980, IEEE Transactions on Software Engineering.

[10]  Thomas P. Barnwell,et al.  Cyclo-static multiprocessor scheduling for the optimal realization of shift-invariant flow graphs , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  D. T. Wang,et al.  Iteration bounds of single-rate data flow graphs for concurrent processing , 1993 .

[12]  Keshab K. Parhi,et al.  Algorithm transformation techniques for concurrent processors , 1989, Proc. IEEE.

[13]  J. Blazewicz,et al.  Selected Topics in Scheduling Theory , 1987 .

[14]  Markku Renfors,et al.  The maximum sampling rate of digital filters under hardware speed constraints , 1981 .

[15]  Selmer M. Johnson,et al.  A Tournament Problem , 1959 .

[16]  Thomas P. Barnwell,et al.  Optimal automatic periodic multiprocessor scheduler for fully specified flow graphs , 1993, IEEE Trans. Signal Process..

[17]  Daniel Y. Chao Performance of Multi-Rate Data Flow Graphs for Concurrent Processing , 1997, J. Inf. Sci. Eng..

[18]  Yu Hen Hu,et al.  Multiprocessor implementation of real-time DSP algorithms , 1995, IEEE Trans. Very Large Scale Integr. Syst..

[19]  Markku Renfors,et al.  Fast multiprocessor realizations of digital filters , 1980, ICASSP.

[20]  David Aaron Schwartz,et al.  Synchronous multiprocessor realizations of shift-invariant flow graphs , 1985 .