论文信息 - Chapter 7 – Exploring Trade-Offs in Performance and Programmability of Processing Element Topologies for Network Processors

Chapter 7 – Exploring Trade-Offs in Performance and Programmability of Processing Element Topologies for Network Processors

Publisher Summary This chapter explores different trade-offs for processing element topologies based on an analytical framework and IPv4 benchmark. A different partitioning for the benchmark on the IXP1200 is implemented. The employed analytical performance modeling is suited to exploring the design space for network processors, because it is quite accurate compared to full system simulation—apart from alleviating the need for detailed executable models and packet traces. For performance metrics, pipelined topologies are found to be consuming more local data memory and causing more end-to-end delay, although they have only point-to-point connections. For forwarding applications, pool topology is the most flexible from a programmability and scalability point of view, as well as best performing for the chosen metrics. The preciseness and usefulness of the analytical framework compared to a detailed simulation of the system are verified.

[1] Tilman Wolf,et al. A Network Processor Performance and Design Model with Benchmark Parameterization , 2003 .

[2] Lothar Thiele,et al. Chapter 4 – Design Space Exploration of Network Processor Architectures , 2003 .

[3] Kurt Keutzer,et al. Comparing analytical modeling with simulation for network processors: a case study , 2003, 2003 Design, Automation and Test in Europe Conference and Exhibition.

[4] Rainer Hoch,et al. From paper to office document standard representation , 1992, Computer.

[5] Scott Shenker,et al. Integrated Services in the Internet Architecture : an Overview Status of this Memo , 1994 .

[6] Fred Baker,et al. Requirements for IP Version 4 Routers , 1995, RFC.

[7] Jean-Yves Le Boudec,et al. Network Calculus: A Theory of Deterministic Queuing Systems for the Internet , 2001 .

[8] Scott Shenker,et al. General Characterization Parameters for Integrated Service Network Elements , 1997, RFC.

[9] Sujit Dey,et al. On-chip communication architecture for OC-768 network processors , 2001, Proceedings of the 38th Design Automation Conference (IEEE Cat. No.01CH37232).

[10] Ed F. Deprettere,et al. An Approach for Quantitative Analysis of Application-Specific Dataflow Architectures , 1997, ASAP.

[11] Daniel D. Gajski,et al. Automatic Model Refinement for Fast Architecture Exploration , 2002 .

[12] Kurt Keutzer,et al. A Benchmarking Methodology for Network Processors , 2003 .

[13] Sharad Malik,et al. Developing Architectural Platforms: A Disciplined Approach , 2002, IEEE Des. Test Comput..

[14] Brian N. Bershad,et al. Characterizing processor architectures for programmable network interfaces , 2000, ICS '00.

[15] James R. Larus,et al. Parallel Computer Research in the Wisconsin Wind Tunnel Project , 1996 .

[16] Anoop Gupta,et al. The Stanford Dash multiprocessor , 1992, Computer.

[17] Giovanni De Micheli,et al. Design Space Exploration , 1992 .

[18] Eddie Kohler,et al. The Click modular router , 1999, SOSP.