High-performance TCP/IP and UDP/IP networking in DEC OSF/1 for Alpha AXP

Network speeds have been increasing rapidly. The higher bandwidth communication links available now, ranging from 100 Mbits/sec to Gbits/sec, present considerable potential for distributed applications. Processor speeds have also increased relentlessly. However, until now the ultimate throughput delivered to the user application has not increased as rapidly. The paper describes the implementation of the TCP/IP and UDP/IP protocol suite on Digital's Alpha AXP systems with the DEC OSF/1 operating system over FDDI. User applications are able to achieve almost the full FDDI bandwidth of 100 Mbits/sec, thus essentially eliminating the end-system as a bottleneck for network I/O bottleneck. Included in the TCP/IP implementation are extensions to TCP recently adopted by the IETF such as support for large transport windows for higher performance. This is particularly desirable for high speed networks and/or large delay networks. Incremental work for data movement and checksums are often the most expensive operations of protocol processing. These have been optimized to take advantage of the Alpha XP workstation architecture including 64 bit support, wider cache lines and the coherence of cache blocks with DMA. We show, via measurement results that TCP achieves a throughput of 95 Mbits/sec. We also show that UDP performance is comparable. In addition, and unlike typical BSD-derived systems, the UDP receive throughput to user applications is also maintained at high load.<<ETX>>

[1]  William E. Lorensen,et al.  Object-Oriented Modeling and Design , 1991, TOOLS.

[2]  David Hutchison,et al.  Integrated quality of service for multimedia communications , 1993, IEEE INFOCOM '93 The Conference on Computer Communications, Proceedings.

[3]  A. Retrospective,et al.  The UNIX Time-sharing System , 1977 .

[4]  Michael D. Smith,et al.  The Impact of Operating System Structure on Personal Computer Performance , 1995 .

[5]  Eoin Hyden,et al.  Operating system support for quality of service , 1994 .

[6]  David A. Mosher,et al.  User-Process Communication Performance in Networks of Computers , 1988, IEEE Trans. Software Eng..

[7]  W. Richard Stevens,et al.  TCP/IP Illustrated, Volume 2: The Implementation , 1995 .

[8]  J. Lumley,et al.  A high-throughput network interface to a RISC workstation , 1992, IEEE Workshop on the Architecture and Implementation of High Performance Communication Subsystems.

[9]  Larry L. Peterson,et al.  Increasing network throughput by integrating protocol layers , 1993, TNET.

[10]  Srinivasan Keshav,et al.  On the Efficient Implementation of Fair Queueing , 1991 .

[11]  K. K. Ramakrishnan,et al.  Performance Considerations in Designing Network Interfaces , 1993, IEEE J. Sel. Areas Commun..

[12]  Andrew J. T. Colin,et al.  The Implementation , 1972, Softw. Pract. Exp..

[13]  P. Steenkiste,et al.  Analysis of the Nectar Communication Processor , 1992, IEEE Workshop on the Architecture and Implementation of High Performance Communication Subsystems.

[14]  David D. Clark,et al.  Architectural considerations for a new generation of protocols , 1990, SIGCOMM '90.

[15]  P. Druschel,et al.  Network subsystem design , 1993, IEEE Network.

[16]  Craig Partridge,et al.  A faster UDP , 1993, TNET.

[17]  Jon Postel,et al.  User Datagram Protocol , 1980, RFC.

[18]  David E. Culler,et al.  A case for NOW (networks of workstation) , 1995, PODC '95.

[19]  Keith Bostic,et al.  The design and implementa-tion of the 4.4BSD operating system , 1996 .

[20]  Jeffrey C. Mogul,et al.  The packer filter: an efficient mechanism for user-level network code , 1987, SOSP '87.

[21]  F. Bitz,et al.  Host interface design for ATM LANs , 1991, [1991] Proceedings 16th Conference on Local Computer Networks.

[22]  Bruce S. Davie A host-network interface architecture for ATM , 1991, SIGCOMM 1991.

[23]  Richard L. Sites,et al.  Alpha Architecture Reference Manual , 1995 .

[24]  Proceedings of the 13th Conference on Local Computer Networks, LCN 1988, October 10-12, 1988, Minneapolis, Minnesota, USA , 1988, LCN.

[25]  Scott Shenker,et al.  Observations on the dynamics of a congestion control algorithm: the effects of two-way traffic , 1991, SIGCOMM 1991.

[26]  David Banks,et al.  A High-Performance Network Architecture for a PA-RISC Workstation , 1993, IEEE J. Sel. Areas Commun..

[27]  Jean-Bernard Stefani,et al.  Open Microkernel Technology, Key to Evolving Telecommunication Systems and Networks * , 1995 .

[28]  Scott Shenker,et al.  Analysis and simulation of a fair queueing algorithm , 1989, SIGCOMM '89.

[29]  David A. Patterson,et al.  Computer Architecture: A Quantitative Approach , 1969 .

[30]  John K. Ousterhout,et al.  Why Aren't Operating Systems Getting Faster As Fast as Hardware? , 1990, USENIX Summer.

[31]  Raj Jain Performance analysis of FDDI token ring networks: effect of parameters and guidelines for setting TTRT , 1990, SIGCOMM '90.

[32]  Dave Katz,et al.  Transmission of IP and ARP over FDDI Networks , 1993, RFC.

[33]  J. Bradley Chen,et al.  Kernel instrumentation tools and techniques , 1995 .

[34]  David D. Clark,et al.  Modularity and efficiency in protocol implementation , 1982, RFC.

[35]  Robert T. Braden,et al.  Requirements for Internet Hosts - Application and Support , 1989, RFC.

[36]  Brian N. Bershad,et al.  The interaction of architecture and operating system design , 1991, ASPLOS IV.

[37]  George Varghese,et al.  Efficient fair queueing using deficit round-robin , 1996, TNET.

[38]  Larry L. Peterson,et al.  Making paths explicit in the Scout operating system , 1996, OSDI '96.

[39]  Samuel J. Leffler,et al.  The design and implementation of the 4.3 BSD Unix operating system , 1991, Addison-Wesley series in computer science.

[40]  Ragunathan Rajkumar,et al.  On Predictable Operating System Protocol Processing , 1994 .

[41]  Srinivasan Keshav,et al.  Comparison of rate-based service disciplines , 1991, SIGCOMM '91.

[42]  Jochen Liedtke,et al.  Toward real microkernels , 1996, CACM.

[43]  Hui Zhang,et al.  Service disciplines for guaranteed performance service in packet-switching networks , 1995, Proc. IEEE.

[44]  Gopalakrishnan Ramamurthy,et al.  An analytical model for UNIX® systems , 1988, AT&T Technical Journal.

[45]  Jonathan M. Smith,et al.  A High-Performance Host Interface for ATM Networks , 1991, SIGCOMM.

[46]  Robert T. Braden,et al.  Requirements for Internet Hosts - Communication Layers , 1989, RFC.

[47]  Andrew T. Campbell,et al.  A quality of service architecture , 1994, CCRV.

[48]  Guy Pujolle,et al.  Introduction to queueing networks , 1987 .

[49]  Gordon S. Blair,et al.  Architectural principles and techniques for distributed multimedia application support in operating systems , 1995, OPSR.

[50]  Tom Shanley,et al.  PCI System Architecture , 1993 .

[51]  Joseph Pasquale,et al.  Profiling and reducing processing overheads in TCP/IP , 1996, TNET.

[52]  Gordon S. Blair,et al.  Supporting continuous media applications in a micro-kernel environment , 1993, Architecture and Protocols for High-Speed Networks.

[53]  Peter Druschel,et al.  Operating system support for high-speed communication , 1996, CACM.

[54]  K. K. Ramakrishnan,et al.  Scheduling issues for interfacing to high speed networks , 1992, [Conference Record] GLOBECOM '92 - Communications for Global Users: IEEE.

[55]  Guru M. Parulkar,et al.  Experimental evaluation of SUNOS IPC and TCP/IP protocol implementation , 1993, TNET.

[56]  Van Jacobson,et al.  TCP Extensions for High Performance , 1992, RFC.

[57]  Greg Chesson,et al.  XTP/PE overview , 1988, Proceedings [1988] 13th Conference on Local Computer Networks.

[58]  Larry L. Peterson,et al.  Fbufs: a high-bandwidth cross-domain transfer facility , 1994, SOSP '93.

[59]  Jon Postel,et al.  Internet Protocol , 1981, RFC.

[60]  Soha Hassoun,et al.  A 200-MHz 64-bit Dual-Issue CMOS Microprocessor , 1992, Digit. Tech. J..

[61]  David R. Cheriton,et al.  The VMP network adapter board (NAB): high-performance network communication for multiprocessors , 1988, SIGCOMM 1988.

[62]  W. Richard Stevens,et al.  TCP/IP Illustrated, Volume 1: The Protocols , 1994 .

[63]  Larry L. Peterson,et al.  The x-Kernel: An Architecture for Implementing Network Protocols , 1991, IEEE Trans. Software Eng..

[64]  Van Jacobson,et al.  Congestion avoidance and control , 1988, SIGCOMM '88.

[65]  R. L. Stewart,et al.  The Design of the DEC 3000 AXP Systems, Two High-performance Workstations , 1992, Digit. Tech. J..

[66]  Peter Druschel,et al.  Lazy receiver processing (LRP): a network subsystem architecture for server systems , 1996, OSDI '96.