Design and Analysis of Load-Balancing Switch with Finite Buffers and Variable Size Packets

As the traffic volume on the Internet increases exponentially, so does the demand for fast switching of packets between asynchronous high-speed routers. Although the optical fiber can provide an extremely high capacity, the Internet switches still remain the main point of traffic bottleneck. The packet switching time may run up to nanoseconds in such routers with more than thousands ports, each processing at 10 GB/s. Even modern extremely fast processing units are not capable to satisfy these needs. It is well known that switching of such a high volume of traffic from input to output requires large buffers and fast processors to perform the header processing, complex scheduling and forwarding functions. Although a large number of switching architectures is presented on the market, the considerable part of them is either not scalable or reach their limits in power consumption and complexity. Therefore, novel and extremely scalable switching systems are essential to be investigated. The load-balancing switching approach is simple, and therefore, may be capable of performing the switching and forwarding from all inputs to all outputs simultaneously with low complexity and high scalability. Since this simple approach has distributed topology (each component of the switch is controlled by an individual chip) and do not require fast switch control units, primarily because each stage is independent and it makes its own distributed calculations, it becomes a perfect candidate for the future practical deployment. The load-balancing switching architecture, considered in this thesis, is proved to have high potential to scale up while maintaining good throughput and other performance characteristics. Additionally, the load-balancing switching architecture can effectively resolve the important problem of packets mis-ordering which can appear due to the distributed structure of the system. Unfortunately, in the research conducted previously, some of the mentioned characteristics were obtained under a set of strong assumptions. In particular, it was assumed that all the packets transmitted through the system have equal length, traffic is admissible and central stage buffers are infinite. On the other hand, due to the distributed control the switch is not able to control and maintain a necessary amount of traffic transmitted from stage to stage inside the switch. The following Ph.D. thesis analyzes behavior of the load-balancing switch equipped with finite central stage buffers. Due to this fact the LB switch will always have a possibility to drop a packet due to an overflow. In this work we first analyze the packet loss probability in the central stage buffers while considering packets of the same length (data cells). The analysis will be performed for both admissible and inadmissible traffic matrices. The obtained results show that the packet loss can have a significant influence on the overall LB switch performance if inputs of the switch are overloaded. In order to present more realistic scenario, the packet loss analysis was performed in the switch with variable size packets. It is considered that most of the internet switches are operating on the cell-based level (to increase buffer utilization), that means that arriving variable size packets are segmented at inputs and reassembled at outputs. The issue of possible cell and correspondingly a packet loss inside the switch can introduce some significant posterior problems to the load-balancing switch reassembly unit. In order to evaluate packet loss we assumed Markovian behavior to be able to use numerically efficient algorithms to solve the model. The mathematical model characterizing inhomogeneous input traffic presented inside the thesis gives the most precise way of packet loss probability evaluation. Unfortunately, the high complexity of this model results in irresolvable complex Markov chains even in case of very small switches. Consequently, as a next step, we performed the analysis with fast solution procedures using a restrictive assumption of identical stochastic processes at all inputs. The final results allowed us to conclude that a single cell drop at the central stage buffers cause the whole packet removal and, the packet loss probability inside the system can be extremely high in comparison with the corresponding cell loss. Another important issue observed from the analysis is the difference in packet loss probabilities depending on the traffic traversing path, e.g. sequential number of input, central stage buffer and output of the switch. This property makes more complex the evaluation of the loss probabilities for large switch sizes. The last but not the least issue observed by our analysis was the instability, congestion and large delays appearing at output re-sequencing and reassembly unit due to the the central stage packet loss. In order to cope with such a behavior, we proposed the novel algorithms which are able to efficiently minimize/avoid packet loss at the central stage buffers of the switch. For instance, the novel minimization protocol is introducing an artificial buffering threshold at the central stage buffers in such a way that packets at the input stage are are dropped in case the actual central stage buffers occupancy is above the threshold. The results show that due to possible packet removal at the input stage of the switch, the overall packet loss probability is significantly reduced. Similarly to the loss minimization service protocol, the novel NoLoss load-balancing switch operates while using information from both inputs and central stage buffers, and allows a packet transmission through the switch only if the central stage buffers have enough space to accept it during the current and the following time slots. In order to minimize communication overheads, the algorithm was implemented by means of centralize controller. Finally, such kind of management helped us to reach the lower boundary in the overall packet loss probability and resolve some other important issues of the switch, like, for instance, the congestion problem of the output reassembly unit.

[1]  Isaac Keslassy,et al.  The Load-Balanced Router , 2004 .

[2]  Samuel P. Morgan,et al.  Input Versus Output Queueing on a Space-Division Packet Switch , 1987, IEEE Trans. Commun..

[3]  Walter Willinger,et al.  On the Self-Similar Nature of Ethernet Traffic ( extended version ) , 1995 .

[4]  Bill Lin,et al.  Frame-aggregated concurrent matching switch , 2007, ANCS '07.

[5]  Muriel Médard,et al.  Extending the Birkhoff-Von Neumann Switching Strategy to Multicast Switches , 2005, NETWORKING.

[6]  Anja Feldmann,et al.  Fitting mixtures of exponentials to long-tail distributions to analyze network performance models , 1997, Proceedings of INFOCOM '97.

[7]  Marco Ajmone Marsan,et al.  On the stability of input-queued switches with speed-up , 2001, TNET.

[8]  M. F.,et al.  Bibliography , 1985, Experimental Gerontology.

[9]  Miklós Telek,et al.  Variable Size Packets Analysis in Load-balanced Switch with Finite Buffers , 2009 .

[10]  Annie Gravey,et al.  On the Geo/D/1/and Geo/D/1/n Queues , 1990, Perform. Evaluation.

[11]  Moti Yung,et al.  Convergence routing on disjoint spanning trees , 1999, Comput. Networks.

[12]  kc claffy,et al.  Longitudinal study of Internet traffic in 1998-2003 , 2004 .

[13]  David Nassimi A self routing Benes network , 1980, ISCA '80.

[14]  Moti Yung,et al.  Routing and Flow Control on the MetaNet: An Overview , 1994, Comput. Networks ISDN Syst..

[15]  Isaac Keslassy,et al.  Optimal-Complexity Optical Router , 2007, IEEE INFOCOM 2007 - 26th IEEE International Conference on Computer Communications.

[16]  Leslie G. Valiant,et al.  Universal schemes for parallel communication , 1981, STOC '81.

[17]  Andrei Z. Broder,et al.  Using multiple hash functions to improve IP lookups , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[18]  Ilkka Norros,et al.  A storage model with self-similar input , 1994, Queueing Syst. Theory Appl..

[19]  Cheng-Shang Chang,et al.  CR Switch: A Load-Balanced Switch With Contention and Reservation , 2007, IEEE/ACM Transactions on Networking.

[20]  Mary C. Brennan,et al.  on the , 1982 .

[21]  Cheng-Shang Chang,et al.  Constructions of optical FIFO queues , 2006, IEEE Transactions on Information Theory.

[22]  Nick McKeown,et al.  Scaling internet routers using optics , 2003, SIGCOMM '03.

[23]  Jonathan S. Turner Resilient Cell Resequencing in Terabit Routers , 2003 .

[24]  H. Jonathan Chao,et al.  A dual-level matching algorithm for 3-stage Clos-network packet switches , 2003, 11th Symposium on High Performance Interconnects, 2003. Proceedings..

[25]  Fouad A. Tobagi,et al.  Architecture, Performance, and Implementation of the Tandem Banyan Fast Packet Switch , 1991, IEEE J. Sel. Areas Commun..

[26]  Miklós Telek,et al.  Packet Loss Analysis of Load-Balancing Switch with ON/OFF Input Processes , 2009, EPEW.

[27]  Nick McKeown,et al.  Making parallel packet switches practical , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[28]  Eiji Oki,et al.  CIXOB-k: combined input-crosspoint-output buffered packet switch , 2001, GLOBECOM'01. IEEE Global Telecommunications Conference (Cat. No.01CH37270).

[29]  Cheng-Shang Chang,et al.  Design a simple and high performance switch using a two-stage architecture , 2005, GLOBECOM '05. IEEE Global Telecommunications Conference, 2005..

[30]  J. Y. Hui,et al.  Queueing strategies for multicast packet switching , 1990, [Proceedings] GLOBECOM '90: IEEE Global Telecommunications Conference and Exhibition.

[31]  Bing Hu,et al.  Load-balanced Three-stage Switch Architecture , 2007, 2007 Workshop on High Performance Switching and Routing.

[32]  Jonathan S. Turner,et al.  Design of a gigabit ATM switch , 1997, Proceedings of INFOCOM '97.

[33]  Tim Roughgarden,et al.  Routers with Very Small Buffers , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[34]  Nick McKeown,et al.  Practical algorithms for performance guarantees in buffered crossbars , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[35]  Cheng-Shang Chang,et al.  Mailbox switch: a scalable two-stage switch architecture for conflict resolution of ordered packets , 2004, IEEE INFOCOM 2004.

[36]  Ioannis Papaefstathiou,et al.  Variable packet size buffered crossbar (CICQ) switches , 2004, 2004 IEEE International Conference on Communications (IEEE Cat. No.04CH37577).

[37]  Jonathan Turner,et al.  Strong Performance Guarantees for Asynchronous Crossbar Schedulers , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[38]  Eytan Modiano,et al.  Logarithmic delay for N × N packet switches under the crossbar constraint , 2007, TNET.

[39]  Nick McKeown,et al.  Optimal load-balancing , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[40]  Jay Cheng,et al.  Multistage Constructions of Linear Compressors, Non-Overtaking Delay Lines, and Flexible Delay Lines , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[41]  Yoram Ofek,et al.  Assessment and Open-Issues of the Load-Balanced Switching Architecture , 2008, 2008 Second International Conference on Future Generation Communication and Networking.

[42]  Yoram Ofek,et al.  Fractional Lambda Switching Principles of Operation and Performance Issues , 2004, Simul..

[43]  Nick McKeown,et al.  Maintaining packet order in two-stage switches , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[44]  Cheng-Shang Chang,et al.  Load balanced Birkhoff-von Neumann switches, part I: one-stage buffering , 2002, Computer Communications.

[45]  Mark Allman,et al.  On making TCP more robust to packet reordering , 2002, CCRV.

[46]  Moti Yung,et al.  Combinatorial design of congestion-free networks , 1997, TNET.

[47]  Bill Lin,et al.  The Concurrent Matching Switch Architecture , 2010, IEEE/ACM Trans. Netw..

[48]  Yoram Ofek,et al.  MetaRing-a full-duplex ring with fairness and spatial reuse , 1993, IEEE Trans. Commun..

[49]  Miklós Telek,et al.  Scalable Model for Packet Loss Analysis of Load-Balancing Switches with Identical Input Processes , 2009, ASMTA.

[50]  Miklós Telek,et al.  Analysis of Load-Balanced Switch with Finite Buffers , 2008, IEEE GLOBECOM 2008 - 2008 IEEE Global Telecommunications Conference.

[51]  R. Srikant,et al.  Padded Frames: A Novel Algorithm for Stable Scheduling in Load-Balanced Switches , 2008, 2006 40th Annual Conference on Information Sciences and Systems.

[52]  Moti Yung,et al.  METANET: principles of an arbitrary topology LAN , 1995, TNET.

[53]  Cheng-Shang Chang,et al.  Load balanced Birkhoff-von Neumann switches , 2001, 2001 IEEE Workshop on High Performance Switching and Routing (IEEE Cat. No.01TH8552).

[54]  R. Wilder,et al.  Wide-area Internet traffic patterns and characteristics , 1997, IEEE Netw..

[55]  Michael Jurczyk Performance and implementation aspects of higher order head-of-line blocking switch boxes , 1997, Proceedings of the 1997 International Conference on Parallel Processing (Cat. No.97TB100162).

[56]  P. Billingsley,et al.  Ergodic theory and information , 1966 .

[57]  Thomas E. Anderson,et al.  High-speed switch scheduling for local-area networks , 1993, TOCS.

[58]  Yanming Shen,et al.  Byte-focal: a practical load balanced switch , 2005, HPSR. 2005 Workshop on High Performance Switching and Routing, 2005..

[59]  Cyriel Minkenberg,et al.  Stability of CIOQ switches with finite buffers and non-negligible round-trip time , 2002, Proceedings. Eleventh International Conference on Computer Communications and Networks.

[60]  Jean C. Walrand,et al.  Achieving 100% throughput in an input-queued switch , 1996, Proceedings of IEEE INFOCOM '96. Conference on Computer Communications.

[61]  Leonard Kleinrock,et al.  Queueing Systems: Volume I-Theory , 1975 .

[62]  San-qi Li,et al.  Folding algorithm: a computational method for finite QBD processes with level-dependent transitions , 1994, IEEE Trans. Commun..

[63]  P. Baran,et al.  On Distributed Communications Networks , 1964 .

[64]  Nick McKeown,et al.  A load-balanced switch with an arbitrary number of linecards , 2004, IEEE INFOCOM 2004.

[65]  Cheng-Shang Chang,et al.  Load balanced Birkhoff-von Neumann switches, part II: multi-stage buffering , 2002, Comput. Commun..

[66]  William Stallings,et al.  High-Speed Networks and Internets: Performance and Quality of Service , 2002 .

[67]  Nick McKeown,et al.  The iSLIP scheduling algorithm for input-queued switches , 1999, TNET.

[68]  Nicholas F. Maxemchuk,et al.  Routing in the Manhattan Street Network , 1987, IEEE Trans. Commun..

[69]  Kwan Lawrence Yeung,et al.  A Novel Feedback Mechanism for Load Balanced Two-Stage Switches , 2007, 2007 IEEE International Conference on Communications.

[70]  Richard Cole,et al.  Edge-Coloring Bipartite Multigraphs in O(E logD) Time , 1999, Comb..

[71]  Nick McKeown,et al.  Configuring a load-balanced switch in hardware , 2004, Proceedings. 12th Annual IEEE Symposium on High Performance Interconnects.

[72]  Walter Willinger,et al.  A Bibliographical Guide to Self-Similar Traffic and Performance Modeling for Modern High-Speed Netwo , 1996 .