Network troubleshooting: Survey, Taxonomy and Challenges

With the robust improvement of the Internet, there are lots of network issues existing in the network systems. Many research works proposed different approaches to resolve the network issues in order to enhance the performance of network management and guarantee the user’s QoE (Quality of Experience). However, it is not easy to have the global views related to the network issues and the troubleshooting approaches. This paper presents a survey on troubleshooting with a special concentration on network issues. The main contribution of this survey is a detailed analysis of state-of-the-art related to network issues troubleshooting which evaluates their benefits and drawbacks. Moreover, these research works are classified into some categories to provide the useful guideline for troubleshooting research. Open issues and challenges in this field are also discussed. To the best of our knowledge, this is the first survey on troubleshooting of network issues.

[1]  Nick McKeown,et al.  Where is the debugger for my software-defined network? , 2012, HotSDN '12.

[2]  Randy H. Katz,et al.  X-Trace: A Pervasive Network Tracing Framework , 2007, NSDI.

[3]  Helen J. Wang,et al.  Automatic Misconfiguration Troubleshooting with PeerPressure , 2004, OSDI.

[4]  Srikanth Kandula,et al.  Shrink: a tool for failure diagnosis in IP networks , 2005, MineNet '05.

[5]  Victor O. K. Li,et al.  An Overview of Packet Reordering in Transmission Control Protocol (TCP): Problems, Solutions, and Challenges , 2007, IEEE Transactions on Parallel and Distributed Systems.

[6]  George Varghese,et al.  Automatic Test Packet Generation , 2012, IEEE/ACM Transactions on Networking.

[7]  Dinkar Sitaram,et al.  Software Troubleshooting Using Machine Learning , 2017, 2017 IEEE 24th International Conference on High Performance Computing Workshops (HiPCW).

[8]  Hai Anh Tran,et al.  A LSTM based framework for handling multiclass imbalance in DGA botnet detection , 2018, Neurocomputing.

[9]  Helen J. Wang,et al.  Server-based inference of Internet link lossiness , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[10]  Patrick Thiran,et al.  Using End-to-End Data to Infer Lossy Links in Sensor Networks , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[11]  Paul Barford,et al.  Improving accuracy in end-to-end packet loss measurement , 2005, SIGCOMM '05.

[12]  Ming Zhu,et al.  Malware traffic classification using convolutional neural network for representation learning , 2017, 2017 International Conference on Information Networking (ICOIN).

[13]  Ivan Beschastnikh,et al.  NetCheck: Network Diagnoses from Blackbox Traces , 2014, NSDI.

[14]  Donald F. Towsley,et al.  Multicast-based inference of network-internal characteristics: accuracy of packet loss estimation , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[15]  David Wetherall,et al.  Forwarding without loops in Icarus , 2002, 2002 IEEE Open Architectures and Network Programming Proceedings. OPENARCH 2002 (Cat. No.02EX571).

[16]  M. Anand Kumar,et al.  Troubleshooting Networks Using Internet Control Message Protocol , 2009 .

[17]  Franck Le,et al.  Detecting Network-Wide and Router-Specific Misconfigurations Through Data Mining , 2009, IEEE/ACM Transactions on Networking.

[18]  Jean C. Walrand,et al.  Knowledge-Defined Networking: Modelització de la xarxa a través de l’aprenentatge automàtic i la inferència , 2016 .

[19]  Aditya Akella,et al.  NetReplay: a new network primitive , 2010, PERV.

[20]  C. Siva Ram Murthy,et al.  Loss classification in optical burst switching networks using machine learning techniques: improving the performance of TCP , 2008, IEEE Journal on Selected Areas in Communications.

[21]  Nick G. Duffield,et al.  Simple network performance tomography , 2003, IMC '03.

[22]  Geoffrey M. Voelker,et al.  NetPrints: Diagnosing Home Network Misconfigurations Using Shared Knowledge , 2009, NSDI.

[23]  R. Chandra,et al.  WiFiProfiler: cooperative diagnosis in wireless LANs , 2006, MobiSys '06.

[24]  Nick Feamster,et al.  Detecting BGP configuration faults with static analysis , 2005 .

[25]  Marco Chiesa,et al.  Prelude: Ensuring Inter-Domain Loop-Freedom in SDN-Enabled Networks , 2018, APNet '18.

[26]  C. Valliyammai,et al.  A survey on link failures in software defined networks , 2015, 2015 Seventh International Conference on Advanced Computing (ICoAC).

[27]  Nick McKeown,et al.  I Know What Your Packet Did Last Hop: Using Packet Histories to Troubleshoot Networks , 2014, NSDI.

[28]  Yin Zhang,et al.  Troubleshooting chronic conditions in large IP networks , 2008, CoNEXT '08.

[29]  Albert G. Greenberg,et al.  IP fault localization via risk modeling , 2005, NSDI.

[30]  Ronald W. Wolff,et al.  Poisson Arrivals See Time Averages , 1982, Oper. Res..

[31]  Ahmad Faraahi,et al.  An Anomaly-Based Method for DDoS Attacks Detection using RBF Neural Networks , 2011 .

[32]  Michael I. Jordan,et al.  Failure diagnosis using decision trees , 2004 .

[33]  Maria Rita Palattella,et al.  SDN-RADAR: Network troubleshooting combining user experience and SDN capabilities , 2015, Proceedings of the 2015 1st IEEE Conference on Network Softwarization (NetSoft).

[34]  Nick G. Duffield,et al.  Network Tomography of Binary Network Performance Characteristics , 2006, IEEE Transactions on Information Theory.

[35]  Tianyin Xu,et al.  EnCore: exploiting system environment and correlation information for misconfiguration detection , 2014, ASPLOS.

[36]  Marco Mellia,et al.  Exploiting hybrid measurements for network troubleshooting , 2014, 2014 16th International Telecommunications Network Strategy and Planning Symposium (Networks).

[37]  Franck Le,et al.  Detecting network-wide and router-specific misconfigurations through data mining , 2009, TNET.

[38]  Renata Teixeira,et al.  NetDiagnoser: troubleshooting network unreachabilities using end-to-end probes and routing data , 2007, CoNEXT '07.

[39]  Jason Flinn,et al.  Automatically Generating Predicates and Solutions for Configuration Troubleshooting , 2009, USENIX Annual Technical Conference.