Adaptive network diagram constructions for representing big data event streams on monitoring dashboards

Critical systems that produce big data streams can require human operators to monitor these event streams for changes of interest. Automated systems which oversee many tasks can still have a need for the ‘human-in-the-loop’ operator to evaluate whether an intervention is required due to a lack of suitable training data initially offered to the system which would allow a correct course of actions to be taken. In order for an operator to be capable of reacting to real-time events, the visual depiction of the event data must be in a form which captures essential associations and is readily understood by visual inspection. A similar requirement can be found during inspections on activity protocols in a large organization where a code of correct conduct is prescribed and there is a need to oversee whether the activity traces match the expectations, with minimal delay. The methodology presented here addresses these concerns by providing an adaptive window sizing measurement for subsetting the data, and subsequently produces a set of network diagrams based upon event label co-occurrence networks. With an intuitive method of network construction the amount of time required for operators to learn how to monitor complex event streams of big datasets can be reduced.

[1]  Jarke J. van Wijk,et al.  Multivariate Network Exploration and Presentation: From Detail to Overview via Selections and Aggregations , 2014, IEEE Transactions on Visualization and Computer Graphics.

[2]  Barbora Micenková,et al.  Clustering attributed graphs: Models, measures and methods , 2015, Network Science.

[3]  Anthony Faiola,et al.  Improving Patient Safety: Integrating Data Visualization and Communication Into Icu Workflow to Reduce Cognitive Load , 2015 .

[4]  R. Kitchin,et al.  The real-time city? Big data and smart urbanism , 2013, GeoJournal.

[5]  Xiang Li,et al.  Meta Structure: Computing Relevance in Large Heterogeneous Information Networks , 2016, KDD.

[6]  Mark E. J. Newman,et al.  Structure and inference in annotated networks , 2015, Nature Communications.

[7]  Aboul Ella Hassanien,et al.  Dimensionality reduction of medical big data using neural-fuzzy classifier , 2014, Soft Computing.

[8]  Shi-Kuo Chang,et al.  Answering How-to-Reach Query in Big Attributed Graph Databases , 2017, 2017 IEEE Third International Conference on Big Data Computing Service and Applications (BigDataService).

[9]  Marco Sacco,et al.  A survey of RDF store solutions , 2014, 2014 International Conference on Engineering, Technology and Innovation (ICE).

[10]  Marina Meila,et al.  Comparing Clusterings by the Variation of Information , 2003, COLT.

[11]  Alexander V. Mantzaris,et al.  Mantzaris, Alexander , 2019 .

[12]  R. Kitchin,et al.  Urban informatics, governmentality and the logics of urban control , 2017 .

[13]  John Murphy,et al.  Reducing emergency services response time in smart cities: An advanced adaptive and fuzzy approach , 2015, 2015 IEEE First International Smart Cities Conference (ISC2).

[14]  Alan Edelman,et al.  Julia: A Fresh Approach to Numerical Computing , 2014, SIAM Rev..

[15]  Frank van Harmelen,et al.  Stream reasoning: A survey and outlook , 2017, Data Sci..

[16]  Christopher Andrews,et al.  The human is the loop: new directions for visual analytics , 2014, Journal of Intelligent Information Systems.

[17]  Seon-Young Kim,et al.  Use of Graph Database for the Integration of Heterogeneous Biological Data , 2017, Genomics & informatics.

[18]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[19]  Emden R. Gansner,et al.  Graphviz - Open Source Graph Drawing Tools , 2001, GD.

[20]  Anna Lisa Gentile,et al.  Multi-lingual Concept Extraction with Linked Data and Human-in-the-Loop , 2017, K-CAP.

[21]  Peter Haase,et al.  Optique: Zooming in on Big Data , 2015, Computer.

[22]  Chris Weaver,et al.  Multidimensional data dissection using attribute relationship graphs , 2010, 2010 IEEE Symposium on Visual Analytics Science and Technology.

[23]  Danai Koutra,et al.  Graph Summarization Methods and Applications , 2016, ACM Comput. Surv..

[24]  M. Rehman,et al.  Big data and visual analytics in anaesthesia and health care. , 2015, British journal of anaesthesia.

[25]  John T. Stasko,et al.  Network-based visual analysis of tabular data , 2011, 2011 IEEE Conference on Visual Analytics Science and Technology (VAST).

[26]  Nitesh V. Chawla,et al.  metapath2vec: Scalable Representation Learning for Heterogeneous Networks , 2017, KDD.

[27]  Sally A. McKee,et al.  Characterizing and subsetting big data workloads , 2014, 2014 IEEE International Symposium on Workload Characterization (IISWC).

[28]  Ye Zhao,et al.  TrajGraph: A Graph-Based Visual Analytics Approach to Studying Urban Network Centralities Using Taxi Trajectory Data , 2016, IEEE Transactions on Visualization and Computer Graphics.

[29]  Pascal Vasseur,et al.  Introduction to Multisensor Data Fusion , 2005, The Industrial Information Technology Handbook.

[30]  François Le Gall,et al.  Powers of tensors and fast matrix multiplication , 2014, ISSAC.

[31]  Yizhou Sun,et al.  Personalized entity recommendation: a heterogeneous information network approach , 2014, WSDM.

[32]  John T. Stasko,et al.  Ploceus: Modeling, visualizing, and analyzing tabular data as networks , 2014, Inf. Vis..

[33]  Andreas Holzinger,et al.  Interactive machine learning for health informatics: when do we need the human-in-the-loop? , 2016, Brain Informatics.

[34]  Ralf Tönjes,et al.  CityPulse: Large Scale Data Analytics Framework for Smart Cities , 2016, IEEE Access.

[35]  A. Davie,et al.  Improved bound for complexity of matrix multiplication , 2013, Proceedings of the Royal Society of Edinburgh: Section A Mathematics.

[36]  Rahul C. Basole,et al.  ecoxight , 2018, ACM Trans. Manag. Inf. Syst..

[37]  Mohammed Essaaidi,et al.  Performance evaluation of linked stream data processing engines for situational awareness applications , 2018, Concurr. Comput. Pract. Exp..

[38]  Kent A. Spackman,et al.  Using co-occurrence network structure to extract synonymous gene and protein names from MEDLINE abstracts , 2005, BMC Bioinformatics.

[39]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.