Hybrid Sankey diagrams: Visual analysis of multidimensional data for understanding resource use

Abstract Sankey diagrams are used to visualise flows of materials and energy in many applications, to aid understanding of losses and inefficiencies, to map out production processes, and to give a sense of scale across a system. As available data and models become increasingly complex and detailed, new types of visualisation may be needed. For example, when looking for opportunities to reduce steel scrap through supply chain integration, it is not enough to consider simply flows of “steel” — the alloy, thickness, coating and forming history of the metal can be critical. This paper combines data-visualisation techniques with the traditional Sankey diagram to propose a new type of “hybrid” Sankey diagram, which is better able to visualise these different aspects of flows. There is more than one way to visualise a dataset as a Sankey diagram, and different ways are appropriate in different situations. To facilitate this, a systematic method is presented for generating different hybrid Sankey diagrams from a dataset, with an accompanying open-source Python implementation. A common data structure for flow data is defined, through which this method can be used to generate Sankey diagrams from different data sources such as material flow analysis, life-cycle inventories, or directly measured data. The approach is introduced with a series of visual examples, and applied to a real database of global steel flows.

[1]  Andreas Möller,et al.  Foundations and applications of computer based material flow networks for environmental management , 2001 .

[2]  Christian S. Jensen,et al.  A foundation for capturing and querying complex multidimensional data , 2001, Inf. Syst..

[3]  Giovanni Lozza,et al.  Thermodynamic analysis of air-blown gasification for IGCC applications , 2011 .

[4]  Anders Grimvall,et al.  Data Cubes and Matrix Formulae for Convenient Handling of Physical Flow Data , 2006 .

[5]  Jonathan M Cullen,et al.  Mapping the global flow of aluminum: from liquid aluminum to end-use goods. , 2013, Environmental science & technology.

[6]  Helwig Hauser,et al.  Parallel Sets: interactive exploration and visual analysis of categorical data , 2006, IEEE Transactions on Visualization and Computer Graphics.

[7]  Julian M. Allwood,et al.  Research data supporting “Hybrid Sankey diagrams: visual analysis of multidimensional data for understanding resource use” , 2016 .

[8]  Ralph Kimball,et al.  The Data Warehouse Toolkit: Practical Techniques for Building Dimensional Data Warehouses , 1996 .

[9]  Cornelius Jordache,et al.  The Importance of Data Reconciliation and Gross Error Detection , 1999 .

[10]  Julian M. Allwood,et al.  Designing Climate Change Mitigation Plans That Add Up , 2013, Environmental science & technology.

[11]  P. Riehmann,et al.  Interactive Sankey diagrams , 2005, IEEE Symposium on Information Visualization, 2005. INFOVIS 2005..

[12]  Bin Su,et al.  Sankey diagram framework for energy and exergy flows , 2014 .

[13]  Gerald Kalt,et al.  Biomass streams in Austria: Drawing a complete picture of biogenic material flows within the national economy , 2015 .

[14]  Mario Schmidt,et al.  The Sankey Diagram in Energy and Material Flow Management , 2008 .

[15]  Faramarz F. Samavati,et al.  EnergyViz: an interactive system for visualization of energy systems , 2015, The Visual Computer.

[16]  Helmut Rechberger,et al.  Practical handbook of material flow analysis , 2003 .

[17]  Klaas Nuttbohm,et al.  Visualising sustainability communication with Sankey diagrams - a viable approach? , 2009, EnviroInfo.

[18]  Julian M. Allwood,et al.  Incremental Material Flow Analysis with Bayesian Inference , 2018 .

[19]  Mitsuhiko Toda,et al.  Methods for Visual Understanding of Hierarchical System Structures , 1981, IEEE Transactions on Systems, Man, and Cybernetics.

[20]  Julian M. Allwood,et al.  The efficient use of energy: Tracing the global flow of energy from fuel to service , 2010 .

[21]  Julian M. Allwood,et al.  Visualising a Stochastic Model of Californian Water Resources Using Sankey Diagrams , 2013, Water Resources Management.

[22]  Julian M. Allwood,et al.  Mapping the Global Flow of Tungsten to Identify Key Material Efficiency and Supply Security Opportunities , 2015 .

[23]  Derek L. Diener,et al.  Scrapping steel components for recycling—Isn’t that good enough? Seeking improvements in automotive component end-of-life , 2016 .

[24]  Philip S. Yu,et al.  Graph OLAP: a multi-dimensional framework for graph data analysis , 2009, Knowledge and Information Systems.

[25]  Surajit Chaudhuri,et al.  An overview of data warehousing and OLAP technology , 1997, SGMD.

[26]  Emden R. Gansner,et al.  A Technique for Drawing Directed Graphs , 1993, IEEE Trans. Software Eng..

[27]  Jonathan M Cullen,et al.  Mapping the global flow of steel: from steelmaking to end-use goods. , 2012, Environmental science & technology.

[28]  J. Allwood,et al.  Material Stock Demographics: Cars in Great Britain. , 2016, Environmental science & technology.

[29]  Roland Clift,et al.  Time-dependent material flow analysis of iron and steel in the UK: Part 1: Production and consumption trends 1970-2000 , 2007 .

[30]  Grant M. Kopec,et al.  Land use implications of future energy system trajectories—The case of the UK 2050 Carbon Plan , 2015 .