Visual analytics for text-based railway incident reports

The GB railways collect about 150,000 text-based records each year on potentially dangerous events and the numbers are on the increase in the Close Call System. The huge volume of text requires considerable human effort to its interpretation. This work focuses on visual text analysis techniques of Close Call records to extract safety lessons more quickly and efficiently. This paper treats basic steps for visual text analysis based on an evaluation test using a pre-constructed test set of 150 Close Call records for "Trespass", "Slip/Trip hazards on site" and "Level crossing". The results demonstrate that visual text analysis can be used to identify the risks in a small-scale test set but differences in language use by different cohorts of people interferes with straightforward risk identification in larger sets. This work paves the way to machine-assisted interpretation of text-based safety records which can speed up risk identification in a large corpus of text. It also demonstrates how new possibilities open up to develop interactive visualisations tools that allow data analysts to use text analysis techniques for risk analysis. Language: en

[1]  D W R Marsh,et al.  Generalizing event trees using Bayesian networks , 2008 .

[2]  Jana Diesner From Texts to Networks: Detecting and Managing the Impact of Methodological Choices for Extracting Network Data from Text Data , 2012, KI - Künstliche Intelligenz.

[3]  Patrick Thomas Hudson Integrating organisational culture into incident analyses: Extending the Bow Tie model , 2010 .

[4]  Roel Popping,et al.  Knowledge Graphs and Network Text Analysis , 2003 .

[5]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[6]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[7]  Maria Grazia Gnoni,et al.  Near-miss management systems: A methodological comparison , 2012 .

[8]  Miguel Figueres-Esteban,et al.  Learning from text-based close call data , 2016 .

[9]  Ted G. Lewis,et al.  Network Science: Theory and Applications , 2009 .

[10]  M. Pottier,et al.  Multidimensional visualization and browsing for intelligence analysis , 1994 .

[11]  Ewan Klein,et al.  Natural Language Processing with Python , 2009 .

[12]  Daniel A. Keim,et al.  Visual Analytics: Definition, Process, and Challenges , 2008, Information Visualization.

[13]  James P. Bliss,et al.  What are close calls? A proposed taxonomy to inform risk communication research , 2014 .

[14]  Dmitry Paranyushkin,et al.  Identifying the Pathways for Meaning Circulation using Text Network Analysis , 2011 .

[15]  Kristin A. Cook,et al.  Illuminating the Path: The Research and Development Agenda for Visual Analytics , 2005 .

[16]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[17]  Daniel A. Keim,et al.  Mastering the Information Age - Solving Problems with Visual Analytics , 2010 .

[18]  Jean-Loup Guillaume,et al.  Fast unfolding of community hierarchies in large networks , 2008, ArXiv.

[19]  Paul E. Keel Collaborative Visual Analytics: Inferring from the Spatial Organization and Collaborative Use of Information , 2006, 2006 IEEE Symposium On Visual Analytics Science And Technology.

[20]  Roel Popping,et al.  Computer-assisted text analysis , 2000 .

[21]  Philipp Drieger,et al.  Semantic Network Analysis as a Method for Visual Text Analytics , 2013 .

[22]  Carl Macrae,et al.  Close Calls: Managing Risk and Resilience in Airline Flight Safety , 2014 .