Iterative generation of insight from text collections through mutually reinforcing visualizations and fuzzy cognitive maps

Abstract Developing a comprehensive explanation of complex social phenomena is a difficult task that analysts often have to perform using vast collections of text documents. On the one hand, solutions exist to assist analysts in creating causal maps from text documents, but these can only articulate the relationships at work in a problem. On the other hand, Fuzzy Cognitive Maps (FCMs) can articulate these relationships and perform simulations, but no environment exists to help analysts in iteratively developing FCMs from text. In this paper, we detail the design and implementation of the first tool that allows analysts to develop FCMs from text collections, using interactive visualizations. We make three contributions: (i) we combine text mining and FCMs, (ii) we implement the first visual analytics environment built on FCMs, and (iii) we promote a strong feedback loop between interactive data exploration and model building. We provide two case studies exemplifying how to create a model from the ground-up or improve an existing one. Limitations include the increase in display complexity when working with large collection of files, and the reliance on KL-divergence for ad-hoc retrieval. Several improvements are discussed to further support analysts in creating high-quality models through interactive visualizations.

[1]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[2]  ChengXiang Zhai,et al.  Statistical Language Models for Information Retrieval: A Critical Review , 2008, Found. Trends Inf. Retr..

[3]  Hausi A. Müller,et al.  Customizing a Fisheye View Algorithm to Preserve the Mental Map , 1999, J. Vis. Lang. Comput..

[4]  Martin Wattenberg,et al.  The Word Tree, an Interactive Visual Concordance , 2008, IEEE Transactions on Visualization and Computer Graphics.

[5]  Elizabeth R. Jessup,et al.  Matrices, Vector Spaces, and Information Retrieval , 1999, SIAM Rev..

[6]  Steven A. Gray,et al.  Using mental-modelling to explore how irrigators in the Murray–Darling Basin make water-use decisions , 2016 .

[7]  Richard May,et al.  A virtual environment for multimedia intelligence data analysis , 1996, IEEE Computer Graphics and Applications.

[8]  Philippe J. Giabbanelli,et al.  Should we simulate mental models to assess whether they agree? , 2018, SpringSim.

[9]  Vijay Kumar Mago,et al.  Analyzing the impact of social factors on homelessness: a Fuzzy Cognitive Map approach , 2013, BMC Medical Informatics and Decision Making.

[10]  Daniel Gonçalves,et al.  EduVis: visualizing educational information , 2014, NordiCHI.

[11]  Donglan Zhang,et al.  An Agent-Based Model of Healthy Eating with Applications to Hypertension , 2018 .

[12]  Philippe J. Giabbanelli,et al.  Supporting a systems approach to healthy weight interventions in British Columbia by modeling weight and well-being , 2016, SpringSim.

[13]  Wessel Kraaij,et al.  Language Models for Topic Tracking , 2003 .

[14]  P. Sacher,et al.  The MEND Programme: tackling childhood obesity , 2007 .

[15]  Barnabás Bede,et al.  Mathematics of Fuzzy Sets and Fuzzy Logic , 2012, Studies in Fuzziness and Soft Computing.

[16]  Luis F. Luna-Reyes,et al.  Collecting and analyzing qualitative data for system dynamics: methods and models , 2003 .

[17]  M. Sheelagh T. Carpendale,et al.  DocuBurst: Visualizing Document Content using Language Structure , 2009, Comput. Graph. Forum.

[18]  Voula C. Georgopoulos,et al.  Fuzzy cognitive map architectures for medical decision support systems , 2008, Appl. Soft Comput..

[19]  Thomas Torsney-Weir,et al.  A fuzzy cognitive map of the psychosocial determinants of obesity , 2012, Appl. Soft Comput..

[20]  R. Axelrod The Dissemination of Culture , 1997 .

[21]  John T. Stasko,et al.  Visualizing Social Media Content with SentenTree , 2017, IEEE Transactions on Visualization and Computer Graphics.

[22]  Padhraic Smyth,et al.  TopicNets: Visual Analysis of Large Text Corpora with Topic Modeling , 2012, TIST.

[23]  Bart Kosko,et al.  Fuzzy Cognitive Maps , 1986, Int. J. Man Mach. Stud..

[24]  Philippe J. Giabbanelli,et al.  Modelling the spatial and social dynamics of insurgency , 2014, Security Informatics.

[25]  Andy Cockburn,et al.  Evaluating the effectiveness of spatial memory in 2D and 3D physical and virtual environments , 2002, CHI.

[26]  John D. Lafferty,et al.  Document Language Models, Query Models, and Risk Minimization for Information Retrieval , 2001, SIGIR Forum.

[27]  Philippe J. Giabbanelli,et al.  Combining fuzzy cognitive maps with agent-based modeling: Frameworks and pitfalls of a powerful hybrid modeling approach to understand human-environment interactions , 2017, Environ. Model. Softw..

[28]  Athanasios K. Tsadiras,et al.  Comparing the inference capabilities of binary, trivalent and sigmoid fuzzy cognitive maps , 2008, Inf. Sci..

[29]  Kasper Kok,et al.  Fuzzy Cognitive Maps for futures studies—A methodological assessment of concepts and methods , 2014 .

[30]  James R. Eagan,et al.  SchemeLens: A Content-Aware Vector-Based Fisheye Technique for Navigating Large Systems Diagrams , 2016, IEEE Transactions on Visualization and Computer Graphics.

[31]  Philippe J. Giabbanelli,et al.  Using Visual Analytics to Support the Integration of Expert Knowledge in the Design of Medical Models and Simulations , 2015, ICCS.

[32]  Paul Vickers,et al.  A survey of two-dimensional graph layout techniques for information visualisation , 2013, Inf. Vis..

[33]  E. H. Mamdani,et al.  Application of Fuzzy Logic to Approximate Reasoning Using Linguistic Synthesis , 1976, IEEE Transactions on Computers.

[34]  Stephen E. Robertson,et al.  Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval , 1994, SIGIR '94.

[35]  ChengXiang Zhai,et al.  Statistical Language Models for Information Retrieval , 2008, NAACL.

[36]  Yongtae Park,et al.  Futuristic data-driven scenario building: Incorporating text mining and fuzzy association rule mining into fuzzy cognitive map , 2016, Expert Syst. Appl..

[37]  George J. Klir,et al.  Fuzzy sets and fuzzy logic - theory and applications , 1995 .

[38]  Panagiota Spyridonos,et al.  Brain tumor characterization using the soft computing technique of fuzzy cognitive maps , 2008, Appl. Soft Comput..

[39]  Philippe J. Giabbanelli,et al.  Feasibility and Framing of Interventions Based on Public Support: Leveraging Text Analytics for Policymakers , 2016, HCI.

[40]  Bruce Edmonds,et al.  Computational modelling for decision-making: where, why, what, who and how , 2018, Royal Society Open Science.

[41]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[42]  Philippe J. Giabbanelli,et al.  A Novel Visualization Environment to Support Modelers in Analyzing Data Generated by Cellular Automata , 2016, HCI.

[43]  Chi Fai Cheung,et al.  Mining knowledge from natural language texts using fuzzy associated concept mapping , 2008, Inf. Process. Manag..

[44]  Lyn Bartram,et al.  The continuous zoom: a constrained fisheye technique for viewing and navigating large information spaces , 1995, UIST '95.

[45]  Michael Glykas Fuzzy Cognitive Maps: Advances in Theory, Methodologies, Tools and Applications , 2010 .

[46]  Jose L. Salmeron,et al.  A Review of Fuzzy Cognitive Maps Research During the Last Decade , 2013, IEEE Transactions on Fuzzy Systems.

[47]  Rafael A. Calvo,et al.  Concept Maps as Cognitive Visualizations of Writing Assignments , 2011, J. Educ. Technol. Soc..

[48]  Diana Wright,et al.  Thinking in systems: a primer , 2012 .

[49]  Krzysztof Z. Gajos,et al.  Evaluation of Artery Visualizations for Heart Disease Diagnosis , 2011, IEEE Transactions on Visualization and Computer Graphics.

[50]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[51]  Antonie J. Jetter,et al.  Building scenarios with Fuzzy Cognitive Maps: An exploratory study of solar energy , 2011 .

[52]  Philippe J. Giabbanelli,et al.  Exploring the Interactions Between Physical Well-Being, and Obesity , 2015 .

[53]  John D. Lafferty,et al.  A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval , 2017, SIGF.

[54]  John Stasko,et al.  Jigsaw: supporting investigative analysis through interactive visualization , 2008 .

[55]  David M. Blei,et al.  Supervised Topic Models , 2007, NIPS.

[56]  Z. S. Xu,et al.  An overview of operators for aggregating information , 2003, Int. J. Intell. Syst..

[57]  Jean-Daniel Fekete,et al.  MatrixExplorer: a Dual-Representation System to Explore Social Networks , 2006, IEEE Transactions on Visualization and Computer Graphics.

[58]  Chrysostomos D. Stylios,et al.  Modelling supervisory control systems using fuzzy cognitive maps , 2000 .

[59]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[60]  Frank S. C. Tseng,et al.  An integration of fuzzy association rules and WordNet for document clustering , 2010, Knowledge and Information Systems.

[61]  Yuen-Hsien Tseng,et al.  Mining concept maps from news stories for measuring civic scientific literacy in media , 2010, Comput. Educ..

[62]  C Scott Findlay,et al.  Integrating conventional science and aboriginal perspectives on diabetes using fuzzy cognitive maps. , 2007, Social science & medicine.

[63]  Philippe J. Giabbanelli,et al.  Modelling the Joint Effect of Social Determinants and Peers on Obesity Among Canadian Adults , 2014, Theories and Simulations of Complex Social Systems.

[64]  Rebecca Jordan,et al.  Using fuzzy cognitive mapping as a participatory approach to analyze change, preferred states, and perceived resilience of social-ecological systems , 2015 .

[65]  Ben Shneiderman,et al.  Why Not Make Interfaces Better than 3D Reality? , 2003, IEEE Computer Graphics and Applications.

[66]  Robert R. Hoffman,et al.  Applied Concept Mapping: Capturing, Analyzing, and Organizing Knowledge , 2011 .

[67]  Rik Crutzen,et al.  Creating groups with similar expected behavioural response in randomized controlled trials: a fuzzy cognitive map approach , 2014, BMC Medical Research Methodology.