Semantic Network Analysis as a Method for Visual Text Analytics

Abstract This paper proposes an approach on a method for visual text analytics to support knowledge building, analytical reasoning and explorative analysis. For this purpose we use semantic network models that are automatically retrieved from unstructured text data using a parametric k -next-neighborhood model. Semantic networks are analyzed with methods of network analysis to gain quantitative and qualitative insights. Quantitative metrics can support the qualitative analysis and exploration of semantic structures. We discuss theoretical presuppositions regarding the text modeling with semantic networks to provide a basis for subsequent semantic network analysis. By presenting a systematic overview of basic network elements and their qualitative meaning for semantic network analysis, we describe exploration strategies that can support analysts to make sense of a given network. As a proof of concept, we illustrate the proposed method by an exemplary analysis of a wikipedia article using a visual text analytics system that leverages semantic network visualization for exploration and analysis.

[1]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[2]  Partha Dasgupta,et al.  Topology of the conceptual network of language. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  Reinhard Köhler,et al.  Patterns in syntactic dependency networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Ramon Ferrer i Cancho,et al.  The small world of human language , 2001, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[5]  Ulrik Brandes,et al.  Network Analysis: Methodological Foundations (Lecture Notes in Computer Science) , 2005 .

[6]  Ulrik Brandes,et al.  Visual Unrolling of Network Evolution and the Analysis of Dynamic Discourse† , 2003, Inf. Vis..

[7]  Sherry Koshman,et al.  Information Visualization: Human-Centered Issues and Perspectives , 2009, J. Assoc. Inf. Sci. Technol..

[8]  John R. Anderson Cognitive Psychology and Its Implications , 1980 .

[9]  F. Guattari,et al.  A Thousand Plateaus: Capitalism and Schizophrenia , 1980 .

[10]  Ronen Feldman,et al.  Book Reviews: The Text Mining Handbook: Advanced Approaches to Analyzing Unstructured Data by Ronen Feldman and James Sanger , 2008, CL.

[11]  Arjan Kuijper,et al.  Visual Analysis of Large Graphs , 2010, Eurographics.

[12]  Martin Wattenberg,et al.  Arc diagrams: visualizing structure in strings , 2002, IEEE Symposium on Information Visualization, 2002. INFOVIS 2002..

[13]  James J. Thomas,et al.  Visualizing the non-visual: spatial analysis and interaction with information from text documents , 1995, Proceedings of Visualization 1995 Conference.

[14]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[15]  Miriam R. L. Petruck FRAME SEMANTICS , 1996 .

[16]  Catherine Plaisant,et al.  TreePlus: Interactive Exploration of Networks with Enhanced Tree Layouts , 2006, IEEE Transactions on Visualization and Computer Graphics.

[17]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[18]  D. Busse Frame-Semantik : ein Kompendium , 2012 .

[19]  I. Jolliffe Principal Component Analysis , 2002 .

[20]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[21]  Anne Kao,et al.  Text Visualization for Visual Text Analytics , 2008, Visual Data Mining.

[22]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[23]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[24]  35th Annual Conference of the European Association for Computer Graphics, Eurographics 2014 - State of the Art Reports, Strasbourg, France, April 7-11, 2014 , 2014, Eurographics.

[25]  Michael W. Berry,et al.  Text mining : applications and theory , 2010 .

[26]  Stuart C. Shapiro SNePS: a logic for natural language understanding and commonsense reasoning , 2000 .

[27]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[28]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[29]  Daniel A. Keim,et al.  Visual Analytics: Definition, Process, and Challenges , 2008, Information Visualization.

[30]  G. J. Rodgers,et al.  Network properties of written human language. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[31]  Marvin Minsky,et al.  Semantic Information Processing , 1968 .

[32]  W. Bradford Paley,et al.  TextArc: Showing Word Frequency and Distribution in Text , 2002 .

[33]  Martin Wattenberg,et al.  The Word Tree, an Interactive Visual Concordance , 2008, IEEE Transactions on Visualization and Computer Graphics.

[34]  John F. Sowa,et al.  A Dynamic Theory of Ontology , 2006, FOIS.

[35]  Umberto Eco,et al.  Semiotics and the philosophy of language , 1985, Advances in semiotics.

[36]  Ferdinand de Saussure Course in General Linguistics , 1916 .

[37]  Katrin Erk,et al.  The SALSA Corpus: a German Corpus Resource for Lexical Semantics , 2006, LREC.

[38]  Philipp Drieger,et al.  Visual Text Analytics using Semantic Networks and Interactive 3D Visualization , 2012, EuroVA@EuroVis.

[39]  S. R. Hiltz,et al.  The International Network for Social Network Analysis , 1984 .

[40]  Samuel Kaski,et al.  Self organization of a massive document collection , 2000, IEEE Trans. Neural Networks Learn. Syst..

[41]  Daniel A. Keim,et al.  Visual Analytics: Scope and Challenges , 2008, Visual Data Mining.

[42]  M. Sheelagh T. Carpendale,et al.  DocuBurst: Visualizing Document Content using Language Structure , 2009, Comput. Graph. Forum.

[43]  Peter Mika Ontologies Are Us: A Unified Model of Social Networks and Semantics , 2005, International Semantic Web Conference.

[44]  Reinhard Diestel,et al.  Graph Theory , 1997 .

[45]  Martin Wattenberg,et al.  Mapping Text with Phrase Nets , 2009, IEEE Transactions on Visualization and Computer Graphics.

[46]  Melvin J. Voigt,et al.  Progress in Communication Sciences , 1982 .

[47]  Vladimir Batagelj,et al.  Network analysis of texts , 2002 .

[48]  Mark E. J. Newman,et al.  Structure and Dynamics of Networks , 2009 .

[49]  Albert-László Barabási,et al.  Hierarchical organization in complex networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[50]  Ulrik Brandes,et al.  Network Analysis: Methodological Foundations , 2010 .