On Close and Distant Reading in Digital Humanities: A Survey and Future Challenges

We present an overview of the last ten years of research on visualizations that support close and distant reading of textual data in the digital humanities. We look at various works published within both the visualization and digital humanities communities. We provide a taxonomy of applied methods for close and distant reading, and illustrate approaches that combine both reading techniques to provide a multifaceted view of the data. Furthermore, we list toolkits and potentially beneficial visualization approaches for research in the digital humanities. Finally, we summarize collaboration experiences when developing visualizations for close and distant reading, and give an outlook on future challenges in that research area.

[1]  Alison Booth,et al.  Documentary Social Networks: Collective Biographies of Women , 2013, DH.

[2]  Steven Skiena,et al.  Spatial Analysis of News Sources , 2006, IEEE Transactions on Visualization and Computer Graphics.

[3]  Chris Weaver,et al.  Multidimensional visual analysis using cross-filtered views , 2008, 2008 IEEE Symposium on Visual Analytics Science and Technology.

[4]  David Beavan DiaView: Visualise Cultural Change in Diachronic Corpora , 2012, DH.

[5]  Martin Wattenberg,et al.  Mapping Text with Phrase Nets , 2009, IEEE Transactions on Visualization and Computer Graphics.

[6]  Diansheng Guo,et al.  Flow Mapping and Multivariate Visualization of Large Spatial Interaction Data , 2009, IEEE Transactions on Visualization and Computer Graphics.

[7]  D. Beavan,et al.  Glimpses through the clouds: collocates in a new light , 2008 .

[8]  Daniel A. Keim,et al.  Literature Fingerprinting: A New Method for Visual Literary Analysis , 2007, 2007 IEEE Symposium on Visual Analytics Science and Technology.

[9]  Edward Finn The Social Lives of Books: Mapping the Ideational Networks of Toni Morrison , 2010, DH.

[10]  Robert S. Laramee,et al.  ShakerVis: Visual analysis of segment variation of German translations of Shakespeare’s Othello , 2015, Inf. Vis..

[11]  Stefan Jänicke,et al.  Visualizing Uncertainty: How to Use the Fuzzy Data of 550 Medieval Texts? , 2013, DH.

[12]  W. Bradford Paley,et al.  TextArc: Showing Word Frequency and Distribution in Text , 2002 .

[13]  M. Sheelagh T. Carpendale,et al.  SparkClouds: Visualizing Trends in Tag Clouds , 2010, IEEE Transactions on Visualization and Computer Graphics.

[14]  Ernesto Peña,et al.  On Metaphor in Text Visualization Prototypes , 2014, DH.

[15]  Miriah D. Meyer,et al.  Empowering Play, Experimenting with Poems: Disciplinary Values and Visualization Development , 2014, DH.

[16]  Henning Lobin,et al.  Uncertain about Uncertainty: Different ways of processing fuzziness in digital humanities data , 2014, DH.

[17]  Susan Hockey,et al.  The History of Humanities Computing , 2007 .

[18]  Michael Gleicher,et al.  Sequence Surveyor: Leveraging Overview for Scalable Genomic Alignment Visualization , 2011, IEEE Transactions on Visualization and Computer Graphics.

[19]  Richard Furuta,et al.  Ambiances: A Framework to Write and Visualize Poetry , 2013, DH.

[20]  Xin Tong,et al.  TextFlow: Towards Better Understanding of Evolving Topics in Text , 2011, IEEE Transactions on Visualization and Computer Graphics.

[21]  M. Sheelagh T. Carpendale,et al.  VisGets: Coordinated Visualizations for Web-based Information Exploration and Discovery , 2008, IEEE Transactions on Visualization and Computer Graphics.

[22]  Ofer Arazy,et al.  Mapping the Information Science Domain , 2012, DH.

[23]  Martin Wattenberg,et al.  ManyEyes: a Site for Visualization at Internet Scale , 2007, IEEE Transactions on Visualization and Computer Graphics.

[24]  Jonathan Goodwin,et al.  Reading graphs, maps, trees : responses to Franco Moretti , 2011 .

[25]  Fotis Jannidis,et al.  Validating Computational Stylistics in Literary Interpretation , 2014, DH.

[26]  Martin Wattenberg,et al.  The Word Tree, an Interactive Visual Concordance , 2008, IEEE Transactions on Visualization and Computer Graphics.

[27]  Thomas Ertl,et al.  Two-stage framework for a topology-based projection and visualization of classified document collections , 2010, 2010 IEEE Symposium on Visual Analytics Science and Technology.

[28]  Mengchen Liu,et al.  StoryFlow: Tracking the Evolution of Stories , 2013, IEEE Transactions on Visualization and Computer Graphics.

[29]  Mark Gahegan,et al.  Visual Semiotics & Uncertainty Visualization: An Empirical Study , 2012, IEEE Transactions on Visualization and Computer Graphics.

[30]  Taro Tezuka,et al.  Visualization of relationships among historical persons from Japanese historical documents , 2013, Lit. Linguistic Comput..

[31]  Tamara Munzner,et al.  A Nested Model for Visualization Design and Validation , 2009, IEEE Transactions on Visualization and Computer Graphics.

[32]  Ian N. Gregory,et al.  Visual GISting: bringing together corpus linguistics and Geographical Information Systems , 2011, Lit. Linguistic Comput..

[33]  Luis Meneses Exploring the biography and artworks of Picasso with interactive calendars and timelines , 2009 .

[34]  Shin Ohno,et al.  A Platform for Cultural Information Visualization Using Schematic Expressions of Cube , 2010, DH.

[35]  Jean-Daniel Fekete,et al.  Exploring the Placement and Design of Word-Scale Visualizations , 2014, IEEE Transactions on Visualization and Computer Graphics.

[36]  Anne E. Trefethen,et al.  Rule‐based Visual Mappings – with a Case Study on Poetry Visualization , 2013, Comput. Graph. Forum.

[37]  Helen Armstrong,et al.  Myopia: A Visualization Tool in Support of Close Reading , 2012, DH.

[38]  Jeremy Hawthorn,et al.  A glossary of contemporary literary theory , 1992 .

[39]  Danah Boyd,et al.  Vizster: visualizing online social networks , 2005, IEEE Symposium on Information Visualization, 2005. INFOVIS 2005..

[40]  Alan Galey Approaching the Coasts of Utopia: Visualization Strategies for Mapping Early Modern Paratexts , 2011, DH.

[41]  John A. Walsh,et al.  Computational Discovery and Visualization of the Underlying Semantic Structure of Complicated Historical and Literary Corpora , 2011, DH.

[42]  Michael Gleicher,et al.  Exploring Collections of Tagged Text for Literary Scholarship , 2011, Comput. Graph. Forum.

[43]  Jacob Eisenstein,et al.  Exploratory Thematic Analysis for Historical Newspaper Archives , 2014, DH.

[44]  Martin Wattenberg,et al.  Parallel Tag Clouds to explore and analyze faceted text corpora , 2009, 2009 IEEE Symposium on Visual Analytics Science and Technology.

[45]  Catherine Plaisant,et al.  The Story of One: Humanity scholarship with visualization and text analysis , 2008 .

[46]  David Beavan ComPair: Compare and Visualise the Usage of Language , 2011, DH.

[47]  Marco Büchler,et al.  Visualizations for Text Re-use , 2015, 2014 International Conference on Information Visualization Theory and Applications (IVAPP).

[48]  Nils Reiter,et al.  An NLP-based cross-document approach to narrative structure discovery , 2014, Lit. Linguistic Comput..

[49]  Alex Christie,et al.  Z-Axis Scholarship: Modeling How Modernists Write the City , 2014, DH.

[50]  Leif Isaksen,et al.  Mapping the World of an Ancient Greek Historian: The HESTIA Project , 2010, DH.

[51]  Tanya E. Clement,et al.  Interactive Exploration of Versions across Multiple Documents , 2008 .

[52]  Pierre Dragicevic,et al.  GeneaQuilts: A System for Exploring Large Genealogies , 2010, IEEE Transactions on Visualization and Computer Graphics.

[53]  Stéfan Sinclair,et al.  Visualizing Theatrical Text: From Watching the Script to the Simulated Environment for Theatre (SET) , 2013, Digit. Humanit. Q..

[54]  Gábor Mihály Tóth,et al.  The computer-assisted analysis of a medieval commonplace book and diary (MS Zibaldone Quaresimale by Giovanni Rucellai) , 2013, Lit. Linguistic Comput..

[55]  Maciej Eder,et al.  Stylometry, network analysis, and Latin literature , 2014, DH.

[56]  Walter J. Scheirer,et al.  Visualizing Sound as Functional N-Grams in Homeric Greek Poetry , 2011, DH.

[57]  Marcus Bingenheimer,et al.  Social network visualization from TEI data , 2011, Lit. Linguistic Comput..

[58]  Amir Zeldes,et al.  ANNIS3: A new architecture for generic corpus query and visualization , 2016, Digit. Scholarsh. Humanit..

[59]  J. M. Binder,et al.  Visibility and meaning in topic models and 18th-century subject indexes , 2014, Lit. Linguistic Comput..

[60]  Dana Wheeles,et al.  Juxta Commons , 2013, DH.

[61]  Marti A. Hearst,et al.  Supporting exploratory text analysis in literature study , 2013, Lit. Linguistic Comput..

[62]  Uta Hinrichs,et al.  Trading Consequences: A Case Study of Combining Text Mining and Visualization to Facilitate Document Exploration , 2015, Digit. Scholarsh. Humanit..

[63]  Jean-Daniel Fekete The InfoVis Toolkit , 2004 .

[64]  Tamara Munzner,et al.  MizBee: A Multiscale Synteny Browser , 2009, IEEE Transactions on Visualization and Computer Graphics.

[65]  Courtney Evans,et al.  Mapping Homer's Catalogue of Ships , 2013, Lit. Linguistic Comput..

[66]  Jason Dykes,et al.  Exploring Uncertainty in Geodemographics with Interactive Graphics , 2011, IEEE Transactions on Visualization and Computer Graphics.

[67]  Matthew L. Jockers Computing and Visualizing the 19th-Century Literary Genome , 2012, DH.

[68]  Paul Spence,et al.  Expressing complex associations in medieval historical documents: the Henry III Fine Rolls Project , 2008, Lit. Linguistic Comput..

[69]  Daniel A. Keim,et al.  CloudLines: Compact Display of Event Episodes in Multiple Time-Series , 2011, IEEE Transactions on Visualization and Computer Graphics.

[70]  Séamus Lawless,et al.  The Problem of Time and Space: The Difficulties in Visualising Spatiotemporal Change in Historical Data , 2014, DH.

[71]  Dana R. Solomon Theorizing Data Visualization: A Comparative Case-Study Approach , 2013, DH.

[72]  Jeffrey Heer,et al.  Protovis: A Graphical Toolkit for Visualization , 2009, IEEE Transactions on Visualization and Computer Graphics.

[73]  Worthy Martin,et al.  Digital Yoknapatawpha: Interpreting a Palimpsest of Place , 2014, DH.

[74]  Bethany Nowviskie,et al.  Geo-Temporal Interpretation of Archival Collections with Neatline , 2013, Lit. Linguistic Comput..

[75]  Jeremy Boggs,et al.  Crowdsourcing individual interpretations: Between microtasking and macrotasking , 2014, Lit. Linguistic Comput..

[76]  Loreen Powell,et al.  Interface design , 1983, The Bell System Technical Journal.

[77]  Peter Fankhauser,et al.  Combining Macro- and Microanalysis for Exploring the Construal of Scientific Disciplinarity , 2014, DH.

[78]  Lihua Chen,et al.  A glimpse of the change of worldview between 7th and 10th century China through two leishu , 2014, DH.

[79]  Ben Shneiderman,et al.  The eyes have it: a task by data type taxonomy for information visualizations , 1996, Proceedings 1996 IEEE Symposium on Visual Languages.

[80]  Jeffrey Heer,et al.  prefuse: a toolkit for interactive information visualization , 2005, CHI.

[81]  William Ribarsky,et al.  LeadLine: Interactive visual analysis of text data through event identification and exploration , 2012, 2012 IEEE Conference on Visual Analytics Science and Technology (VAST).

[82]  John T. Stasko,et al.  Dust & Magnet: Multivariate Information Visualization Using a Magnet Metaphor , 2005, Inf. Vis..

[83]  Aditi S. Muralidharan A Visual Interface for Exploring Language Use in Slave Narratives , 2011, DH.

[84]  Wendell Piez,et al.  Towards Hermeneutic Markup: An architectural outline , 2010, DH.

[85]  Jian Zhao,et al.  Facilitating Discourse Analysis with Interactive Visualization , 2012, IEEE Transactions on Visualization and Computer Graphics.

[86]  Kevin Ponto,et al.  Visualizing and Analyzing the Hollywood Screenplay with ScripThreads , 2014, Digit. Humanit. Q..

[87]  Karthikeyan Umapathy,et al.  Digging into Human Rights Violations: phrase mining and trigram visualization , 2013, DH.

[88]  Noah Peterson Visualization As a Bridge to Close Reading: The Audience in The Castle of Perseverance , 2014, DH.

[89]  Marco Büchler,et al.  Design Rules for Visualizing Text Variant Graphs , 2014 .

[90]  Andrew Kehoe,et al.  eMargin: A Collaborative Textual Annotation Tool , 2013 .

[91]  Franco Moretti Graphs, Maps, Trees: Abstract Models for a Literary History , 2005 .

[92]  Weijia Xu,et al.  Finding stories in the archive through paragraph alignment , 2010, Lit. Linguistic Comput..

[93]  Gerik Scheuermann,et al.  Comparative Visualization of Geospatial-temporal Data , 2018, GRAPP/IVAPP.

[94]  Christian Biemann,et al.  Networks of Names: Visual Exploration and Semi‐Automatic Tagging of Social Networks from Newspaper Articles , 2014, Comput. Graph. Forum.

[95]  Daniel A. Keim,et al.  Towards visualizing linguistic patterns of deliberation: a case study of the S21 arbitration , 2014, DH.

[96]  Ian N. Gregory,et al.  Digital approaches to understanding the geographies in literary and historical texts , 2014, DH.

[97]  Catherine Plaisant,et al.  What's being said near “Martha”? Exploring name entities in literary text collections , 2009, 2009 IEEE Symposium on Visual Analytics Science and Technology.

[98]  Martin Wattenberg,et al.  Participatory Visualization with Wordle , 2009, IEEE Transactions on Visualization and Computer Graphics.

[99]  Beom-mo Kang,et al.  Trends 21 Corpus: A Large Annotated Korean Newspaper Corpus for Linguistic and Cultural Studies , 2011, DH.

[100]  Allison Woodruff,et al.  Guidelines for using multiple views in information visualization , 2000, AVI '00.

[101]  Ulrik Brandes,et al.  Interactive Level-of-Detail Rendering of Large Graphs , 2012, IEEE Transactions on Visualization and Computer Graphics.

[102]  Weiwei Cui,et al.  How Hierarchical Topics Evolve in Large Text Corpora , 2014, IEEE Transactions on Visualization and Computer Graphics.

[103]  John G. Keating,et al.  A Digital Humanities Approach to Narrative Voice in The Secret Scripture: Proposing a New Research Method , 2014, Digit. Humanit. Q..

[104]  Ryan Cordell Taken Possession of: The Reprinting and Reauthorship of Hawthorne's "Celestial Railroad" in the Antebellum Religious Press , 2013, Digit. Humanit. Q..

[105]  Benno Stein,et al.  WORDGRAPH: Keyword-in-Context Visualization for NETSPEAK's Wildcard Search , 2012, IEEE Transactions on Visualization and Computer Graphics.

[106]  Hong Zhou,et al.  Visual Analysis of Set Relations in a Graph , 2013, Comput. Graph. Forum.

[107]  M. Sheelagh T. Carpendale,et al.  EMDialog: Bringing Information Visualization into the Museum , 2008, IEEE Transactions on Visualization and Computer Graphics.

[108]  Drayton C. Benner,et al.  'The Sounds of the Psalter: Computational Analysis of Soundplay' , 2014, Lit. Linguistic Comput..

[109]  Ben Shneiderman,et al.  Balancing Systematic and Flexible Exploration of Social Networks , 2006, IEEE Transactions on Visualization and Computer Graphics.

[110]  Florentina Armaselu The Layered Text. From Textual Zoom, Text Network Analysis and Text Summarisation to a Layered Interpretation of Meaning , 2014, DH.

[111]  Ben Shneiderman,et al.  Readings in information visualization - using vision to think , 1999 .

[112]  Mark Wolff Surveying a Corpus with Alignment Visualization and Topic Modeling , 2013, DH.

[113]  Tim Dwyer,et al.  Untangling Euler Diagrams , 2010, IEEE Transactions on Visualization and Computer Graphics.

[114]  Adam James Bradley Violence and the Digital Humanities Text as Pharmakon , 2012, DH.

[115]  Scott B. Weingart,et al.  Computational analysis of the body in European fairy tales , 2013, Lit. Linguistic Comput..

[116]  Tamara Munzner,et al.  Overview: The Design, Adoption, and Analysis of a Visual Document Mining Tool for Investigative Journalists , 2014, IEEE Transactions on Visualization and Computer Graphics.

[117]  Thomas Ertl,et al.  VarifocalReader — In-Depth Visual Analysis of Large Text Documents , 2014, IEEE Transactions on Visualization and Computer Graphics.

[118]  Mathieu Bastian,et al.  Gephi: An Open Source Software for Exploring and Manipulating Networks , 2009, ICWSM.

[119]  Bettina Speckmann,et al.  Flow Map Layout via Spiral Trees , 2011, IEEE Transactions on Visualization and Computer Graphics.

[120]  Daniel A. Keim,et al.  Fingerprint Matrices: Uncovering the dynamics of social networks in prose literature , 2013, Comput. Graph. Forum.

[121]  Gerik Scheuermann,et al.  GeoTemCo: Comparative Visualization of Geospatial-Temporal Data with Clutter Removal Based on Dynamic Delaunay Triangulations , 2012, VISIGRAPP.

[122]  M. Sheelagh T. Carpendale,et al.  DocuBurst: Visualizing Document Content using Language Structure , 2009, Comput. Graph. Forum.

[123]  Fred Gibbs,et al.  Building Better Digital Humanities Tools: Toward broader audiences and user-centered designs , 2012, Digit. Humanit. Q..

[124]  Lauren F. Klein Social Network Analysis and Visualization in 'The Papers of Thomas Jefferson' , 2012, DH.

[125]  Martin Wattenberg,et al.  Stacked Graphs – Geometry & Aesthetics , 2008, IEEE Transactions on Visualization and Computer Graphics.

[126]  Julie Gonnering Lein,et al.  Solitary Mind, Collaborative Mind: Close Reading and Interdisciplinary Research , 2013, DH.

[127]  Tanya E. Clement,et al.  Distant Listening to Gertrude Stein's 'Melanctha': Using Similarity Analysis in a Discovery Paradigm to Analyze Prosody and Author Influence , 2013, Lit. Linguistic Comput..

[128]  Thomas Eckart,et al.  Detection of Citations and Textual Reuse on Ancient Greek Texts and its Applications in the Classical Studies: eAQUA Project , 2010, DH.

[129]  Michael Gleicher,et al.  Serendip: Topic model-driven visual exploration of text corpora , 2014, 2014 IEEE Conference on Visual Analytics Science and Technology (VAST).