Many large text collections exhibit graph structures, either inherent to the content itself or encoded in the metadata of the individual documents. Example graphs extracted from document collections are co-author networks, citation networks, or named-entity-cooccurrence networks. Furthermore, social networks can be extracted from email corpora, tweets, or social media. When it comes to visualising these large corpora, traditionally either the textual content or the network graph are used. We propose to incorporate both, text and graph, to not only visualise the semantic information encoded in the documents' content but also the relationships expressed by the inherent network structure in a two-dimensional landscape. We illustrate the effectiveness of our approach with an exploration interface for different real world datasets.
[1]
Mark Coddington.
Clarifying Journalism’s Quantitative Turn
,
2015
.
[2]
Arputharaj Kannan,et al.
An Intelligent System for Semantic Information Retrieval Information from Textual Web Documents
,
2008,
IWCF.
[3]
Marie-Anne Chabin.
PANAMA PAPERS: A CASE STUDY FOR RECORDS MANAGEMENT? :: Brapci 2.0
,
2018
.
[4]
Bret Jackson,et al.
Cartograph: Unlocking Spatial Visualization Through Semantic Enhancement
,
2017,
IUI.
[5]
Sargur N. Srihari,et al.
Computational Forensics: Towards Hybrid-Intelligent Crime Investigation
,
2007,
Third International Symposium on Information Assurance and Security.