Compus: visualization and analysis of structured documents for understanding social life in the 16th century

This article describes the Compus visualization system that assists in the exploration and analysis of structured document corpora encoded in XML. Compus has been developed for and applied to a corpus of 100 French manuscript letters of the 16th century, transcribed and encoded for scholarly analysis using the recommendations of the Text Encoding Initiative. By providing a synoptic visualization of a corpus and allowing for dynamic queries and structural transformations, Compus assists researchers in finding regularities or discrepancies, leading to a higher level analysis of historic source. Compus can be used with other richly encoded text corpora as well.

[1]  Christopher G. Healey,et al.  Choosing effective colours for data visualization , 1996, Proceedings of Seventh Annual IEEE Visualization '96.

[2]  C. M. Sperberg-McQueen,et al.  Guidelines for electronic text encoding and interchange , 1994 .

[3]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[4]  David S. Ebert,et al.  The shape of Shakespeare: visualizing text using implicit surfaces , 1998, Proceedings IEEE Symposium on Information Visualization (Cat. No.98TB100258).

[5]  Stephen G. Eick,et al.  Seesoft-A Tool For Visualizing Line Oriented Software Statistics , 1992, IEEE Trans. Software Eng..

[6]  Laurent Robert,et al.  An integrated reading and editing environment for scholarly research on literary works and their handwritten sources , 1998, DL '98.

[7]  James Clark,et al.  XSL Transformations (XSLT) Version 1.0 , 1999 .

[8]  John Bradley,et al.  Using Tact With Electronic Texts: A Guide to Text-Analysis Computing Tools : Version 2.1 for MS-DOS and PC DOS , 1996 .

[9]  Claudia Claridge,et al.  The Lampeter Corpus of Early Modern English Tracts , 2000 .

[10]  Paul Caton Putting Renaissance Women Online , 1997, ELPUB.

[11]  Gary Marchionini,et al.  Interfaces and Tools for the Library of Congress National Digital Library Program , 1998, Inf. Process. Manag..

[12]  C. M. Sperberg-McQueen,et al.  Extensible markup language , 1997 .

[13]  Rainer Siemund,et al.  The Lampeter Corpus of Early Modern English Tracts , 1997 .

[14]  L. Truffet The Frechet Contingency Array Problem is Max-Plus Linear , 2009, 0904.2244.

[15]  Charles F. Goldfarb,et al.  SGML handbook , 1990 .

[16]  Christopher Ahlberg,et al.  Spotfire: an information exploration environment , 1996, SGMD.

[17]  Daniel A. Keim,et al.  Pixel-oriented database visualizations , 1996, SGMD.

[18]  Jock D. Mackinlay,et al.  The document lens , 1993, UIST '93.

[19]  Ben Shneiderman,et al.  Visual information seeking: tight coupling of dynamic query filters with starfield displays , 1994, CHI '94.