Don't Search, Just Show Me What I Did: Visualizing Provenance of Documents and Applications

Computer documents have evolved over time. As a result, this article presents the results from a survey that redefines what a “document” is. The study discovered that people think a document is something that holds information; is manipulated by people (not the system); and is anything that can be seen, heard, or touched. Second, based on the results of the survey, we present a novel, real-time visualization tool that shows the results of nonintrusive tracking of documents and applications for a 6-month period. The tool focuses on document provenance—the history or genealogy of a document. It shows every document and application used as well as what happened to those documents (e.g., if the documents were moved, renamed, and/or deleted). These evaluations of the visualization tool are promising in that it helped with refinding documents, finding behavior workflow patterns, finding insight into general document usage, and performing forensic-type activities.

[1]  Joaquim A. Jorge,et al.  Describing documents: what can users tell us? , 2004, IUI '04.

[2]  Jan Noyes,et al.  HCI International 2005 , 2005, HCI International 2005.

[3]  Theodor Holm Nelson,et al.  Xanalogical structure, needed now more than ever: parallel documents, deep links to content, deep versioning, and deep re-use , 1999, CSUR.

[4]  William J. Bolosky,et al.  A large-scale study of file-system contents , 1999, SIGMETRICS '99.

[5]  Christine Reid,et al.  The Myth of the Paperless Office , 2003, J. Documentation.

[6]  Thomas G. Dietterich,et al.  The use of provenance in information retrieval , 2007 .

[7]  Dominique L. Scapin,et al.  What do people recall about their documents?: implications for desktop search tools , 2007, IUI '07.

[8]  Ben Shneiderman,et al.  Codex, memex, genex: the pursuit of transformational technologies , 1998, Int. J. Hum. Comput. Interact..

[9]  Vannevar Bush,et al.  As we may think , 1945, INTR.

[10]  Craig A. N. Soules,et al.  Connections: using context to enhance file search , 2005, SOSP '05.

[11]  Brian D. Noble,et al.  Using Provenance to Aid in Personal File Search , 2007, USENIX Annual Technical Conference.

[12]  Ben Shneiderman,et al.  The eyes have it: a task by data type taxonomy for information visualizations , 1996, Proceedings 1996 IEEE Symposium on Visual Languages.

[13]  Manuel A. Pérez-Quiñones,et al.  OSI and ET: originating source of information and evidence traceability , 2007, CHI Extended Abstracts.

[14]  Manuel A. Pérez-Quiñones,et al.  Refinding is Not Finding Again , 2005 .

[15]  Edward A. Fox,et al.  Envision: a user-centered database of computer science literature , 1995, CACM.

[16]  Clifford A. Lynch,et al.  When documents deceive: Trust and provenance as new factors for information retrieval in a tangled web , 2001, J. Assoc. Inf. Sci. Technol..

[17]  Jarkko Leponiemi Visualizing Discussion History , 2003, Int. J. Hum. Comput. Interact..

[18]  Chris North,et al.  Citiviz: A Visual User Interface to the CITIDEL System , 2004, ECDL.

[19]  Thomas G. Dietterich,et al.  The life and times of files and information: a study of desktop provenance , 2010, CHI.

[20]  Ben Shneiderman,et al.  Tree visualization with tree-maps: 2-d space-filling approach , 1992, TOGS.

[21]  Mark S. Ackerman,et al.  The perfect search engine is not enough: a study of orienteering behavior in directed search , 2004, CHI.

[22]  Richards J. Heuer,et al.  Psychology of Intelligence Analysis , 1999 .

[23]  Gary Marchionini,et al.  Find What You Need, Understand What You Find , 2007, Int. J. Hum. Comput. Interact..