The life and times of files and information: a study of desktop provenance

In the field of Human-Computer Interaction, provenance refers to the history and genealogy of a document or file. Provenance helps us to understand the evolution and relationships of files; how and when different versions of a document were created, or how different documents in a collection build on each other through copy-paste events. Though methods for tracking provenance and the subsequent use of this meta-data have been proposed and developed into tools, there have been no studies documenting the types and frequency of provenance events in typical computer use. This is knowledge essential for the design of efficient query methods and information displays. We conducted a longitudinal study of knowledge workers at Intel Corporation tracking provenance events in their computer use. We also interviewed knowledge workers to determine the effectiveness of provenance cues for document recall. Our data shows that provenance relationships are common, and provenance cues aid recall.

[1]  Joaquim A. Jorge,et al.  Describing documents: what can users tell us? , 2004, IUI '04.

[2]  Endel Tulving,et al.  Encoding specificity and retrieval processes in episodic memory. , 1973 .

[3]  M. Angela Sasse,et al.  "Stuff goes into the computer and doesn't come out": a cross-tool study of personal information management , 2004, CHI.

[4]  Alison Kidd,et al.  The marks are on the knowledge worker , 1994, CHI '94.

[5]  Dominique L. Scapin,et al.  What do people recall about their documents?: implications for desktop search tools , 2007, IUI '07.

[6]  Thomas G. Dietterich,et al.  The use of provenance in information retrieval , 2007 .

[7]  Gordon B. Davis,et al.  Anytime/anyplace computing and the future of knowledge work , 2002, CACM.

[8]  Bonnie A. Nardi,et al.  Finding and reminding: file organization from the desktop , 1995, SGCH.

[9]  James Cheney,et al.  Report on the Principles of Provenance Workshop , 2008, SGMD.

[10]  Brad A. Myers,et al.  What to do when search fails: finding information by association , 2008, CHI.

[11]  Mary Czerwinski,et al.  An Investigation of Memory for Daily Computing Events , 2002 .

[12]  Craig A. N. Soules Using Context to Assist in Personal File Retrieval (CMU-CS-06-147) , 2006 .

[13]  Joaquim A. Jorge,et al.  In search of personal information: narrative-based interfaces , 2008, IUI '08.

[14]  P. Jaccard,et al.  Etude comparative de la distribution florale dans une portion des Alpes et des Jura , 1901 .

[15]  Víctor M. González,et al.  "Constant, constant, multi-tasking craziness": managing multiple working spheres , 2004, CHI.

[16]  Thomas G. Dietterich,et al.  TaskTracer: a desktop environment to support multi-tasking knowledge workers , 2005, IUI.

[17]  A. Strauss,et al.  Grounded theory , 2017 .

[18]  Rafi Nachmias,et al.  Improved search engines and navigation preference in personal information management , 2008, TOIS.

[19]  Brenda White,et al.  DEPARTMENT OF LABOR , 2006 .

[20]  Helmut Krueger,et al.  In pursuit of desktop evolution: User problems and practices with modern desktop systems , 2004, TCHI.

[21]  Craig A. N. Soules,et al.  Connections: using context to enhance file search , 2005, SOSP '05.

[22]  P. Drucker Knowledge-Worker Productivity: The Biggest Challenge , 1999, IEEE Engineering Management Review.

[23]  Wolfgang Nejdl,et al.  Semantically Enhanced Searching and Ranking on the Desktop , 2005, Semantic Desktop Workshop.

[24]  Mark S. Ackerman,et al.  The perfect search engine is not enough: a study of orienteering behavior in directed search , 2004, CHI.

[25]  Mary Czerwinski,et al.  A diary study of task switching and interruptions , 2004, CHI.