Journey to the past: proposal of a framework for past web browser

While the Internet community recognized early on the need to store and preserve past content of the Web for future use, the tools developed so far for retrieving information from Web archives are still difficult to use and far less efficient than those developed for the "live Web." We expect that future information retrieval systems will utilize both the "live" and "past Web" and have thus developed a general framework for a past Web browser. A browser built using this framework would be a client-side system that downloads, in real time, past page versions from Web archives for their customized presentation. It would use passive browsing, change detection and change animation to provide a smooth and satisfactory browsing experience. We propose a meta-archive approach for increasing the coverage of past Web pages and for providing a unified interface to the past Web. Finally, we introduce query-based and localized approaches for filtered browsing that enhance and speed up browsing and information retrieval from Web archives.

[1]  Sharma Chakravarthy,et al.  WebVigil: An approach to Just-In-Time Information Propagation In Large Network-Centric Environments , 2002, WebDyn@WWW.

[2]  F. Grandi An Annotated Bibliography on Temporal and Evolution Aspects in the World Wide Web , 2003 .

[3]  Fred Douglis,et al.  The AT&T Internet Difference Engine: Tracking and viewing changes on the web , 1998, World Wide Web.

[4]  Brian D. Davison A Web Caching Primer , 2001, IEEE Internet Comput..

[5]  Hector Garcia-Molina,et al.  The Evolution of the Web and Implications for an Incremental Crawler , 2000, VLDB.

[6]  Lauren Wood 技術解説 IEEE Internet Computing , 1999 .

[7]  George Cybenko,et al.  How dynamic is the Web? , 2000, Comput. Networks.

[8]  Marc Najork,et al.  A large‐scale study of the evolution of Web pages , 2003, WWW '03.

[9]  Masaru Kitsuregawa,et al.  Extracting evolution of web communities from a series of web archives , 2003, HYPERTEXT '03.

[10]  Saul Greenberg,et al.  How people revisit web pages: empirical findings and implications for the design of history systems , 1997, Int. J. Hum. Comput. Stud..

[11]  Allan Arvidson,et al.  The Kulturarw3 Project--The Royal Swedish Web Archiw3e--An Example of "Complete" Collection of Web Pages. , 2000 .

[12]  Jock D. Mackinlay,et al.  Visualizing the evolution of Web ecologies , 1998, CHI.

[13]  Curtis E. Dyreson,et al.  Towards a temporal World-Wide Web: a transaction-time server , 2001, Proceedings 12th Australasian Database Conference. ADC 2001.

[14]  Brian D. Davison Predicting web actions from HTML content , 2002, HYPERTEXT '02.

[15]  Andy Cockburn,et al.  What do web users do? An empirical analysis of web use , 2001, Int. J. Hum. Comput. Stud..

[16]  Natalie S. Glance,et al.  ChangeDetector™: a site-level monitoring tool for the WWW , 2002, WWW '02.

[17]  Masaru Kitsuregawa,et al.  A system for visualizing and analyzing the evolution of the web with a time series of graphs , 2005, HYPERTEXT '05.

[18]  Curtis E. Dyreson,et al.  Managing versions of web documents in a transaction-time web server , 2004, WWW '04.