Format technology lifecycle analysis

The lifecycles of format technology have been a defining concern for digital stewardship research and practice. However, little evidence exists to provide robust methods for assessing the state of any given format technology and describing its evolution over time. This article introduces relevant models from diffusion theory and market research and presents a replicable analysis method to compute models of technology evolution. Data cleansing and the combination of multiple data sources enable the application of nonlinear regression to estimate the parameters of the Bass diffusion model on format technology market lifecycles. Through its application to a longitudinal data set from the UK Web Archive, we demonstrate that the method produces reliable results and show that the Bass model can be used to describe format lifecycles. By analyzing adoption patterns across market segments, new insights are inferred about how the diffusion of formats and products such as applications occurs over time. The analysis provides a stepping stone to a more robust and evidence‐based approach to model technology evolution.

[1]  Ccsds Secretariat,et al.  Reference Model for an Open Archival Information System (OAIS) , 1999 .

[2]  Brian Lavoie,et al.  Identifying Threats to Successful Digital Preservation: the SPOT Model for Risk Assessment , 2012, D Lib Mag..

[3]  David L. Giaretta,et al.  Managing Risks in the Preservation of Research Data with Preservation Networks , 2012, Int. J. Digit. Curation.

[4]  Eric W. Ford,et al.  Predicting the adoption of electronic health records by physicians: when will health care be paperless? , 2006, Journal of the American Medical Informatics Association : JAMIA.

[5]  David S. H. Rosenthal,et al.  Format obsolescence: assessing the threat and the defenses , 2010, Libr. Hi Tech.

[6]  Sébastien Peyrard,et al.  Describing and Preserving Digital Object Environments , 2013 .

[7]  D. North Competing Technologies , Increasing Returns , and Lock-In by Historical Events , 1994 .

[8]  Colin Webb,et al.  Defining File Format Obsolescence: A Risky Journey , 2008, Int. J. Digit. Curation.

[9]  Sergiu Gordea,et al.  Human and Machine-based File Format Endangerment Notification and Recommender Systems Development , 2015, iPRES.

[10]  José Carlos Ramalho,et al.  Preservation Watch: What to monitor and how , 2012, iPRES.

[11]  Kresimir Duretec,et al.  BenchmarkDP - Format technology lifecycle analysis based on the UK Web Archive Format Profile dataset , 2016 .

[12]  Christoph Becker,et al.  Content Profiling for Preservation: Improving Scale, Depth and Quality , 2014, ICADL.

[13]  C. Kemerer,et al.  Network Externalities in Microcomputer Software: An Econometric Analysis of the Spreadsheet Market , 2014 .

[14]  Vijay Mahajan,et al.  New Product Diffusion Models in Marketing: A Review and Directions for Research: , 1990 .

[15]  Frank M. Bass,et al.  A New Product Growth for Model Consumer Durables , 2004, Manag. Sci..

[16]  Bradley Turner,et al.  Predicting the Diffusion Pattern of Internet-Based Communication Applications Using Bass Model Parameter Estimates for Email , 2011 .

[17]  Michael L. Nelson,et al.  How much of the web is archived? , 2011, JCDL '11.

[18]  Steve Knight,et al.  Reimagining the Format Model: Introducing the Work of the NSLA Digital Preservation Technical Registry , 2014 .

[19]  Ian Milligan,et al.  Mining the ‘Internet Graveyard’: Rethinking the Historians’ Toolkit , 2012 .

[20]  R. Kalaba,et al.  Nonlinear Least Squares , 1986 .

[21]  Philip Calvert,et al.  Preserving digital information: Report of the task force on archiving of digital information , 2014 .

[22]  Robert J. Kauffman,et al.  Making Sense of Technology Trends in the Information Technology Landscape: A Design Science Approach , 2008, MIS Q..

[23]  Donald R. Lehmann,et al.  A Meta-Analysis of Applications of Diffusion Models , 1990 .

[24]  V. Mahajan,et al.  Innovation diffusion and new product growth models: A critical review and research directions , 2010 .

[25]  Heather M. Ryan Occam's Razor and File Format Endangerment Factors , 2014, iPRES.

[26]  Andrew N. Jackson Formats over Time: Exploring UK Web History , 2012, iPRES.

[27]  F. Bass A new product growth model for consumer durables , 1976 .

[28]  Herbert Van de Sompel,et al.  Profiling web archive coverage for top-level domain and content language , 2013, International Journal on Digital Libraries.

[29]  Kresimir Duretec,et al.  Scalable decision support for digital preservation , 2014, OCLC Syst. Serv..

[30]  W. Arthur,et al.  Increasing Returns and Path Dependence in the Economy , 1996 .

[31]  F. Schmidt Meta-Analysis , 2008 .

[32]  John R. Garrett,et al.  Avoiding Technological Quicksand : Finding a Viable Technical Foundation for Digital Preservation , 2009 .

[33]  Charlotte H. Mason,et al.  Technical Note---Nonlinear Least Squares Estimation of New Product Diffusion Models , 1986 .

[34]  John Garrett,et al.  Preserving Digital Information. Report of the Task Force on Archiving of Digital Information. , 1996 .

[35]  Christophe Van den Bulte,et al.  Technical Report: Want to know how diffusion speed varies across countries and products? Try using a Bass model , 2005 .

[36]  Jane Hunter,et al.  AONS—An obsolescence detection and notification service for Web archives and digital repositories , 2007, New Rev. Hypermedia Multim..

[37]  Kresimir Duretec,et al.  Scalable decision support for digital preservation: an assessment , 2015, OCLC Syst. Serv..

[38]  Henry M. Gladney,et al.  Preserving digital information , 2007 .

[39]  Jeff Rothenberg,et al.  Avoiding Technological Quicksand: Finding a Viable Technical Foundation for Digital Preservation. A Report to the Council on Library and Information Resources. , 1999 .

[40]  William H. Walters,et al.  Risk Management of Digital Information: A File Format Investigation. , 2000 .

[41]  R. Heeler,et al.  Problems in Predicting New Product Growth for Consumer Durables , 1980 .