Long term file migration: development and evaluation of algorithms

The steady increase in the power and complexity of modern computer systems has encouraged the implementation of automatic file migration systems which move files dynamically between mass storage devices and disk in response to user reference patterns. Using information describing 13 months of user disk data set file references, we develop and evaluate (replacement) algorithms for the selection of files to be moved from disk to mass storage. Our approach is general and demonstrates a general methodology for this type of problem. We find that algorithms based on both the file size and the time since the file was last used work well. The best realizable algorithms tested condition on the empirical distribution of the times between file references. Acceptable results are also obtained by selecting for replacement that file whose size times time to most recent reference is maximal. Comparisons are made with a number of standard algorithms developed for paging, such as Working Set, VMIN, and GOPT. Sufficient information (parameter values, fitted equations) is provided so that our algorithms may be easily implemented on other systems.

[1]  Alan Jay Smith Bibliography on File and I/O System Optimization and related Topics , 1981, ACM SIGOPS Oper. Syst. Rev..

[2]  Alan Jay Smith Analysis of Long Term File Reference Patterns for Application to File Migration Algorithms , 1981, IEEE Transactions on Software Engineering.

[3]  Edward Preble Stritter,et al.  File migration , 1976 .

[4]  Vincent Y. Lum,et al.  A cost oriented algorithm for data set allocation in storage hierarchies , 1975, Commun. ACM.

[5]  Liba Svobodova Performance monitoring in computer systems: a structured approach , 1981, OPSR.

[6]  Wesley W. Chu,et al.  Program behavior and the page-fault-frequency replacement algorithm , 1976, Computer.

[7]  Irving L. Traiger,et al.  Evaluation Techniques for Storage Hierarchies , 1970, IBM Syst. J..

[8]  Alan Jay Smith,et al.  Bibliography on paging and related topics , 1978, OPSR.

[9]  Peter J. Denning,et al.  Statistical methods in performance evaluation , 1971 .

[10]  James P. Considine,et al.  MARC: MVS Archival Storage and Recovery Program , 1977, IBM Syst. J..

[11]  Alan Jay Smith Analysis of the Optimal, Look-Ahead Demand Paging Algorithms , 1976, SIAM J. Comput..

[12]  G. A. Michael Mass archival storage: some trends, needs, and plans at DOE laboratories , 1980 .

[13]  R. S. Fabry,et al.  MIN—an optimal variable-space page replacement algorithm , 1976, CACM.

[14]  Peter J. Denning,et al.  The working set model for program behavior , 1968, CACM.

[15]  Alan Jay Smith Long term file reference patterns and their application to file migration algorithms , 1981 .

[16]  Peter J. Denning,et al.  Operating Systems Theory , 1973 .

[17]  K. Dan Levin,et al.  Optimal program and data locations in computer networks , 1977, CACM.

[18]  Alan Jay Smith,et al.  Sequentiality and prefetching in database systems , 1978, TODS.

[19]  Peter J. Denning,et al.  Generalized working sets for segment reference strings , 1978, CACM.

[20]  Roger Fajman,et al.  WYLBUR: an interactive text editing and remote job entry system , 1973, CACM.

[21]  Donald L. Boyd Implementing Mass Storage Facilities in Operating Systems , 1978, Computer.