The problem of file reorganization which we consider involves altering the placement of records on pages of a secondary storage device. In addition, we want this reorganization to be done in-place, i.e., using the file's original storage space for the newly reorganized file. The motivation for such a physical change is to improve the database system's performance. For example, by placing frequently and jointly accessed records on the same page or pages, we can try to minimize the number of page accesses made in answering a set of queries. The optimal assignment (or reassignment) of records to clusters is exactly what record clustering algorithms [1,2,4,9] attempt to do. However, record clustering algorithms usually do not solve the entire problem, i.e., they do not specify how to efficiently reorganize the file to reflect the clustering assignment which they determine. Our algorithm is a companion to general record clustering algorithms since it actually transforms the file. The problem of optimal file reorganization is NP-hard [3]. Consequently, our reorganization algorithm is based on heuristics for which we prove three important observations.
[1]
Toby J. Teorey,et al.
Design of Database Structures
,
1982
.
[2]
Aris M. Ouksel,et al.
Multidimensional B-trees for associative searching in database systems
,
1982,
Inf. Syst..
[3]
Edward Omiecinski.
Incremental File Reorganization Schemes
,
1985,
VLDB.
[4]
Clement T. Yu,et al.
Adaptive information system design: one query at a time
,
1985,
SIGMOD '85.
[5]
Matti Jakobsson,et al.
Reducing block accesses in inverted files by partial clustering
,
1980,
Inf. Syst..
[6]
Clement T. Yu,et al.
Adaptive record clustering
,
1985,
TODS.
[7]
Peter Scheuermann,et al.
A Global Approach to Record Clustering and File Reorganization
,
1984,
SIGIR.
[8]
Gary H. Sockut,et al.
Database Reorganization—Principles and Practice
,
1979,
CSUR.
[9]
S. Bing Yao,et al.
Multi-dimensional clustering for data base organizations
,
1977,
Inf. Syst..