Index Coding: A Compression Technique for Large Statistical Databases

Index encoding is a compression technique that involves the substitution of numeric codes for data values. Current methods of index encoding are suited only for attributes whose underlying domains are small or static. In this paper, general methods to encode dynamic domains are proposed and analyzed. A practical methodology for their applictition is presented. We also compare and contrast our methods with another that is now being used in a commercial file management system.