Encoding Databases Satisfying a Given Set of Dependencies

Consider a relation schema with a set of dependency constraints. A fundamental question is what is the minimum space where the possible instances of the schema can be "stored". We study the following model. Encode the instances by giving a function which maps the set of possible instances into the set of words of a given length over the binary alphabet in a decodable way. The problem is to find the minimum length needed. This minimum is called the information content of the database. We investigate several cases where the set of dependency constraints consist of relatively simple sets of functional or multivalued dependencies. We also consider the following natural extension. Is it possible to encode the instances such a way that small changes in the instance cause a small change in the code.

[1]  Béla Bollobás,et al.  Random Graphs , 1985 .

[2]  Walid G. Aref,et al.  Supporting views in data stream management systems , 2010, TODS.

[3]  Victor Vianu,et al.  Tractable query languages for complex object databases , 1991, ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems.

[4]  Henning Köhler Global Database Design based on Storage Space and Update Time Minimization , 2009, J. Univers. Comput. Sci..

[5]  Solmaz Kolahi,et al.  Dependency-preserving normalization of relational and XML data , 2007, J. Comput. Syst. Sci..

[6]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[7]  Solmaz Kolahi,et al.  On redundancy vs dependency preservation in normalization: an information-theoretic study of 3NF , 2006, PODS '06.

[8]  Stéphane Grumbach,et al.  In Search of the Lost Schema , 1999, ICDT.

[9]  Solmaz Kolahi,et al.  An information-theoretic analysis of worst-case redundancy in database design , 2008, TODS.

[10]  Alan M. Frieze,et al.  Random graphs , 2006, SODA '06.

[11]  András A. Benczúr Information Measurement in Relational Databases , 1987, MFDBS.

[12]  Marcelo Arenas,et al.  An information-theoretic approach to normal forms for relational and XML data , 2003, PODS.

[13]  A. Benczúr The evolution of human communication and the information revolution - A mathematical perspective , 2003 .

[14]  Malcolm P. Atkinson,et al.  Issues Raised by Three Years of Developing PJama: An Orthogonally Persistent Platform for Java , 1999, ICDT.