论文信息 - How to Achieve Minimax Expected Kullback-Leibler Distance from an Unknown Finite Distribution

How to Achieve Minimax Expected Kullback-Leibler Distance from an Unknown Finite Distribution

We consider a problem that is related to the "Universal Encoding Problem" from information theory. The basic goal is to find rules that map "partial information" about a distribution X over an m-letter alphabet into a guess X for X such that the Kullback-Leibler divergence between X and X is as small as possible. The cost associated with a rule is the maximal expected Kullback-Leibler divergence between X and X. First, we show that the cost associated with the well-known add-one rule equals ln(1+(m-1)/(n+1)) thereby extending a result of Forster and Warmuth [3,2] to m ? 3. Second, we derive an absolute (as opposed to asymptotic) lower bound on the smallest possible cost. Technically, this is done by determining (almost exactly) the Bayes error of the add-one rule with a uniform prior (where the asymptotics for n ? ? was known before). Third, we hint to tools from approximation theory and support the conjecture that there exists a rule whose cost asymptotically matches the theoretical barrier from the lower bound.

Hans Ulrich Simon | Dietrich Braess | Tomas Sauer | Jürgen Forster

[1] Ronald L. Graham,et al. Concrete mathematics - a foundation for computer science , 1991 .

[2] T. Cover. Admissibility Properties of Gilbert ’ s Encoding for Unknown Source Probabilities , 1998 .

[3] David A. Huffman,et al. A method for the construction of minimum-redundancy codes , 1952, Proceedings of the IRE.

[4] Andrew R. Barron,et al. Asymptotic minimax regret for data compression, gambling, and prediction , 1997, IEEE Trans. Inf. Theory.

[5] R. E. Krichevskii. Universal Compression and Retrieval , 1994 .

[6] Rafail E. Krichevskiy,et al. Laplace's Law of Succession and Universal Encoding , 1998, IEEE Trans. Inf. Theory.

[7] Andrew R. Barron,et al. Minimax redundancy for the class of memoryless sources , 1997, IEEE Trans. Inf. Theory.

[8] Manfred K. Warmuth,et al. Relative Expected Instantaneous Loss Bounds , 2000, J. Comput. Syst. Sci..

[9] C. E. SHANNON,et al. A mathematical theory of communication , 1948, MOCO.

[10] Thomas M. Cover. Admissibility properties or Gilbert's encoding for unknown source probabilities (Corresp.) , 1972, IEEE Trans. Inf. Theory.