A universal prediction lemma and applications to universal data compression and prediction

A universal prediction lemma is derived for the class of prediction algorithms that only make inferences about the conditional distribution of an unknown random process based on what has been observed in the training data. The lemma is then used to derive lower bounds on the efficiency of a number of universal prediction and data compression algorithms. These bounds are nonasymptotic in the sense that they express the effect of limited training data on the efficiency of universal prediction and universal data compression.