Asymptotic Recurrence and Waiting Times for Stationary Processes

AbstractLet $${\text{X = }}\left\{ {X_n ;n \in \mathbb{Z}} \right\}$$ be a discrete-valued stationary ergodic process distributed according to P and let x=(..., x−1, x0, x1,...) denote a realization from X. We investigate the asymptotic behavior of the recurrence time Rn defined as the first time that the initial n-block $$x_1^n = (x_1 ,x_2 , \ldots ,x_n )$$ reappears in the past of x. We identify an associated random walk, $$ - \log P(X_1^n )$$ on the same probability space as X, and we prove a strong approximation theorem between log Rn and $$ - \log P(X_1^n )$$ . From this we deduce an almost sure invariance principle for log Rn. As a byproduct of our analysis we get unified proofs for several recent results that were previously established using methods from ergodic theory, the theory of Poisson approximation and the analysis of random trees. Similar results are proved for the waiting time Wn defined as the first time until the initial n-block from one realization first appears in an independent realization generated by the same (or by a different) process.

[1]  W. Philipp,et al.  Almost sure invariance principles for partial sums of weakly dependent random variables , 1975 .

[2]  B. Pittel Asymptotical Growth of a Class of Random Trees , 1985 .

[3]  Benjamin Weiss,et al.  Entropy and data compression schemes , 1993, IEEE Trans. Inf. Theory.

[4]  L. Breiman The Individual Ergodic Theorem of Information Theory , 1957 .

[5]  Leonidas J. Guibas,et al.  Periods in Strings , 1981, J. Comb. Theory, Ser. A.

[6]  Paul C. Shields,et al.  Waiting times: Positive and negative results on the Wyner-Ziv problem , 1993 .

[7]  D. Aldous,et al.  A diffusion limit for a class of randomly-growing binary trees , 1988 .

[8]  A. Barron THE STRONG ERGODIC THEOREM FOR DENSITIES: GENERALIZED SHANNON-MCMILLAN-BREIMAN THEOREM' , 1985 .

[9]  Aaron D. Wyner,et al.  Some asymptotic properties of the entropy of a stationary ergodic data source with applications to data compression , 1989, IEEE Trans. Inf. Theory.

[10]  Philippe Jacquet,et al.  Autocorrelation on Words and Its Applications - Analysis of Suffix Trees by String-Ruler Approach , 1994, J. Comb. Theory A.

[11]  A. Nobel,et al.  A recurrence theorem for dependent processes with applications to data compression , 1992, IEEE Trans. Inf. Theory.

[12]  M. Kac On the notion of recurrence in discrete stochastic processes , 1947 .

[13]  P. Pevzner,et al.  Linguistics of nucleotide sequences. I: The significance of deviations from mean statistical characteristics and prediction of the frequencies of occurrence of words. , 1989, Journal of biomolecular structure & dynamics.

[14]  Frans M. J. Willems,et al.  Universal data compression and repetition times , 1989, IEEE Trans. Inf. Theory.

[15]  I. Ibragimov,et al.  Some Limit Theorems for Stationary Processes , 1962 .

[16]  T. Cover,et al.  A sandwich proof of the Shannon-McMillan-Breiman theorem , 1988 .

[17]  Paul C. Shields,et al.  Almost-sure waiting time results for weak and very weak Bernoulli processes , 1995, Ergodic Theory and Dynamical Systems.

[18]  Wojciech Szpankowski,et al.  Asymptotic properties of data compression and suffix trees , 1993, IEEE Trans. Inf. Theory.