Success run statistics defined on an urn model

Statistics denoting the numbers of success runs of length exactly equal and at least equal to a fixed length, as well as the sum of the lengths of success runs of length greater than or equal to a specific length, are considered. They are defined on both linearly and circularly ordered binary sequences, derived according to the Pólya-Eggenberger urn model. A waiting time associated with the sum of lengths statistic in linear sequences is also examined. Exact marginal and joint probability distribution functions are obtained in terms of binomial coefficients by a simple unified combinatorial approach. Mean values are also derived in closed form. Computationally tractable formulae for conditional distributions, given the number of successes in the sequence, useful in nonparametric tests of randomness, are provided. The distribution of the length of the longest success run and the reliability of certain consecutive systems are deduced using specific probabilities of the studied statistics. Numerical examples are given to illustrate the theoretical results.

[1]  A. Mood The Distribution Theory of Runs , 1940 .

[2]  Marco Muselli,et al.  Simple expressions for success run distributions in bernoulli trials , 1996 .

[3]  Andreas N. Philippou,et al.  Binomial Distributions of Order K on the Circle , 1994 .

[4]  Andreas N. Philippou Distributions and Fibonacci Polynomials of Order k, Longest Runs, and Reliability of Consecutive-k-out-of-n : F Systems , 1986 .

[5]  Eugene F. Schuster,et al.  Distribution theory of runs via exchangeable random variables , 1991 .

[6]  Frederick Mosteller,et al.  Note on an Application of Runs to Quality Control Charts , 1941 .

[7]  Małgorzata Roos,et al.  Runs and Scans With Applications , 2001 .

[8]  Charles M. Grinstead,et al.  Introduction to probability , 1999, Statistics for the Behavioural Sciences.

[9]  W. Y. Wendy Lou,et al.  Distribution Theory of Runs and Patterns and Its Applications: A Finite Markov Chain Imbedding Approach , 2003 .

[10]  William G. Cochran An extension of gold's method of examining the apparent persistence of one type of weather , 2007 .

[11]  T. F. Móri On the waiting time till each of some given patterns occurs as a run , 1990, Canadian Journal of Mathematics.

[12]  Z. Bai,et al.  The Exact and Limiting Distributions for the Number of Successes in Success Runs Within a Sequence of Markov-Dependent Two-State Trials , 2002 .

[13]  D. E. Barton,et al.  Runs in a ring , 1958 .

[14]  M. Koutras,et al.  WAITING TIMES ASSOCIATED WITH THE SUM OF SUCCESS RUN LENGTHS , 2003 .

[15]  J. Panaretos,et al.  On Some Distributions Arising from Certain Generalized Sampling Schemes , 1986 .

[16]  Kanwar Sen,et al.  Lengths of runs and waiting time distributions by using Pólya-Eggenberger sampling scheme , 2002 .

[17]  Markos V. Koutras,et al.  Non-parametric randomness tests based on success runs of fixed length , 1997 .

[18]  Norman L. Johnson,et al.  Urn models and their application , 1977 .

[19]  Michael R. Chernick,et al.  Runs and Scans With Applications , 2002, Technometrics.

[20]  Markos V. Koutras,et al.  Distribution Theory of Runs: A Markov Chain Approach , 1994 .

[21]  Andreas N. Philippou,et al.  Polya, Inverse Polya, and Circular Polya Distributions of Order k for l-Overlapping Success Runs , 2007 .

[22]  W. Y. Wendy Lou,et al.  The exact distribution of the k-tuple statistic for sequence homology , 2003 .

[23]  P. A. P. Moran,et al.  An introduction to probability theory , 1968 .

[24]  Anant P. Godbole On hypergeometric and related distributions of order k , 1990 .

[25]  J. Wolfowitz,et al.  On the Theory of Runs with some Applications to Quality Control , 1943 .

[26]  John Riordan,et al.  Introduction to Combinatorial Analysis , 1959 .

[27]  Charles M. Goldie,et al.  Distribution Theory of Runs and Patterns and Its Applications: A Finite Markov Chain Imbedding Approach , 2005 .

[28]  Demetrios L. Antzoulakos,et al.  On the distribution of the total number of run lengths , 2003 .

[29]  A new multivariate inverse polya distribution of order k , 1997 .

[30]  E. J. Burr,et al.  Longest run of consecutive observations having a specified attribute , 1961 .

[31]  Andreas N. Philippou,et al.  Shortest and longest length of success runs in binary sequences , 2007 .

[32]  L. Goldstein,et al.  Poisson approximation and dna sequence matching , 1990 .

[33]  William Feller,et al.  An Introduction to Probability Theory and Its Applications , 1951 .

[34]  Andreas N. Philippou,et al.  A generalized geometric distribution and some of its properties , 1983 .

[35]  Charalambos A. Charalambides,et al.  Enumerative combinatorics , 2018, SIGA.

[36]  Serkan Eryilmaz,et al.  Success runs in a sequence of exchangeable binary trials , 2007 .

[37]  D. Bowman,et al.  A full likelihood procedure for analysing exchangeable binary data. , 1995, Biometrics.

[38]  Markos V. Koutras,et al.  Runs on a circle , 1995, Journal of Applied Probability.

[39]  Steven J. Schwager,et al.  Run Probabilities in Sequences of Markov-Dependent Trials , 1983 .

[40]  L. Dworsky An Introduction to Probability , 2008 .