Algorithms for operations on probability distributions in a computer algebra system

In mathematics and statistics, the desire to eliminate mathematical tedium and facilitate exploration has lead to computer algebra systems. These computer algebra systems allow students and researchers to perform more of their work at a conceptual level. The design of generic algorithms for tedious computations allows modelers to push current modeling boundaries outward more quickly. Probability theory, with its many theorems and symbolic manipulations of random variables is a discipline in which automation of certain processes is highly practical, functional, and efficient. There are many existing statistical software packages, such as SPSS, SAS, and S-Plus, that have numeric tools for statistical applications. There is a potential for a probability package analogous to these statistical packages for manipulation of random variables. The software package being developed as part of this dissertation, referred to as “A Probability Programming Language” (APPL) is a random variable manipulator and is proposed to fill a technology gap that exists in probability theory. My research involves developing algorithms for the manipulation of discrete random variables. By defining data structures for random variables and writing algorithms for implementing common operations, more interesting and mathematically intractable probability problems can be solved, including those not attempted in undergraduate statistics courses because they were deemed too mechanically arduous. Algorithms for calculating the probability density function of order statistics, transformations, convolutions, products, and minimums/maximums of independent discrete random variables are included in this dissertation.

[1]  E. Reingold,et al.  Combinatorial Algorithms: Theory and Practice , 1977 .

[2]  Herbert S. Winokur,et al.  Exact Moments of the Order Statistics of the Geometric Distribution and Their Relation to Inverse Sampling and Reliability of Redundant Systems , 1967 .

[3]  G. Casella,et al.  Statistical Inference , 2003, Encyclopedia of Social Network Analysis and Mining.

[4]  D. Barr,et al.  Mean and Variance of Truncated Normal Distributions , 1999 .

[5]  Andrew G. Glen,et al.  Computing the cumulative distribution function of the Kolmogorov-Smirnov statistic , 2000 .

[6]  Andrew G. Glen,et al.  APPL: A Probability Programming Language , 2001 .

[7]  Jason Brown,et al.  Reliability: Probabilistic Models and Statistical Methods , 1996 .

[8]  E. Gehan A GENERALIZED WILCOXON TEST FOR COMPARING ARBITRARILY SINGLY-CENSORED SAMPLES. , 1965, Biometrika.

[9]  Frank M. Carrano,et al.  Data abstraction and problem solving with C++ - walls and mirrors , 1995 .

[10]  Robert J. Lopez,et al.  Advanced Engineering Mathematics , 2000 .

[11]  E. Ley On the Peculiar Distribution of the U.S. Stock Indexes' Digits , 1996 .

[12]  Gordon Johnston,et al.  Statistical Models and Methods for Lifetime Data , 2003, Technometrics.

[13]  Peter Thompson,et al.  Getting Normal Probability Approximations without Using Normal Tables , 2000 .

[14]  Theodore P. Hill The First Digit Phenomenon , 1998 .

[15]  Albert Nijenhuis,et al.  Combinatorial Algorithms for Computers and Calculators , 1978 .

[16]  Kevin J. Hastings Introduction to Probability with Mathematica , 2000 .

[17]  A. Sveshnikov,et al.  Problems in Probability Theory, Mathematical Statistics and Theory of Random Functions , 1979 .

[18]  A. Cohen An Introduction to Probability Theory and Mathematical Statistics , 1979 .

[19]  Lawrence Leemis,et al.  A Generalized Univariate Change-of-Variable Transformation Technique , 1997, INFORMS J. Comput..

[20]  R. Larsen,et al.  An introduction to mathematical statistics and its applications (2nd edition) , by R. J. Larsen and M. L. Marx. Pp 630. £17·95. 1987. ISBN 13-487166-9 (Prentice-Hall) , 1987, The Mathematical Gazette.

[21]  Charles M. Grinstead,et al.  Introduction to probability , 1999, Statistics for the Behavioural Sciences.

[22]  Franco Vivaldi Experimental Mathematics with Maple , 2001 .

[23]  Pieter C. Allaart AN INVARIANT-SUM CHARACTERIZATION OF BENFORD'S LAW , 1997 .

[24]  S. K. Park,et al.  Random number generators: good ones are hard to find , 1988, CACM.

[25]  L. J. Bain,et al.  Introduction to Probability and Mathematical Statistics , 1987 .

[26]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[27]  Wilfrid S. Kendall,et al.  Computer algebra in probability and statistics , 1993 .

[28]  T. Hill A Statistical Derivation of the Significant-Digit Law , 1995 .

[29]  Mark Allen Weiss,et al.  Data structures and algorithm analysis in C , 1991 .

[30]  Emanuel Parzen,et al.  Modern Probability Theory And Its Applications , 1960 .