Adding Constraints to Situations When, In Addition to Intervals,We Also Have Partial Information about Probabilities

In many practical situations, we need to combine probabilistic and interval uncertainty. For example, we need to 1 A compute statistics like population mean E = 1/n.nSigmai=1xi or population variance V = 1/nnSigmai=1(xi-E)2 in situations when we only know intervals xi of possible value of xi. In this case, it is desirable to compute the range of the corresponding characteristic. Some range computation problems are NP-hard; for these problems, in general, only an enclosure is possible. For other problems, there are efficient algorithms. In many practical situations, we have additional information that can be used as constraints on possible cumulative distribution functions (cdfs). For example, we may know that the actual (unknown) cdf is Gaussian. In this paper, we show that such constraints enable us to drastically narrow down the resulting ranges - and sometimes, transform the originally intractable (NP-hard) computational problem of computing the exact range into an efficiently solvable one. This possibility is illustrated on the simplest example of an NP-problem from interval statistics: the problem of computing the range V of the variance V. We also describe how we can estimate the amount of information under such combined intervals-and-constraints uncertainty.

[1]  Gang Xiang,et al.  How the concept of information as average number of "yes"-"no" questions (bits) can be extended to intervals, p-boxes, and more general uncertainty , 2005, NAFIPS 2005 - 2005 Annual Meeting of the North American Fuzzy Information Processing Society.

[2]  Vladik Kreinovich,et al.  Towards Combining Probabilistic and Interval Uncertainty in Engineering Calculations , 2004 .

[3]  Vladik Kreinovich,et al.  A Feasible Algorithm for Locating Concave and Convex Zones of Interval Data and Its Use in Statistics-Based Clustering , 2004, Numerical Algorithms.

[4]  G. Klir,et al.  Estimating Information Amount under Interval Uncertainty : Algorithmic Solvability and Computational Complexity , 2006 .

[5]  P. Walley Statistical Reasoning with Imprecise Probabilities , 1990 .

[6]  Vladik Kreinovich,et al.  Computing Population Variance and Entropy under Interval Uncertainty: Linear-Time Algorithms , 2007, Reliab. Comput..

[7]  Vladik Kreinovich,et al.  Outlier Detection under Interval Uncertainty: Algorithmic Solvability and Computational Complexity , 2005, Reliab. Comput..

[8]  Vladik Kreinovich,et al.  Detecting Outliers under Interval Uncertainty: A New Algorithm Based on Constraint Satisfaction , 2006 .

[9]  Harrison M. Wadsworth Handbook of Statistical Methods for Engineers and Scientists , 1990 .

[10]  Alex M. Andrew,et al.  Uncertainty and Information: Foundations of Generalized Information Theory , 2006 .

[11]  Luc Jaulin,et al.  Applied Interval Analysis , 2001, Springer London.

[12]  Vladik Kreinovich,et al.  Interval versions of statistical techniques with applications to environmental analysis, bioinformatics, and privacy in statistical databases , 2007 .