Constrained Maximum-Entropy Sampling

A fundamental experimental design problem is to select a most informative subset, having prespecified size, from a set of correlated random variables. Instances of this problem arise in many applied domains such as meteorology, environmental statistics, and statistical geology. In these applications, observations can be collected at different locations and, possibly, at different times. Information is measured by "entropy." Practical situations have further restrictions on the design space. For example, budgetary limits, geographical considerations, as well as legislative and political considerations may restrict the design space in a complicated manner. Using techniques of linear algebra, combinatorial optimization, and convex optimization, we develop upper and lower bounds on the optimal value for the Gaussian case. We describe how these bounds can be integrated into a branch-and-bound algorithm for the exact solution of these design problems. Finally, we describe how we have implemented this algorithm, and we present computational results for estimated covariance matrices corresponding to sets of environmental monitoring stations in the Ohio Valley of the United States.

[1]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[2]  Claude E. Shannon,et al.  The Mathematical Theory of Communication , 1950 .

[3]  D. Blackwell Comparison of Experiments , 1951 .

[4]  D. Lindley On a Measure of the Information Provided by an Experiment , 1956 .

[5]  E. Jaynes Information Theory and Statistical Mechanics , 1957 .

[6]  David Lindley,et al.  BINOMIAL SAMPLING SCHEMES AND THE CONCEPT OF INFORMATION , 1957 .

[7]  Tosio Kato Perturbation theory for linear operators , 1966 .

[8]  A. Hoffman,et al.  Lower bounds for the partitioning of graphs , 1973 .

[9]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[10]  Ellis L. Johnson,et al.  Solving Large-Scale Zero-One Linear Programming Problems , 1983, Oper. Res..

[11]  W. F. Caselton,et al.  Optimal monitoring network designs , 1984 .

[12]  F. R. Gantmakher The Theory of Matrices , 1984 .

[13]  P. Lancaster,et al.  The theory of matrices : with applications , 1985 .

[14]  William H. Press,et al.  Numerical recipes in C. The art of scientific computing , 1987 .

[15]  Ravi B. Boppana,et al.  Eigenvalues and graph bisection: An average-case analysis , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).

[16]  B. Gollan EIGENVALUE PERTURBATIONS AND NONLINEAR PARAMETRIC OPTIMIZATION , 1987 .

[17]  Henry P. Wynn,et al.  Maximum entropy sampling , 1987 .

[18]  William H. Press,et al.  Numerical Recipes in FORTRAN - The Art of Scientific Computing, 2nd Edition , 1987 .

[19]  Laurence A. Wolsey,et al.  Integer and Combinatorial Optimization , 1988, Wiley interscience series in discrete mathematics and optimization.

[20]  Laurence A. Wolsey,et al.  Integer and Combinatorial Optimization , 1988 .

[21]  M. Overton On minimizing the maximum eigenvalue of a symmetric matrix , 1988 .

[22]  V. Fedorov,et al.  Comparison of two approaches in the optimal design of an observation network , 1989 .

[23]  Franz Rendl,et al.  Applications of parametric programming and eigenvalue maximization to the quadratic assignment problem , 1992, Math. Program..

[24]  J. Zidek,et al.  An entropy-based analysis of data from selected NADP/NTN network sites for 1983–1986 , 1992 .

[25]  Michael L. Overton,et al.  Optimality conditions and duality theory for minimizing sums of the largest eigenvalues of symmetric matrices , 2015, Math. Program..

[26]  B. Mohar,et al.  Eigenvalues in Combinatorial Optimization , 1993 .

[27]  V. Klee,et al.  Combinatorial and graph-theoretical problems in linear algebra , 1993 .

[28]  Charles Delorme,et al.  Laplacian eigenvalues and the maximum cut problem , 1993, Math. Program..

[29]  E. Verriest,et al.  On analyticity of functions involving eigenvalues , 1994 .

[30]  W. Press,et al.  Numerical Recipes in Fortran: The Art of Scientific Computing.@@@Numerical Recipes in C: The Art of Scientific Computing. , 1994 .

[31]  Maurice Queyranne,et al.  An Exact Algorithm for Maximum Entropy Sampling , 1995, Oper. Res..

[32]  Jorge Nocedal,et al.  Algorithm 778: L-BFGS-B: Fortran subroutines for large-scale bound-constrained optimization , 1997, TOMS.

[33]  Henry Wolkowicz,et al.  Handbook of Semidefinite Programming , 2000 .