Decomposition methods for large scale stochastic and robust optimization problems

We propose new decomposition methods for use on broad families of stochastic and robust optimization problemis in order to yield tractable approaches for large-scale real world application. We introduce a new type of a Markov decision problem named the Generalized Rest less Bandits Problem that encompasses a broad generalization of the restless bandit problem. For this class of stochastic optimization problems, we develop a nested policy heuristic which iteratively solves a series of sub-problems operating on smaller bandit systems. We also develop linear-optimization based bounds for the Generalized Restless Bandit problem and demonstrate promising computational performance of the nested policy heuristic on a large-scale real world application of search term selection for sponsored search advertising. We further study the distributionally robust optimization problem with known mean, covariance and support. These optimization models are attractive in their real world applications as they require the model consumer to only rely on those statistics of uncertainty that are known with relative confidence rather than making arbitrary assumptions about the exact dynamics of the underlying distribution of uncertainty. Known to be AP hard, current approaches invoke tractable but often weak relaxations for real-world applications. We develop a decomposition method for this family of problems which recursively derives sub-policies along projected dimensions of uncertainty and provides a sequence of bounds on the value of the derived policy. In the development of this method, we prove that non-convex quadratic optimization in n-dimensions over a box in two-dimensions is efficiently solvable. We also show that this same decomposition method yields a promising heuristic for the MAXCUT problem. We then provide promising computational results in the context of a real world fixed income portfolio optimization problem. The decomposition methods developed in this thesis recursively derive sub-policies on projected dimensions of the master )roblem. These sub-policies are optimal on relaxations which admit "tight" projections of the master problem; that is, the projection of the feasible region for the relaxation is equivalent to the projection of that of master problem along the dimensions of the sub-policy. Additionally, these decomposition strategies provide a hierarchical solution structure that aids in solving large-scale problems. Thesis Supervisor: Dimitris Bertsimas Title: Boeing Leaders for Global Operations Professor Co-Director, Operations Research Center

[1]  Peng Sun,et al.  A Robust Optimization Perspective on Stochastic Programming , 2007, Oper. Res..

[2]  Salil P. Vadhan,et al.  Computational Complexity , 2005, Encyclopedia of Cryptography and Security.

[3]  Dimitris Bertsimas,et al.  Conservation Laws, Extended Polymatroids and Multiarmed Bandit Problems; A Polyhedral Approach to Indexable Systems , 1996, Math. Oper. Res..

[4]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[5]  Herbert E. Scarf,et al.  A Min-Max Solution of an Inventory Problem , 1957 .

[6]  Yinyu Ye,et al.  Distributionally Robust Optimization Under Moment Uncertainty with Application to Data-Driven Problems , 2010, Oper. Res..

[7]  P. Whittle Restless Bandits: Activity Allocation in a Changing World , 1988 .

[8]  Gary L. Miller,et al.  Approaching Optimality for Solving SDD Linear Systems , 2010, 2010 IEEE 51st Annual Symposium on Foundations of Computer Science.

[9]  Lawrence M. Wein,et al.  Scheduling a Make-To-Stock Queue: Index Policies and Hedging Points , 1996, Oper. Res..

[10]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[11]  Copula Marginal Expected Tail Loss Efficient Frontiers for CDOs of Bespoke Portfolios , .

[12]  Alexandru Niculescu-Mizil Multi-Armed Bandits with Betting , 2009 .

[13]  X. V. Doan,et al.  Bounds on some contingent claims with non-convex payoff based on multiple assets , 2007 .

[14]  Demosthenis Teneketzis,et al.  Multi-armed bandits with switching penalties , 1996, IEEE Trans. Autom. Control..

[15]  Stefan Schmieta,et al.  The DIMACS library of semidefinite-quadratic-linear programs , 1999 .

[16]  Kridsda Nimmanunta A Comparative analysis of CDO pricing models , 2006 .

[17]  Christian M. Ernst,et al.  Multi-armed Bandit Allocation Indices , 1989 .

[18]  Ioana Popescu,et al.  Robust Mean-Covariance Solutions for Stochastic Optimization , 2007, Oper. Res..

[19]  Sean P. Meyn,et al.  An information-theoretic framework to aggregate a Markov chain , 2009, 2009 American Control Conference.

[20]  Gábor Pataki,et al.  On the Rank of Extreme Matrices in Semidefinite Programs and the Multiplicity of Optimal Eigenvalues , 1998, Math. Oper. Res..

[21]  Warren P. Adams,et al.  A Reformulation-Linearization Technique for Solving Discrete and Continuous Nonconvex Problems , 1998 .

[22]  J. Bather,et al.  Multi‐Armed Bandit Allocation Indices , 1990 .

[23]  Dimitris Bertsimas,et al.  Restless Bandits, Linear Programming Relaxations, and a Primal-Dual Index Heuristic , 2000, Oper. Res..

[24]  David P. Williamson,et al.  An adaptive algorithm for selecting profitable keywords for search-based advertising services , 2006, EC '06.

[25]  Paul Glasserman,et al.  LARGE DEVIATIONS IN MULTIFACTOR PORTFOLIO CREDIT RISK , 2007 .

[26]  Melvyn Sim,et al.  Distributionally Robust Optimization and Its Tractable Approximations , 2010, Oper. Res..

[27]  John N. Tsitsiklis,et al.  The complexity of optimal queueing network control , 1994, Proceedings of IEEE 9th Annual Conference on Structure in Complexity Theory.

[28]  Laurent El Ghaoui,et al.  Worst-Case Value-At-Risk and Robust Portfolio Optimization: A Conic Programming Approach , 2003, Oper. Res..

[29]  P. Whittle Multi‐Armed Bandits and the Gittins Index , 1980 .

[30]  David P. Williamson,et al.  Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming , 1995, JACM.

[31]  James E. Smith,et al.  Generalized Chebychev Inequalities: Theory and Applications in Decision Analysis , 1995, Oper. Res..

[32]  R. Weber,et al.  On an index policy for restless bandits , 1990, Journal of Applied Probability.

[33]  K. Glazebrook,et al.  Some indexable families of restless bandit problems , 2006, Advances in Applied Probability.

[34]  Diego Ruiz-Hernández,et al.  Indexable Restless Bandits: Index Policies for Some Families of Stochastic Scheduling and Dynamic Allocation Problems , 2008 .

[35]  K. Glazebrook,et al.  Index policies for a class of discounted restless bandits , 2002, Advances in Applied Probability.

[36]  Melvyn Sim,et al.  TRACTABLE ROBUST EXPECTED UTILITY AND RISK MODELS FOR PORTFOLIO OPTIMIZATION , 2009 .

[37]  Xuan Vinh Doan,et al.  Models for Minimax Stochastic Linear Optimization Problems with Risk Aversion , 2010, Math. Oper. Res..

[38]  Paul Glasserman,et al.  Importance Sampling for Portfolio Credit Risk , 2005, Manag. Sci..

[39]  José Niño-Mora,et al.  Multitarget tracking via restless bandit marginal productivity indices and Kalman filter in discrete time , 2009, Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference.

[40]  David Pisinger A minimal algorithm for the Multiple-choice Knapsack Problem , 1995 .

[41]  Ioana Popescu,et al.  Optimal Inequalities in Probability Theory: A Convex Optimization Approach , 2005, SIAM J. Optim..

[42]  Dimitris Bertsimas,et al.  An Approximate Dynamic Programming Approach to Multidimensional Knapsack Problems , 2002, Manag. Sci..

[43]  Robert B. Washburn,et al.  Optimal Policies for a Class of Restless Multiarmed Bandit Scheduling Problems with Applications to Sensor Management , 2008, J. Adv. Inf. Fusion.