A Dual Active-Set Algorithm for Regularized Monotonic Regression

Monotonic (isotonic) regression is a powerful tool used for solving a wide range of important applied problems. One of its features, which poses a limitation on its use in some areas, is that it produces a piecewise constant fitted response. For smoothing the fitted response, we introduce a regularization term in the monotonic regression, formulated as a least distance problem with monotonicity constraints. The resulting smoothed monotonic regression is a convex quadratic optimization problem. We focus on the case, where the set of observations is completely (linearly) ordered. Our smoothed pool-adjacent-violators algorithm is designed for solving the regularized problem. It belongs to the class of dual active-set algorithms. We prove that it converges to the optimal solution in a finite number of iterations that does not exceed the problem size. One of its advantages is that the active set is progressively enlarging by including one or, typically, more constraints per iteration. This resulted in solving large-scale test problems in a few iterations, whereas the size of that problems was prohibitively too large for the conventional quadratic optimization solvers. Although the complexity of our algorithm grows quadratically with the problem size, we found its running time to grow almost linearly in our computational experiments.

[1]  William L. Maxwell,et al.  Establishing Consistent and Realistic Reorder Intervals in Production-Distribution Systems , 1985, Oper. Res..

[2]  Quentin F. Stout,et al.  Isotonic Regression via Partitioning , 2013, Algorithmica.

[3]  Oleg Burdakov,et al.  DATA PREORDERING IN GENERALIZED PAV ALGORITHM FOR MONOTONIC REGRESSION , 2006 .

[4]  Oleg Burdakov,et al.  A smoothed monotonic regression via L2 regularization , 2018, Knowledge and Information Systems.

[5]  Anne Greenbaum,et al.  Iterative methods for solving linear systems , 1997, Frontiers in applied mathematics.

[6]  Oleg Burdakov,et al.  Monotonic regression for the detection of temporal trends in environmental quality data , 2005 .

[7]  Stig W. Omholt,et al.  Monotonicity is a key feature of genotype-phenotype maps , 2013, Front. Genet..

[8]  J. Kruskal Nonmetric multidimensional scaling: A numerical method , 1964 .

[9]  H. Wan,et al.  Least Squares Isotonic Regression in Two Dimensions , 2003 .

[10]  Oleg Burdakov,et al.  A generalised PAV algorithm for monotonic regression in several variables , 2004 .

[11]  Stephen J. Wright,et al.  Numerical Optimization , 2018, Fundamental Statistical Inference.

[12]  Panos M. Pardalos,et al.  Algorithms for a Class of Isotonic Regression Problems , 1999, Algorithmica.

[13]  Zheng Han,et al.  Primal-Dual Active-Set Methods for Isotonic Regression and Trend Filtering , 2015, ArXiv.

[14]  Daniel P. Robinson,et al.  A globally convergent primal-dual active-set framework for large-scale convex quadratic optimization , 2015, Comput. Optim. Appl..

[15]  J. M. Bremner,et al.  Statistical Inference under Restrictions , 1973 .

[16]  Oleg Burdakov,et al.  An O(n2) algorithm for isotonic regression problems , 2006 .

[17]  H. D. Brunk,et al.  AN EMPIRICAL DISTRIBUTION FUNCTION FOR SAMPLING WITH INCOMPLETE INFORMATION , 1955 .

[18]  Pedro Antonio Gutiérrez,et al.  Ordinal Regression Methods: Survey and Experimental Study , 2016, IEEE Transactions on Knowledge and Data Engineering.

[19]  Varghese S. Jacob,et al.  Isotonic Separation , 2005, INFORMS J. Comput..

[20]  Ivan Kapyrin,et al.  Monotonicity recovering and accuracy preserving optimization methods for postprocessing finite element solutions , 2012, J. Comput. Phys..

[21]  F. T. Wright,et al.  Order restricted statistical inference , 1988 .

[22]  Marina Velikova Monotone Prediction Models in Data Mining , 2008 .

[23]  R. E. Miles THE COMPLETE AMALGAMATION INTO BLOCKS, BY WEIGHTED MEANS, OF A FINITE SET OF REAL NUMBERS , 1959 .

[24]  G. Jongbloed,et al.  Trends in Moderate Rainfall Extremes: A Regional Monotone Regression Approach , 2015 .

[25]  Oleg Burdakov,et al.  A segmentation-based algorithm for large-scale partially ordered monotonic regression , 2011, Comput. Stat. Data Anal..

[26]  Michael J. Best,et al.  Active set algorithms for isotonic regression; A unifying framework , 1990, Math. Program..

[27]  J. Leeuw,et al.  Isotone Optimization in R: Pool-Adjacent-Violators Algorithm (PAVA) and Active Set Methods , 2009 .

[28]  H. D. Brunk,et al.  Statistical inference under order restrictions : the theory and application of isotonic regression , 1973 .

[29]  M. Kalish,et al.  A statistical test of the equality of latent orders , 2016 .

[30]  Quentin F. Stout,et al.  Isotonic Regression for Multiple Independent Variables , 2015, Algorithmica.