A comparison of optimization solvers for log binomial regression including conic programming

Relative risks are estimated to assess associations and effects due to their ease of interpretability, e.g., in epidemiological studies. Fitting log-binomial regression models allows to use the estimated regression coefficients to directly infer the relative risks. The estimation of these models, however, is complicated because of the constraints which have to be imposed on the parameter space. In this paper we systematically compare different optimization algorithms to obtain the maximum likelihood estimates for the regression coefficients in log-binomial regression. We first establish under which conditions the maximum likelihood estimates are guaranteed to be finite and unique, which allows to identify and exclude problematic cases. In simulation studies using artificial data we compare the performance of different optimizers including solvers based on the augmented Lagrangian method, interior-point methods including a conic optimizer, majorize-minimize algorithms, iteratively reweighted least squares and expectation-maximization algorithm variants. We demonstrate that conic optimizers emerge as the preferred choice due to their reliability, lack of requirement to tune hyperparameters and speed.

[1]  Anthony V. Fiacco,et al.  Nonlinear programming;: Sequential unconstrained minimization techniques , 1968 .

[2]  James M. Ortega,et al.  Iterative solution of nonlinear equations in several variables , 2014, Computer science and applied mathematics.

[3]  M. Silvapulle On the Existence of Maximum Likelihood Estimators for the Binomial Response Models , 1981 .

[4]  A. Albert,et al.  On the existence of maximum likelihood estimates in logistic regression models , 1984 .

[5]  Narendra Karmarkar,et al.  A new polynomial-time algorithm for linear programming , 1984, Comb..

[6]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[7]  H. Kaufmann,et al.  On existence and uniqueness of maximum likelihood estimates in quantal and ordinal response models , 1988 .

[8]  K. Lange An adaptive barrier method for convex programming , 1994 .

[9]  Yurii Nesterov,et al.  Interior-point polynomial algorithms in convex programming , 1994, Siam studies in applied mathematics.

[10]  H. Davies,et al.  When can odds ratios mislead? , 1998, BMJ.

[11]  Stephen J. Wright,et al.  Numerical Optimization , 2018, Fundamental Statistical Inference.

[12]  D. A. Luke,et al.  An Odd Measure of Risk: Use and Misuse of the Odds Ratio , 2001, Obstetrics and gynecology.

[13]  Kaj Madsen,et al.  Optimization with Constraints, 2nd ed. , 2004 .

[14]  D. Hunter,et al.  A Tutorial on MM Algorithms , 2004 .

[15]  A. Nemirovski Advances in convex optimization : conic programming , 2005 .

[16]  Lorenz T. Biegler,et al.  On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming , 2006, Math. Program..

[17]  D W Hosmer,et al.  Parameter Estimation and Goodness‐of‐Fit in Log Binomial Regression , 2006, Biometrical journal. Biometrische Zeitschrift.

[18]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[19]  Shuangge Ma,et al.  Relative Risk Regression in Medical Research: Models, Contrasts, Estimators, and Algorithms , 2006 .

[20]  K. Konis,et al.  Linear programming algorithms for detecting separated data in binary logistic regression models , 2007 .

[21]  K. Fokianos,et al.  Safe density ratio modeling , 2009 .

[22]  Andreas Wächter,et al.  Short Tutorial: Getting Started With Ipopt in 90 Minutes , 2009, Combinatorial Scientific Computing.

[23]  Robert Chares Cones and interior-point algorithms for structured convex optimization involving powers andexponentials , 2009 .

[24]  Y. Yasui,et al.  Estimation of relative risk and prevalence ratio , 2010, Statistics in medicine.

[25]  Ian C. Marschner,et al.  glm2: Fitting Generalized Linear Models with Convergence Problems , 2011, R J..

[26]  Charles R. Johnson,et al.  Matrix Analysis, 2nd Ed , 2012 .

[27]  Stephen P. Boyd,et al.  ECOS: An SOCP solver for embedded systems , 2013, 2013 European Control Conference (ECC).

[28]  Alexander Domahidi Methods and tools for embedded optimization and control , 2013 .

[29]  Misha Eliasziw,et al.  Log-binomial models: exploring failed convergence , 2013, Emerging Themes in Epidemiology.

[30]  Ji Luo,et al.  Estimation of relative risk using a log-binomial model with constraints , 2014, Comput. Stat..

[31]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[32]  I. Marschner Relative Risk Regression for Binary Outcomes: Methods and Recommendations , 2015 .

[33]  Kenneth Lange,et al.  MM optimization algorithms , 2016 .

[34]  Stephen P. Boyd,et al.  Conic Optimization via Operator Splitting and Homogeneous Self-Dual Embedding , 2013, Journal of Optimization Theory and Applications.

[35]  Kurt Hornik,et al.  ROI: An Extensible R Optimization Infrastructure , 2020, Journal of Statistical Software.

[36]  B.B. de Andrade,et al.  Some results for maximum likelihood estimation of adjusted relative risks , 2018 .

[37]  I. Marschner,et al.  logbin: An R Package for Relative Risk Regression Using the Log-Binomial Model , 2018 .

[38]  Kurt Hornik,et al.  R Optimization Infrastructure [R package ROI version 1.0-0] , 2020 .