Best Subset Selection: Statistical Computing Meets Quantum Computing.

With the rapid development of quantum computers, quantum algorithms have been studied extensively. However, quantum algorithms tackling statistical problems are still lacking. In this paper, we propose a novel non-oracular quantum adaptive search (QAS) method for the best subset selection problems. QAS performs almost identically to the naive best subset selection method but reduces its computational complexity from $O(D)$ to $O(\sqrt{D}\log_2D)$, where $D=2^p$ is the total number of subsets over $p$ covariates. Unlike existing quantum search algorithms, QAS does not require the oracle information of the true solution state and hence is applicable to various statistical learning problems with random observations. Theoretically, we prove QAS attains any arbitrary success probability $q \in (0.5, 1)$ within $O(\log_2D)$ iterations. When the underlying regression model is linear, we propose a quantum linear prediction method that is faster than its classical counterpart. We further introduce a hybrid quantum-classical strategy to avoid the capacity bottleneck of existing quantum computing systems and boost the success probability of QAS by majority voting. The effectiveness of this strategy is justified by both theoretical analysis and extensive empirical experiments on quantum and classical computers.

[1]  L. Breiman Heuristics of instability and stabilization in model selection , 1996 .

[2]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[3]  Tamara B Harris,et al.  Comparisons of percentage body fat, body mass index, waist circumference, and waist-stature ratio in adults. , 2009, The American journal of clinical nutrition.

[4]  D. Donoho,et al.  Basis pursuit , 1994, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers.

[5]  Seth Lloyd,et al.  Quantum algorithm for data fitting. , 2012, Physical review letters.

[6]  Hussein Hazimeh,et al.  Fast Best Subset Selection: Coordinate Descent and Local Combinatorial Optimization Algorithms , 2018, Oper. Res..

[7]  Anmer Daskin Quantum Principal Component Analysis , 2015 .

[8]  Peter W. Shor,et al.  Algorithms for quantum computation: discrete logarithms and factoring , 1994, Proceedings 35th Annual Symposium on Foundations of Computer Science.

[9]  Lorenza Viola,et al.  Non-Gaussian noise spectroscopy with a superconducting qubit sensor , 2019, Nature Communications.

[10]  Terence Tao,et al.  The Dantzig selector: Statistical estimation when P is much larger than n , 2005, math/0506081.

[11]  Lov K. Grover Quantum Mechanics Helps in Searching for a Needle in a Haystack , 1997, quant-ph/9706033.

[12]  N. Tajima,et al.  Relationship of body mass index to percent body fat and waist circumference among schoolchildren in Japan - the influence of gender and obesity: a population-based cross-sectional study , 2010, BMC public health.

[13]  W. Welch Algorithmic complexity: three NP- hard problems in computational statistics , 1982 .

[14]  Peng Zhao,et al.  On Model Selection Consistency of Lasso , 2006, J. Mach. Learn. Res..

[15]  R. R. Hocking,et al.  Selection of the Best Subset in Regression Analysis , 1967 .

[16]  Gilles Brassard,et al.  Tight bounds on quantum searching , 1996, quant-ph/9605034.

[17]  Maria Schuld,et al.  Supervised Learning with Quantum Computers , 2018 .

[18]  Cun-Hui Zhang Nearly unbiased variable selection under minimax concave penalty , 2010, 1002.4734.

[19]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[20]  Balas K. Natarajan,et al.  Sparse Approximate Solutions to Linear Systems , 1995, SIAM J. Comput..

[21]  Tim Byrnes,et al.  Simulating lattice gauge theories on a quantum computer (熱場の量子論とその応用) , 2006 .

[22]  P. Wocjan,et al.  Quantum algorithm for approximating partition functions , 2008, 0811.0596.

[23]  Igor E. Shparlinski,et al.  Classical and Quantum Algorithms for Exponential Congruences , 2008, TQC.

[24]  Guoming Wang Quantum Algorithm for Linear Regression , 2017 .

[25]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[26]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[27]  S. Jordan Fast quantum algorithm for numerical gradient estimation. , 2004, Physical review letters.

[28]  Gilles Brassard,et al.  Strengths and Weaknesses of Quantum Computing , 1997, SIAM J. Comput..

[29]  M. Szegedy,et al.  Quantum Walk Based Search Algorithms , 2008, TAMC.

[30]  Yazhen Wang Quantum Monte Carlo simulation , 2011 .

[31]  Xiaotong Shen,et al.  Journal of the American Statistical Association Likelihood-based Selection and Sharp Parameter Estimation Likelihood-based Selection and Sharp Parameter Estimation , 2022 .

[32]  M. Kendall,et al.  The discarding of variables in multivariate analysis. , 1967, Biometrika.

[33]  D. Bertsimas,et al.  Best Subset Selection via a Modern Optimization Lens , 2015, 1507.03133.

[34]  Jianqing Fan,et al.  When is best subset selection the "best"? , 2020 .

[35]  Peter W. Shor,et al.  Polynomial-Time Algorithms for Prime Factorization and Discrete Logarithms on a Quantum Computer , 1995, SIAM Rev..

[36]  Joseph Fitzsimons,et al.  Quantum assisted Gaussian process regression , 2015, Physical Review A.

[37]  Trevor Hastie,et al.  Best Subset, Forward Stepwise or Lasso? Analysis and Recommendations Based on Extensive Comparisons , 2020 .

[38]  Roger W. Johnson Fitting Percentage of Body Fat to Simple Body Measurements: College Women , 1996, Journal of Statistics and Data Science Education.

[39]  Yazhen Wang,et al.  Quantum Annealing via Path-Integral Monte Carlo With Data Augmentation , 2020, J. Comput. Graph. Stat..

[40]  I. Kassal,et al.  Polynomial-time quantum algorithm for the simulation of chemical dynamics , 2008, Proceedings of the National Academy of Sciences.

[41]  Kristan Temme,et al.  Error mitigation extends the computational reach of a noisy quantum processor , 2019, Nature.

[42]  Michael Short,et al.  Improved Inequalities for the Poisson and Binomial Distribution and Upper Tail Quantile Functions , 2013 .

[43]  D. Abrams,et al.  Simulation of Many-Body Fermi Systems on a Universal Quantum Computer , 1997, quant-ph/9703054.

[44]  H. Zou,et al.  STRONG ORACLE OPTIMALITY OF FOLDED CONCAVE PENALIZED ESTIMATION. , 2012, Annals of statistics.

[45]  M. Yuan,et al.  On the non‐negative garrotte estimator , 2007 .

[46]  M. Schuld,et al.  Prediction by linear regression on a quantum computer , 2016, 1601.07823.

[47]  M. Ruskai,et al.  Bounds for the adiabatic approximation with applications to quantum computation , 2006, quant-ph/0603175.

[48]  A. Harrow,et al.  Quantum algorithm for linear systems of equations. , 2008, Physical review letters.