Quantum algorithms for nearest-neighbor methods for supervised and unsupervised learning

We present quantum algorithms for performing nearest-neighbor learning and k-means clustering. At the core of our algorithms are fast and coherent quantum methods for computing the Euclidean distance both directly and via the inner product which we couple with methods for performing amplitude estimation that do not require measurement. We prove upper bounds on the number of queries to the input data required to compute such distances and find the nearest vector to a given test example. In the worst case, our quantum algorithms lead to polynomial reductions in query complexity relative to Monte Carlo algorithms. We also study the performance of our quantum nearest-neighbor algorithms on several real-world binary classification tasks and find that the classification accuracy is competitive with classical methods.

[1]  William Nick Street,et al.  Breast Cancer Diagnosis and Prognosis Via Linear Programming , 1995, Oper. Res..

[2]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[3]  Keinosuke Fukunaga,et al.  A Branch and Bound Algorithm for Computing k-Nearest Neighbors , 1975, IEEE Transactions on Computers.

[4]  Tzyh Jong Tarn,et al.  Quantum Reinforcement Learning , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[5]  Daniel A. Spielman,et al.  Exponential algorithmic speedup by a quantum walk , 2002, STOC '03.

[6]  Lov K. Grover,et al.  Creating superpositions that correspond to efficiently integrable probability distributions , 2002, quant-ph/0208112.

[7]  Masoud Mohseni,et al.  Quantum support vector machine for big feature and big data classification , 2013, Physical review letters.

[8]  Canada.,et al.  Data Mining and Machine Learning in Astronomy , 2009, 0906.2173.

[9]  Christoph Dürr,et al.  A Quantum Algorithm for Finding the Minimum , 1996, ArXiv.

[10]  Barry C. Sanders,et al.  Simulating quantum dynamics on a quantum computer , 2010, 1011.3489.

[11]  Gilles Brassard,et al.  Machine Learning in a Quantum World , 2006, Canadian AI.

[12]  Felix Wu,et al.  The quantum query complexity of approximating the median and related statistics , 1998, STOC '99.

[13]  Robert J. Brunner,et al.  Robust Machine Learning Applied to Astronomical Data Sets. III. Probabilistic Photometric Redshifts for Galaxies and Quasars in the SDSS and GALEX , 2008, 0804.3413.

[14]  Amnon Ta-Shma,et al.  Adiabatic quantum state generation and statistical zero knowledge , 2003, STOC '03.

[15]  S. Lloyd,et al.  Quantum algorithms for supervised and unsupervised machine learning , 2013, 1307.0411.

[16]  Paul Compton,et al.  Inductive knowledge acquisition: a case study , 1987 .

[17]  Nathan Wiebe,et al.  Hamiltonian simulation using linear combinations of unitary operations , 2012, Quantum Inf. Comput..

[18]  J. Whitfield,et al.  Simulation of electronic structure Hamiltonians using quantum computers , 2010, 1001.3855.

[19]  Ilse C. F. Ipsen,et al.  Importance Sampling for a Monte Carlo Matrix Multiplication Algorithm, with Application to Information Retrieval , 2011, SIAM J. Sci. Comput..

[20]  Dénes Petz,et al.  On asymptotics of large Haar distributed unitary matrices , 2004, Period. Math. Hung..

[21]  Seth Lloyd,et al.  Quantum random access memory. , 2007, Physical review letters.

[22]  M. Hastings,et al.  Can quantum chemistry be performed on a small quantum computer , 2013 .

[23]  Ilya Levner,et al.  Feature selection and nearest centroid classification for protein mass spectrometry , 2005, BMC Bioinformatics.

[24]  Anil K. Jain Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..

[25]  N. Soonthornphisaj,et al.  Anti-spam filtering: a centroid-based classification approach , 2002, 6th International Conference on Signal Processing, 2002..

[26]  G. Brassard,et al.  Quantum Amplitude Amplification and Estimation , 2000, quant-ph/0005055.

[27]  R. Tibshirani,et al.  Diagnosis of multiple cancer types by shrunken centroids of gene expression , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[29]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[30]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[31]  Steven T. Flammia,et al.  Quantum tomography via compressed sensing: error bounds, sample complexity and efficient estimators , 2012, 1205.2300.

[32]  Oliver Kramer,et al.  Detecting Quasars in Large-Scale Astronomical Surveys , 2010, 2010 Ninth International Conference on Machine Learning and Applications.

[33]  R. Cleve,et al.  Quantum fingerprinting. , 2001, Physical review letters.

[34]  D. Cory,et al.  Hamiltonian learning and certification using quantum resources. , 2013, Physical review letters.