论文信息 - BILGO: Bilateral greedy optimization for large scale semidefinite programming

BILGO: Bilateral greedy optimization for large scale semidefinite programming

Many machine learning tasks (e.g. metric and manifold learning problems) can be formulated as convex semidefinite programs. To enable the application of these tasks on a large-scale, scalability and computational efficiency are considered as desirable properties for a practical semidefinite programming algorithm. In this paper, we theoretically analyze a new bilateral greedy optimization (denoted BILGO) strategy in solving general semidefinite programs on large-scale datasets. As compared to existing methods, BILGO employs a bilateral search strategy during each optimization iteration. In such an iteration, the current semidefinite matrix solution is updated as a bilateral linear combination of the previous solution and a suitable rank-1 matrix, which can be efficiently computed from the leading eigenvector of the descent direction at this iteration. By optimizing for the coefficients of the bilateral combination, BILGO reduces the cost function in every iteration until the KKT conditions are fully satisfied, thus, it tends to converge to a global optimum. In fact, we prove that BILGO converges to the global optimal solution at a rate of O(1/k), where k is the iteration counter. The algorithm thus successfully combines the efficiency of conventional rank-1 update algorithms and the effectiveness of gradient descent. Moreover, BILGO can be easily extended to handle low rank constraints. To validate the effectiveness and efficiency of BILGO, we apply it to two important machine learning tasks, namely Mahalanobis metric learning and maximum variance unfolding. Extensive experimental results clearly demonstrate that BILGO can solve large-scale semidefinite programs efficiently.

Bernard Ghanem | Ganzhao Yuan | Zhifeng Hao

[1] Amir Globerson,et al. Metric Learning by Collapsing Classes , 2005, NIPS.

[2] Xiaowei Yang,et al. A Linear Support Higher-Order Tensor Machine for Classification , 2013, IEEE Transactions on Image Processing.

[3] Kilian Q. Weinberger,et al. Learning a kernel matrix for nonlinear dimensionality reduction , 2004, ICML.

[4] Bernard Ghanem,et al. Low-rank quadratic semidefinite programming , 2013, Neurocomputing.

[5] Sören Laue. A Hybrid Algorithm for Convex Semidefinite Optimization , 2012, ICML.

[6] H. Sebastian Seung,et al. Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[7] Lei Wang,et al. Positive Semidefinite Metric Learning Using Boosting-like Algorithms , 2011, J. Mach. Learn. Res..

[8] J. Dunn. Newton’s Method and the Goldstein Step-Length Rule for Constrained Minimization Problems , 1980 .

[9] Katya Scheinberg,et al. Block Coordinate Descent Methods for Semidefinite Programming , 2012 .

[10] Yurii Nesterov,et al. Primal-dual subgradient methods for convex problems , 2005, Math. Program..

[11] G. G. Stokes. "J." , 1890, The New Yale Book of Quotations.