Working Set Selection Using Second Order Information for Training Svm, " Complexity-reduced Scheme for Feature Extraction with Linear Discriminant Analysis

1 2 (w · w) + γ 2 (w * · w *) + C n i=1 ξ * i s.t. y i ((w · z i) + b) ≥ 1 − ξ The dual optimization problem of (29) is minimize α − n i=1 α i + 1 2 n i, j =1 α i α j y i y j (z i · z j) + 1 2γ n i, j =1 (α i + β i − C)(α j + β j − C)(z * i · z * j) s.t. ACKNOWLEDGMENT The authors would like to thank the anonymous reviewers for their helpful technical feedback. REFERENCES [1] B. Bakker and T. Heskes, " Task clustering and gating for Bayesian multitask learning, " A framework for learning predictive structures from multiple tasks and unlabeled data, " [20] J. Platt, " Using sparseness and analytic QP to speed training of support vector machines, " in Multi-task learning for classification with Dirichlet process priors, " Abstract— Owing to the singularity of the within-class scatter, linear discriminant analysis (LDA) becomes ill-posed for small sample size (SSS) problems. Null-space-based LDA (NLDA), which is an extension of LDA, provides good discriminant performances for SSS problems. Yet, as the original scheme for the feature extractor (FE) of NLDA suffers from a complexity burden, a few modified schemes have since been proposed for complexity reduction. In this brief, by transforming the problem of finding the FE of NLDA into a linear equation problem, a novel scheme is derived, offering a further reduction of the complexity.

[1]  C. Geary Leukemia , 1984, British Journal of Cancer.

[2]  Yong-Qing Cheng,et al.  An Efficient Algorithm for Foley-Sammon Optimal Set of Discriminant Vectors by Algebraic Method , 1992, Int. J. Pattern Recognit. Artif. Intell..

[3]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[4]  László Györfi,et al.  Lower Bounds for Bayes Error Estimation , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Ja-Chen Lin,et al.  A new LDA-based face recognition system which can solve the small sample size problem , 1998, Pattern Recognit..

[6]  Hua Yu,et al.  A direct LDA algorithm for high-dimensional data - with application to face recognition , 2001, Pattern Recognit..

[7]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[8]  S. Sathiya Keerthi,et al.  Improvements to Platt's SMO Algorithm for SVM Classifier Design , 2001, Neural Computation.

[9]  Hanqing Lu,et al.  Solving the small sample size problem of LDA , 2002, Object recognition supported by user interaction for service robots.

[10]  Tom Heskes,et al.  Task Clustering and Gating for Bayesian Multitask Learning , 2003, J. Mach. Learn. Res..

[11]  Neil D. Lawrence,et al.  Learning to learn with the informative vector machine , 2004, ICML.

[12]  Ryan M. Rifkin,et al.  In Defense of One-Vs-All Classification , 2004, J. Mach. Learn. Res..

[13]  Haesun Park,et al.  Generalizing discriminant analysis using the generalized singular value decomposition , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Xuejun Liao,et al.  Radial Basis Function Network for Multitask Learning , 2005 .

[15]  Helmut Alt,et al.  Square rooting is as difficult as multiplication , 1979, Computing.

[16]  Ivor W. Tsang,et al.  Core Vector Machines: Fast SVM Training on Very Large Data Sets , 2005, J. Mach. Learn. Res..

[17]  Tong Zhang,et al.  A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data , 2005, J. Mach. Learn. Res..

[18]  Rajat Raina,et al.  Constructing informative priors using transfer learning , 2006, ICML.

[19]  Jieping Ye,et al.  Computational and Theoretical Analysis of Null Space and Orthogonal Linear Discriminant Analysis , 2006, J. Mach. Learn. Res..

[20]  Jianlin Wang,et al.  Solving the small sample size problem in face recognition using generalized discriminant analysis , 2006, Pattern Recognit..

[21]  Lipo Wang,et al.  Data Mining With Computational Intelligence , 2006, IEEE Transactions on Neural Networks.

[22]  Lawrence Carin,et al.  Multi-Task Learning for Classification with Dirichlet Process Priors , 2007, J. Mach. Learn. Res..

[23]  Jiawei Han,et al.  SRDA: An Efficient Algorithm for Large-Scale Discriminant Analysis , 2008, IEEE Transactions on Knowledge and Data Engineering.

[24]  Vladimir Cherkassky,et al.  Connection between SVM+ and multi-task learning , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[25]  Wenming Zheng Heteroscedastic Feature Extraction for Texture Classification , 2009, IEEE Signal Processing Letters.

[26]  D. Lee,et al.  Linear Discriminant Analysis for Signatures , 2010, IEEE Transactions on Neural Networks.

[27]  Delin Chu,et al.  A new and fast implementation for null space based linear discriminant analysis , 2010, Pattern Recognit..

[28]  David Zhang,et al.  Local Linear Discriminant Analysis Framework Using Sample Neighbors , 2011, IEEE Transactions on Neural Networks.

[29]  Soo-Young Lee,et al.  Discriminant Independent Component Analysis , 2011, IEEE Transactions on Neural Networks.

[30]  Soo-Young Lee,et al.  Discriminant Independent Component Analysis , 2011, IEEE Trans. Neural Networks.

[31]  S. Anitha,et al.  SEMI-SUPERVISED BIASED MAXIMUM MARGIN ANALYSIS FOR INTERACTIVE IMAGE RETRIEVAL , 2013 .

[32]  Ji Haibo,et al.  Multi-task feature learning by using trace norm regularization , 2017 .