Inhibition in Multiclass Classification

The role of inhibition is investigated in a multiclass support vector machine formalism inspired by the brain structure of insects. The so-called mushroom bodies have a set of output neurons, or classification functions, that compete with each other to encode a particular input. Strongly active output neurons depress or inhibit the remaining outputs without knowing which is correct or incorrect. Accordingly, we propose to use a classification function that embodies unselective inhibition and train it in the large margin classifier framework. Inhibition leads to more robust classifiers in the sense that they perform better on larger areas of appropriate hyperparameters when assessed with leave-one-out strategies. We also show that the classifier with inhibition is a tight bound to probabilistic exponential models and is Bayes consistent for 3-class problems. These properties make this approach useful for data sets with a limited number of labeled examples. For larger data sets, there is no significant comparative advantage to other multiclass SVM approaches.

[1]  Ambuj Tewari,et al.  On the Consistency of Multiclass Classification Methods , 2007, J. Mach. Learn. Res..

[2]  E. S. Pearson,et al.  On the Problem of the Most Efficient Tests of Statistical Hypotheses , 1933 .

[3]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[4]  Francisco B. Rodríguez,et al.  Techniques for temporal detection of neural sensitivity to external stimulation , 2009, Biological Cybernetics.

[5]  Nikolai F. Rulkov,et al.  On the performance of gas sensor arrays in open sampling systems using Inhibitory Support Vector Machines , 2013 .

[6]  John C. Platt Using Analytic QP and Sparseness to Speed Training of Support Vector Machines , 1998, NIPS.

[7]  T. R. Tobin,et al.  Conditional withholding of proboscis extension in honeybees (Apis mellifera) during discriminative punishment. , 1991, Journal of comparative psychology.

[8]  Olivier Chapelle,et al.  Training a Support Vector Machine in the Primal , 2007, Neural Computation.

[9]  Gilles Laurent,et al.  Olfactory network dynamics and the coding of multidimensional signals , 2002, Nature Reviews Neuroscience.

[10]  Chih-Jen Lin,et al.  Training and Testing Low-degree Polynomial Data Mappings via Linear SVM , 2010, J. Mach. Learn. Res..

[11]  Alexander J. Smola,et al.  Kernel methods and the exponential family , 2006, ESANN.

[12]  Karel Svoboda,et al.  Locally dynamic synaptic learning rules in pyramidal neuron dendrites , 2007, Nature.

[13]  Joachim M. Buhmann,et al.  Entropy and Margin Maximization for Structured Output Learning , 2010, ECML/PKDD.

[14]  L. Abbott,et al.  Synaptic computation , 2004, Nature.

[15]  M. Heisenberg Mushroom body memoir: from maps to models , 2003, Nature Reviews Neuroscience.

[16]  Ryan M. Rifkin,et al.  In Defense of One-Vs-All Classification , 2004, J. Mach. Learn. Res..

[17]  G. Lugosi,et al.  On the Bayes-risk consistency of regularized boosting methods , 2003 .

[18]  Arthur P. Dempster,et al.  The direct use of likelihood for significance testing , 1997, Stat. Comput..

[19]  S. Sathiya Keerthi,et al.  Improvements to Platt's SMO Algorithm for SVM Classifier Design , 2001, Neural Computation.

[20]  E. S. Pearson,et al.  On the Problem of the Most Efficient Tests of Statistical Hypotheses , 1933 .

[21]  G. Lugosi,et al.  Complexity regularization via localized random penalties , 2004, math/0410091.

[22]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[23]  Alexander J. Smola,et al.  Online learning with kernels , 2001, IEEE Transactions on Signal Processing.

[24]  Yoav Freund,et al.  Large Margin Classification Using the Perceptron Algorithm , 1998, COLT' 98.

[25]  B. Smith,et al.  Learning-based recognition and discrimination of floral odors , 2005 .

[26]  Thomas G. Dietterich,et al.  Solving Multiclass Learning Problems via Error-Correcting Output Codes , 1994, J. Artif. Intell. Res..

[27]  Ramón Huerta,et al.  Learning Classification in the Olfactory System of Insects , 2004, Neural Computation.

[28]  Fu Jie Huang,et al.  A Tutorial on Energy-Based Learning , 2006 .

[29]  Yoram Singer,et al.  Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..

[30]  Koby Crammer,et al.  On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines , 2002, J. Mach. Learn. Res..

[31]  Yi Lin Multicategory Support Vector Machines, Theory, and Application to the Classification of . . . , 2003 .

[32]  H. Seung,et al.  Learning in Spiking Neural Networks by Reinforcement of Stochastic Synaptic Transmission , 2003, Neuron.

[33]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[34]  Jason Weston,et al.  Support vector machines for multi-class pattern recognition , 1999, ESANN.

[35]  Randall C. O'Reilly,et al.  Generalization in Interactive Networks: The Benefits of Inhibitory Competition and Hebbian Learning , 2001, Neural Computation.

[36]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[37]  Alexander Vergara,et al.  On the calibration of sensor arrays for pattern recognition using the minimal number of experiments , 2014 .

[38]  Ramón Huerta,et al.  Self-organization in the olfactory system: one shot odor recognition in insects , 2005, Biological Cybernetics.

[39]  Tong Zhang,et al.  Solving large scale linear prediction problems using stochastic gradient descent algorithms , 2004, ICML.

[40]  Yoram Singer,et al.  Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers , 2000, J. Mach. Learn. Res..

[41]  Thomas Hofmann,et al.  Large Margin Methods for Structured and Interdependent Output Variables , 2005, J. Mach. Learn. Res..

[42]  G. Laurent,et al.  Conditional modulation of spike-timing-dependent plasticity for olfactory learning , 2012, Nature.

[43]  Alexander Vergara,et al.  Algorithmic mitigation of sensor failure: is sensor replacement really necessary? , 2013 .

[44]  Ramón Huerta,et al.  Fast and Robust Learning by Reinforcement Signals: Explorations in the Insect Brain , 2009, Neural Computation.

[45]  Clint J. Perry,et al.  Invertebrate learning and cognition: relating phenomena to neural substrate. , 2013, Wiley interdisciplinary reviews. Cognitive science.

[46]  Gunnar Rätsch,et al.  An introduction to kernel-based learning algorithms , 2001, IEEE Trans. Neural Networks.

[47]  Léon Bottou,et al.  The Tradeoffs of Large Scale Learning , 2007, NIPS.

[48]  Clint J. Perry,et al.  Honey bees selectively avoid difficult choices , 2013, Proceedings of the National Academy of Sciences.

[49]  Yoram Singer,et al.  Pegasos: primal estimated sub-gradient solver for SVM , 2007, ICML '07.