论文信息 - A re-weighting strategy for improving margins

A re-weighting strategy for improving margins

We present a simple general scheme for improving margins that is inspired on well known margin theory principles. The scheme is based on a sample re-weighting strategy. The very basic idea is in fact to add to the training set new replicas of samples which are not classified with a sufficient margin. As a study case, we present a new algorithm, namely TVQ, which is an instance of the proposed scheme and involves a tangent distance based 1-NN classifier implementing a sort of quantization of the tangent distance prototypes. The tangent distance models created in this way have shown a significant improvement in generalization power with respect to standard tangent models. Moreover, the obtained models were able to outperform other state of the art algorithms, such as SVM, in an OCR task.

Alessandro Sperduti | Fabio Aiolli

[1] Shigeo Abe DrEng. Pattern Classification , 2001, Springer London.

[2] Yoav Freund,et al. Boosting the margin: A new explanation for the effectiveness of voting methods , 1997, ICML.

[3] Robert E. Schapire,et al. Theoretical Views of Boosting , 1999, EuroCOLT.

[4] Patrice Y. Simard,et al. Learning Prototype Models for Tangent Distance , 1994, NIPS.

[5] Alessandro Sperduti,et al. A Rapid Graph-based Method for Arbitrary Transformation-Invariant Pattern Classification , 1994, NIPS.

[6] Vladimir N. Vapnik,et al. The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[7] Vladimir Vapnik,et al. Statistical learning theory , 1998 .

[8] Holger Schwenk,et al. Learning Discriminant Tangent Models for Handwritten Character Recognition , 1995 .

[9] Nils J. Nilsson,et al. Artificial Intelligence , 1974, IFIP Congress.

[10] Patrice Y. Simard. Efficient Computation of Complex Distance Metrics Using Hierarchical Filtering , 1993, NIPS.

[11] Jorma Laaksonen,et al. LVQ_PAK: The Learning Vector Quantization Program Package , 1996 .