In many scientific and engineering applications, detecting and understanding differences between two groups of examples can be reduced to a classical problem of training a classifier for labeling new examples while making as few mistakes as possible. In the traditional classification setting, the resulting classifier is rarely analyzed in terms of the properties of the input data captured by the discriminative model. However, such analysis is crucial if we want to understand and visualize the detected differences. We propose an approach to interpretation of the statistical model in the original feature space that allows us to argue about the model in terms of the relevant changes to the input vectors. For each point in the input space, we define a discriminative direction to be the direction that moves the point towards the other class while introducing as little irrelevant change as possible with respect to the classifier function. We derive the discriminative direction for kernel-based classifiers, demonstrate the technique on several examples and briefly discuss its use in the statistical shape analysis, an application that originally motivated this work.
[1]
Vladimir N. Vapnik,et al.
The Nature of Statistical Learning Theory
,
2000,
Statistics for Engineering and Information Science.
[2]
Vladimir Vapnik,et al.
Statistical learning theory
,
1998
.
[3]
W. Eric L. Grimson,et al.
Small Sample Size Learning for Shape Analysis of Anatomical Structures
,
2000,
MICCAI.
[4]
Gunnar Rätsch,et al.
Input space versus feature space in kernel-based methods
,
1999,
IEEE Trans. Neural Networks.
[5]
Si Wu,et al.
Improving support vector machine classifiers by modifying kernel functions
,
1999,
Neural Networks.
[6]
Vladimir Vapnik,et al.
The Nature of Statistical Learning
,
1995
.
[7]
Shun-ichi Amari,et al.
Natural Gradient Works Efficiently in Learning
,
1998,
Neural Computation.
[8]
Bernhard Schölkopf,et al.
Nonlinear Component Analysis as a Kernel Eigenvalue Problem
,
1998,
Neural Computation.
[9]
Christopher J. C. Burges,et al.
Geometry and invariance in kernel based methods
,
1999
.