Learning Matching Score Dependencies for Classifier Combination

The integration of recognition algorithms into a single document processing system might involve different available modules suitable for a single task. For example, we might possess few character or word recognition algorithms which all can be used in the system. One possible approach is to test these algorithms and to choose the one with the best performance. But practice shows that better approach is to try to use all available algorithms and to combine their outputs in order to achieve a better performance than any single algorithm. The combination problem consists in learning the behavior of given algorithms and deriving best possible combination function. We assume that both the combined algorithms and the result of combination are classifiers. Thus a finite number of classes are distinguished in the problem, and the task is to find a class, which corresponds most to the input. As examples, classes might be a character set, a word lexicon, a person list, etc. Usually classifiers output the numeric matching scores corresponding to each class, and we will assume that these scores are available for combination. The combination algorithm is a function producing a final combined score for each class, and the final classifier selects class with the best combined score. The purpose of this chapter is to investigate the different scenarios of combining classifiers, to show the difficulties in finding the optimal combination algorithms, and to present few possible approaches to combination problems. Generally, the classifier combination problem can be viewed as a construction of postprocessing classifier operating on the matching scores of combined classifiers. For many classifier combination problems, though, the number of classes or the number of classifiers and, consequently, the number of matching scores is too big, and applying generic pattern classification algorithms is difficult. Thus some scores are usually discarded from combination algorithm, or simplifying assumptions on score distributions are made and used in the combination algorithm. Though the dependency between classifiers is usually learned by the combination algorithms, the dependency between scores assigned to different classes by the same classifier is discarded. In this work we will show that accounting for score dependencies is essential for proper com

[1]  Roland Auckenthaler,et al.  Score Normalization for Text-Independent Speaker Verification Systems , 2000, Digit. Signal Process..

[2]  Roberto Brunelli,et al.  Person identification using multiple cues , 1995, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  P. J. Green,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[4]  Aaron E. Rosenberg,et al.  Speaker background models for connected digit password speaker verification , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[5]  S. Tulyakov,et al.  Identification Model for Classifier Combinations , 2006, 2006 Biometrics Symposium: Special Session on Research at the Biometric Consortium Conference.

[6]  Arun Ross,et al.  Score normalization in multimodal biometric systems , 2005, Pattern Recognit..

[7]  Venu Govindaraju,et al.  Using Independence Assumption to Improve Multimodal Biometric Fusion , 2005, Multiple Classifier Systems.

[8]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[9]  Mübeccel Demirekler,et al.  A statistical unified framework for rank-based multiple classifier decision combination , 2001, Pattern Recognit..

[10]  Gyeonghwan Kim,et al.  A Lexicon Driven Approach to Handwritten Word Recognition for Real-Time Applications , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Venu Govindaraju,et al.  Classifier Combination Types for Biometric Applications , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[12]  Patrick Grother,et al.  Face Recognition Vendor Test 2002 Supplemental Report NISTIR 7083 , 2004 .

[13]  J.P. Campbell,et al.  Allowing good impostors to test , 1997, Conference Record of the Thirty-First Asilomar Conference on Signals, Systems and Computers (Cat. No.97CB36136).

[14]  Gyeonghwan Kim,et al.  Bankcheck Recognition Using Cross Validation Between Legal and Courtesy Amounts , 1997, Int. J. Pattern Recognit. Artif. Intell..

[15]  Sargur N. Srihari,et al.  Decision Combination in Multiple Classifier Systems , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Samy Bengio,et al.  A unified framework for score normalization techniques applied to text-independent speaker verification , 2005, IEEE Signal Processing Letters.

[17]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Sharath Pankanti,et al.  Guide to Biometrics , 2003, Springer Professional Computing.