论文信息 - Multi-View Maximum Entropy Discrimination

Multi-View Maximum Entropy Discrimination

Maximum entropy discrimination (MED) is a general framework for discriminative estimation based on the well known maximum entropy principle, which embodies the Bayesian integration of prior information with large margin constraints on observations. It is a successful combination of maximum entropy learning and maximum margin learning, and can subsume support vector machines (SVMs) as a special case. In this paper, we present a multi-view maximum entropy discrimination framework that is an extension of MED to the scenario of learning with multiple feature sets. Different from existing approaches to exploiting multiple views, such as co-training style algorithms and co-regularization style algorithms, we propose a new method to make use of the distinct views where classification margins from these views are required to be identical. We give the general form of the solution to the multi-view maximum entropy discrimination, and provide an instantiation under a specific prior formulation which is analogical to a multi-view version of SVMs. Experimental results on real-world data sets show the effectiveness of the proposed multi-view maximum entropy discrimination approach.

Shiliang Sun | Guoqing Chao

[1] Rayid Ghani,et al. Analyzing the effectiveness and applicability of co-training , 2000, CIKM '00.

[2] Vikas Sindhwani,et al. An RKHS for multi-view learning and manifold co-regularization , 2008, ICML '08.

[3] R. Bharat Rao,et al. Bayesian Co-Training , 2007, J. Mach. Learn. Res..

[4] Jun Zhu,et al. Maximum Entropy Discrimination Markov Networks , 2009, J. Mach. Learn. Res..

[5] Zoubin Ghahramani,et al. Proceedings of the 24th international conference on Machine learning , 2007, ICML 2007.

[6] Mikhail Belkin,et al. A Co-Regularization Approach to Semi-supervised Learning with Multiple Views , 2005 .

[7] Philip M. Long,et al. Mistake Bounds for Maximum Entropy Discrimination , 2004, NIPS.

[8] Maria-Florina Balcan,et al. Co-Training and Expansion: Towards Bridging Theory and Practice , 2004, NIPS.

[9] David S. Rosenberg,et al. The rademacher complexity of coregularized kernel classes , 2007 .

[10] Bo Zhang,et al. Laplace maximum margin Markov networks , 2008, ICML '08.

[11] Steffen Bickel,et al. Multi-view clustering , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).