MaER: A New Ensemble Based Multiclass Classifier for Binding Activity Prediction of HLA Class II Proteins

Human Leukocyte Antigen class II (HLA II) proteins are crucial for the activation of adaptive immune response. In HLA class II molecules, high rate of polymorphisms has been observed. Hence, the accurate prediction of HLA II-peptide interactions is a challenging task that can both improve the understanding of immunological processes and facilitate decision-making in vaccine design. In this regard, during the last decade various computational tools have been developed, which were mainly focused on the binding activity prediction of different HLA II isotypes (such as DP, DQ and DR) separately. This fact motivated us to make a humble contribution towards the prediction of isotypes binding propensity as a multiclass classification task. In this regard, we have analysed a binding affinity dataset, which contains the interactions of 27 HLA II proteins with 636 variable length peptides, in order to prepare new multiclass datasets for strong and weak binding peptides. Thereafter, a new ensemble based multiclass classifier, called Meta EnsembleR (MaER) is proposed to predict the activity of weak/unknown binding peptides, by integrating the results of various heterogeneous classifiers. It pre-processes the training and testing datasets by making feature subsets, bootstrap samples and creates diverse datasets using principle component analysis, which are then used to train and test the MaER. The performance of MaER with respect to other existing state-of-the-art classifiers, has been estimated using validity measures, ROC curves and gain value analysis. Finally, a statistical test called Friedman test has been conducted to judge the statistical significance of the results produced by MaER.

[1]  Diogo M. Camacho,et al.  Wisdom of crowds for robust gene network inference , 2012, Nature Methods.

[2]  T. Hanai,et al.  Hidden Markov model-based prediction of antigenic peptides that interact with MHC class II molecules. , 2002, Journal of bioscience and bioengineering.

[3]  Subhadip Basu,et al.  AMS 4.0: consensus prediction of post-translational modifications in protein sequences , 2012, Amino Acids.

[4]  Ujjwal Maulik,et al.  Fuzzy clustering of physicochemical and biochemical properties of amino Acids , 2011, Amino Acids.

[5]  M. Friedman A Comparison of Alternative Tests of Significance for the Problem of $m$ Rankings , 1940 .

[6]  Meena Kishore Sakharkar,et al.  Types of inter-atomic interactions at the MHC-peptide interface: Identifying commonality from accumulated data , 2002, BMC Structural Biology.

[7]  D R Flower,et al.  MHC Class II Binding Prediction by Molecular Docking , 2011, Molecular informatics.

[8]  Peicho Petkov,et al.  HLA‐DP2 binding prediction by molecular dynamics simulations , 2011, Protein science : a publication of the Protein Society.

[9]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[10]  Ujjwal Maulik,et al.  Application of RotaSVM for HLA Class II Protein-Peptide Interaction Prediction , 2014, BIOINFORMATICS.

[11]  Ji Wan,et al.  SVRMHC prediction server for MHC-binding peptides , 2006, BMC Bioinformatics.

[12]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[13]  Ujjwal Maulik,et al.  Improved differential evolution for microarray analysis , 2012, Int. J. Data Min. Bioinform..

[14]  A. Rudensky,et al.  Studies on naturally processed peptides associated with MHC class II molecules. , 1993, Chemical immunology.

[15]  Yanqing Niu,et al.  Quantitative prediction of MHC-II binding affinity using particle swarm optimization , 2010, Artif. Intell. Medicine.

[16]  H. Grey,et al.  Prediction of major histocompatibility complex binding regions of protein antigens by sequence pattern analysis. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Z. Nagy,et al.  Precise prediction of major histocompatibility complex class II-peptide interaction based on peptide side chain scanning , 1994, The Journal of experimental medicine.

[18]  U. Şahin,et al.  Generation of tissue-specific and promiscuous HLA ligand databases using DNA microarrays and virtual HLA class II matrices , 1999, Nature Biotechnology.

[19]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[20]  Ujjwal Maulik,et al.  Improvement of new automatic differential fuzzy clustering using SVM classifier for microarray analysis , 2011, Expert Syst. Appl..

[21]  Michelangelo Ceci,et al.  Integrating microRNA target predictions for the discovery of gene regulatory networks: a semi-supervised ensemble learning approach , 2014, BMC Bioinformatics.

[22]  R. R. Mallios,et al.  Predicting class II MHC/peptide multi-level binding with an iterative stepwise discriminant analysis meta-algorithm , 2001, Bioinform..

[23]  Dariusz Plewczynski,et al.  Consensus classification of human leukocyte antigen class II proteins , 2012, Immunogenetics.

[24]  Darren R. Flower,et al.  Predicting Class II MHC-Peptide binding: a kernel based approach using similarity scores , 2006, BMC Bioinformatics.

[25]  Ujjwal Maulik,et al.  Ensemble learning prediction of protein-protein interactions using proteins functional annotations. , 2014, Molecular bioSystems.

[26]  Debotosh Bhattacharjee,et al.  RotaSVM: A New Ensemble Classifier , 2013 .

[27]  D. Flower,et al.  Peptide binding to the HLA-DRB1 supertype: a proteochemometrics analysis. , 2010, European journal of medicinal chemistry.

[28]  Pat Langley,et al.  Estimating Continuous Distributions in Bayesian Classifiers , 1995, UAI.

[29]  Jianming Shi,et al.  Prediction of MHC class II binders using the ant colony search strategy , 2005, Artif. Intell. Medicine.

[30]  C. Janeway Immunobiology: The Immune System in Health and Disease , 1996 .

[31]  Bjoern Peters,et al.  Functional classification of class II human leukocyte antigen (HLA) molecules reveals seven different supertypes and a surprising degree of repertoire sharing across supertypes , 2011, Immunogenetics.

[32]  Vladimir Brusic,et al.  Prediction of MHC class II-binding peptides using an evolutionary algorithm and artificial neural network , 1998, Bioinform..