HLA-DR4Pred2: an improved method for predicting HLA-DRB1*04:01 binders

HLA-DRB1*04:01 is associated with many disease that include sclerosis, arthritis, diabetes and Covid19. Thus, it is important to scan binders of HLA-DRB1*04:01 in an antigen to develop immunotherapy, vaccine and protection against these diseases. One of the major limitations of existing methods for predicting with HLA-DRB1*04:01 binders is that these methods trained on small datasets. This study present a method HLA-DR4Pred2 developed on a large dataset contain 12676 binders and equal number of non-binders. It is an improved version of HLA-DR4Pred, which was trained on a small dataset contain only 576 binders and equal number of binders. All models in this study were trained, optimized and tested on 80% of data called training datasets using five-fold cross-validation; final models were evaluated on 20% of data called validation/independent dataset. A wide range of machine learning techniques have been employed to develop prediction models and achieved maximum AUC of 0.90 and 0.87 on validation dataset using composition and binary profile features respectively. The performance of our composition based model increased from 0.90 to 0.93 when combined with BLAST search. In addition, we also developed our models on alternate or realistic dataset that contain 12676 binders and 86300 non-binders and achieved maximum AUC 0.99. Our method perform better than existing methods when we compare the performance of our best model with performance of existing methods on validation dataset. Finally, we developed standalone and online version of HLA-DR4Pred2 for predicting, designing and virtual scanning of HLA- DRB1*04:01(https://webs.iiitd.edu.in/raghava/hladr4pred2/; https://github.com/raghavagps/hladr4pred2). Key Points HLADR4Pred2.0 is an update of HLADR4Pred Predict the binding or non-binding peptides for MHC-Class II allele HLA- DRB1*04:01 Used alignment free and alignment based hybrid approach Motifs which are highly specific to HLA-DRB1*04:01 binders Benchmark the performance of the other existing methods with HLADR4Pred2.0 Author’s Biography Sumeet Patiyal is currently working as Ph.D. in Computational biology from Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India Anjali Dhall is currently working as Ph.D. in Computational Biology from Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India. Nishant Kumar is currently working as Ph.D. in Computational biology from Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India Gajendra P. S. Raghava is currently working as Professor and Head of Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.

[1]  P. Agrawal,et al.  Pfeature: A Tool for Computing Wide Range of Protein Features and Building Prediction Models , 2022, J. Comput. Biol..

[2]  G. Solgi,et al.  HLA-DRB1*04 may predict the severity of disease in a group of Iranian COVID-19 patients , 2021, Human Immunology.

[3]  B. Lie,et al.  The influence of HLA genotype on the severity of COVID‐19 infection , 2021, HLA.

[4]  A. Hosseini,et al.  HLA, Immune Response, and Susceptibility to COVID-19 , 2021, Frontiers in Immunology.

[5]  S. El-Kafrawy,et al.  Mortality in COVID-19 disease patients: Correlating the association of major histocompatibility complex (MHC) with severe acute respiratory syndrome 2 (SARS-CoV-2) variants , 2020, International Journal of Infectious Diseases.

[6]  S. Rowland-Jones,et al.  Tracking Changes in SARS-CoV-2 Spike: Evidence that D614G Increases Infectivity of the COVID-19 Virus , 2020, Cell.

[7]  L. García,et al.  Immune Response, Inflammation, and the Clinical Spectrum of COVID-19 , 2020, Frontiers in Immunology.

[8]  E. Elinav,et al.  Interaction between microbiota and immunity in health and disease , 2020, Cell Research.

[9]  A. Vogrig,et al.  Associations between HLA and autoimmune neurological diseases with autoantibodies , 2020, Autoimmunity Highlights.

[10]  Paul Flicek,et al.  IPD-IMGT/HLA Database , 2019, Nucleic Acids Res..

[11]  Hari Prasad Devkota,et al.  Polyphenols in the treatment of autoimmune diseases. , 2019, Autoimmunity reviews.

[12]  J. L. Sánchez-Trincado,et al.  Fundamentals and Methods for T- and B-Cell Epitope Prediction , 2017, Journal of immunology research.

[13]  Wei Jiang,et al.  The Other Function: Class II-Restricted Antigen Presentation by B Cells , 2017, Front. Immunol..

[14]  Frank Noé,et al.  Major Histocompatibility Complex (MHC) Class I and MHC Class II Proteins: Conformational Plasticity in Antigen Presentation , 2017, Front. Immunol..

[15]  K. Rock,et al.  Present Yourself! By MHC Class I and MHC Class II Molecules. , 2016, Trends in immunology.

[16]  Y. Mosaad,et al.  Clinical Role of Human Leukocyte Antigen in Health and Disease , 2015, Scandinavian journal of immunology.

[17]  H. Chen,et al.  Association of HLA-DR4/HLA-DRB1*04 with Vogt-Koyanagi-Harada disease: A Systematic Review and Meta-analysis , 2014, Scientific Reports.

[18]  M. Claesson,et al.  Classification of Human Leukocyte Antigen (HLA) Supertypes , 2014, Methods in molecular biology.

[19]  F. Dammacco,et al.  MHC class I antigen processing and presenting machinery: organization, function, and defects in tumor cells. , 2013, Journal of the National Cancer Institute.

[20]  D. Cole,et al.  Re-Directing CD4+ T Cell Responses with the Flanking Residues of MHC Class II-Bound Peptides: The Core is Not Enough , 2013, Front. Immunol..

[21]  I. Lucenko,et al.  Associations of HLA DR and DQ molecules with Lyme borreliosis in Latvian patients , 2012, BMC Research Notes.

[22]  Celine Vens,et al.  Identifying discriminative classification-based motifs in biological sequences , 2011, Bioinform..

[23]  Morten Nielsen,et al.  Major histocompatibility complex class I binding predictions as a tool in epitope discovery , 2010, Immunology.

[24]  Morten Nielsen,et al.  MHC Class II epitope predictive algorithms , 2010, Immunology.

[25]  David Burgner,et al.  HLA and Infectious Diseases , 2009, Clinical Microbiology Reviews.

[26]  Morten Nielsen,et al.  Quantitative Predictions of Peptide Binding to Any HLA-DR Molecule of Known Sequence: NetMHCIIpan , 2008, PLoS Comput. Biol..

[27]  Morten Nielsen,et al.  Prediction of MHC class II binding affinity using SMM-align, a novel stabilization matrix alignment method , 2007, BMC Bioinformatics.

[28]  S. Choo The HLA System: Genetics, Immunology, Clinical Testing, and Clinical Implications , 2007, Yonsei medical journal.

[29]  W. L. Benedict,et al.  Multiple Sclerosis , 2007, Journal - Michigan State Medical Society.

[30]  K. Gillespie Type 1 diabetes: pathogenesis and prevention , 2006, Canadian Medical Association Journal.

[31]  Jianming Shi,et al.  Prediction of MHC class II binders using the ant colony search strategy , 2005, Artif. Intell. Medicine.

[32]  S. Hauser,et al.  The HLA locus and multiple sclerosis in Sicily , 2005, Neurology.

[33]  C. Larsen,et al.  The genetics of HLA-associated disease. , 2004, Current opinion in immunology.

[34]  Thomas L. Madden,et al.  BLAST: at the core of a powerful and diverse set of sequence analysis tools , 2004, Nucleic Acids Res..

[35]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[36]  JL Newton,et al.  A review of the MHC genetics of rheumatoid arthritis , 2004, Genes and Immunity.

[37]  Gajendra P. S. Raghava,et al.  SVM based method for predicting HLA-DRB1*0401 binding peptides in an antigen sequence , 2004, Bioinform..

[38]  J. Taurog HLA-DR4 and the spondyloarthropathies , 2002, Annals of the rheumatic diseases.

[39]  Gajendra P. S. Raghava,et al.  ProPred: prediction of HLA-DR binding sites , 2001, Bioinform..

[40]  T. Pincus,et al.  HLA-DR4 and other genetic markers in rheumatoid arthritis. , 1988, British journal of rheumatology.

[41]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 , 2000, Nucleic Acids Res..

[42]  G. Dunston,et al.  Vitiligo is associated with HLA-DR4 in black patients. A preliminary report. , 1990, Archives of dermatology.