Big Data Analytics in Immunology: A Knowledge-Based Approach

With the vast amount of immunological data available, immunology research is entering the big data era. These data vary in granularity, quality, and complexity and are stored in various formats, including publications, technical reports, and databases. The challenge is to make the transition from data to actionable knowledge and wisdom and bridge the knowledge gap and application gap. We report a knowledge-based approach based on a framework called KB-builder that facilitates data mining by enabling fast development and deployment of web-accessible immunological data knowledge warehouses. Immunological knowledge discovery relies heavily on both the availability of accurate, up-to-date, and well-organized data and the proper analytics tools. We propose the use of knowledge-based approaches by developing knowledgebases combining well-annotated data with specialized analytical tools and integrating them into analytical workflow. A set of well-defined workflow types with rich summarization and visualization capacity facilitates the transformation from data to critical information and knowledge. By using KB-builder, we enabled streamlining of normally time-consuming processes of database development. The knowledgebases built using KB-builder will speed up rational vaccine design by providing accurate and well-annotated data coupled with tailored computational analysis tools and workflow.

[1]  V. Brusic,et al.  Large-Scale Analysis of B-Cell Epitopes on Influenza Virus Hemagglutinin – Implications for Cross-Reactivity of Neutralizing Antibodies , 2014, Front. Immunol..

[2]  Raphael Gottardo,et al.  Computational resources for high-dimensional immune analysis from the Human Immunology Project Consortium , 2014, Nature Biotechnology.

[3]  Vladimir Brusic,et al.  BlockLogo: visualization of peptide and sequence motif conservation. , 2013, Journal of immunological methods.

[4]  A. Aderem Editorial overview. System immunology. , 2013, Seminars in immunology.

[5]  Shuzhao Li,et al.  Systems biological approaches to measure and understand vaccine immunity in humans. , 2013, Seminars in immunology.

[6]  Vladimir Brusic,et al.  Literature classification for semi-automated updating of biological knowledgebases , 2013, BMC Genomics.

[7]  Loren Gragert,et al.  Six-locus high resolution HLA haplotype frequencies derived from mixed-resolution DNA typing for the entire US donor registry. , 2013, Human immunology.

[8]  R. Wilson,et al.  The Next-Generation Sequencing Revolution and Its Impact on Genomics , 2013, Cell.

[9]  Vladimir Brusic,et al.  Landscape of neutralizing assessment of monoclonal antibodies against dengue virus , 2013, BCB.

[10]  Vladimir Brusic,et al.  HPVdb: a data mining system for knowledge discovery in human papillomavirus with applications in T cell immunology and vaccinology , 2013, BCB.

[11]  O. Lund,et al.  NetMHCIIpan-3.0, a common pan-specific MHC class II prediction method including all three human MHC class II isotypes, HLA-DR, HLA-DP and HLA-DQ , 2013, Immunogenetics.

[12]  V. Brusic,et al.  FLAVIdB: A data mining system for knowledge discovery in flaviviruses with direct applications in immunology and vaccinology , 2013, Immunome research.

[13]  M. Biggerstaff,et al.  Novel Framework for Assessing Epidemiologic Effects of Influenza Epidemics and Pandemics , 2013, Emerging infectious diseases.

[14]  Philip R. O. Payne Chapter 1: Biomedical Knowledge Integration , 2012, PLoS Comput. Biol..

[15]  Shu-Hsien Liao,et al.  Data mining techniques and applications - A decade review from 2000 to 2011 , 2012, Expert Syst. Appl..

[16]  D. Haussler,et al.  Integrating Genomes , 2012, Science.

[17]  D. Keskin,et al.  Conservation Analysis of Dengue Virus T-cell Epitope-Based Vaccine Candidates Using Peptide Block Entropy , 2011, Front. Immun..

[18]  R. Berkowitz,et al.  Direct Identification of an HPV-16 Tumor Antigen from Cervical Cancer Biopsy Specimens , 2011, Front. Immun..

[19]  J. Söllner,et al.  Concept and application of a computational vaccinology workflow , 2010, Immunome research.

[20]  S. Jonjić,et al.  Modulation of natural killer cell activity by viruses. , 2010, Current opinion in microbiology.

[21]  D. Keskin,et al.  A Conserved E7-derived Cytotoxic T Lymphocyte Epitope Expressed on Human Papillomavirus 16-transformed HLA-A2+ Epithelial Cancers , 2010, The Journal of Biological Chemistry.

[22]  Alessandro Sette,et al.  The Immune Epitope Database 2.0 , 2009, Nucleic Acids Res..

[23]  James Robinson,et al.  The IMGT/HLA database , 2008, Nucleic Acids Res..

[24]  Jennifer E. Rowley,et al.  The wisdom hierarchy: representations of the DIKW hierarchy , 2007, J. Inf. Sci..

[25]  I. Bozic,et al.  Prediction of supertype-specific HLA class I binding peptides using support vector machines. , 2007, Journal of immunological methods.

[26]  Jian Huang,et al.  CED: a conformational epitope database , 2006, BMC Immunology.

[27]  Vladimir Brusic,et al.  Neural Models for Predicting Viral Vaccine Targets , 2005, J. Bioinform. Comput. Biol..

[28]  O. Lund,et al.  The design and implementation of the immune epitope database and analysis resource , 2005, Immunogenetics.

[29]  F. Pontén,et al.  Antibody-based Proteomics for Human Tissue Profiling , 2005, Molecular & Cellular Proteomics.

[30]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[31]  O. Lund,et al.  Definition of supertypes for HLA molecules using clustering of specificity matrices , 2004, Immunogenetics.

[32]  K. Katoh,et al.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. , 2002, Nucleic acids research.

[33]  Jia-huai Wang,et al.  Structural basis of T cell recognition of peptides bound to MHC molecules. , 2002, Molecular immunology.

[34]  J. Sidney,et al.  Nine major HLA class I supertypes account for the vast preponderance of HLA-A and -B polymorphism , 1999, Immunogenetics.

[35]  C. Janeway Immunobiology: The Immune System in Health and Disease , 1996 .

[36]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[37]  Eugene W. Myers,et al.  Basic local alignment search tool. Journal of Molecular Biology , 1990 .

[38]  E. Reinherz,et al.  Clonal analysis of human cytotoxic T lymphocytes: T4+ and T8+ effector T cells recognize products of different major histocompatibility complex regions. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[39]  R. Ackoff From Data to Wisdom , 2014 .

[40]  Nathalie Vigneron,et al.  Database of T cell-defined human tumor antigens: the 2013 update. , 2013, Cancer immunity.

[41]  M. V. Van Regenmortel,et al.  What is a B-cell epitope? , 2009, Methods in molecular biology.

[42]  O. Lund,et al.  NetMHCpan, a method for MHC class I binding prediction beyond humans , 2008, Immunogenetics.

[43]  Caroline Kampf,et al.  Antibody-based proteomics for human tissue profiling; the Swedish Human Proteome Resource project (HPR) , 2005 .

[44]  Rino Rappuoli,et al.  Reverse vaccinology. , 2000, Current opinion in microbiology.

[45]  R. Rappuoli Reverse vaccinology : Genomics , 2000 .

[46]  P. van der Bruggen,et al.  T cell defined tumor antigens. , 1997, Current opinion in immunology.