A Computational Framework for Influenza Antigenic Cartography

Influenza viruses have been responsible for large losses of lives around the world and continue to present a great public health challenge. Antigenic characterization based on hemagglutination inhibition (HI) assay is one of the routine procedures for influenza vaccine strain selection. However, HI assay is only a crude experiment reflecting the antigenic correlations among testing antigens (viruses) and reference antisera (antibodies). Moreover, antigenic characterization is usually based on more than one HI dataset. The combination of multiple datasets results in an incomplete HI matrix with many unobserved entries. This paper proposes a new computational framework for constructing an influenza antigenic cartography from this incomplete matrix, which we refer to as Matrix Completion-Multidimensional Scaling (MC-MDS). In this approach, we first reconstruct the HI matrices with viruses and antibodies using low-rank matrix completion, and then generate the two-dimensional antigenic cartography using multidimensional scaling. Moreover, for influenza HI tables with herd immunity effect (such as those from Human influenza viruses), we propose a temporal model to reduce the inherent temporal bias of HI tables caused by herd immunity. By applying our method in HI datasets containing H3N2 influenza A viruses isolated from 1968 to 2003, we identified eleven clusters of antigenic variants, representing all major antigenic drift events in these 36 years. Our results showed that both the completed HI matrix and the antigenic cartography obtained via MC-MDS are useful in identifying influenza antigenic variants and thus can be used to facilitate influenza vaccine strain selection. The webserver is available at http://sysbio.cvm.msstate.edu/AntigenMap.

[1]  Emmanuel J. Candès,et al.  Exact Matrix Completion via Convex Optimization , 2008, Found. Comput. Math..

[2]  Henryk Minc,et al.  Eigenvalues of matrices with prescribed entries , 1972 .

[3]  Keiji Fukuda,et al.  Mortality associated with influenza and respiratory syncytial virus in the United States. , 2003, JAMA.

[4]  Andrea Montanari,et al.  Matrix completion from a few entries , 2009, 2009 IEEE International Symposium on Information Theory.

[5]  Albert D. M. E. Osterhaus,et al.  Characterization of a Novel Influenza A Virus Hemagglutinin Subtype (H16) Obtained from Black-Headed Gulls , 2005, Journal of Virology.

[6]  W. Fitch,et al.  Predicting the evolution of human influenza A. , 1999, Science.

[7]  Inderjit S. Dhillon,et al.  Rank minimization via online learning , 2008, ICML '08.

[8]  Inderjit S. Dhillon,et al.  Matrix Completion from Power-Law Distributed Samples , 2009, NIPS.

[9]  L. Finelli,et al.  Emergence of a novel swine-origin influenza A (H1N1) virus in humans. , 2009, The New England journal of medicine.

[10]  Pablo A. Parrilo,et al.  Guaranteed Minimum-Rank Solutions of Linear Matrix Equations via Nuclear Norm Minimization , 2007, SIAM Rev..

[11]  P. Palese,et al.  Influenza: old and new threats , 2004, Nature Medicine.

[12]  R Farber,et al.  The geometry of shape space: application to influenza. , 2001, Journal of theoretical biology.

[13]  S O Mast REPLY TO HOLMES'S CRITICISM OF "LIGHT AND THE BEHAVIOR OF ORGANISMS". , 1912, Science.

[14]  G. N. de Oliveira,et al.  Matrices with prescribed entries and eigenvalues. III , 1975 .

[15]  Emmanuel J. Candès,et al.  The Power of Convex Relaxation: Near-Optimal Matrix Completion , 2009, IEEE Transactions on Information Theory.

[16]  L. Simonsen,et al.  The impact of influenza epidemics on hospitalizations. , 2000, The Journal of infectious diseases.

[17]  Libo Dong,et al.  Cross-reactive antibody responses to the 2009 pandemic H1N1 influenza virus. , 2009, The New England journal of medicine.

[18]  R. Tibshirani,et al.  Regularization methods for learning incomplete matrices , 2009, 0906.2034.

[19]  G. N. de Oliveira,et al.  Matrices with prescribed entries and eigenvalues. I , 1973 .

[20]  Olgica Milenkovic,et al.  SET: An algorithm for consistent matrix completion , 2009, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[21]  Daniel Hershkowitz,et al.  Existence of matrices with prescribed eigenvalues and entries , 1983 .

[22]  L. Mirsky,et al.  Matrices with Prescribed Characteristic Roots and Diagonal Elements , 1958 .

[23]  G. N. De Oliveira,et al.  Matrices with Prescribed Entries and Eigenvalues. II , 1973 .

[24]  Dima Grigoriev,et al.  Complexity of Quantifier Elimination in the Theory of Algebraically Closed Fields , 1984, MFCS.

[25]  Yoshihiro Kawaoka,et al.  The origins of new pandemic viruses: the acquisition of new host ranges by canine parvovirus and influenza A viruses. , 2005, Annual review of microbiology.

[26]  A. Lapedes,et al.  Mapping the Antigenic and Genetic Evolution of Influenza Virus , 2004, Science.

[27]  Shmuel Friedland,et al.  Matrices with prescribed off-diagonal elements , 1972 .

[28]  W. Ledermann,et al.  Matrices with Prescribed Characteristic Polynomials , 1959, Proceedings of the Edinburgh Mathematical Society.

[29]  Emmanuel J. Candès,et al.  A Singular Value Thresholding Algorithm for Matrix Completion , 2008, SIAM J. Optim..