Generalized unmixing model for multispectral flow cytometry utilizing nonsquare compensation matrices

Multispectral and hyperspectral flow cytometry (FC) instruments allow measurement of fluorescence or Raman spectra from single cells in flow. As with conventional FC, spectral overlap results in the measured signal in any given detector being a mixture of signals from multiple labels present in the analyzed cells. In contrast to traditional polychromatic FC, these devices utilize a number of detectors (or channels in multispectral detector arrays) that is larger than the number of labels, and no particular detector is a priori dedicated to the measurement of any particular label. This data‐acquisition modality requires a rigorous study and understanding of signal formation as well as unmixing procedures that are employed to estimate labels abundance. The simplest extension of the traditional compensation procedure to multispectral data sets is equivalent to an ordinary least‐square (LS) solution for estimating abundance of labels in individual cells. This process is identical to the technique employed for unmixing spectral data in various imaging fields. The present study shows that multispectral FC data violate key assumptions of the LS process, and use of the LS method may lead to unmixing artifacts, such as population distortion (spreading) and the presence of negative values in biomarker abundances. Various alternative unmixing techniques were investigated, including relative‐error minimization and variance‐stabilization transformations. The most promising results were obtained by performing unmixing using Poisson regression with an identity‐link function within a generalized linear model framework. This formulation accounts for the presence of Poisson noise in the model of signal formation and subsequently leads to superior unmixing results, particularly for dim fluorescent populations. The proposed Poisson unmixing technique is demonstrated using simulated 8‐channel, 2‐fluorochrome data and real 32‐channel, 6‐fluorochrome data. The quality of unmixing is assessed by computing absolute and relative errors, as well as by calculating the symmetrized Kullback–Leibler divergence between known and approximated populations. These results are applicable to any flow‐based system with more detectors than labels where Poisson noise is the dominant contributor to the overall system noise and highlight the fact that explicit incorporation of appropriate noise models is the key to accurately estimating the true label abundance on the cells. © 2013 International Society for Advancement of Cytometry

[1]  T. Zimmermann Spectral imaging and linear unmixing in light microscopy. , 2005, Advances in biochemical engineering/biotechnology.

[2]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[3]  Jeff J. Settle,et al.  On the relationship between spectral unmixing and subspace projection , 1996, IEEE Trans. Geosci. Remote. Sens..

[4]  J. Paul Robinson,et al.  Hyperspectral cytometry at the single‐cell level using a 32‐channel photodetector , 2012, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[5]  Shaul K. Bar-Lev,et al.  On the classical choice of variance stabilizing transformations and an application for a Poisson variate , 1988 .

[6]  F. J. Anscombe,et al.  THE TRANSFORMATION OF POISSON, BINOMIAL AND NEGATIVE-BINOMIAL DATA , 1948 .

[7]  Martin Vingron,et al.  Variance stabilization applied to microarray data calibration and to the quantification of differential expression , 2002, ISMB.

[8]  Walter Krämer,et al.  Review of Modern applied statistics with S, 4th ed. by W.N. Venables and B.D. Ripley. Springer-Verlag 2002 , 2003 .

[9]  Chein-I. Chang Spectral information divergence for hyperspectral image analysis , 1999, IEEE 1999 International Geoscience and Remote Sensing Symposium. IGARSS'99 (Cat. No.99CH36293).

[10]  John P Nolan,et al.  Surface-enhanced Raman scattering (SERS) cytometry. , 2011, Methods in cell biology.

[11]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[12]  R. Bro,et al.  A fast non‐negativity‐constrained least squares algorithm , 1997 .

[13]  A. Lanterman,et al.  Chemical mixture estimation under a Poisson Raman spectroscopy model , 2010 .

[14]  P. B. Coates Photomultiplier noise statistics , 1972 .

[15]  R. Olea,et al.  Dealing with Zeros , 2011 .

[16]  Huaiyu Zhu On Information and Sufficiency , 1997 .

[17]  J. R. Prescott,et al.  A statistical model for photomultiplier single-electron statistics , 1966 .

[18]  Mark A. Naivar,et al.  Single particle high resolution spectral analysis flow cytometry , 2006, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[19]  Mari-Ann Akerjord,et al.  Assessing mixing models within a common framework , 1996 .

[20]  C. Tofallis Least Squares Percentage Regression , 2008 .

[21]  P. Ubezio,et al.  Linearity and noise sources in flow cytometry. , 1985, Cytometry.

[22]  Christopher Snow,et al.  Flow cytometer electronics , 2004, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[23]  J. Fox,et al.  Applied Regression Analysis and Generalized Linear Models , 2008 .

[24]  Ian C. Marschner,et al.  Stable Computation of Maximum Likelihood Estimates in Identity Link Poisson Regression , 2010 .

[25]  Trevor Hastie,et al.  A Closer Look at the Deviance , 1987 .

[26]  Charles L. Lawson,et al.  Solving least squares problems , 1976, Classics in applied mathematics.

[27]  G. Lachs The statistics for the detection of light by nonideal photomultipliers , 1974 .

[28]  J. Settle,et al.  Linear mixing and the estimation of ground cover proportions , 1993 .

[29]  Mario Roederer,et al.  Compensation in Flow Cytometry , 2002, Current protocols in cytometry.

[30]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .

[31]  Leif O. Brown,et al.  A flow cytometer for the measurement of Raman spectra , 2008, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[32]  Werner Alfred Stahel,et al.  Linear unmixing of multivariate observations: a structural model , 2002 .

[33]  M Roederer,et al.  Spectral compensation for flow cytometry: visualization artifacts, limitations, and caveats. , 2001, Cytometry.

[34]  C B Bagwell,et al.  Fluorescence Spectral Overlap Compensation for Any Number of Flow Cytometry Parameters , 1993, Annals of the New York Academy of Sciences.

[35]  Alfonso Fernández-Manso,et al.  Spectral unmixing , 2012 .

[36]  John Fox,et al.  Applied Regression Analysis and Generalized Linear Models , 2008 .

[37]  J. H. Schuenemeyer,et al.  Generalized Linear Models (2nd ed.) , 1992 .

[38]  W. Huber,et al.  Model-based variance-stabilizing transformation for Illumina microarray data , 2008, Nucleic acids research.

[39]  Brian D. Ripley,et al.  Modern applied statistics with S, 4th Edition , 2002, Statistics and computing.

[40]  P. McCullagh,et al.  Generalized Linear Models, 2nd Edn. , 1990 .