Pattern recognition in flow cytometry.

BACKGROUND Analytical flow cytometry (AFC), by quantifying sometimes more than 10 optical parameters on cells at rates of approximately 10(3) cells/s, rapidly generates vast quantities of multidimensional data, which provides a considerable challenge for data analysis. We review the application of multivariate data analysis and pattern recognition techniques to flow cytometry. METHODS Approaches were divided into two broad types depending on whether the aim was identification or clustering. Multivariate statistical approaches, supervised artificial neural networks (ANNs), problems of overlapping character distributions, unbounded data sets, missing parameters, scaling up, and estimating proportions of different types of cells comprised the first category. Classic clustering methods, fuzzy clustering, and unsupervised ANNs comprised the second category. We demonstrate the state of the art by using AFC data on marine phytoplankton populations. RESULTS AND CONCLUSIONS Information held within the large quantities of data generated by AFC was tractable using ANNs, but for field studies the problem of obtaining suitable training data needs to be resolved, and coping with an almost infinite number of cell categories needs further research.

[1]  Richard Lippmann,et al.  Neural Network Classifiers Estimate Bayesian a posteriori Probabilities , 1991, Neural Computation.

[2]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[3]  Ravi Kothari,et al.  On finding the number of clusters , 1999, Pattern Recognit. Lett..

[4]  Robert M. Pap,et al.  Handbook of neural computing applications , 1990 .

[5]  Glen A. Tarran,et al.  Discrimination of marine phytoplankton species through the statistical analysis of their flow cytometric signatures , 1996 .

[6]  C Decaestecker,et al.  Methodological aspects of using decision trees to characterise leiomyomatous tumors. , 1996, Cytometry.

[7]  G. Dubelaar,et al.  CytoBuoy: a step forward towards using flow cytometry in operational oceanography* , 2000 .

[8]  U. Holst,et al.  Statistical evaluation of cell kinetic data from DNA flow cytometry (FCM) by the EM algorithm. , 1989, Cytometry.

[9]  C. Nombela,et al.  Applications of Flow Cytometry to Clinical Microbiology , 2000, Clinical Microbiology Reviews.

[10]  Lynne Boddy,et al.  Identification of basidiomycete spores by neural network analysis of flow cytometry data , 1992 .

[11]  L Boddy,et al.  Training radial basis function neural networks: effects of training set size and imbalanced training sets. , 2000, Journal of microbiological methods.

[12]  Morgan P. Conrad,et al.  A rapid, non-parametric clustering scheme for flow cytometric data , 1987, Pattern Recognit..

[13]  P. Burkill,et al.  The rapid analysis of single marine cells by flow cytometry , 1990, Philosophical Transactions of the Royal Society of London. Series A: Physical and Engineering Sciences.

[14]  Etienne Barnard,et al.  Backpropagation uses prior information efficiently , 1993, IEEE Trans. Neural Networks.

[15]  L Boddy,et al.  Comparison of five clustering algorithms to classify phytoplankton from flow cytometry data. , 2001, Cytometry.

[16]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[17]  Lynne Boddy,et al.  Identification of Phytoplankton from Flow Cytometry Data by Using Radial Basis Function Neural Networks , 1999, Applied and Environmental Microbiology.

[18]  Gerrit Kateman,et al.  Pattern classification with artificial neural networks : classification of algae, based upon flow cytometer data , 1992 .

[19]  Lynne Boddy,et al.  A comparison of Radial Basis Function and backpropagation neural networks for identification of marine phytoplankton from multivariate flow cytometry data , 1994, Comput. Appl. Biosci..

[20]  P M Ravdin,et al.  Neural Network Analysis of DNA flow cytometry histograms. , 1993, Cytometry.

[21]  T C Bakker Schut,et al.  Cluster analysis of flow cytometric list mode data on a personal computer. , 1993, Cytometry.

[22]  C. W. Morris,et al.  Neural network analysis of flow cytometric data for 40 marine phytoplankton species. , 1994, Cytometry.

[23]  Joydeep Ghosh,et al.  Scale-based clustering using the radial basis function network , 1996, IEEE Trans. Neural Networks.

[24]  Richard P. Lippmann,et al.  An introduction to computing with neural nets , 1987 .

[25]  J. Collier,et al.  FLOW CYTOMETRY AND THE SINGLE CELL IN PHYCOLOGY , 2000, Journal of phycology.

[26]  Robert J. Schalkoff,et al.  Pattern recognition - statistical, structural and neural approaches , 1991 .

[27]  Amit Gupta,et al.  Estimating Missing Values Using Neural Networks , 1996 .

[28]  R. Murphy Automated identification of subpopulations in flow cytometric list mode data using cluster analysis. , 1985, Cytometry.

[29]  Lynne Boddy,et al.  Evaluation of artificial neural networks for fungal identification, employing morphometric data from spores of Pestalotiopsis species , 1998 .

[30]  J. Wikner,et al.  Rapid Determination of Bacterial Abundance, Biovolume, Morphology, and Growth by Neural Network-Based Image Analysis , 1998, Applied and Environmental Microbiology.

[31]  Charles W. Butler,et al.  Naturally intelligent systems , 1990 .

[32]  C Laplace-Builhé,et al.  Application of flow cytometry to rapid microbial analysis in food and drinks industries , 1993, Biology of the cell.

[33]  Teuvo Kohonen,et al.  The self-organizing map , 1990 .

[34]  A M Marchevsky,et al.  Neural networks as a prognostic tool for patients with non-small cell carcinoma of the lung. , 1997, Modern pathology : an official journal of the United States and Canadian Academy of Pathology, Inc.

[35]  LiMin Fu,et al.  Neural networks in computer intelligence , 1994 .

[36]  H M Davey,et al.  Variable selection and multivariate methods for the identification of microorganisms by flow cytometry. , 1999, Cytometry.

[37]  R R Jonker,et al.  Design and first results of CytoBuoy: a wireless flow cytometer for in situ analysis of marine and fresh waters. , 1999, Cytometry.

[38]  F. Colijn,et al.  Phytoplankton monitoring by flow cytometry , 1994 .

[39]  LiMin Fu,et al.  Real-time adaptive clustering of flow cytometric data , 1993, Pattern Recognit..

[40]  M Godavarti,et al.  Automated particle classification based on digital acquisition and analysis of flow cytometric pulse waveforms. , 1996, Cytometry.

[41]  G. Dunn,et al.  An Introduction to Mathematical Taxonomy , 1983 .

[42]  L Boddy,et al.  Proportion estimation with confidence limits. , 2000, Journal of microbiological methods.

[43]  Stephen Grossberg,et al.  The ART of adaptive pattern recognition by a self-organizing neural network , 1988, Computer.

[44]  Lynne Boddy,et al.  Support vector machines for identifying organisms: a comparison with strongly partitioned radial basis function networks , 2001 .

[45]  H. Kahle,et al.  White Cell and Thrombocytie Disorders , 1993 .

[46]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[47]  Ian Phillips,et al.  Rapid compound pattern classification by recursive partitioning of feature space. An application in flow cytometry , 1995, Pattern Recognit. Lett..

[48]  D.R. Hush,et al.  Progress in supervised neural networks , 1993, IEEE Signal Processing Magazine.

[49]  D S Frankel,et al.  Application of neural networks to flow cytometry data analysis and real-time cell classification. , 1996, Cytometry.

[50]  S Demers,et al.  Analyzing multivariate flow cytometric data in aquatic sciences. , 1992, Cytometry.

[51]  Shang-Liang Chen,et al.  Orthogonal least squares learning algorithm for radial basis function networks , 1991, IEEE Trans. Neural Networks.

[52]  Lynne Boddy,et al.  Artificial neural networks for pattern recognition , 1999 .

[53]  Hichem Frigui,et al.  A robust algorithm for automatic extraction of an unknown number of clusters from noisy data , 1996, Pattern Recognit. Lett..

[54]  Antonello Rizzi,et al.  Scale-based approach to hierarchical fuzzy clustering , 2000, Signal Process..

[55]  G C Salzman,et al.  Classification and regression trees for bone marrow immunophenotyping. , 1995, Cytometry.

[56]  L. Jespersen,et al.  Flow cytometric detection of wild yeast in lager breweries. , 1993, International journal of food microbiology.

[57]  R C Mann,et al.  On multiparameter data analysis in flow cytometry. , 1987, Cytometry.

[58]  T. Balachander,et al.  Neural network analysis of flow cytometry immunophenotype data , 1996, IEEE Transactions on Biomedical Engineering.

[59]  Lynne Boddy,et al.  A comparison of some neural and non-neural methods for identification of phytoplankton from flow cytometry data , 1996, Comput. Appl. Biosci..

[60]  Volker Tresp,et al.  Classification with missing and uncertain inputs , 1993, IEEE International Conference on Neural Networks.

[61]  Lynne Boddy,et al.  Identification of 72 phytoplankton species by radial basis function neural network analysis of flow cytometric data , 2000 .

[62]  Sallie W. Chisholm,et al.  Use of a neural net computer system for analysis of flow cytometric data of phytoplankton populations , 1989 .

[63]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[64]  J. Sugar,et al.  Application of multivariate, fuzzy set and neural network analysis in quantitative cytological examinations. , 1993, Analytical cellular pathology : the journal of the European Society for Analytical Cellular Pathology.

[65]  Naoto Urano,et al.  The use of flow cytometry and small-scale brewing in protoplast fusion: Exclusion of undesired phenotypes in yeasts , 1994 .

[66]  H. W. Balfoort,et al.  Automatic identification of algae: neural network analysis of flow cytometric data , 1992 .

[67]  Gerrit Kateman,et al.  Drift correction for pattern classification with neural networks , 1993 .

[68]  Peter J. Rousseeuw,et al.  Fuzzy clustering using scatter matrices , 1996 .

[69]  P Vaupel,et al.  Computer-assisted interpretation of flow cytometry data in hematology. , 1996, Cytometry.