A hybrid machine-learning approach for segmentation of protein localization data

MOTIVATION Subcellular protein localization data are critical to the quantitative understanding of cellular function and regulation. Such data are acquired via observation and quantitative analysis of fluorescently labeled proteins in living cells. Differentiation of labeled protein from cellular artifacts remains an obstacle to accurate quantification. We have developed a novel hybrid machine-learning-based method to differentiate signal from artifact in membrane protein localization data by deriving positional information via surface fitting and combining this with fluorescence-intensity-based data to generate input for a support vector machine. RESULTS We have employed this classifier to analyze signaling protein localization in T-cell activation. Our classifier displayed increased performance over previously available techniques, exhibiting both flexibility and adaptability: training on heterogeneous data yielded a general classifier with good overall performance; training on more specific data yielded an extremely high-performance specific classifier. We also demonstrate accurate automated learning utilizing additional experimental data.

[1]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  B. Nichols,et al.  Lipid raft proteins have a random distribution during localized activation of the T-cell receptor , 2004, Nature Cell Biology.

[3]  S. Bromley,et al.  The immunological synapse: a molecular machine controlling T cell activation. , 1999, Science.

[4]  Matthew F. Krummel,et al.  Quantifying signaling-induced reorientation of T cell receptors during immunological synapse formation , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Tobias Meyer,et al.  Receptor-induced transient reduction in plasma membrane PtdIns(4,5)P2 concentration monitored in living cells , 1998, Current Biology.

[6]  Mark M Davis,et al.  Dynamics of cell surface molecules during T cell recognition. , 2003, Annual review of biochemistry.

[7]  Mark M Davis,et al.  Quantitative imaging of lymphocyte membrane protein reorganization and signaling. , 2005, Biophysical journal.

[8]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[9]  Kinji Ohno,et al.  Rapsyn mutations in humans cause endplate acetylcholine-receptor deficiency and myasthenic syndrome. , 2002, American journal of human genetics.

[10]  Y Huan,et al.  Polycystin-1, the PKD1 gene product, is in a complex containing E-cadherin and the catenins. , 1999, The Journal of clinical investigation.

[11]  Massimiliano Pontil,et al.  Support Vector Machines for 3D Object Recognition , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Chris H. Q. Ding,et al.  Multi-class protein fold recognition using support vector machines and neural networks , 2001, Bioinform..

[13]  David Haussler,et al.  Classifying G-protein coupled receptors with support vector machines , 2002, Bioinform..

[14]  M. Davis,et al.  Differential clustering of CD4 and CD3zeta during T cell recognition. , 2000, Science.

[15]  Vannary Meas-Yedid,et al.  Segmentation and tracking of migrating cells in videomicroscopy with parametric active contours: a tool for cell-based drug testing , 2002, IEEE Transactions on Medical Imaging.

[16]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[17]  Péter Várnai,et al.  Visualization of Phosphoinositides That Bind Pleckstrin Homology Domains: Calcium- and Agonist-induced Dynamic Changes and Relationship to Myo-[3H]inositol-labeled Phosphoinositide Pools , 1998, The Journal of cell biology.

[19]  Angela Wandinger-Ness,et al.  A polycystin-1 multiprotein complex is disrupted in polycystic kidney disease cells. , 2003, Molecular biology of the cell.

[20]  David A. Gough,et al.  Predicting protein-protein interactions from primary structure , 2001, Bioinform..

[21]  J. Sanes,et al.  Defective Neuromuscular Synaptogenesis in Agrin-Deficient Mutant Mice , 1996, Cell.

[22]  M. Davis,et al.  Visualizing the dynamics of T cell activation: intracellular adhesion molecule 1 migrates rapidly to the T cell/B cell interface and acts to sustain calcium levels. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Mark Turmaine,et al.  Formation of Neuronal Intranuclear Inclusions Underlies the Neurological Dysfunction in Mice Transgenic for the HD Mutation , 1997, Cell.

[24]  Mark M Davis,et al.  Dynamics of p56lck translocation to the T cell immunological synapse following agonist and antagonist stimulation. , 2002, Immunity.

[25]  Colin R. F. Monks,et al.  Three-dimensional segregation of supramolecular activation clusters in T cells , 1998, Nature.

[26]  J. Ellenberg,et al.  Four-dimensional imaging and quantitative reconstruction to analyse complex spatiotemporal processes in live cells , 2001, Nature Cell Biology.

[27]  R. Eils,et al.  Quantitative motion analysis and visualization of cellular structures. , 2003, Methods.

[28]  Guillermo Sapiro,et al.  Geodesic Active Contours , 1995, International Journal of Computer Vision.

[29]  P. Danielsson Euclidean distance mapping , 1980 .

[30]  Mark M. Davis,et al.  T-cell-antigen recognition and the immunological synapse , 2003, Nature Reviews Immunology.

[31]  Bo Zhang,et al.  Tracking of multiple fluorescent biological objects in three dimensional video microscopy , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[32]  Jiwei Yang,et al.  Telomerized human microvasculature is functional in vivo , 2001, Nature Biotechnology.

[33]  Tian Xu,et al.  A Genetic Screen in Drosophila for Metastatic Behavior , 2003, Science.

[34]  Mark M Davis,et al.  Continuous T cell receptor signaling required for synapse maintenance and full effector potential , 2003, Nature Immunology.

[35]  K. Chou,et al.  Using Functional Domain Composition and Support Vector Machines for Prediction of Protein Subcellular Location* , 2002, The Journal of Biological Chemistry.

[36]  Arup K Chakraborty,et al.  The Immunological Synapse Balances T Cell Receptor Signaling and Degradation , 2003, Science.

[37]  Michael Loran Dustin,et al.  T Cell Receptor Signaling Precedes Immunological Synapse Formation , 2002, Science.

[38]  M. Loda,et al.  Loss or altered subcellular localization of p27 in Barrett's associated adenocarcinoma. , 1998, Cancer research.

[39]  Yong Liao,et al.  Phosphorylation/Cytoplasmic Localization of p21Cip1/WAF1 Is Associated with HER2/neu Overexpression and Provides a Novel Combination Predictor for Poor Prognosis in Breast Cancer Patients , 2004, Clinical Cancer Research.

[40]  Steven Finkbeiner,et al.  Huntingtin Acts in the Nucleus to Induce Apoptosis but Death Does Not Correlate with the Formation of Intranuclear Inclusions , 1998, Cell.

[41]  P. Fink,et al.  Correlations between T-cell specificity and the structure of the antigen receptor , 1986, Nature.