Applying Deep Neural Network Analysis to High-Content Image-Based Assays

The etiological underpinnings of many CNS disorders are not well understood. This is likely due to the fact that individual diseases aggregate numerous pathological subtypes, each associated with a complex landscape of genetic risk factors. To overcome these challenges, researchers are integrating novel data types from numerous patients, including imaging studies capturing broadly applicable features from patient-derived materials. These datasets, when combined with machine learning, potentially hold the power to elucidate the subtle patterns that stratify patients by shared pathology. In this study, we interrogated whether high-content imaging of primary skin fibroblasts, using the Cell Painting method, could reveal disease-relevant information among patients. First, we showed that technical features such as batch/plate type, plate, and location within a plate lead to detectable nuisance signals, as revealed by a pre-trained deep neural network and analysis with deep image embeddings. Using a plate design and image acquisition strategy that accounts for these variables, we performed a pilot study with 12 healthy controls and 12 subjects affected by the severe genetic neurological disorder spinal muscular atrophy (SMA), and evaluated whether a convolutional neural network (CNN) generated using a subset of the cells could distinguish disease states on cells from the remaining unseen control–SMA pair. Our results indicate that these two populations could effectively be differentiated from one another and that model selectivity is insensitive to batch/plate type. One caveat is that the samples were also largely separated by source. These findings lay a foundation for how to conduct future studies exploring diseases with more complex genetic contributions and unknown subtypes.

[1]  J. Weissenbach,et al.  Identification and characterization of a spinal muscular atrophy-determining gene , 1995, Cell.

[2]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[3]  Mark B Bromberg,et al.  Natural history of denervation in SMA: Relation to age, SMN2 copy number, and function , 2005, Annals of neurology.

[4]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[5]  Nick C Fox,et al.  Letter abstract - Genome-wide association study identifies variants at CLU and PICALM associated with Alzheimer's Disease , 2009 .

[6]  Gemma C. Garriga,et al.  Permutation Tests for Studying Classifier Performance , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[7]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[8]  L Shamir,et al.  Assessing the efficacy of low‐level image content descriptors for computer‐based fluorescence microscopy image analysis , 2011, Journal of microscopy.

[9]  J. Steen,et al.  A screen for regulators of survival of motor neuron protein levels. , 2011, Nature chemical biology.

[10]  Erratum: Personal medicine[mdash]the new banking crisis , 2012 .

[11]  W. Chung,et al.  Prospective cohort study of spinal muscular atrophy types 2 and 3 , 2012, Neurology.

[12]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[13]  G. Hamilton,et al.  Spinal muscular atrophy: going beyond the motor neuron. , 2013, Trends in molecular medicine.

[14]  Marc'Aurelio Ranzato,et al.  DeViSE: A Deep Visual-Semantic Embedding Model , 2013, NIPS.

[15]  S. Horvath DNA methylation age of human tissues and cell types , 2013, Genome Biology.

[16]  Anirvan Ghosh,et al.  SMN2 splicing modifiers improve motor function and longevity in mice with spinal muscular atrophy , 2014, Science.

[17]  Madeline A. Lancaster,et al.  Creating Patient-Specific Neural Cells for the In Vitro Study of Brain Disorders , 2015, Stem cell reports.

[18]  David G Hendrickson,et al.  Genome-wide RNA-Seq of Human Motor Neurons Implicates Selective ER Stress Activation in Spinal Muscular Atrophy. , 2015, Cell stem cell.

[19]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[20]  J. Gargus,et al.  Shared functional defect in IP3R-mediated calcium signaling in diverse monogenic autism syndromes , 2015, Translational Psychiatry.

[21]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[22]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Anne E Carpenter,et al.  Cell Painting, a high-content image-based assay for morphological profiling using multiplexed fluorescent dyes , 2016, Nature Protocols.

[24]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[25]  D. Panchision,et al.  Concise Review: Progress and Challenges in Using Human Stem Cells for Biological and Therapeutics Discovery: Neuropsychiatric Disorders , 2016, Stem cells.

[26]  Subhashini Venugopalan,et al.  Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. , 2016, JAMA.

[27]  M. Kirber,et al.  Impairment of PARK14-dependent Ca2+ signalling is a novel determinant of Parkinson's disease , 2016, Nature Communications.

[28]  Marc Berndl,et al.  Improving Phenotypic Measurements in High-Content Imaging Screens , 2017, bioRxiv.

[29]  Aleksey Boyko,et al.  Detecting Cancer Metastases on Gigapixel Pathology Images , 2017, ArXiv.

[30]  W. Fujibuchi,et al.  Report of the International Stem Cell Banking Initiative Workshop Activity: Current Hurdles and Progress in Seed‐Stock Banking of Human Pluripotent Stem Cells , 2017, Stem cells translational medicine.

[31]  Anders Larsson,et al.  Global, regional, and national burden of neurological disorders during 1990–2015: a systematic analysis for the Global Burden of Disease Study 2015 , 2017, The Lancet Neurology.

[32]  Rachel Nguyen,et al.  High-throughput screen detects calcium signaling dysfunction in typical sporadic autism spectrum disorder , 2017, Scientific Reports.

[33]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[34]  Chadwick M. Hales,et al.  Fibroblast bioenergetics to classify amyotrophic lateral sclerosis patients , 2017, Molecular Neurodegeneration.

[35]  Jakob Grove,et al.  Common schizophrenia alleles are enriched in mutation-intolerant genes and in regions under strong background selection , 2018, Nature Genetics.

[36]  Sonja W. Scholz,et al.  Parkinson’s disease genetics: identifying novel risk loci, providing causal insights and improving estimates of heritable risk , 2018, bioRxiv.

[37]  Stephan Hoyer,et al.  Assessing microscope image focus quality with deep learning , 2018, BMC Bioinformatics.

[38]  George R. Thoma,et al.  Pre-trained convolutional neural networks as feature extractors toward improved malaria parasite detection in thin blood smear images , 2018, PeerJ.

[39]  L. Rubin,et al.  Toward Precision Medicine for Neurological and Neuropsychiatric Disorders. , 2018, Cell stem cell.

[40]  S. Sherman,et al.  Parkinson's Disease Skin Fibroblasts Display Signature Alterations in Growth, Redox Homeostasis, Mitochondrial Function, and Autophagy , 2018, Front. Neurosci..

[41]  Abhishek Das,et al.  Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).