MAPS: machine-assisted phenotype scoring enables rapid functional assessment of genetic variants by high-content microscopy

Background Genetic testing is widely used in evaluating a patient’s predisposition to hereditary diseases. In the case of cancer, when a functionally impactful mutation (i.e. genetic variant) is identified in a disease-relevant gene, the patient is at elevated risk of developing a lesion in their lifetime. Unfortunately, as the rate and coverage of genetic testing has accelerated, our ability to assess the functional status of new variants has fallen behind. Therefore, there is an urgent need for more practical, streamlined and cost-effective methods for classifying variants. Results To directly address this issue, we designed a new approach that uses alterations in protein subcellular localization as a key indicator of loss of function. Thus, new variants can be rapidly functionalized using high-content microscopy (HCM). To facilitate the analysis of the large amounts of imaging data, we developed a new software toolkit, named MAPS for machine-assisted phenotype scoring, that utilizes deep learning to extract and classify cell-level features. MAPS helps users leverage cloud-based deep learning services that are easy to train and deploy to fit their specific experimental conditions. Model training is code-free and can be done with limited training images. Thus, MAPS allows cell biologists to easily incorporate deep learning into their image analysis pipeline. We demonstrated an effective variant functionalization workflow that integrates HCM and MAPS to assess missense variants of PTEN , a tumor suppressor that is frequently mutated in hereditary and somatic cancers. Conclusions This paper presents a new way to rapidly assess variant function using cloud deep learning. Since most tumor suppressors have well-defined subcellular localizations, our approach could be widely applied to functionalize variants of uncertain significance and help improve the utility of genetic testing.

[1]  Jonathan S. Berg,et al.  Comparative analysis of functional assay evidence use by ClinGen Variant Curation Expert Panels , 2019, Genome Medicine.

[2]  E. Mylona,et al.  Immunohistochemical study of PTEN and phosphorylated mTOR proteins in familial and sporadic invasive breast carcinomas , 2010, Histopathology.

[3]  Daniel J. Park,et al.  Variant effect prediction tools assessed using independent, functional assay-based datasets: implications for discovery and diagnostics , 2017, Human Genomics.

[4]  C. Downes,et al.  PTEN function: how normal cells control it and tumour cells lose it. , 2004, The Biochemical journal.

[5]  I T Young,et al.  A comparison of different focus functions for use in autofocus algorithms. , 1985, Cytometry.

[6]  P. Ng,et al.  SIFT missense predictions for genomes , 2015, Nature Protocols.

[7]  P. Devreotes,et al.  Mechanism of Human PTEN Localization Revealed by Heterologous Expression in Dictyostelium , 2013, Oncogene.

[8]  C. Eng,et al.  The nuclear affairs of PTEN , 2008, Journal of Cell Science.

[9]  Gregory M. Cooper,et al.  CADD: predicting the deleteriousness of variants throughout the human genome , 2018, Nucleic Acids Res..

[10]  Leopold Parts,et al.  Accurate Classification of Protein Subcellular Localization from High-Throughput Microscopy Images Using Deep Learning , 2016, G3: Genes, Genomes, Genetics.

[11]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[12]  Jayanta Debnath,et al.  Morphogenesis and oncogenesis of MCF-10A mammary epithelial acini grown in three-dimensional basement membrane cultures. , 2003, Methods.

[13]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[14]  Chikashi Ishioka,et al.  Identification of breast tumor mutations in BRCA1 that abolish its function in homologous DNA recombination. , 2010, Cancer research.

[15]  Sergio L. Netto,et al.  A Survey on Performance Metrics for Object-Detection Algorithms , 2020, 2020 International Conference on Systems, Signals and Image Processing (IWSSIP).

[16]  Oren Z. Kraus,et al.  Machine learning and computer vision approaches for phenotypic profiling , 2017, The Journal of cell biology.

[17]  Yoon‐Kyoung Cho,et al.  AI-powered transmitted light microscopy for functional analysis of live cells , 2019, Scientific Reports.

[18]  Joseph D. Janizek,et al.  Accurate classification of BRCA1 variants with saturation genome editing , 2018, Nature.

[19]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction , 2018, ArXiv.

[20]  P. Giannakakou,et al.  The importance of p53 location: nuclear or cytoplasmic zip code? , 2003, Drug resistance updates : reviews and commentaries in antimicrobial and anticancer chemotherapy.

[21]  Shulin Li,et al.  Protein mislocalization: mechanisms, functions and clinical applications in cancer. , 2014, Biochimica et biophysica acta.

[22]  Michael C. Ostrowski,et al.  Allele-specific tumor spectrum in Pten knockin mice , 2010, Proceedings of the National Academy of Sciences.

[23]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[24]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[25]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[26]  P. Liberali,et al.  Single-cell and multivariate approaches in genetic perturbation screens , 2014, Nature Reviews Genetics.

[27]  Marius Otesteanu,et al.  A Review on Image Segmentation Techniques and Performance Measures , 2018 .

[28]  A. Klippel,et al.  Membrane localization of phosphatidylinositol 3-kinase is sufficient to activate multiple signal-transducing kinase pathways , 1996, Molecular and cellular biology.

[29]  C. Eng,et al.  Germline and somatic cancer-associated mutations in the ATP-binding motifs of PTEN influence its subcellular localization and tumor suppressive function , 2009, Human molecular genetics.

[30]  Tom L. Blundell,et al.  SDM: a server for predicting effects of mutations on protein stability , 2017, Nucleic Acids Res..

[31]  Anne E Carpenter,et al.  Workflow and Metrics for Image Quality Control in Large-Scale High-Content Screens , 2012, Journal of biomolecular screening.

[32]  Jianxu Chen,et al.  The Allen Cell and Structure Segmenter: a new open source toolkit for segmenting 3D intracellular structures in fluorescence microscopy images , 2018, bioRxiv.

[33]  J. Yeh,et al.  Differential nuclear and cytoplasmic expression of PTEN in normal thyroid tissue, and benign and malignant epithelial thyroid tumors. , 2000, The American journal of pathology.

[34]  Karsten M. Borgwardt,et al.  The Evaluation of Tools Used to Predict the Impact of Missense Variants Is Hindered by Two Types of Circularity , 2015, Human mutation.

[35]  M. Vihinen,et al.  Pathogenic or not? And if so, then how? Studying the effects of missense mutations using bioinformatics methods , 2009, Human mutation.

[36]  Yolanda T. Chong,et al.  Automated analysis of high‐content microscopy data with deep learning , 2017, Molecular systems biology.

[37]  R. Wollman,et al.  High throughput microscopy: from raw images to discoveries , 2007, Journal of Cell Science.

[38]  Yumay Chen,et al.  The Nuclear Localization Sequences of the BRCA1 Protein Interact with the Importin-α Subunit of the Nuclear Transport Signal Receptor* , 1996, The Journal of Biological Chemistry.

[39]  J. Rodriguez,et al.  Cytoplasmic mislocalization of BRCA1 caused by cancer-associated mutations in the BRCT domain. , 2004, Experimental cell research.

[40]  P Komminoth,et al.  Mutation and expression analyses reveal differential subcellular compartmentalization of PTEN in endocrine pancreatic tumors compared to normal islet cells. , 2000, The American journal of pathology.

[41]  Monte Westerfield,et al.  Bedside Back to Bench: Building Bridges between Basic and Clinical Genomic Research , 2017, Cell.

[42]  Giuseppe Tradigo,et al.  A framework for the decomposition and features extraction from lung DICOM images , 2018, IDEAS.

[43]  B. Rost,et al.  Better prediction of functional effects for sequence variants , 2015, BMC Genomics.

[44]  Eran Segal,et al.  A Systematic p53 Mutation Library Links Differential Functional Impact to Cancer Mutation Pattern and Evolutionary Conservation. , 2018, Molecular cell.

[45]  W. Sellers,et al.  Tumor suppressor PTEN acts through dynamic interaction with the plasma membrane. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[46]  D. Rimm,et al.  Frequent nuclear/cytoplasmic localization of beta-catenin without exon 3 mutations in malignant melanoma. , 1999, The American journal of pathology.

[47]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection , 2018, J. Open Source Softw..

[48]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[49]  Nuclear β-catenin accumulation is associated with increased expression of Nanog protein and predicts poor prognosis of non-small cell lung cancer , 2013, Journal of Translational Medicine.

[50]  Kenji Suzuki,et al.  A method of high-throughput functional evaluation of EGFR gene variants of unknown significance in cancer , 2017, Science Translational Medicine.

[51]  Kenneth A. Matreyek,et al.  A Premalignant Cell-Based Model for Functionalization and Classification of PTEN Variants , 2020, Cancer Research.

[52]  R. Weiss,et al.  CRM1 blockade by selective inhibitors of nuclear export attenuates kidney cancer growth. , 2013, The Journal of urology.

[53]  Wilhelm Burger,et al.  Digital Image Processing - An Algorithmic Introduction using Java , 2008, Texts in Computer Science.

[54]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[55]  Lani F. Wu,et al.  Image-based multivariate profiling of drug responses from single cells , 2007, Nature Methods.

[56]  Joseph D. Janizek,et al.  Accurate functional classification of thousands of BRCA1 variants with saturation genome editing , 2018, bioRxiv.

[57]  Anne E Carpenter,et al.  CellProfiler 3.0: Next-generation image processing for biology , 2018, PLoS biology.

[58]  D. Bojanic,et al.  Impact of high-throughput screening in biomedical research , 2011, Nature Reviews Drug Discovery.

[59]  Vanessa E. Gray,et al.  Multiplex Assessment of Protein Variant Abundance by Massively Parallel Sequencing , 2018, Nature Genetics.

[60]  S. Serra,et al.  Nuclear E-cadherin Immunoexpression: From Biology to Potential Applications in Diagnostic Pathology , 2008, Advances in anatomic pathology.

[61]  Anne E Carpenter,et al.  A Chemical Screen Probing the Relationship between Mitochondrial Content and Cell Size , 2012, PloS one.

[62]  Taghi M. Khoshgoftaar,et al.  A survey of transfer learning , 2016, Journal of Big Data.

[63]  Beate Sick,et al.  Single-Cell Phenotype Classification Using Deep Convolutional Neural Networks , 2016, Journal of biomolecular screening.

[64]  Domenec Puig,et al.  Analysis of focus measure operators for shape-from-focus , 2013, Pattern Recognit..

[65]  B. Keon,et al.  An infrastructure for high-throughput microscopy: instrumentation, informatics, and integration. , 2006, Methods in enzymology.

[66]  Angela N. Brooks,et al.  High-throughput Phenotyping of Lung Cancer Somatic Mutations. , 2016, Cancer cell.