EGFR and HER 3 C-terminal tails are intrinsically disordered Biophysical Evidence for Intrinsic Disorder in the C-terminal Tails of the EGFR and HER 3 Receptor Tyrosine Kinases

The EGFR/ErbB family of receptor tyrosine kinases includes oncogenes important in the progression of breast and other cancers and they are targets for many drug development strategies. Each member of the ErbB family possesses a unique, structurally uncharacterized Cterminal tail that plays an important role in autophosphorylation and signal propagation. To determine if these C-terminal tails are intrinsically disordered regions, we conducted a battery of biophysical experiments on the EGFR and HER3 tails. Using hydrogen/deuterium exchange mass spectrometry, we measured the conformational dynamics of intracellular half constructs and compared the tails to the ordered kinase domains. The C-terminal tails demonstrate more rapid deuterium exchange behavior when compared to the kinase domains. Next, we expressed and purified EGFR and HER3 tail-only constructs. Results from circular dichroism spectroscopy, size-exclusion chromatography with multi-angle light scattering, dynamic light scattering, analytical ultracentrifugation, and small angle Xray scattering each provide evidence that the EGFR and HER3 C-terminal tails are intrinsically disordered with extended, non-globular structure in solution. The intrinsic disorder and extended conformation of these tails may be important for their function by increasing the capture radius and reducing the thermodynamic barriers for binding of downstream signaling proteins. INTRODUCTION The Epidermal Growth Factor Receptor (EGFR/ErbB) family of receptor tyrosine kinases (RTK’s) contains four member proteins: EGFR/ErbB1, HER2/ErbB2/neu, HER3/ErbB3, and HER4/ErbB4. These RTK’s carry out important signaling functions via the sequential process of ligand-binding by the extracellular domain, homoor heterodimerization, activation of their intracellular kinase domain, and recruitment of downstream signaling proteins. These RTK’s are also important oncogenic drivers in many breast, lung, and other human cancers (1). Several structural biology studies on the ErbB family have been published and this has helped advance drug development for HER2 positive breast cancer (2). Protein crystallography studies published in 2004 showed the structure of pertuzumab bound to the extracellular domain of HER2 and lapatinib bound to the kinase domain of EGFR (3,4). Since that time, growing structural biology based understanding of how EGFR, 1 http://www.jbc.org/cgi/doi/10.1074/jbc.M116.747485 The latest version is at JBC Papers in Press. Published on November 21, 2016 as Manuscript M116.747485 Copyright 2016 by The American Society for Biochemistry and Molecular Biology, Inc. by gest on N ovem er 5, 2016 hp://w w w .jb.org/ D ow nladed from EGFR and HER3 C-terminal tails are intrinsically disordered HER2, and HER3 functions at the atomic level have dramatically re-shaped our understanding of RTK’s (1,2). Despite these advances, there is a domain in each EGFR/ErbB family protein for which little structural biology information is available; this domain is the C-terminal tail (CTT) domain. The CTTs contain numerous autophosphorylation sites that are essential for recruiting downstream signaling proteins and initiating intracellular signaling (5,6). The CTT can also contribute to autoinhibition of the RTK’s kinase domain (7). The lack of available crystallographic information on the CTT region of EGFR/ErbB family RTK’s led us to examine if these proteins lack a stable secondary and/or tertiary structure. Intrinsically disordered regions (IDRs) represent an emerging area of interest in medicine. IDRs are regions within proteins that exhibit high flexibility and may lack a secondary or tertiary structure, but are still able to carry out important biological functions (8-15). Algorithm prediction methods indicate that around 25-30% of eukaryotic proteins can be categorized as having disordered regions (16). These disordered regions provide certain advantages in protein-protein interactions including a larger hydrodynamic radius (17,18), faster onand off-rates of binding (19), high binding specificity (20), and the ability to adopt different conformations depending on the binding partner (9,21). Correlation studies have revealed a high propensity for disordered regions to undergo post-translational modification, particularly phosphorylation (22). Because traditional methods, such as NMR and X-ray crystallography, were developed to study stable protein structures, IDRs have been more difficult to analyze due to difficulties in obtaining high concentrations or representative protein crystals (11,23). Correlation studies suggest a strong association between IDRs and human cancerassociated proteins (24) and, therefore, identifying and analyzing IDRs in cancer-related proteins is vital in understanding how they function. Protein kinases demonstrate a high degree of specificity in facilitating phosphorylation, yet many are able to perform such interactions with multiple substrate partners (25). An analysis of the human kinome shows that as many as 83% of kinase genes contain IDRs, which could facilitate these multiple interactions; RTK’s being involved in more protein-protein interactions than any other kinase group, thus potentially pointing to the involvement of IDRs in their recognition mechanisms (26). Multi-sequence alignment (27) of EGFR/ErbB family RTK’s shows that the kinase domain sequences are highly conserved between all four family members. However, the CTT regions are highly divergent between each EGFR/ErbB family member. One previous study using circular dichroism (CD) spectroscopy indicated that the EGFR CTT is rich in α-helical and β-sheet content (6), but a later study used coarse-grained modeling to show many possible conformations of the EGFR CTT based on the assumption that it is naturally disordered (28). A high degree of flexibility in the CTT would provide distinct advantages to EGFR interaction with downstream Src homology-2 (SH2) and phosphotyrosine-binding (PTB) domains during signaling functions (6). In this work, we examine the biophysical properties of the CTT of EGFR and HER3. The motivation of this study is to gather structural information on the CTT region to determine if the tails are highly dynamic, disordered regions. First, we measured the conformational dynamics of intracellular half (ICH) constructs of EGFR/ErbB family members using amide hydrogen/deuterium exchange mass spectrometry (HDX-MS). This information is used to compare the CTTs to the kinase domains. We also expressed and purified EGFR and HER3 CTT only constructs and demonstrated that they are functional because they are recognized and phosphorylated by EGFR family kinases and once phosphorylated, can be bound by the Grb2 SH2 domain. Using these CTT constructs, we performed multiple biophysical analyses, including CD spectroscopy, sizeexclusion chromatography with multi-angle light scattering (SEC-MALS), dynamic light scattering (DLS), analytical ultracentrifugation (AUC), and small angle X-ray scattering (SAXS). The results of these methods support the hypothesis that the EGFR and HER3 CTTs are IDRs with extended, non-globular structure in solution. RESULTS Disorder Predictions on EGFR, HER2 and HER3 We used computation algorithms to provide an initial survey of disordered regions in the EGFR/ErbB family of RTK’s. Predictor of 2 by gest on N ovem er 5, 2016 hp://w w w .jb.org/ D ow nladed from EGFR and HER3 C-terminal tails are intrinsically disordered Natural Disordered Regions (PONDR) is a collection of algorithms that use an amino acid sequence to predict native disorder (29,30). The VL-XT algorithm assigns a disorder propensity score for disorder on a residue by residue basis (29-31). The VL-XT prediction for EGFR, HER2 and HER3 can be seen in figure 1. PONDR scores greater than 0.5 indicate predicted disorder and scores less than 0.5 indicate predicted order. Kinase domains of the three EGFR/ErbB family members show mostly predicted order, whereas the CTT in each show large regions of predicted disorder. The amino acid composition of the CTT and kinase domains of both EGFR and HER3 is shown in table 1. When comparing the CTTs to the kinase domains in both proteins, we observe that the CTTs are more enriched in polar, uncharged residues and prolines, while relatively depleted in hydrophobic residues. For EGFR, polar, uncharged residues make up 30.1% of the CTT versus 14.9% of the kinase domain. Prolines make up 10.6% of the EGFR CTT versus 5.4% in the kinase domain and hydrophobic residues (excluding tyrosine which can become phosphorylated) are 27.4% of the EGFR CTT versus 38.9% of the kinase domain (Table 1). Similar values are seen with HER3 CTT (Table 1), despite its primary sequence divergence from EGFR CTT (22% sequence identity between EGFR and HER3 CTT, calculated with Clustal Omega). The lowered hydrophobicity provides a simple explanation as to why the tails would not form a hydrophobic core and therefore be disordered in an aqueous environment. These predictions are a useful tool for gaining a general view of disordered regions in a protein sequence, but direct empirical evidence would better support the hypothesis of the CTTs as being IDRs. Hydrogen/Deuterium Exchange Mass Spectrometry of EGFR/ErbB Family RTK’s HDX-MS is a highly useful technique to identify and study IDRs (32-36). HDX-MS provides kinetic information about the exchange rate of amide hydrogen atoms along the backbone of a protein for solvent deuterons. This rate of H/D exchange is affected by factors such as the presence of strong hydrogen bonds, secondary structure, and tertiary structure (37-40). IDRs show very rapid H/D exchange rates, on the order of milliseconds to seconds (32-36). HDX-MS provides local information about different regions of the protein and readily distinguishes folded, globular regions from IDRs. We performed HDXMS experiments by exposing the protein to deuterium oxide (D2O) over two ranges of time points, 108 m

[1]  D. Svergun,et al.  A practical guide to small angle X‐ray scattering (SAXS) of flexible and intrinsically disordered proteins , 2015, FEBS letters.

[2]  D. Weis,et al.  Mapping Residual Structure in Intrinsically Disordered Proteins at Residue Resolution Using Millisecond Hydrogen/Deuterium Exchange and Residue Averaging , 2015, Journal of The American Society for Mass Spectrometry.

[3]  V. Uversky,et al.  Presence and utility of intrinsically disordered regions in kinases. , 2014, Molecular bioSystems.

[4]  P. Tompa,et al.  Introducing protein intrinsic disorder. , 2014, Chemical reviews.

[5]  J. Schlessinger,et al.  The EGFR family: not so prototypical receptor tyrosine kinases. , 2014, Cold Spring Harbor perspectives in biology.

[6]  John G. Koland,et al.  Coarse-Grained Molecular Simulation of Epidermal Growth Factor Receptor Protein Tyrosine Kinase Multi-Site Self-Phosphorylation , 2014, PLoS Comput. Biol..

[7]  Christopher Bystroff,et al.  The designability of protein switches by chemical rescue of structure: mechanisms of inactivation and reactivation. , 2013, Journal of the American Chemical Society.

[8]  M. J. Chalmers,et al.  Time Window Expansion for HDX Analysis of an Intrinsically Disordered Protein , 2013, Journal of The American Society for Mass Spectrometry.

[9]  D. Sept,et al.  Carboxyl Group Footprinting Mass Spectrometry and Molecular Dynamics Identify Key Interactions in the HER2-HER3 Receptor Tyrosine Kinase Interface* ♦ , 2013, The Journal of Biological Chemistry.

[10]  K. Gajiwala EGFR: Tale of the C‐terminal tail , 2013, Protein science : a publication of the Protein Society.

[11]  Vladimir N Uversky,et al.  The most important thing is the tail: Multitudinous functionalities of intrinsically disordered protein termini , 2013, FEBS letters.

[12]  D. Weis,et al.  Analysis of disordered proteins using a simple apparatus for millisecond quench-flow H/D exchange. , 2013, Analytical chemistry.

[13]  Huan‐Xiang Zhou,et al.  Rate constants and mechanisms of intrinsically disordered proteins binding to structured targets. , 2012, Physical chemistry chemical physics : PCCP.

[14]  Derek J. Wilson,et al.  Measuring dynamics in weakly structured regions of proteins using microfluidics-enabled subsecond H/D exchange mass spectrometry. , 2012, Analytical chemistry.

[15]  P. Tompa,et al.  Intrinsic disorder in cell signaling and gene transcription , 2012, Molecular and Cellular Endocrinology.

[16]  David D Weis,et al.  Mapping unstructured regions and synergistic folding in intrinsically disordered proteins with amide H/D exchange mass spectrometry. , 2011, Biochemistry.

[17]  Yoshikazu Ohta,et al.  Structural Analysis of the Mechanism of Inhibition and Allosteric Activation of the Kinase Domain of HER2 Protein , 2011, The Journal of Biological Chemistry.

[18]  Lars Konermann,et al.  Hydrogen exchange mass spectrometry for studying protein structure and dynamics. , 2011, Chemical Society reviews.

[19]  C. Ebel,et al.  Analytical Ultracentrifugation, a Useful Tool to Probe Intrinsically Disordered Proteins , 2010 .

[20]  F. White,et al.  EGFRvIV: a previously uncharacterized oncogenic mutant reveals a kinase autoinhibitory mechanism , 2010, Oncogene.

[21]  John Kuriyan,et al.  Structural analysis of the catalytically inactive kinase domain of the human EGF receptor 3 , 2009, Proceedings of the National Academy of Sciences.

[22]  R. Bose,et al.  Her4 and Her2/neu Tyrosine Kinase Domains Dimerize and Activate in a Reconstituted in Vitro System* , 2009, The Journal of Biological Chemistry.

[23]  James R Faeder,et al.  Toward a quantitative theory of intrinsically disordered proteins and their function , 2009, Proceedings of the National Academy of Sciences.

[24]  Yongqi Huang,et al.  Kinetic advantage of intrinsically disordered proteins in coupled folding-binding process: a critical assessment of the "fly-casting" mechanism. , 2009, Journal of molecular biology.

[25]  John A. Tainer,et al.  Robust, high-throughput solution structural analyses by small angle X-ray scattering (SAXS) , 2009, Nature Methods.

[26]  H. Dyson,et al.  Prediction of the rotational tumbling time for proteins with disordered segments. , 2009, Journal of the American Chemical Society.

[27]  M. Bolognesi,et al.  Function and Structure of Inherently Disordered Proteins This Review Comes from a Themed Issue on Proteins Edited Prediction of Non-folding Proteins and Regions Frequency of Disordered Regions Protein Evolution Partitioning Unstructured Proteins and Regions into Groups Involvement of Inherently Diso , 2022 .

[28]  Christopher J. Oldfield,et al.  Intrinsically disordered proteins in human diseases: introducing the D2 concept. , 2008, Annual review of biophysics.

[29]  B. Wallace,et al.  Protein secondary structure analyses from circular dichroism spectroscopy: methods and reference databases. , 2008, Biopolymers.

[30]  Christopher J. Oldfield,et al.  Flexible nets: disorder and induced fit in the associations of p53 and 14-3-3 with their partners , 2008, BMC Genomics.

[31]  J. Ferrell,et al.  Mechanisms of specificity in protein phosphorylation , 2007, Nature Reviews Molecular Cell Biology.

[32]  Peter V. Konarev,et al.  ATSAS 2.1 – towards automated and web-supported small-angle scattering data analysis , 2007 .

[33]  Christopher J. Oldfield,et al.  Functional anthology of intrinsic disorder. 3. Ligands, post-translational modifications, and diseases associated with intrinsically disordered proteins. , 2007, Journal of proteome research.

[34]  John Kuriyan,et al.  An Allosteric Mechanism for Activation of the Kinase Domain of Epidermal Growth Factor Receptor , 2006, Cell.

[35]  Sonia Longhi,et al.  Assessing protein disorder and induced folding , 2005, Proteins.

[36]  Christopher J. Oldfield,et al.  Showing your ID: intrinsic disorder as an ID for recognition, regulation and cell signaling , 2005, Journal of molecular recognition : JMR.

[37]  Krystal J Alligood,et al.  A Unique Structure for Epidermal Growth Factor Receptor Bound to GW572016 (Lapatinib) , 2004, Cancer Research.

[38]  C. Redfield,et al.  Using nuclear magnetic resonance spectroscopy to study molten globule states of proteins. , 2004, Methods.

[39]  Lee Whitmore,et al.  DICHROWEB, an online server for protein secondary structure analyses from circular dichroism spectroscopic data , 2004, Nucleic Acids Res..

[40]  M. Sliwkowski,et al.  Insights into ErbB signaling from the structure of the ErbB2-pertuzumab complex. , 2004, Cancer cell.

[41]  L. Iakoucheva,et al.  Intrinsic disorder in cell-signaling and cancer-associated proteins. , 2002, Journal of molecular biology.

[42]  V. Uversky,et al.  Denatured collapsed states in protein folding: Example of apomyoglobin , 2001, Proteins.

[43]  J. Schlessinger Cell Signaling by Receptor Tyrosine Kinases , 2000, Cell.

[44]  Benjamin A. Shoemaker,et al.  Speeding molecular recognition by using the folding funnel: the fly-casting mechanism. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[45]  P. Schuck,et al.  Size-distribution analysis of macromolecules by sedimentation velocity ultracentrifugation and lamm equation modeling. , 2000, Biophysical journal.

[46]  M. Eisenstein,et al.  Analysis of lissencephaly-causing LIS1 mutations. , 1999, European journal of biochemistry.

[47]  M. Moran,et al.  Distinct tyrosine autophosphorylation sites negatively and positively modulate neu-mediated transformation , 1997, Molecular and cellular biology.

[48]  O. Ptitsyn,et al.  Further evidence on the equilibrium "pre-molten globule state": four-state guanidinium chloride-induced unfolding of carbonic anhydrase B at low temperature. , 1996, Journal of molecular biology.

[49]  D. Svergun,et al.  CRYSOL : a program to evaluate X-ray solution scattering of biological macromolecules from atomic coordinates , 1995 .

[50]  O. Ptitsyn,et al.  "Partly folded" state, a new equilibrium state of protein molecules: four-state guanidinium chloride-induced unfolding of beta-lactamase at low temperature. , 1994, Biochemistry.

[51]  Zhongqi Zhang,et al.  Determination of amide hydrogen exchange by mass spectrometry: A new tool for protein structure elucidation , 1993, Protein science : a publication of the Protein Society.

[52]  P. Wyatt Light scattering and the absolute characterization of macromolecules , 1993 .

[53]  W C Johnson,et al.  Variable selection method improves the prediction of protein secondary structure from circular dichroism spectra. , 1987, Analytical biochemistry.

[54]  W C Johnson,et al.  Analysis of protein circular dichroism spectra for secondary structure using a simple matrix multiplication. , 1986, Analytical biochemistry.

[55]  N. Kallenbach,et al.  Hydrogen exchange and structural dynamics of proteins and nucleic acids , 1983, Quarterly Reviews of Biophysics.

[56]  C. Cantor,et al.  Biophysical Chemistry: Part II: Techniques for the Study of Biological Structure and Function , 1980 .

[57]  L Mayne,et al.  Primary structure effects on peptide group hydrogen exchange. , 1972, Proteins.

[58]  D. Svergun,et al.  Structural analysis of intrinsically disordered proteins by small-angle X-ray scattering. , 2012, Molecular bioSystems.

[59]  Vladimir N Uversky,et al.  What does it mean to be natively unfolded? , 2002, European journal of biochemistry.

[60]  P. Romero,et al.  Sequence complexity of disordered protein , 2001, Proteins.

[61]  N. Sreerama,et al.  Estimation of protein secondary structure from circular dichroism spectra: comparison of CONTIN, SELCON, and CDSSTR methods with an expanded reference set. , 2000, Analytical biochemistry.

[62]  Obradovic,et al.  Predicting Protein Disorder for N-, C-, and Internal Regions. , 1999, Genome informatics. Workshop on Genome Informatics.

[63]  T. Laue,et al.  Modern applications of analytical ultracentrifugation. , 1999, Annual review of biophysics and biomolecular structure.

[64]  Romero,et al.  Sequence Data Analysis for Long Disordered Regions Prediction in the Calcineurin Family. , 1997, Genome informatics. Workshop on Genome Informatics.

[65]  For introductions and reviews , 1985 .

[66]  A. Hvidt,et al.  Hydrogen exchange in proteins. , 1966, Advances in protein chemistry.