New Research In
Physical Sciences
Social Sciences
Featured Portals
Articles by Topic
Biological Sciences
Featured Portals
Articles by Topic
- Agricultural Sciences
- Anthropology
- Applied Biological Sciences
- Biochemistry
- Biophysics and Computational Biology
- Cell Biology
- Developmental Biology
- Ecology
- Environmental Sciences
- Evolution
- Genetics
- Immunology and Inflammation
- Medical Sciences
- Microbiology
- Neuroscience
- Pharmacology
- Physiology
- Plant Biology
- Population Biology
- Psychological and Cognitive Sciences
- Sustainability Science
- Systems Biology
Altered conformational landscape and dimerization dependency underpins the activation of EGFR by αC–β4 loop insertion mutations
Edited by Yibing Shan, D. E. Shaw Research, New York, NY, and accepted by Editorial Board Member J. A. McCammon July 12, 2018 (received for review February 26, 2018)

Significance
The αC–β4 loop is a conserved structural motif present in all eukaryotic protein kinases. This study focuses on epidermal growth factor receptor (EGFR), a receptor tyrosine kinase that harbors many recurrent insertion mutations in the αC–β4 loop. We show that EGFR activity and drug sensitivity are strongly dependent on the nature, size, and location of mutations in the loop and that even subtle variations in the loop have a dramatic effect on kinase dynamics and free-energy landscape. These findings inform ongoing drug-discovery efforts on EGFR kinases and provide a structural framework for investigating the role of the understudied αC–β4 loop in kinase regulation, dynamics, and evolution.
Abstract
Mutational activation of epidermal growth factor receptor (EGFR) in human cancers involves both point mutations and complex mutations (insertions and deletions). In particular, short in-frame insertion mutations within a conserved αC–β4 loop in the EGFR kinase domain are frequently observed in tumor samples and patients harboring these mutations are insensitive to first-generation EGFR inhibitors. Despite the prevalence and clinical relevance of insertion mutations, the mechanisms by which these mutations regulate EGFR activity and contribute to drug sensitivity are poorly understood. Using cell-based mutation screening, we find that the precise location, length, and sequence of the inserted segment are critical for ligand-independent EGFR activation and downstream signaling. We identify three insertion mutations (N771_P772insN, D770_N771insG, and D770>GY) that activate EGFR in a unique way by relying more on the “acceptor” interface for kinase activation. Our drug inhibition studies indicate that these activating insertion mutations respond more favorably to osimertinib, a recently Food and Drug Administration-approved EGFR inhibitor for T790M-positive patients with lung cancer. Molecular dynamics simulations and umbrella sampling of WT and mutant EGFR suggest a model in which activating insertion mutations increase catalytic activity by relieving key autoinhibitory interactions associated with αC-helix movement and by lowering the transition free energy (
Many human cancers are caused by the accumulation of somatic mutations in oncogenes that confer selective growth advantage. The epidermal growth factor receptor (EGFR) kinase is one such oncogene that is mutationally activated in many human cancers, in particular lung cancer (1⇓⇓–4). Cancer genome sequencing studies have revealed numerous genomic alterations in EGFR, including point mutations, deletions, and insertions (5⇓–7). A detailed understanding of these mutations is essential for developing effective therapeutic strategies (8). While some frequently occurring point mutations and deletion mutations, such as L858R, G719S, T790M, and E746_A750del, have been well studied (9⇓⇓⇓–13), the structural and functional impact of many other recurrent mutations still remains largely unknown. In-frame insertions in exon 20 are a subcategory of understudied mutations that are frequently observed in patients with cancer (14⇓⇓–17). Most of the exon 20 insertions map to a short loop connecting the αC-helix and β4 strand in the protein kinase domain, i.e., the αC–β4 loop (15). Exon 20 insertions confer resistance to first-generation EGFR inhibitors and are associated with poor clinical outcomes (6, 18). Recent studies suggest that the drug response profile of exon 20 insertions is heterogeneous and depends on the nature of the inserted segment (15, 16, 19). However, an incomplete understanding of how exon 20 insertion mutations regulate EGFR activity has hindered the development of therapies for patients harboring these mutations.
Structural and biochemical studies on human EGFR have provided considerable insight into their mechanisms of action in physiological and pathological states. Under physiological conditions, ligand binding to the extracellular domain induces dimerization and higher-order multimerization of the intracellular kinase domains (9, 20, 21). Structurally, the active state of the intracellular kinase domain is achieved through a “head-to-tail” configuration where one molecule of EGFR (the “donor” kinase) docks to the N lobe of another EGFR molecule (the “acceptor” kinase) in an asymmetric manner (9). Multimerization is established by forming a chain of asymmetric dimers, in which each EGFR protomer functions both as a donor and an acceptor kinase (21). The dimerized or multimerized EGFR complex is catalytically active due to the stabilization of key catalytic motifs that prime the kinase domain for autophosphorylation and downstream signaling (9, 21). Many activating oncogenic mutations, such as L858R, E746_A750del, G719S, G724S, R776H, and M766T, strictly rely on the asymmetric dimer for their activity (22⇓⇓⇓–26). Intriguingly, some other complex mutations, including L747_A750>P and D770_N771insNPG, are reported to be dimerization independent (23), although a low-resolution crystal structure of the D770_N771insNPG mutant [Protein Data Bank (PDB) ID: 4LRM] reveals an asymmetric dimer organization (15). The EGFR dimerization interface has been targeted by monoclonal antibody-based inhibitors such as cetuximab (27). Recently, several optimized stapled peptides that mimic the donor kinase interface were shown to be effective for EGFR inhibition (28). Thus, understanding the dimerization dependency and the mechanism of action of complex insertion mutations can inform ongoing drug-discovery efforts on EGFR kinases.
Computational structural modeling and molecular dynamics (MD) simulations have provided new atomic-level insights into EGFR regulation and activation. For example, long-timescale MD simulations of EGFR showed that the N lobe of the acceptor kinase is intrinsically disordered (11, 29) and oncogenic mutations, such as L858R, facilitate EGFR dimerization by quenching the intrinsic disorder (11). MD studies on other recurring mutations such as R776H in the αC–β4 loop have identified novel autoinhibitory interactions associated with kinase activation (25) and protonation-dependent changes in mutant EGFR functions (30). In addition, MD-based free-energy methods (26, 31), molecular mechanics calculations (32), and molecular modeling studies (33) have provided new insights into mutation-induced drug resistance mechanisms and the conformational transitions connecting active and inactive states (34⇓–36).
In this study, we systematically analyzed exon 20 insertion mutations in EGFR, using a combination of experimental and computational approaches. We identified the αC–β4 loop, which is involved in kinase regulation (25, 37, 38), evolution (39), and chaperone-assisted folding (40, 41), as a hotspot for insertion mutations. Using a cell-based screen of 12 most commonly occurring insertions in the αC–β4 loop, we identified three activating mutations (N771_P772insN, D770_N771insG, and D770>GY) that activate EGFR in a dimerization-dependent manner. These activating mutations rely more on the canonical acceptor interface than the donor interface for kinase activation. Drug response studies indicate that the three insertion mutations are insensitive toward reversible EGFR inhibitors (gefitinib, erlotinib, and lapatinib), but display a heterogeneous response toward irreversible inhibitors. In particular, all three insertion mutations respond to osimertinib more favorably. Likewise, D770>GY is more sensitive to dacomitinib treatment compared with WT EGFR. To investigate the structural basis for mutational activation, we generated representative structural models for each of the insertion mutations and performed a total of 35 μs MD simulations in various conformational states. MD simulations suggest that insertion mutations restrict the αC-helix conformational freedom in the active state by hindering the formation of a critical autoinhibitory capping interaction associated with αC-helix movement. Steered MD and umbrella sampling suggest that activating insertion mutations not only lower the free-energy difference between active and inactive states (
Results
The αC–β4 Loop Is a Hotspot for Insertion Mutations in EGFR.
To comprehensively identify EGFR insertion mutations, we mined the Catalog of Somatic Mutations in Cancer (COSMIC v83) database and the Protein Kinase Ontology (ProKinO v2.0) (43). We identified a total of 97 distinct insertion mutations mapping to the EGFR kinase domain (Fig. 1A and SI Appendix, Dataset S1). The identified insertion mutations are heterogeneous in that they are variable in length and map to different regions of the protein. In particular, the αC–β4 loop, which connects the αC-helix and β4 strand in the N-terminal ATP-binding lobe of the kinase domain, is an insertion hotspot accounting for 86.9% of unique insertion mutations in EGFR (Fig. 1A and SI Appendix, Dataset S1). Some of the frequently occurring insertion mutations in this loop include V769_D770insASV, D770_N771insSVD, H773_V774insNPH, H773_V774insH, D770_N771insG, D770>GY, V774_C775insHV, H773_V774insPH, P772_H773insPR, N771_P772insN, H773_V774insAH, and N771_P772>SVDNR (see Fig. 1 legend for insertion mutation nomenclature). These mutations are also referred to as exon 20 insertion mutations in the literature (6, 15). In addition to the αC–β4 loop, insertion mutations are also identified in the β3 strand (also referred to as exon 19 insertion mutations), the juxtamembrane segment, and the N terminus of the αC-helix (Fig. 1A) (17, 44). These mutations, however, are not as frequent as the αC–β4 loop insertion mutations.
EGFR insertion mutations. (A) Structural location of insertion mutations in the EGFR kinase domain crystal structure (PDB ID: 2GS6). The size of the magenta sphere is proportional to the log of the number of patient samples containing the insertion mutation at the particular residue position. (B) Sequence location of insertion mutations within the EGFR αC–β4 loop. The nomenclature of insertion mutations follows the guidelines established previously (42). For example, V769_D770insASV indicates an Ala–Ser–Val insertion between residues V769 and D770, and D770>GY indicates a complex insertion mutation in which D770 is replaced by a Gly–Tyr sequence. Insertion mutations characterized in this study are underlined. The complete list of patient-derived insertion mutations in EGFR can be found in SI Appendix, Dataset S1.
Identification of Activating Insertion Mutations.
We next investigated the impact of αC–β4 loop insertion mutations on EGFR autophosphorylation activity and downstream signaling, using cell-based kinase assays (Materials and Methods). The insertion mutations exhibit varying levels of C-terminal tail tyrosine (Y1197) phosphorylation, activation loop (Y869) phosphorylation, and downstream protein phosphorylation in comparison with WT (Fig. 2 A and B). For most mutations, including the frequently occurring V769_D770insASV, D770_N771insSVD, and H773_V774insNPH mutations, the activity of the mutant form is comparable, or slightly reduced, relative to WT (Fig. 2 A and B). N771_P772insN, D770_N771insG, D770_N771insSVD, V769_D770insASV, D770>GY, and N771_P772>SVDNR exhibit similar Y1197 phosphorylation in the presence and absence of EGF ligand, suggesting that these insertion mutations are insensitive to EGF stimulation (Fig. 2A). This is different from the patient-derived point mutation (D770N) in the αC–β4 loop, which still shows ligand-dependent activation (Fig. 2A). Interestingly, three mutations (D770_N771insG, D770>GY, and N771_P772insN) display increased autophosphorylation activity both in the presence and the absence of EGF (Fig. 2 A and B). Notably, downstream STAT3 phosphorylation is also enhanced for these mutants relative to WT (Fig. 2A), suggesting that N771_P772insN, D770_N771insG, and D770>GY are activating gain-of-function mutations.
Cell-based screening of EGFR insertion mutations. (A) Western blot of 13 different EGFR mutations. The – and + indicate the absence and presence of EGF ligand. Autophosphorylation activity is indicated by Y1197 and Y869 phosphorylation. GFP signal indicates the total expression level of EGFR. Downstream signaling of STAT3 and Akt is also shown. (B) Densitometry analysis of the relative phosphorylation activity of various EGFR insertion mutations compared with WT. The ratio between Y1197 and GFP signal is used as a measure of activity. The score for each insertion mutation is normalized to WT EGFR. Both ligand-independent activity (Top) and ligand-dependent activity (Bottom) are shown. SE bars represent three to five independent experiments.
Differential Dimerization Dependency of Activating αC–β4 Loop Insertion Mutations.
Previous studies have shown that activating point mutations in EGFR such as L858R/T790M and R776H enhance the acceptor activity in the asymmetric dimer (25, 45). To investigate the dimerization dependency of the activating αC–β4 loop insertion mutations (D770_N771insG, D770>GY, and N771_P772insN), we introduced a C-lobe dimerization-deficient mutation (V948R or M952R) or an N-lobe dimerization-deficient mutation (L760R or L704N) in the insertion mutation plasmids. We selected two mutations in each interface to rule out the possibility that the mutations might alter the kinase domain itself rather than the dimer interaction. Western blot analysis indicates that insertion mutations are still active in the presence of C-lobe dimerization-deficient mutation (V948R or M952R) (Fig. 3A, lanes 11–12, 17–18, 23–24, 39–40, 45–46, and 51–52 and SI Appendix, Fig. S2). In contrast, the presence of the N-lobe dimerization-deficient mutation (L760R or L704N) results in significantly reduced Y1197 phosphorylation (Fig. 3A, lanes 13–14, 19–20, 25–26, 37–38, 43–44, and 49–50 and SI Appendix, Fig. S2). As a control, Y1197 phosphorylation activity is negligible for WT EGFR when dimerization-deficient mutations are present (Fig. 3A, lanes 3–6 and 29–33 and SI Appendix, Fig. S2); however, Y1197 phosphorylation can be rescued by cotransfecting the two mutants (V948R+L760R or M952R+L704N) (Fig. 3A, lanes 7–8 and 33–34 and SI Appendix, Fig. S1). These data indicate that activating αC–β4 loop insertion mutations rely more on the N-lobe interface than on the C-lobe interface for EGF-dependent and EGF-independent activation. In addition, we also tested the effect of introducing both V948R and M952R mutations in the insertion mutation background and observed comparable Y1197 autophosphorylation (SI Appendix, Figs. S2 and S3), suggesting that the αC–β4 loop insertion mutations could bypass the canonical C-lobe asymmetric dimer interface to achieve kinase activation.
Dimerization dependency of EGFR insertion mutations. (A) Western blot analysis of the N-lobe dimerization-deficient mutation (L760R or L704N) and the C-lobe dimerization-deficient mutation (V948R or M952R) on Y1197 autophosphorylation for WT and mutant (D770_N771insG, N771_P772insN, and D770<GY) EGFR. All of the constructs used are tagged with GFP except for L760R and L704N, which are tagged with FLAG. Cotransfection of V948R and L760R is indicated by V948R+L760R. Co-transfection of M952R and L704N is indicated by M952R+L704N. Eight percent SDS/PAGE is used to resolve the cell lysate sample. Y1197 phosphorylation of V948R+L760R and M952R+L704N is also analyzed by 5% SDS/PAGE to separate GFP- and FLAG-tagged constructs (SI Appendix, Fig. S1 A and B). Densitometry analysis of three to five independent experiments is shown in SI Appendix, Fig. S2). (B) A schematic plot summarizing the observed differences in WT and mutant EGFR in the different dimerization-deficient mutant backgrounds.
Drug Sensitivity of Activating αC–β4 Loop Insertion Mutations to Reversible and Irreversible EGFR Inhibitors.
Exon 20 insertion mutations are generally associated with poor clinical response to first-generation EGFR inhibitors, such as gefitinib, erlotinib, and lapatinib (6, 14). We tested the dose-dependent response of the three activating αC–β4 loop insertion mutations against these inhibitors. Our data indicate that the activating insertion mutants (D770_N771insG, D770>GY, and N771_P772insN) are insensitive to these inhibitors as the inhibition profile is either unaltered or resistant with increasing drug concentration (Fig. 4A), suggesting that patients harboring these mutations might not benefit from first-generation EGFR inhibitors. Recently, second-generation nonreversible covalent bond inhibitors, such as osimertinib, dacomitinib, and afatinib, have been developed to overcome drug resistance conferred by the gatekeeper (T790M) mutation (46⇓–48). We tested the responsiveness of activating αC–β4 loop insertion mutations to these nonreversible EGFR inhibitors (osimertinib and dacomitinib). Interestingly, we find that the three insertion mutations respond to osimertinib more favorably, as the Y1197 autophosphorylation is inhibited to various degrees at 0.1 μM drug concentration for insertion mutations but not for WT EGFR (Fig. 4 A and B). In addition, we find that the D770>GY mutant displays a sensitive profile toward dacomitinib (Fig. 4 A and B). In conclusion, our drug sensitivity assays demonstrate a heterogeneous response to irreversible EGFR inhibitors by activating αC–β4 loop insertion mutations.
Drug inhibition assay of five EGFR inhibitors (gefitinib, lapatinib, erlotinib, osimertinib, and dacomitinib). (A) The amount of Y1197 phosphorylation at drug concentration (0, 0.001 μM, 0.01 μM, 0.1 μM, 1.0 μM, and 10.0 μM) is tested for WT and three activating insertion mutations. (B) Densitometry quantification of the drug treatment experiments with three to five replicates. SE bars represent three to five independent experiments.
Activating Insertion Mutations Restrict Kinase Conformational Freedom.
To investigate the structural basis for kinase domain activation by αC–β4 loop insertion mutations, we used homology modeling and loop refinement strategies to generate an ensemble of structural models for each activating insertion mutation (Materials and Methods). The generated models were clustered based on the root-mean-square deviation (rmsd) values. Representative models from each cluster were used for MD simulations. The number of clusters identified for each insertion mutation and the Rosetta energy distribution for each cluster are shown in SI Appendix, Figs. S4 and S5. Comparisons of the MD trajectories indicate that the overall thermal fluctuations of WT and mutant EGFR are comparable during the course of the simulations (SI Appendix, Figs. S6–S8). However, the intrinsic dynamics of the kinase domain, particularly for regions involved in kinase regulation, are significantly different, as described below.
Restricted conformational sampling by activating αC–β4 loop insertions in the active state.
We used the rmsd of the regulatory αC-helix and the K745–E762 salt bridge distance as proxies (variables) to compare the conformational freedom of WT and mutant EGFR in the active state. A joint probability distribution plot of these two variables in energy units is shown in Fig. 5A, where each point represents the log-transformed frequency of MD snapshots adopting the corresponding αC-helix rmsd and K745–E762 distance. Based on the plot, distinct clusters/states can be identified for WT and mutant EGFR. In general, WT is more flexible and samples four distinct states/clusters (indicated by the arrows in Fig. 5A), while the three activating insertion mutations explore only a subset of these states. State 1 corresponds to a native-like “active” conformation shared by WT and mutant EGFR with an intact αC-helix and a well-established K745–E762 salt bridge (Fig. 5 A and B). State 2 is a meta-state, also shared by WT EGFR and all three insertion mutations (Fig. 5A). The K745–E762 distance in state 2 centers around 5 Å and the αC-helix is in an “inward” conformation (Fig. 5B). State 3 represents a cluster of conformations in which the N terminus of the αC-helix is unfolded/disordered (Fig. 5B). State 3 is sampled by N771_P772insN, but not by D770>GY and D770_N771insG. In state 3, a truncated αC-helix is formed only after residue K757, which makes the αC-helix four residues shorter (Fig. 5B). Notably, such helix truncation is also observed in some of the recently solved EGFR crystal structures (PDB IDs: 4WRG and 4ZAU) (49), indicating that our simulations are sampling biologically relevant conformations. State 4 corresponds to a highly destabilized αC-helix and a broken K745–E762 salt bridge, as observed in previous microsecond timescale MD simulations (11). In state 4, E762 in the C-helix makes a charged interaction with K860 from the activation loop, an interaction seen only in the inactive crystal structure of EGFR (Fig. 5B). None of the mutants sample state 4, suggesting that the conformational freedom of the activating mutants is restricted to the active state (state 1) or meta-states close to the active state (states 2–3), consistent with our cell-based assay in which these three insertion mutations are more active than WT (Figs. 2 and 5A). Interestingly, the conformation heterogeneity of the mutants in the active state is also correlated with kinase activity in that the conformationally flexible N771_P772insN mutant is less active compared with the conformationally restricted D770>GY and D770_N771insG mutants (Figs. 2B and 5A).
MD simulation of EGFR and three activating insertion mutations in the active state. (A) The joint distribution of the K745/NZ–E762/(OE1,OE2) distance and the αC-helix rmsd is shown in logarithmic scale. The plot is constructed using 100 different bins on both axes. The probability of each bin is estimated using the frequency of the 3-μs MD snapshots that fall into each bin. The distribution plot is transformed into an energy unit by
Activating insertion mutations occlude autoinhibitory interactions associated with the αC-helix movement.
We previously reported that stabilization of the αC-helix in the inactive state is correlated with an autoinhibitory capping interaction formed between R776/(NE,NH1,NH2) in the αC–β4 loop and A767/O in the αC-helix (25). We hypothesize that activating insertion mutations in the αC–β4 loop could also modulate the αC-helix movement by altering the αC-helix capping interaction. To test this hypothesis, we analyzed our MD trajectories to correlate changes in the R776/(NE,NH1,NH2)–A767/O distance with the K745–E762 salt bridge, which serves as a proxy for kinase conformational transition (Fig. 6A). The Boltzmann weighted probability distribution plot of these two distance variables indicates an energy well at a K745–E762 distance of 5 Å and an R776–A767 distance of 2.5 Å for WT (Fig. 6A). However, for the mutants, the energy well is distributed across various R776–A767 distance values in the vertical axis (Fig. 6A). For the D770>GY mutant, the energy well is maintained at 10 Å whereas for the D770_N771insG mutant, it is mostly populated at 7.5 Å (Fig. 6A), indicating that the capping interaction is not frequently sampled by these mutants. By visualizing the MD trajectories of WT and mutant EGFR, we identify key atomic interactions that provide plausible explanations for the observed differences in the conformational ensemble (Fig. 6B). In the D770>GY mutant, for example, the inserted tyrosine residue orients in a way that prevents R776 from interacting with A767 (Fig. 6B). Likewise, in the D770_N771insG mutant, the inserted segment introduces an additional turn in the αC-helix that prevents R776 from forming the canonical autoinhibitory capping interaction with A767 (Fig. 6B). Although no clear interactions are identified for N771_P772insN, the flat energy distribution observed for this mutant suggests that the R776–A767 interaction is destabilized relative to WT EGFR (Fig. 6A).
Modulation of autoinhibitory capping interactions [R776/(NE,NH1,NH2)–A767/O hydrogen bonds] by activating insertion mutations. (A) The joint distribution of K745–E762 distance and R776–A767 distance is shown in logarithmic scale. The same binning and log-transformation procedure as used in Fig. 5A was followed to generate the plot. (B) Representative snapshots from the energy wells (indicated by arrows in A).
Stabilized inactive state simulation.
We also performed MD simulations of WT and mutant forms in the inactive state. Unlike the active state, the overall thermal fluctuations of WT and mutant forms are similar, with only minor differences in the C terminus of the αC-helix (SI Appendix, Fig. S7). This may be due to the overall global stabilization of the inactive state relative to the active state, as suggested by our free-energy landscape analysis described below.
Insertion Mutations Alter the Free-Energy Landscape of EGFR Conformational Transitions.
We next wanted to investigate how insertion mutations in the αC–β4 loop alter the free-energy landscape between active and inactive states. To this end, we used an umbrella sampling technique, which allows efficient sampling of the entire transition event by restraining the simulation at various intermediate states. We used two collective variables (CVs) to describe the transition between active and inactive states (SI Appendix, Fig. S9A). CV1 is the pseudodihedral angle formed by the Cα atom of the DFG motif and the DFG+1 residue (D855, F856, G857, L858). The pseudodihedral angle describes the conformation of the activation loop and serves as an important conformational feature for distinguishing kinase active and inactive conformations (50). CV2 is the difference in the distance between two salt bridge-forming pairs, i.e., K745–E762 (
Free-energy landscape of EGFR and insertion mutations from umbrella sampling. (A) The 2D PMF of CV1 and CV2. The LFEP connecting the active and inactive states is highlighted. Red dashed frames show the CV space that is stabilized by insertion mutations. (B) The
Conclusion and Discussion
Using a combination of computational and experimental approaches we find that patient-derived insertion mutations in the EGFR αC–β4 loop impact kinase autophosphorylation and drug sensitivity to varying degrees. We identify three gain-of-function mutations (D770_N771insG, D770>GY, and N771_P772insN) that activate EGFR in a ligand-independent manner. Interestingly, these activating mutations rely more on the N-lobe asymmetric dimer interface than the C-lobe interface for kinase activation. Consistent with our findings, a previous study focusing on the dimerization dependence of the D770_N771insNPG mutant found that the N-lobe dimerization-deficient mutation (L704N) had a more dramatic effect on the transforming activity of EGFR compared with the C-lobe dimerization-deficient mutation (I941R) (23). These findings, however, cannot be reconciled based on the only available low-resolution crystal structure of an exon 20 insertion mutation (D770_N771insNPG mutation; PDB ID: 4LRM) in which the C lobe of the donor kinase docks to the N lobe of the acceptor in a manner similar to WT EGFR (15). Thus, based on the asymmetric dimer model, it is unclear how activating insertion mutations in the αC–β4 loop can maintain EGFR activity in the presence of the C-lobe dimerization-deficient mutant. Based on our cell-based mutagenesis experiment, we conclude that the asymmetric dimer is important for the full activation of EGFR insertion mutations. However, with a compromised C-lobe dimer interface, the mutants can still achieve activation through alternative configurations (Fig. 3B).
The αC–β4 loop is a conserved regulatory segment present in all eukaryotic protein kinases (ePKs) (39). Through quantitative comparisons of diverse eukaryotic protein kinases and distantly related small molecule kinases, we previously proposed that the HxN motif in the αC–β4 loop (NPH motif in EGFR) serves as a hinge point for αC-helix and interlobe movement (39). While the length of the αC–β4 loop is typically seven to eight residues long in most protein kinases, some kinases such as CK2α, VRK1, VRK2, and SRPK1 (52⇓–54) naturally contain an extended αC–β4 loop due to sequence insertions. Interestingly, the αC-helix in these kinases is stabilized in an active conformation (52⇓–54), although the precise mechanisms by which such stabilization occurs are unclear. Our studies suggest that modulation of key autoinhibitory interactions associated with the αC-helix movement, such as the R776–A767 capping interaction in EGFR, is one possible mechanism by which sequence and conformational variation in the αC–β4 loop may impact αC-helix conformational freedom and, consequently, kinase activity. The variable impact on EGFR autophosphorylation activity based on the nature, location, and size of the inserted segment further supports the hypothesis that the αC–β4 loop is a rheostat that can be fine-tuned to regulate kinase activity (55). For example, the two insertion mutations (N771_P772insN and N771_P772insH) that differ only by a single residue at the NPH-Asn position have a dramatically different effect on kinase autophosphorylation. N771_P772insH abolishes Y1197 autophosphorylation, whereas N771_P772insN is constitutively active (Fig. 2). Thus, subtle sequence and conformational variations in the αC–β4 loop may have long-range allosteric effects on ATP binding and activation loop conformation as proposed previously for other tyrosine kinases (37, 38).
Our drug response assays indicate that activating insertion mutations are more sensitive to osimertinib compared with WT EGFR. This is consistent with a recently established high-throughput functional evaluation assay in which the relative viability of Ba/F3 cells expressing N771_P772insN is reported to be sensitive to osimertinib treatment (19). Another recently published study showed that D770>GY is partially responsive to dacomitinib compared with other insertion mutations in the αC–β4 loop (16). This is in agreement with our experimental result in that the other two activation insertion mutations (D771_N772insG and N771_P772insN) show a similar inhibition profile compared with WT EGFR. Our finding also suggests that nonreversible EGFR inhibitors, originally designed to overcome the drug-resistant T790M mutant, may also be repurposed to treat patients with activating αC–β4 loop insertion mutations. Moreover, establishing a comprehensive drug response profile for αC–β4 loop insertion mutations to irreversible EGFR inhibitors is critical to guide therapeutic strategies for patient treatment.
The αC–β4 loop has recently been appreciated as an important region for Hsp90 association and cochaperone cdc37 binding (40). While EGFR is not a typical client protein for Hsp90, insertion mutations in the αC–β4 loop could alter this preference (56). A previous study has shown that D770_N771insNPG and D770_N771insNPH mutants are sensitive to Hsp90 inhibitors, opening another therapeutic window for clinical treatment (56). In addition, our umbrella sampling analysis of WT and mutant EGFR identified alternative inactive states that are uniquely sampled by αC–β4 loop insertion mutants (Fig. 7 and SI Appendix, Fig. S10). Thus, selectively targeting these inactive states can be beneficial for treating patients harboring αC–β4 loop insertion mutations. Finally, by mechanistically characterizing oncogenic mutations in the αC–β4 loop, our study provides a framework for investigating the underappreciated role of the αC–β4 loop in kinase structure, function, regulation, and disease.
Materials and Methods
Materials and Reagents.
Q5 site-directed mutagenesis kits for generating mutant EGFR and DH5α competent cells were purchased from New England Biolabs. Qiaprep spin miniprep/midiprep kits were obtained from Qiagen. The 1× Dulbecco’s modified Eagle’s medium (DMEM), 1× Ham’s F12, and PBS were purchased from Mediatech. Lipofectamine-2000 was obtained from Invitrogen. Anti–pY1197-EGFR, anti–pY869-EGFR, anti-GFP, anti-STAT3, anti–pY705-STAT3, anti-Akt, anti–pS473-Akt, and horseradish peroxidase (HRP)-conjugated mouse monoclonal and rabbit polyclonal antibodies were obtained from Cell Signaling. Anti-FLAG human recombinant EGF and dacomitinib were bought from Sigma-Aldrich. BSA was purchased from Rockland Immunochemicals. Pierce ECL Western blotting substrate was bought from Thermo Scientific. Gefitinib, erlotinib, and lapatinib were kind gifts from Kojo Mensa-Wilmot, University of Georgia Department of Cellular Biology. Osimertinib was purchased from LC Laboratories.
The pEGFP-N1-EGFR plasmid from our previous studies was used for mutagenesis (25, 26). The PCR primers for site-directed mutagenesis are designed using the NEBaseChanger v1.2.6 web server from New England Biolabs. All mutated plasmids were confirmed via Sanger sequencing through Eurofins Genomics.
Cell Culture, Transfection, and Starvation.
Chinese hamster ovary (CHO) cells were cultured in DMEM with 10% FBS. DNA transfection was performed using Lipofectamine-2000 by following the manufacturer’s protocol. After 12–16 h, the transfection efficacy was inspected under the microscope for GFP signal. Transfected CHO cells were washed with PBS and starved in Ham’s F12 media for more than 12 h before EGF stimulation/drug treatment.
EGF Stimulation, Cell Lysis, and Immunoblotting.
EGF stimulation was carried out by incubating serum-starved CHO cells in 50
Drug Inhibition Assay.
Cells were transfected and then incubated with drugs at five different concentrations (0.001 μM, 0.01 μM, 0.1 μM, 1.0 μM, and 10.0 μM) in Ham’s F12 media without serum. After 2 h drug treatment, the cells were processed as described above.
Image Digitization and Densitometry Analysis.
The exposed Western blotting films were scanned using an EPSON perfection 4490 photo scanner at 300 dpi resolution and saved in portable document format (PDF). The resulting images were converted to black and white using Inkscape software without any additional means of postprocessing. Densitometry analysis was performed using ImageJ software.
Structural Modeling.
The PDB structures 2GS6 (residues 696–1019) and 3W32 (residues 701–1017) were used to model the active and inactive states of the EGFR kinase domain, respectively (9, 51). All crystallographic ligands, ions, and water molecules were removed from the original structures. The two missing loops of 2GS6 (
MD Simulations.
Unbiased MD simulation.
Full-atom unbiased MD simulations were performed for each insertion mutation using GROMACS version 5.0.14 patched with plumed version 2.3.3 (59, 60). Representative structural models from each cluster center were used as the starting conformation for MD. All hydrogen atoms were converted to virtual sites to remove the fastest vibrational freedom. The protein was parameterized with the amber99sb-ildn force field and solvated with the TIP3P water model in a dodecahedron box that was 1 nm larger than the protein in all directions (61). Sodium and chloride ions were added to the system to neutralize the charge of the protein. The Verlet cutoff scheme was used to maintain the neighbor list for calculating nonbonded interactions (62). Long-range electrostatics were calculated using particle mesh Ewald (PME) algorithms. Energy minimization was performed with the steepest-descent algorithm followed by the conjugate-gradient algorithm until the
SMD simulation.
SMD is a technique to transform the system from an initial configuration to a target configuration by applying an external potential biased toward the target state. In our simulation, the external potential takes the form of a harmonic function of the instantaneous atom coordinates (
Umbrella sampling.
Umbrella sampling (US) is a widely used enhanced sampling technique to obtain the PMF of certain CVs. We used two CVs to describe the conformational transition of the EGFR kinase domain. CV1 is defined as the pseudodihedral angle formed by four atoms in the DFG motif, Cα(D855)–Cα(F856) –Cα(G857) –Cα(L858), and CV2 is defined as the distance between two salt bridge-forming pairs
Acknowledgments
We acknowledge the constructive feedback from Dr. Robert J. Woods, Dr. Eileen J. Kennedy, and Dr. Shaying Zhao. We thank members of the N.K. laboratory for helpful comments and suggestions on the manuscript. Funding for N.K. from National Institutes of Health Grant 5RO1GM114409 is acknowledged. Z.R. is the recipient of a 2017 Innovative and Interdisciplinary Research Grant for Doctoral Students at the University of Georgia. This study was supported in part by resources and technical expertise from the Georgia Advanced Computing Resource Center, a partnership between the University of Georgia’s Office of the Vice President for Research and Office of the Vice President for Information Technology.
Footnotes
- ↵1To whom correspondence should be addressed. Email: nkannan{at}uga.edu.
Author contributions: Z.R. and N.K. designed research; Z.R. performed research; Z.R. contributed new reagents/analytic tools; Z.R. analyzed data; and Z.R. and N.K. wrote the paper.
The authors declare no conflict of interest.
This article is a PNAS Direct Submission. Y.S. is a guest editor invited by the Editorial Board.
Data deposition: The MD snapshots, CV dataset obtained from US sampling, and the python code for generating the figures are available in github repository (https://github.com/esbg/PNAS_Ruan_2018).
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1803152115/-/DCSupplemental.
Published under the PNAS license.
References
- ↵
- ↵
- ↵
- Day KC, et al.
- ↵
- ↵
- ↵
- Arcila ME, et al.
- ↵
- Jackman DM, et al.
- ↵
- ↵
- ↵
- ↵
- ↵
- Yun CH, et al.
- ↵
- Foster SA, et al.
- ↵
- ↵
- Yasuda H, et al.
- ↵
- Kosaka T, et al.
- ↵
- He M, et al.
- ↵
- ↵
- Kohsaka S, et al.
- ↵
- Dawson JP, et al.
- ↵
- Huang Y, et al.
- ↵
- ↵
- Cho J, et al.
- ↵
- ↵
- Ruan Z,
- Kannan N
- ↵
- Ruan Z,
- Katiyar S,
- Kannan N
- ↵
- ↵
- Fulton MD, et al.
- ↵
- Shan Y,
- Arkhipov A,
- Kim ET,
- Pan AC,
- Shaw DE
- ↵
- White KA, et al.
- ↵
- ↵
- Ma L, et al.
- ↵
- Kannan S, et al.
- ↵
- Li Y,
- Li X,
- Ma W,
- Dong Z
- ↵
- Pan AC,
- Weinreich TM,
- Shan Y,
- Scarpazza DP,
- Shaw DE
- ↵
- Sutto L,
- Gervasio FL
- ↵
- ↵
- Klein T, et al.
- ↵
- ↵
- Verba KA, et al.
- ↵
- ↵
- ↵
- ↵
- Politi K,
- Lynch TJ
- ↵
- Red Brewer M, et al.
- ↵
- Cross DAE, et al.
- ↵
- Gonzales AJ, et al.
- ↵
- ↵
- Yosaatmadja Y, et al.
- ↵
- ↵
- Kawakita Y, et al.
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- Eswar N, et al.
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- PyMOL
- ↵
- ↵
- Scherer MK, et al.
- ↵
Citation Manager Formats
Sign up for Article Alerts
Article Classifications
- Biological Sciences
- Biophysics and Computational Biology