Binding kinetics and substrate selectivity in HIV-1 protease−Gag interactions probed at atomic resolution by chemical exchange NMR
See allHide authors and affiliations
Contributed by G. Marius Clore, September 28, 2017 (sent for review September 13, 2017; reviewed by Junji Iwahara and Michael F. Summers)

Significance
The formation of infectious HIV-1 virions is triggered by sequential proteolysis of the group-specific antigen (Gag) polyprotein by the viral protease. Using chemical exchange-based NMR spectroscopy, we explore, in the context of Gag polyprotein, the formation of productive complexes between protease and Gag cleavage sites located within the intrinsically disordered linkers that connect the globular domains of Gag. We show that the ability of protease to sequentially cleave distinct Gag cleavage sites, which display little to no sequence identity, primarily originates from conformational dynamics of the protease flaps that cover the active site. The protease flaps are actively involved in substrate recognition and regulate the lifetime of productive complexes, allowing protease to differentiate between cognate and noncognate sequences.
Abstract
The conversion of immature noninfectious HIV-1 particles to infectious virions is dependent upon the sequential cleavage of the precursor group-specific antigen (Gag) polyprotein by HIV-1 protease. The precise mechanism whereby protease recognizes distinct Gag cleavage sites, located in the intrinsically disordered linkers connecting the globular domains of Gag, remains unclear. Here, we probe the dynamics of the interaction of large fragments of Gag and various variants of protease (including a drug resistant construct) using Carr−Purcell−Meiboom−Gill relaxation dispersion and chemical exchange saturation transfer NMR experiments. We show that the conformational dynamics within the flaps of HIV-1 protease that form the lid over the catalytic cleft play a significant role in substrate specificity and ordered Gag processing. Rapid interconversion between closed and open protease flap conformations facilitates the formation of a transient, sparsely populated productive complex between protease and Gag substrates. Flap closure traps the Gag cleavage sites within the catalytic cleft of protease. Modulation of flap opening through protease−Gag interactions fine-tunes the lifetime of the productive complex and hence the likelihood of Gag proteolysis. A productive complex can also be formed in the presence of a noncognate substrate but is short-lived owing to lack of optimal complementarity between the active site cleft of protease and the substrate, resulting in rapid flap opening and substrate release, thereby allowing protease to differentiate between cognate and noncognate substrates.
- invisible states
- relaxation dispersion
- chemical exchange saturation transfer
- substrate selection
- conformational dynamics
Conversion of HIV-1, as well as other retroviruses, from immature viral particles to infectious virions is triggered by sequential proteolytic cleavage of the group-specific antigen (Gag) polyprotein catalyzed by a dimeric viral aspartyl protease (1, 2). Hence, HIV-1 protease inhibitors constitute an integral component of current anti-HIV combination therapies (3). HIV-1 protease (PR) is a 99-residue symmetric dimer comprising a β-strand−rich core, a catalytic site centered on the active site residue Asp25, and glycine-rich flexible flaps that form a lid over the catalytic site (1). The Gag polyprotein comprises three ordered domains, matrix (MA), capsid (CA), and nucleocapsid (NC), connected by linkers and organized as follows: MA–CA–spacer peptide 1 (SP1)–NC–spacer peptide 2 (SP2)–p6. The MA|CA linker, SP1, SP2, and p6 are intrinsically disordered in solution such that the ordered domains behave like beads on a string (4⇓⇓–7).
The five cleavage sites within Gag are hydrolyzed at distinct rates by PR, generating a characteristic Gag cleavage pattern whose rates are ordered as follows: SP1|NC > SP2|p6 ≈ MA|CA > CA|SP1 ≈ NC|SP2 (5, 8⇓⇓–11). The underlying mechanism governing ordered Gag cleavage by PR is unclear. Further, there is little to no sequence identity between the different Gag cleavage sites, making it difficult to assess the determinants of PR substrate specificity or to derive a generalized consensus sequence recognized by PR. The precise control of Gag proteolysis and the apparent promiscuous substrate specificity of PR are seemingly paradoxical. One current explanation lies in the “substrate envelope” hypothesis (12), based on crystal structures of inactive PR variants in complex with peptide substrates (12⇓–14), which posits that PR recognizes the 3D conformation of Gag cleavage sites rather than their primary sequences. As the Gag polyprotein is refractory to crystallization, all crystallographic work on PR−Gag interactions has been carried out using peptide analogs. Such peptides, however, are poor substitutes for the Gag polyprotein, since there are large differences in proteolysis rates of peptides and Gag under identical experimental conditions (6, 15). The latter observations imply the presence of additional factors governing PR−Gag interactions, as well as a role for the ordered domains of Gag. Indeed, paramagnetic relaxation enhancement (PRE) measurements have revealed the existence of transient (lifetimes ≤ 250 μs to 500 μs), sparsely populated encounter complexes between PR and the ordered domains of Gag that may serve to efficiently guide PR to the Gag cleavage junctions (6). Details, however, regarding the formation of productive complexes between Gag and PR have remained elusive.
Here we make use of Carr−Purcell−Meiboom−Gill (CPMG) relaxation dispersion and chemical exchange saturation transfer (CEST) experiments to probe the association of PR with Gag cleavage sites on the millisecond time scale in the context of large fragments of Gag comprising MA−CA−SP1−NC and CA−SP1−NC. We show that the time scale of opening and closing of the PR flaps is modulated by PR−Gag interactions, allowing PR to fine-tune the lifetime of the productive complex and hence differentiate between cognate and noncognate Gag cleavage sites.
Results
Recombinant Gag and PR Constructs.
The current work made use of three engineered monomeric Gag constructs, each carrying a double mutation at the CA dimerization interface (W316A/M317A): a larger Gag fragment consisting of the MA, CA, SP1, and NC domains (Fig. 1A), hereafter referred to as
Kinetic scheme for PR−Gag binding. (A) Schematic representation of
Cleavage reactions were carried out using protease from HIV-1 Group O denoted as PR, and a multidrug-resistant protease bearing 20 mutations from HIV-1 group M denoted as PR20 (16, 17). The NMR experiments made use of the corresponding inactive variants, PRD25N and PR20D25N, respectively, in which the active site Asp25 was substituted for Asn.
Overall Kinetic Scheme for PR−Gag Binding.
The most general overall scheme for the binding of PRD25N to Gag is shown in Fig. 1B (and Fig. S1A). The flaps of PR that cover the active site adopt a range of conformations in crystal structures: open, semiopen, and closed (1). In solution, however, NMR residual dipolar coupling data indicate that the flaps of PRD25N are predominantly closed (PRclosed) (6, 18) (see also Fig. S2). The exchange lifetime (τex = 1/kex) between the major closed (PRclosed) and minor open (PRopen) states is ≤80 μs at 20 °C as deduced from measurement of backbone 1H and 15N transverse relaxation rates as a function of effective rotating frame field (19). The active site of PR is not accessible to substrate in the closed flap conformation, and therefore the initial binding event of PRD25N to Gag cleavage sites must involve PRopen to generate an initial asymmetric PRopen*−Gag complex (where the asterisk denotes that the two subunits of PR are no longer symmetric with regard to the bound substrate). The flaps in crystal structures of PR−peptide complexes are in the closed state with numerous interactions between the bound peptide and the flaps, as expected for productive complexes (1). One can therefore surmise that the flaps are also closed in the end-state productive PR*−Gag complexes in solution.
No 15N CPMG dispersions are observed for free PRD25N at 30 °C, the conditions used here, indicating that the interconversion rate between the closed (major) and open (minor) states is faster than the limit of detection of CPMG experiments (∼100 μs). The generalized scheme in Fig. 1B can therefore be simplified to exchange between free and bound states, represented by pseudo three-state (Fig. 1C) or two-state (Fig. 1D) models when monitoring the binding of isotopically labeled PRD25N or Gag, respectively. [The pseudo three-state model in the context of PR arises because of asymmetry in the bound state such that the chemical shifts for the two PR subunits are different (20); see Supporting Information]. The binding kinetics in the simplified “two species” scheme are governed by an overall association rate constant
The PRD25N−Δ Gag M317A W316A Complex.
The kinetics of PRD25N binding to the cleavage sites within
Kinetic analysis of PR−Gag binding. Examples of 15N CEST profiles observed for 200 μM 2H/15N-labeled
Kinetic parameters for PR−Gag interactions derived from the fits to the CEST and CPMG relaxation dispersion data
The PRD25N−CA−SP1−NC M317A W316A Complex.
To probe the interaction from the perspective of both PR and Gag, we made use of the CA−SP1−
The catalytic efficiency of PR is known to increase with salt concentration (24). Proteolytic cleavage of CA−SP1−
Fig. 2C shows 15N and 13Cmethyl CEST and 15N CPMG relaxation dispersion profiles observed at the SP1|NC cleavage site for 200 μM 2H/15N/13Cmethyl-labeled CA−SP1−
Role of PR Flaps in Substrate Recognition.
To investigate the role of the PR flaps in substrate recognition, we made use of a monomeric Gag variant, CA−SP1−
Kinetic analysis of PRD25N−CA−SP1−
Impact of PR Drug-Resistant Mutations.
To gain further insight into the impact of PR flap conformational dynamics on substrate recognition, we carried out 15N CPMG relaxation dispersion experiments using a multidrug-resistant PR variant from HIV-1 group M bearing 20 mutations (in addition to D25N), known as PR20D25N (16). At 300 mM NaCl, PR20 cleaves ΔGag approximately fourfold slower than the corresponding wild-type PR (15); at 50 mM NaCl, the rates of cleavage of CA−SP1−
Kinetic analysis of PR20D25N− CA−SP1−
The average chemical shift differences, |15N Δϖave|, between free open and bound states of PR20D25N, with the exception of two residues, are reasonably well correlated to the |15N Δϖ| values between the open (major) and closed (minor) states of free PR20D25N, with a slope of ∼1 (Fig. 4D), providing good evidence that the flaps of PR20D25N are predominantly closed in the presence of substrate. The absence of correlation for Gly51 and Leu54 suggests additional large contributions from interactions with the bound substrate for these two residues which are not averaged out in |15N Δϖave|.
The 15N CPMG relaxation dispersion experiments recorded on 75 μM 2H/15N-labeled PR20D25N in the presence of the proteolysis-resistant mutant CA−SP1−
Concluding Remarks.
The current CEST and CPMG relaxation dispersion data, in combination with our previous intermolecular PRE experiments (6) and real-time NMR experiments used to follow proteolysis of Gag by PR (5), lead us to propose the following overall scheme for the various steps involved in PR−Gag association (Fig. 5). Transient encounter complexes with lifetimes ≤250 μs to 500 μs, detected by intermolecular PRE measurements, involve the globular domains of Gag and serve to guide PR toward the cleavage sites located in the linkers connecting the globular domains. The latter are clearly important, as PR cleaves intact Gag and peptides corresponding to Gag cleavage sites at very different rates (6, 15). The formation of productive PR−Gag complexes at the Gag cleavage sites occurs on a slower time scale (τex ≈ 3 ms to 30 ms), modulated by conformational exchange between open and closed flap conformations of PR, both in the free and bound states. Finally, proteolytic cleavage occurs on a much slower time scale of minutes to hours.
Schematic of the sequence of events involved in protease−Gag interactions. Sparsely populated encounter complexes, detected by intermolecular PRE measurements (6), are initially formed between Gag domains and PR, τex ≤ 250 μs to 500 μs. A productive complex is assembled through a series of interconnected steps (τex ≈ 3 ms to 30 ms) derived from CEST and CPMG relaxation dispersion measurements (current work); an expanded view is enclosed in a dashed square. Gag proteolysis occurs on the time scale of minutes to hours, amenable to real-time NMR (5). In the presence of Gag, the sparsely populated PROpen state forms an intermediate PRopen*−Gag complex on the path to the productive PRclosed*−Gag complex. With a noncognate substrate, a productive complex can still form, albeit with a significantly smaller population due to the lack of adequate complementarity between the PR and the substrate, resulting in rapid flap opening and, therefore, faster dissociation.
Exchange between the major closed and minor open flap conformations in free protease [PRD25N (6, 18), and Fig. S2A] is very rapid (τex ≤ 80 μs) (19) facilitating the formation of a productive Gag−PRclosed complex which can only occur via the initial formation of a Gag−PRopen complex. Flap opening in the bound state is substantially slower (τex ≈ 3 ms to 30 ms) due to a network of complementary intermolecular interactions between the flaps/catalytic cleft and the substrate. Lack of optimal complementarity, for example as a consequence of the introduction of a β-branched Ile side chain at the P1 position of Gag cleavage sites (as in the case of the CA−SP1−
Thus, rapid opening−closing of the flaps in the absence of substrate and modulation of flap conformational dynamics due to PR−substrate interactions are critical for both the promiscuity and precision of PR and other retroviral proteases. Fit complementarity and multiple intermolecular contacts between the substrate and the catalytic cleft and flaps of PR increase the lifetime of the productive complex, significantly increasing the likelihood of proteolysis; on the other hand, a reduction in these intermolecular interactions for a noncognate substrate leads to rapid substrate release, thereby allowing retroviral proteases to sample multiple binding partners in a short time span.
Materials and Methods
Protein Expression and Purification.
Full details of cloning, expression, site-directed mutagenesis, isotope labeling (2H/15N/13C, 2H/15N, and selective methyl labeling), and purification are provided in SI Materials and Methods. NMR samples were prepared in a buffer containing 20 mM sodium phosphate, pH 6.5, 0.1 mM ZnCl2, and 1 mM TCEP, with 300 and 50 mM NaCl for ΔGag and CA−SP1−NC constructs, respectively.
NMR Spectroscopy.
All heteronuclear NMR experiments were carried out at 30 °C on Bruker 600- and 800-MHz spectrometers equipped with z-gradient triple resonance cryoprobes. Full details of the CEST and CPMG experiments, NMR spectral processing, analysis, and data fits are provided in SI Materials and Methods.
Acknowledgments
We thank R. Ghirlando and A. Szabo for useful discussions; A. Aniana, J. Baber, and D. Garrett for technical assistance; and J. Lloyd of the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) Advanced Mass Spectrometry Core for technical support. Darunavir used in this study was obtained through the AIDS Reagent Program, Division of AIDS, National Institute of Allergies and Infectious Diseases, NIH. This work was supported by the intramural program of NIDDK/NIH and the AIDS Targeted Antiviral Program of the Office of the NIH Director (G.M.C.).
Footnotes
- ↵1To whom correspondence should be addressed. Email: mariusc{at}mail.nih.gov.
Author contributions: L.D., V.T., and G.M.C. designed research; L.D., V.T., and J.M.L. performed research; L.D., V.T., and G.M.C. analyzed data; and L.D., V.T., and G.M.C. wrote the paper.
Reviewers: J.I., University of Texas Medical Branch; and M.F.S., Howard Hughes Medical Institute, University of Maryland, Baltimore County.
The authors declare no conflict of interest.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1716098114/-/DCSupplemental.
Published under the PNAS license.
References
- ↵
- Konvalinka J,
- Kräusslich HG,
- Müller B
- ↵
- ↵
- Arts EJ,
- Hazuda DJ
- ↵
- ↵
- Deshmukh L,
- Ghirlando R,
- Clore GM
- ↵
- Deshmukh L,
- Louis JM,
- Ghirlando R,
- Clore GM
- ↵
- ↵
- Kräusslich HG, et al.
- ↵
- ↵
- Pettit SC, et al.
- ↵
- Wiegers K, et al.
- ↵
- ↵
- ↵
- ↵
- Louis JM,
- Deshmukh L,
- Sayer JM,
- Aniana A,
- Clore GM
- ↵
- Dierynck I, et al.
- ↵
- Louis JM,
- Aniana A,
- Weber IT,
- Sayer JM
- ↵
- ↵
- ↵
- Cai M, et al.
- ↵
- ↵
- ↵
- ↵
- Tropea JE,
- Nashed NT,
- Louis JM,
- Sayer JM,
- Jerina DM
- ↵
- ↵
- Pettit SC, et al.
- ↵
- Libich DS,
- Fawzi NL,
- Ying J,
- Clore GM
- Schur FK, et al.
Citation Manager Formats
Article Classifications
- Biological Sciences
- Biophysics and Computational Biology