Spatial domain organization in the HIV-1 reverse transcriptase p66 homodimer precursor probed by double electron-electron resonance EPR
See allHide authors and affiliations
Contributed by G. Marius Clore, July 9, 2019 (sent for review June 28, 2019; reviewed by David S. Cafiso, David J. Weber, and Peter E. Wright)

Significance
HIV-1 reverse transcriptase, the enzyme that catalyzes the conversion of viral RNA into DNA, is initially released from the Gag-Pol polyprotein as a p66 homodimer precursor. Further cleavage by HIV-1 protease removes the RNase H domain of only a single subunit to yield the mature p66/p51 heterodimer. We have determined the spatial subunit organization within the p66 homodimer from a large number of distances between spin labels obtained by EPR. We show that the structural subunit asymmetry of the p66/p51 heterodimer is preserved in the p66 homodimer and that only one of the RNase H domains is accessible to HIV-1 protease.
Abstract
HIV type I (HIV-1) reverse transcriptase (RT) catalyzes the conversion of viral RNA into DNA, initiating the chain of events leading to integration of proviral DNA into the host genome. RT is expressed as a single polypeptide chain within the Gag-Pol polyprotein, and either prior to or following excision by HIV-1 protease forms a 66 kDa chain (p66) homodimer precursor. Further proteolytic attack by HIV-1 protease cleaves the ribonuclease H (RNase H) domain of a single subunit to yield the mature p66/p51 heterodimer. Here, we probe the spatial domain organization within the p66 homodimer using pulsed Q-band double electron-electron resonance (DEER) EPR spectroscopy to measure a large number of intra- and intersubunit distances between spin labels attached to surface-engineered cysteines. The DEER-derived distances are fully consistent with the structural subunit asymmetry found in the mature p66/p51 heterodimer in which catalytic activity resides in the p66 subunit, while the p51 subunit purely serves as a structural scaffold. Furthermore, the p66 homodimer precursor undergoes a conformational change involving the thumb, palm, and finger domains in one of the subunits (corresponding to the p66 subunit in the mature p66/p51 heterodimer) from a closed to a partially open state upon addition of a nonnucleoside inhibitor. The relative orientation of the domains was modeled by simulated annealing driven by the DEER-derived distances. Finally, the RNase H domain that is cleaved to generate p51 in the mature p66/p51 heterodimer is present in 2 major conformers. One conformer is fully solvent accessible thereby accounting for the observation that only a single subunit of the p66 homodimer precursor is susceptible to HIV-1 protease.
- HIV-1 reverse transcriptase
- p66 homodimer precursor
- asymmetric homodimer
- site-directed spin labeling
- DEER-derived distances
Reverse transcriptase (RT) plays a central role in the HIV-1 life cycle, catalyzing the conversion of virally encoded RNA into proviral DNA, a prerequisite for integration of viral DNA into the host genome (1). HIV-1 RT is initially expressed as a p66 within the Gag-Pol polyprotein and released following proteolytic cleavage by HIV-1 protease. The Gag-Pol polyprotein may be dimeric, and p66 is released either as a preformed dimer or dimerizes subsequently (2, 3). The resulting p66 homodimer constitutes the immature HIV-1 RT precursor (3, 4). Each p66 subunit comprises 5 domains: finger, palm, thumb, connection, and RNase H. Subsequent proteolytic cleavage by HIV-1 protease removes the RNase H domain from only a single subunit of the p66 homodimer to yield the mature p66/p51 heterodimer. Although p66 and p51 differ only in the presence or absence of an RNase H domain, respectively, the p66/p51 RT heterodimer is structurally asymmetric with the 2 subunits displaying completely different relative orientations of the domains to one another (Fig. 1 A and B) (5). Polymerase activity lies exclusively within the active site formed by the finger, palm, and thumb domains of the p66 subunit, while p51 serves purely as a structural scaffold stabilizing the orientation of the 5 domains of p66 relative to one another (6). Likewise, nonnucleoside RT inhibitors (NNRTIs) bind exclusively to the palm domain of the p66 subunit at a site located about 10 Å from the catalytic site, blocking polymerase activity in a noncompetitive manner. The p66/p51 heterodimer is resistant to proteolysis as the cleavage site at the N-terminal end of the first β-strand of the RNase H domain in the p66 subunit is buried and inaccessible to protease (5).
Spin labeling and structural organization of HIV-1 RT. (A) Crystal structure of the mature HIV-1 RT p66/p51 heterodimer complexed to a DNA/RNA hybrid and the NNRTI efavirenz [PDB 4B3O (29)]. The domains of p66 are color coded as follows: finger, blue; palm, green; thumb, cyan; connection, orange; and RNase H, lilac. The p51 subunit, which serves as a scaffold, is shown in gray, and the RNA/DNA hybrid is shown in mauve. (B) Ribbon diagram of the p66 (Left) and p51 (Right) subunits from the structure in A with the sites of nitroxide spin labeling indicated. The color coding of the domains is the same as that in A. The views of p66 and p51 are superimposed on the finger domains (blue) thereby illustrating how the relative positioning of the domains in the p66 and p51 subunits is entirely different, despite the fact that p66 and p51 differ only in the presence and absence of the RNase H domain, respectively. (C) Schematic of the three nitroxide spin-labeling schemes employed to measure inter- and intramolecular distances between spin labels (shown as circles) in the p66 (blue)/p66′ (red) homodimer.
While many crystal structures of the p66/p51 mature RT have been solved in various forms (6), including free and complexed to DNA, DNA/RNA, and/or NNRTIs (SI Appendix, Table S1), the p66 homodimer precursor has been refractory to crystallization to date, and other structural information on the p66 homodimer is sparse (7⇓⇓–10). Two major questions with regard to the p66 homodimer precursor remain unresolved: First, what is the spatial domain organization within the p66 homodimer, and is the homodimer structurally symmetric or asymmetric as in the mature p66/p51; second, why is the RNase H domain of only a single p66 subunit of the p66 homodimer cleaved by protease to yield the p51 subunit? Here, we address these issues using site-directed nitroxide spin labeling (11⇓–13) in conjunction with deuteration (14, 15) and Q-band pulsed DEER EPR spectroscopy (16) to probe, by means of different labeling schemes, both intra- and intersubunit long range distances between pairs of nitroxide spin labels attached to appropriately engineered surface cysteines.
Results and Discussion
Site-Directed Nitroxide Spin Labeling of p66.
Nitroxide spin labels (R1) were introduced by conjugation of S-(1-oxyl-2,2,5,5-tetramethyl-2,5-dihydro-1H-pyrrol-3-yl)methyl methanesulfonothioate to surface exposed cysteines via a disulfide linkage (11). Two cysteines, Cys38 and Cys280, are naturally occurring and were mutated to Ala in constructs in which R1 was not attached to these residues (17). Twenty-one sites, depicted on the structures of the p66 and p51 subunits of mature RT (Fig. 1B), were employed for spin labeling: 2 in the finger domain, 2 in the palm domain, 6 in the thumb domain, 7 in the connection domain, and 4 in the RNase H domain.
Three different spin-labeling schemes were used to obtain inter- and intramolecular distances (Fig. 1C). Intermolecular (intersubunit) distances between spin labels located on the same residue in both subunits of the dimer were obtained using single-site labeled p66 (Fig. 1 C, Top). Intramolecular (intrasubunit) distances between spin labels located on different residues were obtained using double-site labeling combined with spin dilution in which double spin-labeled p66 and unlabeled p66 were mixed in a 1:10 ratio (Fig. 1 C, Middle). If the dimer is symmetric, the latter labeling scheme will yield only a single distance, but if asymmetric, 2 distances will be observed. Finally, intermolecular distances between spin labels located on different residues in the 2 subunits of the dimer were obtained from samples comprising an equal mixture of 2 differently single-site labeled p66 chains. Such samples can nominally yield 4 distances, 2 between equivalent residues (known from the single-site labeled samples) and 2 between nonequivalent residues. It should be noted that, since any given p66 homodimer only contains a pair of spin labels, the current DEER data present no issues from potential ghost peaks in DEER-derived distance distributions that can arise in samples containing 3 or more spin labels (18).
Spin-labeled p66 homodimer is readily cleaved by HIV-1 protease to yield the p66/p51 heterodimer. This is illustrated in SI Appendix, Fig. S1 for 2 single-site spin labels within the RNase H domain. In the homodimer, 2 spin labels are present, one on the RNase H domain of each subunit (SI Appendix, Fig. S1A), which gives rise to a modulated DEER echo curve (SI Appendix, Fig. S1C). Upon cleavage of one of the RNase H domains, only a single spin label remains, resulting in a large decrease in modulation depth with any remaining modulation due to residual uncleaved p66 homodimer (SI Appendix, Fig. S1C).
Q-Band DEER.
Q-band pulsed DEER EPR data were acquired on p66 homodimer samples at 50 K. The samples were fully deuterated thereby optimizing spin-label phase memory relaxation times and enabling data to be acquired for long dipolar evolution times (19). This not only permits longer distances between spin labels to be accessed (14, 15), but also increases the accuracy of the distance probability distributions P(r), extracted from the data (16) either through Gaussian analysis (20) or through Tikhonov regularization (21). Examples of DEER echo curves and the corresponding P(r) distributions using the 3 labeling schemes described in the previous section (Fig. 1C) are shown in Figs. 2–4 (as well as SI Appendix, Figs. S2 and S3). A summary of the distances between spin labels is provided in Table 1.
Examples of experimental DEER data and intermolecular P(r) distance distributions obtained for single-site spin-labeled samples of the p66 homodimer RT precursor in the absence and presence of the NNRTI delavirdine. The labeling scheme is shown in Fig. 1 C, Top. (A) Examples of background-corrected DEER echo curves (recorded at Q band and 50 K) in the absence (blue) and presence (red) of the NNRTI. (B) Corresponding DEER-derived P(r) distance distributions (blue, free; red, with NNRTI) obtained by Gaussian fitting using the program DD (20). Raw and background-corrected DEER data for all single-site spin-labeled mutants together with P(r) distributions derived by both Gaussian fitting (20) and validated Tikhonov regulation (21) are provided in SI Appendix, Fig. S2.
Examples of experimental DEER data and intramolecular P(r) distance distributions obtained from double-site spin-labeled samples of the p66 homodimer RT precursor spin diluted 10-fold with unlabeled p66 in the absence and presence of the NNRTI delavirdine. The labeling scheme is shown in Fig. 1 C, Middle. The Left show examples of background-corrected DEER echo curves (recorded at Q band and 50 K) in the absence (blue) and presence (red) of the NNRTI. The corresponding DEER-derived P(r) distance distributions (blue, free; red, with NNRTI) obtained by Gaussian fitting using the program DD (20) are shown on the Right. The positions of the peaks corresponding to the distances in the p66 and p66′ subunits (which correspond to p66 and p51, respectively, in the mature RT heterodimer) for the ligand-free samples are indicated by the dashed lines. The latter are unaffected by the addition of the NNRTI. Raw and background-corrected DEER data for all double-site spin-labeled mutants together with P(r) distributions derived by both Gaussian fitting (20) and validated Tikhonov regularization (21) are provided in SI Appendix, Fig. S3.
Examples of experimental DEER data and intermolecular P(r) distance distributions obtained from mixed spin-labeled samples of the p66 homodimer RT precursor in the absence of a NNRTI. The mixed spin-labeling scheme is shown in Fig. 1 C, Bottom. The DEER data (recorded at Q band and 50 K) and corresponding P(r) distributions for the mixed spin-labeled samples are shown in the Middle row, and the results using the constituent single-site-labeled samples are shown in the Top and Bottom rows. The raw and background-corrected DEER echo curves are shown in red and green, respectively, with the best-fit background shown in black and the best-fit DEER echo curve curves obtained by DD analysis (20) (blue) superimposed on the raw experimental curves. The P(r) distributions obtained by Gaussian analysis using DD (20) and by validated Tikhonov regularization using DeerAnalysis 2016 (21) are displayed in red and gray, respectively. The dashed lines indicate the intermolecular distances arising between spin labels attached to the same residue in the p66 and p66′ subunits. The arrows indicate intermolecular distances arising between spin labels attached to different residues in the p66 and p66′ subunits. While 2 such distances are expected (see Fig. 1B), only a single distance is observed in each case corresponding to the distance between A304C-R1 of p66 and Q394C-R1 of p66′ and between E308C-R1 of p66 and T400C-R1 of p66′; in both cases, comparison with the crystal structure of the mature p66/p51 heterodimer suggest that the second distance (i.e., between Q394C-R1 of p66 and A304C-R1 of p66′ and between T400C-R1 of p66 and E308C-R1 of p66′) overlaps with the shorter distance in the single-site-labeled samples (Bottom). It should also be noted that the intensities of the peaks in the P(r) distributions of the mixed samples (Middle) deviate quite significantly from the expected statistical distribution, presumably reflecting variations in the equilibrium dimer dissociation constants for the 4 dimer species (see Fig. 1C) present in each mixed sample.
Distances in the p66/p66 RT precursor derived from Q-band DEER EPR measurements and site-directed spin-labeling*
Comparison of the intermolecular P(r) distributions for the single-site spin-labeled p66 homodimer indicates that only distances between the thumb domains and between the palm domains are affected upon addition of the NNRTI delavirdine (Fig. 3 and SI Appendix, Fig. S2). In the mature p66/p51 heterodimer, the addition of delavirdine effects a conformational change from a closed thumb/finger conformation within the p66 subunit to an open one but has no effect on domain orientations within the p51 subunit (6, 17).
Examination of the intramolecular P(r) distributions obtained for double-site nitroxide-labeled p66 homodimer with spin dilution (with labels located in the finger and thumb domains) reveals 2 sets of distances (Fig. 3 and SI Appendix, Fig. S3) indicating the presence of asymmetry in the p66 homodimer. Comparison of these distances (Table 1) to the corresponding calculated ones from crystal structures of mature p66/p51 RT (SI Appendix, Table S2) reveals that one of the distances corresponds to the p66 subunit and the other to the p51 subunit. Furthermore, only the distance corresponding to the p66 subunit in mature RT is affected by the addition of the NNRTI delavirdine, providing further proof of structural asymmetry in the p66/p66′ homodimer precursor. (Note the prime designation for the second subunit of the homodimer is used to indicate the subunit whose RNase H domain is cleaved by protease.)
Additional intermolecular distances between spin labels at different sites can be obtained by mixing 2 different single-site-labeled samples and comparing the P(r) distributions of the mixed spin-labeled sample with the 2 corresponding single-site spin-labeled samples (Fig. 4). Given that the DEER echo curve of the mixed spin-label samples may reflect up to 4 distances (2 intramolecular and 2 intermolecular; see Fig. 1C), it is important to note that the P(r) distributions obtained by model-dependent analysis fitting a sum of Gaussians (20) or model free analysis by Tikhonov regularization (21) yields essentially the same results (Fig. 4). In the 2 examples involving spin labels located in the thumb and connection domains, only a single intermolecular distance is observed (Fig. 4, Middle). The P(r) distributions, therefore, do not allow one to ascertain a priori whether the single intermolecular distance peak in the 2 P(r) distributions arises from a single spin pair or 2 spin pairs with similar distances. Comparison with crystal structures of the mature p66/p51 heterodimer, however, predicts that, in both instances, only a single intermolecular distance in the P(r) distribution would be observable as the other intermolecular distance overlaps with one of the intramolecular distances.
A comparison of distances involving the finger, palm, thumb, connection domain, as well as the RNase H domain corresponding to the p66 subunit in the mature p66/p51 heterodimer, reveals excellent agreement between the DEER-derived distances in the p66/p66′ homodimer and the calculated distances in 22 p66/p51 crystal structures (Table 1 and SI Appendix, Table S2, Fig. 5 and SI Appendix, Figs. S4 and S5). Moreover, the DEER-derived distances obtained for free p66/p66′ are much better correlated with the corresponding distances in the crystal structures of p66/p51 in the closed I (free) state, than in open (ligand-bound) states, while the reverse is true of p66/p66′ in the presence of the NNRTI delavirdine (Fig. 5 and SI Appendix, Figs. S4 and S5). The differences in fit are largely a consequence of differences in the intra- and intermolecular distances involving the thumb domain of the p66 subunit of the p66/p66′ homodimer (Fig. 5A and SI Appendix, Figs. S4 and S5).
Correlation between experimental DEER-derived distances between spin labels in the homodimeric p66/p66′ RT precursor in the absence and presence of the NNRTI delavirdine and the corresponding calculated distances from crystal structures of the mature p66/p51 RT heterodimer. (A) Correlation plots between the experimental DEER-derived distances and the corresponding calculated distances from closed [PDB 2IAJ (33)] and open [PDB 4B3O (29)] crystal structures. The color coding is as follows: red circles, intra- and intermolecular distances involving the thumb domain; green filled-in circles, intramolecular distances involving the p51 domain; blue circles, all remaining distances. (B) Plot of the R2 Pierson correlation coefficient between the experimental DEER-derived distances obtained in the absence (blue) and presence (red) of the NNRTI delavirdine and the calculated distances from 22 crystal structures of the mature p66/p51 heterodimer, providing examples of closed I, closed II, partially open, open I, and open II conformations of the finger/palm/thumb domains of the p66 subunit. The mean distances between spin labels from the crystal structures were obtained from P(r) distributions calculated using the program MMMv2013.2 (34). The numbering of the crystal structures corresponds to the numbering and PDB codes provided in SI Appendix, Table S1.
Modeling the p66 Subunit of the p66/p66′ Precursor from DEER-Derived Distances Based on Crystal Structures of p66/p51.
To model the spatial configuration of the domains within the p66 subunit of the p66/p66′ homodimer, we made use of conjoined rigid body/torsion angle simulated annealing (22) in Xplor-NIH (23, 24) driven by the DEER-derived distances between spin labels (see SI Appendix, SI Structural Modeling Section for details). The distance restraints were represented by square-well potentials which reflect the uncertainty in the mean distances obtained from the DEER-derived P(r) distributions (SI Appendix, Table S3). In these calculations, we excluded distances involving the RNase H domain of p66′ (the domain that is cleaved by HIV-1 protease), and the coordinates of p51 and the connection domain (residues 316–324) of p66 were held fixed, while the finger (residues 1–100 and 113–152), palm (residues 103–110 and 155–241), thumb (residues 244–313), and RNase H (residues 428–600) domains of p66 were allowed to move as rigid bodies with only a few linker residues connecting the domains given torsional degrees of freedom (specifically residues 101–102, 111–112, 153–154, 242–243, 314–315, and 425–427). The rationale for fixing the coordinates of p51 and the connection domain of p66 is that a comparison of the crystal structures of p66/p51 in various free and liganded states (including complexes with nucleic acids and/or NNRTIs) reveals that these regions are structurally invariant within the uncertainties of the present modeling calculations (SI Appendix, Fig. S6). Moreover, the DEER-derived intermolecular distances obtained from samples with a single-site spin label within the connection domain (Table 1) are in near-quantitative agreement with those calculated from crystal structures (SI Appendix, Table S2). In addition, based on the comparison of the DEER-derived experimental distances and the distances in the crystal structures presented in the previous section, upper limits of 20, 8, and 4 Å were placed on backbone rms displacements of the thumb, palm, and RNase H domains, respectively, to prevent undue large-scale domain rearrangements. The modeling calculations were carried out starting from representative crystal structure coordinates of the 5 conformational classes of p66/p51 RT: closed I [1DLO (25)], closed II [3DLK (26)], partially open [3KK3 (27)], open I [1IKW (28)], and open II [4B3O (29)].
The resulting 5 lowest energy structures (1 structure from each set of starting coordinates) obtained with the DEER-derived distances in the absence and presence of the NNRTI delavirdine are shown in Fig. 6 A and B, respectively, and the agreement with the experimental distance restraints is provided in SI Appendix, Table S4. The overall configuration of the domains within the p66 subunit of the p66/p66′ homodimer is the same as that in the mature p66/p51 heterodimer. The major difference between structures in the absence and presence of the NNRTI delavirdine lies in the position of the thumb domain which corresponds to the closed I conformation in the free state (Fig. 6A) and, surprisingly, the partially open conformation in the NNRTI-liganded state (Fig. 6B). Thus, the binding of a NNRTI to the p66/p66′ homodimer induces a smaller structural displacement of the thumb domain toward an open conformation than that observed in crystal structures of mature p66/p51 heterodimer complexed to NNRTIs which are in the open I conformation (SI Appendix, Fig. S7).
Modeling of the spatial disposition of the domains of p66 within the homodimeric p66/p66′ HIV-1 RT precursor in the absence and presence of the NNRTI delavirdine. (A) Unliganded closed state and (B) NNRTI-bound partially open state. Modeling was carried out by conjoined rigid body/torsion angle simulated annealing in Xplor-NIH (23) driven by DEER-derived intra- and intermolecular distances between spin labels starting from 5 different crystal structures, closed I [PDB 1DLO (25)], closed II [PDB 3DLK (26)], partially open [PDB code 3KK3 (27)], open I [PDB code 1IKW (28)], and open II [PDB 4B3O (29)]. The lowest energy structure from each starting coordinate is displayed; note that the backbone rms displacement between structures starting from the same initial coordinates is significantly smaller than that starting from different initial structures (displayed). This is due to small differences in the domain coordinates between the various crystal structures, especially evident in linkers and loops. These calculations do not include distances involving the RNase H domain of the p66′ subunit. The p66′ subunit without the RNase H domain (i.e., equivalent to p51) is displayed as a gray surface representation and held fixed together with the connection domain of p66 since these structural elements constitute a single, essentially invariant unit in the mature p66/p51 heterodimer (SI Appendix, Fig. S6). The domains of p66 (with the exception of the connection domain) were allowed to move as rigid bodies and 2 to 3 residues within the linkers (white) connecting them are given torsional degrees of freedom. The finger, palm, thumb, connection, and RNase H domains are shown as blue, green, cyan, orange, and lilac ribbons, respectively. The superpositions on the Left, displaying all of p66, are best fitted to the p51 portion of the p66′ subunit and the connection domain of the p66 subunit. The superpositions on the Right display the finger, palm, and thumb domains of the p66 subunit best fitted to the finger and palm domains. The red ribbons in the Left show the superposition with the closest crystal structures [1DLO (25) in A, 3KK3 (27) in B]; the equivalent superpositions with other crystal structures are shown in SI Appendix, Fig. S7.
Conformational Space Sampled by the RNase H Domain of the p66′ Subunit of the p66/p66′ Homodimer.
Examples of DEER echo curves, together with the corresponding P(r) distributions for double-site spin-labeled samples with spin dilution (intramolecular interactions) and single-site spin-labeled samples (intermolecular interactions) involving, at least, a single spin label within the RNase H domain are shown in Fig. 7 (as well as SI Appendix, Fig. S3). For all 3 single-site spin-labeled samples (A454C-R1, G504C-R1, and Q547C-R1, Fig. 7, Right), 2 distinct distance distributions are observed. Given that the position of the RNase H domain of the p66 subunit is well defined (Fig. 6), these data indicate that the RNase H domain of p66′ exists in 2 major conformers. Two sets of intramolecular distances for p66′ are also observed for one of the double-site spin-labeled samples (T49C-R1/A454C-R1, Fig. 7, Top Left).
Examples of experimental DEER data and corresponding intra- and intermolecular P(r) distance distributions involving the RNase H domains of the p66/p66′ homodimer RT precursor. The Left-hand panels show examples of double-site spin labeling with spin dilution (1 in 10) giving rise to intramolecular distances (see Fig. 1 C, Middle row); the Right-hand panels show examples of single-site spin labeling giving rise to intermolecular distances (see Fig. 1 C, Top row). The raw and background-corrected DEER echo curves (recorded at Q band and 50 K) are shown in red and green, respectively, with the best-fit background shown in black, and the best-fit DEER echo curve curves obtained by DD analysis (20) (blue) superimposed on the raw experimental curves. The P(r) distributions obtained by Gaussian analysis using DD (20) and by validated Tikhonov regularization using DeerAnalysis 2016 (21) are displayed in red and gray, respectively. In the case of intramolecular distances, the identity of the distance corresponding to the p66 subunit of the mature p66/p51 heterodimer is easily ascertained by comparison with the p66/p51 crystal structures (SI Appendix, Table S2). In one instance involving an intramolecular distance between T39C-R1 and A454C-R1 in the p66′ subunit, and in all 3 examples of intermolecular distances, 2 distances are observed indicating that the RNase H domain of the p66′ subunit exists in 2 conformer ensembles (each characterized by a distribution of states).
To structurally model the RNase H domain of the p66′ subunit, two-membered ensemble calculations were carried out in Xplor-NIH (23, 24) starting from the structural models of the closed conformation of p66/p66′ in the free state. The linker region (residues 426–440) between the connection domain and the RNase H domain of p66′ was given Cartesian degrees of freedom, the RNase H domain of p66′ was allowed to move as a rigid body, all sidechains with the exceptions of those with a spin label were allowed torsion angle motion, and all remaining portions of the p66/p66′ homodimer were held fixed. In the 4 instances where 2 distinct P(r) distributions were observed for the p66′ subunit (see above), a double-well potential was employed (SI Appendix, SI Structural Modeling Section and Eq. S1) which allows distance assignments to switch between ensemble members as it was not possible to associate the distances with the corresponding substates a priori. The results are illustrated in Fig. 8 and the agreement with the DEER-derived experimental distance restraints is given in SI Appendix, Table S4. The 2 conformational states of the p66′ RNase H domain are distinct from one another with the RNase H domain of state I associated with the shorter distances between spin labels, making contact with the p66 subunit (Fig. 8, Left), while that of state II, characterized by the longer distances between spin labels is fully solvent accessible (Fig. 8, Right). It is important to stress that the positions of the p66′ RNase H domain displayed in Fig. 8 represent mean positions and do not report on the extent of the conformational space sampled by the 2 states. This is because the distance restraints employed represent distances between the centroids of pairs of nitroxide spin labels. Clearly, the p66′ RNase H domain of state II must sample a large conelike conformational ensemble, given that the domain makes no contact with the rest of the protein.
Modeling the conformational states of the RNase H domain of the p66′ subunit within the p66/p66′ homodimer RT precursor. Modeling was carried out by simulated annealing driven by DEER-derived intra- and intermolecular distances between spin labels involving the RNase H domain of the p66′ subunit. Two views are displayed. A superposition of 5 structures (ribbons) is shown for each p66′ RNase H conformer with state I in blue (Left) and state II in red (Right); the linker connecting the RNase H domain to the connection domain is shown in white and was given Cartesian degrees of freedom while the RNase H domain was allowed to move as a rigid body. The remainder of the p66′ subunit and the p66 subunit are shown as surface representations in dark and light gray, respectively. Note that only the mean distance for each spin label/spin label interaction is employed and the spin labels are represented by single centroid positions (SI Appendix); hence, the two computed states of the p66′ RNase H domain represent the mean positions of the ensemble of conformers. While the mean positions of the p66′ RNase H domain of the 2 conformers are fairly well defined, the linker connecting the RNase H domain to the connection domain of p66′ is ill-defined as expected since the DEER-derived distance restraints provide no structural information on the linker other than end-to-end distance.
It seems likely that HIV-1 protease probably cleaves the p66′ RNase H domain preferentially in state II, given that the RNase H domain in that state is fully solvent accessible, but in all likelihood, states I and II rapidly interconvert in solution.
Concluding Remarks.
The current DEER-derived distances between spin labels attached to surface engineered cysteine residues demonstrate unambiguously that the p66/p66′ homodimer RT precursor is asymmetric and that the spatial arrangement and relative orientations of the domains within the 2 subunits (excluding the RNase H domain of p66′) is the same as that in the mature p66/p51 heterodimer with the p66′ subunit serving as a scaffold to stabilize the polymerase active site of the p66 subunit. The data also show that the spatial proximity of the thumb and finger domains of the p66 subunit in the homodimer precursor is modulated by interaction with a NNRTI: In the free state, the thumb and finger domains of the p66 subunit form a closed state that blocks access to double-stranded nucleic acid (Fig. 6A); upon binding of a NNRTI to the palm domain that constitutes the floor of the cleft enclosed by the finger and thumb domains (Fig. 1), a more open conformation is favored that is very similar to that of the partially open state of the mature p66/p51 heterodimer (Fig. 6B). This observation implies that both the p66/p66′ homodimer precursor and the mature p66/p51 heterodimer are targets for NNRTIs. While unusual for a homodimer, structural asymmetry of the p66/p66′ homodimer RT precursor is crucial on several fronts: First, it ensures that the precursor is enzymatically active as a RT (4); second, it ensures that only a single RNase H domain is cleaved by HIV-1 protease to generate the mature p66/p51 heterodimer; and third, no conformational rearrangements are required subsequent to protease cleavage. In addition, the structural asymmetry of the p66/p66′ homodimer precursor accounts for the observation that the p66 homodimer precursor is enzymatically active, albeit less so than the mature p66/p51 heterodimer (30) owing to reduced stability of the precursor homodimer relative to the mature heterodimer (31, 32).
In the p66/p66′ homodimer precursor, only the cleavage site located at the N terminus of the first β-strand of the p66′ RNase H domain is accessible to proteolytic attack, while the same site in the p66 subunit is buried and, hence, resistant to proteolysis. The RNase H domain of the p66′ subunit samples, at least, 2 major conformers (Fig. 8), one of which interacts with the p66 subunit (state I), while the other is fully solvent accessible (state II). We speculate that interaction with and cleavage by HIV-1 protease only involves state II and that, in solution, states I and II rapidly interconvert between one another. Moreover, it is also likely that the presence of an excited form of state I involving transient partial unfolding at the N-terminal end of the β1 strand of the p66′ RNase H domain is required to fully unmask the cleavage site and permit binding to HIV-1 protease. Such an excited form is likely to be favored in state II by the presence of the long disordered linker between the connection and the RNase H domains of p66′, as well as the absence of any stabilizing interactions between the RNase H domain of p66′ and the other domains in the p66/p66′ precursor.
Experiment
Details of protein expression and purification, deuteration, site-specific nitroxide spin labeling (R1), pulsed Q-band DEER EPR spectroscopy, analysis of DEER echo curves, generation of P(r) probability distributions from DEER data, and structural modeling based on the DEER-derived experimental distances are provided in the SI Appendix.
Acknowledgments
We thank Dusty Baber and Dan Garrett for technical support, Wei Yang for the gift of the HIV-1 RT p66 clone, John Louis for the gift of HIV-1 protease, and Steven Hughes for helpful discussions. This work was supported by the Intramural Program of the National Institute of Diabetes and Digestive and Kidney Diseases and by the Office of AIDS Research, NIH (to G.M.C.).
Footnotes
- ↵1To whom correspondence may be addressed. Email: mariusc{at}mail.nih.gov.
Author contributions: T.S., C.D.S., and G.M.C. designed research, performed research, analyzed data, and wrote the paper.
Reviewers: D.S.C., University of Virginia; D.J.W., University of Maryland, Baltimore; and P.E.W., The Scripps Research Institute.
The authors declare no conflict of interest.
See Commentary on page 17614.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1911086116/-/DCSupplemental.
Published under the PNAS license.
References
- ↵
- ↵
- ↵
- ↵
- R. A. Marko et al
- ↵
- L. A. Kohlstaedt,
- J. Wang,
- J. M. Friedman,
- P. A. Rice,
- T. A. Steitz
- ↵
- ↵
- ↵
- ↵
- R. E. London
- ↵
- ↵
- ↵
- L. M. Wingler et al
- ↵
- M. Stevens et al
- ↵
- H. El Mkami,
- R. Ward,
- A. Bowman,
- T. Owen-Hughes,
- D. G. Norman
- ↵
- ↵
- ↵
- T. Schmidt,
- L. Tian,
- G. M. Clore
- ↵
- ↵
- J. L. Baber,
- J. M. Louis,
- G. M. Clore
- ↵
- ↵
- ↵
- ↵
- ↵
- C. D. Schwieters,
- G. A. Bermejo,
- G. M. Clore
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- M. C. Starnes,
- W. Y. Gao,
- R. Y. C. Ting,
- Y. C. Cheng
- ↵
- T. Restle,
- B. Müller,
- R. S. Goody
- ↵
- T. Schmidt,
- R. Ghirlando,
- J. Baber,
- G. M. Clore
- ↵
- ↵
Citation Manager Formats
Article Classifications
- Biological Sciences
- Biophysics and Computational Biology
- Physical Sciences
- Biophysics and Computational Biology
See related content: