Highly polarized C-terminal transition state of the leucine-rich repeat domain of PP32 is governed by local stability
See allHide authors and affiliations
Edited* by S. Walter Englander, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, and approved March 11, 2015 (received for review June 27, 2014)

Significance
Proteins fold to their native structure in significantly shorter timescales than expected for a random search mechanism. Rapid folding, which is likely to be important to optimize yield and avoid misfolded, aggregated states, has been proposed to result from specific folding pathways. However, mapping such pathways and understanding the principles by which they are selected remain central problems in protein science. Repeat proteins, containing tandem repeated structural units, greatly simplify folding studies. Here, we exploit the linear architecture of a β-strand-containing leucine-rich repeat protein and discover that folding is highly polarized toward the C terminus. By measuring the energy distribution, we observe that polarization is guided by local stability.
Abstract
The leucine-rich repeat domain of PP32 is composed of five β-strand-containing repeats anchored by terminal caps. These repeats differ in sequence but are similar in structure, providing a means to connect topology, sequence, and folding pathway selection. Through kinetic studies of PP32, we find folding to be rate-limited by the formation of an on-pathway intermediate. Destabilizing core substitutions reveal a transition state ensemble that is highly polarized toward the C-terminal repeat and cap. To determine if this nucleus for folding corresponds to the most stable region of PP32, we monitored amide hydrogen exchange by NMR spectroscopy. Indeed, we find the highest protection to be biased toward the C terminus. Sequence manipulations that destabilize the C terminus spread out the transition state toward the middle of the protein. Consistent with results for helical ankyrin repeat proteins, these results suggest that local stabilities determine folding pathways.
Almost 50 y ago, Cyrus Levinthal suggested that protein folding is a guided process, rather than a random search (1). Since then, it has been proposed that rapid folding can be accomplished via a single preferred pathway (2⇓–4). However, contrasting models of funneled, nonspecific folding capture key aspects of the folding process, including specific intermediates and transient stable structures (5, 6). To what extent does protein folding follow a single pathway, and to the extent that a single pathway dominates, what determines that pathway?
For some globular proteins, specific kinetic intermediates have been identified with features of thermodynamically stable substructures of the native state. This observation suggests specific pathways that correspond to low energy folding routes (7⇓⇓⇓⇓–12). However, directly testing this thermodynamic control of folding pathway selections requires mapping of local stability. Hydrogen exchange methods have provided the most detailed energy maps of native proteins (13), but sequence-distant contacts and irregular tertiary structures of globular proteins tend to blur the boundaries of local stability.
In contrast to globular proteins, elongated repeat proteins comprise tandem repeated secondary structural units, giving rise to regular topology lacking sequence-distant contacts. As a result, the contributions made by different regions of a repeat protein to its folding thermodynamics and kinetics can be easily dissected and compared. If local stability dictates folding, a preferred pathway should be observed, because repeats differ in sequence. If topology drives folding, multiple parallel pathways would be observed, because the repeats are similar in structure. For designed consensus α-helical ankyrin repeat constructs, with repeats of nearly identical sequence, folding proceeds via parallel pathways (14). In contrast, naturally occurring ankyrin repeat proteins with repeats of high sequence variation fold through preferred pathways (15⇓⇓–18). Regions that initiate folding correspond to low-energy structures, based on energy landscapes determined at single repeat resolution, consistent with low-energy folding routes for α-helical repeat proteins (16, 19⇓–21).
Much less is known about the folding mechanisms of β-strand-containing repeat proteins, and particularly, leucine-rich repeat (LRR) proteins. To date, the folding pathways for only two LRR proteins have been elucidated. Like the ankyrin repeat domains, the LRR domain of Internalin B (InlB) also folds through a somewhat polarized pathway, with the transition state ensemble involving the N-terminal capping motif and first three repeats (22). The folding of the mRNA exporter protein TAP (TAPLRR) is more complicated, with a triangular mechanism and a diffuse transition state involving most of the protein (23). However, how these pathways are selected remains unknown.
To test if LRR proteins fold via low-energy pathways, we investigated the folding mechanism and stability distribution of the LRR domain of PP32. This domain is composed of a linear array of five repeats, each containing a β-strand followed by an extended structure, a 310-helix, or a short α-helix (Fig. 1) (24). The N and C termini of PP32 are capped by a helix–loop–helix motif and a β-hairpin, respectively. Deletion and mutational analysis showed that whereas the N-terminal cap contributes stability, the C-terminal cap is critical for stability: without the C-terminal cap, PP32 is entirely unfolded (25). Here we investigate the effects of destabilizing substitutions on folding kinetics and find that PP32 folds via an on-pathway intermediate with a highly polarized C-terminal pathway. We also map residue-specific stabilities throughout the LRR domain of PP32 using amide hydrogen exchange by NMR spectroscopy and find that the C-terminal repeat and cap have the highest protection factors. Thus, the region to first become structured during refolding corresponds to the most stable part of the protein. Destabilizing the C terminus of PP32 causes the highly polarized folding pathway to become much more diffuse. These results highlight the importance of local stabilities in determining the folding mechanisms of LRR proteins, as observed with α-helical repeat proteins.
Structure and sequence of PP32. (A) Ribbon representation of the crystal structure of the LRR domain of PP32 (24). The N-cap, repeats 1–5, and the C-cap are colored as in B. The residues substituted for Φ-value analysis are shown in sphere representation. (B) Sequence alignment of the five LRRs of PP32. The five C-terminal residues (150–154) included in this study but not present in the crystal structure are highlighted in yellow.
Results
Spectroscopic Changes upon PP32 Unfolding.
To study the folding of PP32, we monitored structural changes by circular dichroism (CD) at 220 nm and by fluorescence, which monitor β-sheet structure and the tryptophan residue at the C terminus, respectively (Fig. 2). Urea-induced equilibrium unfolding monitored by both techniques resulted in the same Cm and
Solution spectroscopy and equilibrium unfolding of PP32. (A) Far-UV CD spectrum showing characteristic β-strand signal with a minimum at 217 nm. (B) Tryptophan fluorescence emission spectra in buffer (continuous black line) and 6 M urea (broken red line). (C) Urea-induced denaturation monitored by CD at 220 nm (filled black circle, continuous black line) and fluorescence (open red circle, broken red line). Lines result from fitting a two-state unfolding model to the data. The CD data are adapted from ref. 25.
Refolding and unfolding kinetics of PP32. (A) Fluorescence-detected refolding (circles) from 4.6 to 0.42 M urea and (B) unfolding to 6 M urea. Red and black lines show single- and double-exponential fits, respectively. (Upper) Residuals. (C) Urea dependence of rate constants and (D) associated amplitudes for folding. Circles, major refolding and unfolding phases; triangles, minor refolding phase; diamonds, minor unfolding phase. The black lines result from fitting of (C) rate constants (excluding the minor proline-limited refolding phase) and (D) unfolding amplitudes using a sequential three-state model (Experimental Procedures). Straight lines in C show fitted folding (red) and unfolding (green) rate constants for conversion between D and I (continuous) and between I and N (dashed). (C and D, Upper) Residuals from the three-state model.
Chevron plots for variants of PP32. Urea dependence of fluorescence-monitored rate constants for the major refolding and unfolding phases (circles), the minor refolding phase (triangles), and the minor unfolding phase (diamonds) of WT PP32 (black chevron in each panel), and variants (colors are as in Fig. 4). Lines result from fitting a sequential three-state kinetic model to the data.
Urea Dependence of Folding and Unfolding Kinetics.
To gain insights into the folding pathway of PP32, we monitored fluorescence changes upon rapid dilution of native and denatured protein to various urea concentrations by stopped flow. Both refolding and unfolding reactions appear to be multiphasic. The data are poorly fitted by a single-exponential model (red lines in Fig. 3 A and B), resulting in large, nonrandom residuals (Fig. 3 A and B, Upper). In contrast, a double-exponential model fits both folding and unfolding transients quite well (black lines in Fig. 3 A and B), resulting in small, random residuals (Fig. 3 A and B, Upper). For refolding, a fast major phase is followed by a slow minor phase. Amplitudes for both refolding phases have the same sign, in the refolding direction. For unfolding, an early lag is followed by a slow major phase (Fig. 3B, Inset). Amplitudes for the unfolding phases have opposite signs: the amplitude of the major phase is in the unfolding direction, whereas that for the minor phase is in the refolding direction.
PP32 has five prolines, each in trans configuration (Fig. S1A). Cis–trans prolyl isomerization reactions are known to produce a slow, denaturant-insensitive phase during refolding, with a rate constant between 0.01 and 0.1 s−1 (26⇓⇓⇓⇓⇓⇓⇓⇓⇓⇓⇓–38). The magnitude and denaturant dependence of the rate constant for the slow refolding phase in PP32 (triangles in Fig. 3C) are consistent with prolyl isomerization. To further explore whether the slow refolding phase results from prolyl isomerization, we carried out interrupted unfolding double-jump experiments, in which the protein is unfolded, held in unfolding conditions for various delay times, and then refolded (27, 34, 35, 39). If this slow phase is due to prolyl isomerization, the associated amplitudes would be small at short delay times, because most of the prolines have yet to convert to the cis conformation upon unfolding. Indeed, for PP32, the ratio of the slow phase amplitude to the total amplitude increases as a function of delay time (Fig. S1B), further supporting the slow refolding phase as prolyl isomerization phase.
In contrast, the major (fast) phase for refolding has a significant linear denaturant dependence, likely corresponding to a major conformational step in folding. This refolding phase joins up to the major unfolding phase at the equilibrium unfolding midpoint to form a V-shaped chevron plot (circles in Fig. 3C). Unlike the refolding phases, both unfolding phases show significant denaturant dependencies. Moreover, the major phase of the unfolding arm shows a rollover at 4.4 M urea. This type of nonlinearity is commonly observed for proteins folding through an intermediate (40). For PP32, an intermediate in unfolding is supported by the observation of a second kinetic phase (squares in Fig. 3C). The rollover and minor phase in unfolding are consistent with a populated on-pathway kinetic intermediate (39). This type of unfolding intermediate has been observed in other linear repeat proteins, including the ankyrin domains of the Notch receptor (39) and IκBα (41), and in an ankyrin consensus series (14).
To obtain stepwise rate constants and their urea dependences, we globally fit a sequential three-state model (D⇌I⇌N; Experimental Procedures) to the major refolding rate constants, the major and minor unfolding rate constants, and the associated unfolding amplitudes (black lines in Fig. 3 C and D). These data are well-fitted by the model, as illustrated by the small, random residuals (Fig. 3 C and D, Upper). Moreover, both the kinetically deduced
Fitted parameters for two-state equilibrium unfolding and three-state kinetic refolding and unfolding PP32 variants
Structure Formation Along the PP32 Folding Pathway.
To determine the structure of PP32 along the folding pathway, we investigated the equilibrium and kinetic effects of point substitutions along the molecule, with the idea that substitutions slowing down folding kinetics are in regions that are structured in the transition states. We made substitutions in the N- and C-terminal caps, as well as the conserved leucines on the β-strand and convex side of the repeats (Fig. 1). Except for K137G, D146L, and Y131F, all substitutions are structurally conservative, removing only part of a hydrophobic side chain, and should not introduce new interactions. All variants are destabilized by at least 1 kcal⋅mol−1 (except for K137G), but remain largely folded (Fig. S2 and Table 1), allowing for accurate determination of Φ-values (42).
As with WT PP32, the refolding kinetics of all variants are biphasic (Fig. 4). The slow minor phase of each variant is similar to that of WT, consistent with it being a proline isomerization phase, rather than a folding reaction. The fast refolding phase is unaffected by substitutions from the N-terminal helical cap to repeat three (L93A), but is significantly slowed down by substitutions in the C terminus (repeats four and five, and capping motif).
In contrast to WT PP32, for most variants (except I7A, V19A, L60A, and L69A), only a single unfolding phase is observed (Fig. 4), suggesting that the on-pathway intermediate is no longer populated to high levels. However, rollovers at high urea concentrations persist in the chevron plots of all variants, suggesting that the intermediate continues to influence the kinetics of unfolding. Whereas the unfolding reactions of the N-terminal variants (I7A, L11A, V19A, L22A, L37A, L47A, L60A, and L69A) significantly speed up, those of variants in the central repeats (L83A, L93A, L109A, and L118A) are slightly faster, and those of the C-terminal variants (Y131A, V135G, K137G, L139A, L142A, L145A, and D146L) remain largely unaffected.
To quantitatively map out the folding pathway, we fit a sequential three-state model to the chevron plot of each variant (Table 1 and solid lines in Fig. 4). For variants that show two unfolding phases (I7A, V19A, and L69A), the rate constants and amplitudes are well-determined by the model. For the variants without an observed minor unfolding phase, there is not enough information to determine the full parameter set. To constrain these fits, we set mNI and mID to WT values. The fitted curves describe the data accurately and the resulting parameters are well-determined. Using equilibrium
Φ-values and the folding pathway of PP32. (A) Folding Φ-values for the first (rate-limiting) transition state (TS1), intermediate (I), and second transition state (TS2). Residues that are structured (Φ > 0.5), partially structured (0.25 < Φ < 0.5), and unstructured (Φ < 0.25) in TS1 are in cyan, magenta, and black, respectively. (B) A reaction coordinate for folding of PP32. The structural models of TS1, I, and TS2 are derived from the Φ-values in A. The location of each successive state (from D to N) is based on kinetic m-values (the sum of the m-values of preceding steps, as determined from the sequential three-state fit of the WT PP32 kinetic data; Table 1).
This Φ-value-based map of the folding pathway is in good agreement with kinetic m-values from the sequential three-state analysis of the WT PP32 domain (Table 1 and Fig. 5). The kinetic m-value for forming the rate-limiting TS1 is ∼30% (0.88/2.95) of the total m-value for folding, consistent with structure formation in repeats four and five. The kinetic m-value for forming I from TS1 is small, which coincides with partial structure formation in some of the N-terminal repeats, suggesting a fairly high level of solvent-accessible surface area in these N-terminal repeats. The kinetic m-value for forming TS2 from I is large (Table 1), corresponding to substantial organization of the N-terminal repeats (Fig. 5A).
If PP32 folds via a polarized C-terminal pathway, removing the N-terminal region should not affect the first step of folding. To test this prediction, we measured folding kinetics of a PP32 construct that lacks the N-terminal α-helical capping motif (ΔNCap PP32) (25). The urea-dependent refolding phases of ΔNCap PP32 are similar to those of WT PP32 (Fig. 4), whereas the slow unfolding phase significantly speeds up. These observations are consistent with the results from Φ-value analysis, confirming that the N-terminal cap is not involved in the first step of folding.
Residue-Specific Hydrogen Exchange of PP32.
To map out the distribution of stability along PP32, we monitored exchange rates of backbone amide hydrogens with solvent using NMR. Hydrogen exchange (HX) provides residue-specific information on the distribution of stability within the native state (13). The amide hydrogens in PP32 exchange over very different timescales: many peaks disappear before acquisition of the first spectrum, whereas others persist for over 3 mo (Fig. 6 A and B). For peaks with measurable exchange profiles, rate constants for exchange (kex) were fitted from peak heights, and residue-specific protection factors (PFs) were calculated as the ratio of the observed kex values to those for exchange from fully solvent-exposed unstructured state (kint; Experimental Procedures).
Residue-specific hydrogen exchange of PP32. (A) Heteronuclear single quantum coherence (HSQC) spectra after PP32 is exchanged from H2O to D2O buffer for various times: 21 min (black), 19 h (blue), 10 d (yellow), and 69 d (red). (B) Exponential decays of peak heights of representative amide protons with very different exchange rates. The lines show single-exponential fits to the data. (C) Protection factors (kex/kint) for main-chain amides with quantifiable protection. The β-sheet residues are red-filled circles; non–β-sheet residues are black circles. The solid curve results from fitting a cosine-modulated Gaussian distribution to the data (Materials and Methods). The N terminus shows uniformly low protection. The region with highest protection factors is toward the C terminus. (D) Mapping of protection factors onto the structure of PP32. For residues with main-chain amides that have quantifiable protection and are not exposed to solvent, Cα’s are shown in spheres, and colored from white to blue, with blue having the highest protection factors.
Plotting the PFs as a function of sequence and mapping the PFs onto PP32 structure reveal increasing protection from the N terminus to the C terminus (Fig. 6 C and D). This trend (at pD 6.7) is also observed for pD 6.2 and pD 7.2, indicating that exchange is limited by stability, and not by opening kinetics. Therefore, the C-terminal region appears to have much a higher local stability than the N-terminal region, consistent with our previous findings that PP32 completely unfolds upon removal of part of the C-terminal cap, but retains C-terminal structure upon removal the N-terminal α-helical cap (25). In addition, there appears to be substantial variation of protection factors within each repeat. The β-strands and neighboring residues show the greatest protection (red points in Fig. 6C), and the sequences opposite the β-strand show the greatest exchange. To attempt to capture both the high C-terminal protection and the local variation within repeats, we fitted protection factors using a normal distribution (shifted toward the C terminus) modulated by a cosine function (solid curve in Fig. 6C; Experimental Procedures). Although this model does not capture all of the variation in protection factors (especially the local variation at the C terminus), the asymmetric distribution of stability is reproduced, as is the periodicity of the local variation. In particular, the fitted frequency of the cosine term–22 residues per cycle–matches the periodicity of the sequence repeats (Fig. 1).
Transition state structure can be directly mapped to local stability by comparing Φ-values with PFs at the same sites (Fig. 7). The amide hydrogens of these residues are buried or involved in hydrogen bonds. Moreover, the two substituted leucines within each repeat are structurally conserved, providing direct comparison of different regions of PP32. The stability of PP32 increases gradually toward the C terminus and is highest for repeats four and five and part of the C-terminal cap. Although the stability map is more spread out than the Φ-value map, both are polarized toward the C terminus, suggesting that the folding pathway of PP32 is determined by local stability.
Folding pathway of PP32. Ribbon representation, with Φ-value substitutions shown as spheres. Residues are shaded from white to black, with black having the highest local stability (Upper) and Φ-value (Lower). For direct comparison, only sites with both Φ-value and local stability data are shaded. Based on the coincidence of high protection factors and Φ-values, folding is initiated at the most stable region of PP32.
Consequences of Destabilizing the C Terminus on the Folding Pathway of PP32.
To further test if stability dictates the location of the transition state ensemble, we determined the folding pathway of a C-terminally destabilized PP32 variant, Y131F/D146L (YD). Because the structure of YD is highly similar to that of WT PP32 (25), the destabilizing effect should be localized to the C terminus. Although YD is significantly destabilized, it retains sufficient stability for Φ-value analysis (Fig. S2D and Table 1). For direct comparison, we made the same substitutions along the YD construct as we made in WT PP32 (Fig. 8A). The YD variants are highly destabilized, to about the same extent as in the WT background. Complete unfolding transitions are observed for all but two variants (L93A and L139A; Fig. S2E).
Effects of destabilizing the C terminus to the folding pathway of PP32. (A) Ribbon representation of PP32. Residues Y131 and D146, which we have substituted to destabilize the C terminus, are shown as black sticks. Residues substituted in Φ-value analysis in the Y131F/D146L (YD) background are shown in sphere representation. (B) Urea dependence of fluorescence-monitored rate constants for the major refolding and unfolding phases of variants in WT and YD backgrounds. (C) Φ-values for TS1 in the WT and YD backgrounds. The transition state for the YD construct is less polarized than for WT PP32.
To determine the effects of the substitutions on folding kinetics as the C terminus is destabilized, we compare the major phases of the chevron plots for the variants in the WT and YD backgrounds (Fig. 8B). In the YD background, we observe increased sensitivity to substitutions at the N terminus and decreased sensitivity to substitutions at the C terminus, compared with WT PP32. To quantitatively assess the effects of destabilizing the C terminus on the structure of transition state ensemble for the rate-limiting step in refolding of PP32, we compare Φ-values for the five variants in the YD and WT backgrounds (Fig. 8C). The Φ-values for L139A are the highest in both backgrounds, indicating that both fold via their C-termini. However, whereas Φ-values in WT PP32 show a highly polarized transition state restricted to repeat five, the significant positive Φ-values in repeat three and four of YD indicate a more dispersed transition state. Thus, destabilizing the C terminus (the preferred folding route in WT PP32) spreads out the transition state, switching from a highly polarized structure to a more extended region.
Discussion
The linear LRR domain of PP32 is well-suited for mechanistic studies of folding. It has sufficient stability to accommodate destabilizing point substitutions, permitting Φ-values to be determined reliably (Fig. 2 and Table 1). Its linear, repetitive architecture allows the results of Φ-values analysis to be evaluated in the context of local stability variation in structurally conserved fragments.
Overall Folding Mechanism of PP32.
Kinetic analysis of PP32 refolding and unfolding reveals an on-pathway intermediate. On-pathway intermediates are also observed for the α-helical ankyrin repeat domains, including Notch (16), p16 (37), and IκBα (41), but not for the two LRR proteins InlB (28) and TAPLRR (23) studied to date. For PP32, formation of this intermediate from the denatured state is rate-limiting in refolding at low to moderate urea concentrations. Although the rate constant for the I to N step (kIN) cannot be determined with great precision, given the large extrapolation in the chevron plot, it is clear that kIN exceeds the rate constant for the D to I step (kDI) by several orders of magnitude (Table 1), and may be as large as estimated “speed limits” for folding (45, 46).
In contrast, the rate constants for the two unfolding steps (kNI and kID) are similar to each other at all urea concentrations, with the first unfolding step (from N to I) about fivefold faster than the second (from I to D) (Table 1 and Fig. 3C). The similarity of these two rate constants produces a lag in unfolding at high denaturant concentrations, where I is more stable than N, and thus becomes transiently populated during unfolding. Below 4.4 M urea, I is less stable than N; thus, unfolding shifts to a single-exponential conversion from N to D, but is rate-limited by the transition state between I and D. As a result, at the urea midpoint for the reaction between N and I (see minimum in minor phase chevron, Fig. 3C), there is a rollover in the unfolding arm of the chevron for the major phase. This rollover reflects a shift from a modest denaturant dependence above this midpoint (resulting from surface area changes between I and TS1; Fig. 5B) to a more severe denaturant dependence below this midpoint (resulting from surface area changes between N and TS1).
The Discrete C-Terminal Folding Pathway Is Selected Based on Local Stability.
Φ-value analysis for 20 substitutions along PP32 reveals a highly polarized rate-limiting transition state (D to I) involving only the C-terminal repeat and cap (Figs. 4 and 5 and Table 1). Kinetic studies of a construct without the N-terminal cap (ΔNCap PP32) further support the C-terminal polarized folding pathway of PP32. The degree to which the folding nucleus of PP32 is localized is striking in comparison with other naturally occurring repeat proteins, such as InlB and the ankyrin domains of Notch, p16, and IκBα. Structure in the rate-limiting transitions states for these proteins includes roughly half of each protein (16, 18, 22, 41).
Through amide hydrogen exchange measurements monitored by NMR, we have found that PP32 is more stable toward the C terminus (Fig. 6). The correspondence between local stability and high Φ-values is consistent with stability dictating folding pathway selection. However, whereas the Φ-values for the TS1 are near zero for most of the protein, and approach the limiting value of one for a small cluster of residues (135–142), indicative of a highly polarized pathway, the stability distribution determined by HX is more spread out (Fig. 7). This difference results from the fact that native-state hydrogen exchange and Φ-value analysis probe different parts of the energy landscape. Φ-value analysis probes the minimum structural unit required to lead to downhill folding from the denatured state. In contrast, HX studies observe unfolding reactions in the context of the folded state, such that structural modules (foldons) (47) can stabilize one another. Work on consensus α-helical repeat proteins (48, 49) and on deletion studies of LRR proteins (25, 50) have shown that the interfacial interactions between repeats are strongly stabilizing. Thus, we expect the highly stable C terminus of PP32 to protect neighboring repeats from exchange, broadening the observed stability map toward the N terminus.
Previous studies have shown that changes to local stability can affect transition state structures. For the coiled coil GCN4, destabilizing point substitutions at one end of the molecule shift the folding pathway away from the site of destabilization (51), whereas site-specific metal-induced stabilization shifts the folding pathway to the site of stabilization (52). For the four-ankyrin repeat protein myotrophin, destabilizing C-terminal substitutions shift the folding pathway from the C terminus to the N terminus (17). In contrast, a set of consensus substitutions stabilizing the last two repeats of the Notch ankyrin domain shifts the folding pathway from the middle repeats to the C-terminal repeats (20, 21). Here, we show that the C-terminally destabilized variant Y131F/D146L folds via a much more diffuse transition state than WT PP32 (Fig. 8), providing additional evidence that folding pathways are determined by local energetics. However, unlike the ankyrin domains, C-terminal repeats of PP32 still participate in the transition state despite local destabilization, consistent with the C terminus being significantly more stable than the rest of PP32. To reroute the folding pathway of PP32 so that the C-terminal repeats fold after the transition state ensemble, a combination of substitutions destabilizing the C terminus and stabilizing the N-terminal repeats may be required.
Possible Determinants for Folding Pathway Selections in Other LRR Proteins.
The folding mechanisms of three LRR proteins whose pathways have been elucidated to date are very different from one another. Like PP32, the structure of the transition-state ensemble for InlB is also localized, although to the N terminus instead of the C terminus (28). In contrast, TAPLRR folds via a diffuse transition state in which the first three (of four) repeats are structured and the N-terminal cap makes nonnative contacts with the first repeat (23). It is possible that pathway selections for both InlB and TAPLRR are also based on local stability, as observed for PP32, in which case, the N-terminal capping motif and first three repeats would be the most stable region of InlB, whereas the repeats of TAPLRR would have more uniform stabilities.
Secondary structure might also contribute to the variations in folding pathways among these three LRR proteins. α-helices have been suggested to fold faster than β-stranded structure (53). Although PP32, InlB, and TAPLRR all have β-sheets spanning their concave surfaces, they differ significantly in helix content. The C-terminal LRRs of PP32 have α-helices on their convex surfaces (Fig. 1A), but the N-terminal LRRs do not. Thus, these helices may help initiate folding of the C-terminal LRRs. The transition state for folding of InlB includes the N-terminal cap, which is the only region with α-helices. The uniform distribution of α-helices along the TAPLRR domain may contribute to its diffuse transition state and may lead to parallel folding pathways, as recently reported for an α-helical ankyrin repeat protein series (14). However, helical structure alone is not enough to determine folding routes because preferred pathways are observed for α-helical repeats proteins, where repeats all have a high helix content (15⇓⇓–18, 21).
Biological Implications.
PP32 is primarily a nuclear protein. Nuclear import is mediated by a nuclear localization signal in its acidic region, C-terminal to the LRR domain (54). However, PP32 can also shuttle back to the cytoplasm through binding of the LRR domain to nuclear export protein Crm1 (55). Crm1 recognizes its cargo through leucine-rich nuclear export signals with a consensus sequence of x-Φ0-x2-Φ1-x2,3-Φ2-x2,3-Φ3-x-Φ4, where hydrophobic residues Φ (most commonly leucine) are separated by 1–3 residues x (often polar or charged; 56, 57). This consensus sequence is found in PP32 LRRs 1–4, at least one of which is likely to interact with Crm1. Structures of Crm1 complexes have shown that the conserved Φ’s of the consensus sequence dock into the pockets of the hydrophobic cleft of Crm1 (58, 59). Because the conserved Φ’s of PP32 make up its core and are not solvent-exposed, binding to Crm1 is likely to require partial unfolding of PP32. Our HX result showing that the N terminus is much less stable than the C terminus suggests the N-terminal repeats (LRRs 1–3) as likely binding motifs for Crm1.
Experimental Procedures
Subcloning, Protein Expression, and Purification.
The gene encoding PP32 was a kind gift from the laboratory of Cynthia Wolberger (Johns Hopkins University School of Medicine, Baltimore, MD). Constructs encoding PP32, ΔNCap PP32, and PP32 variants I7A, V19A, L69A, L93A, L139A, Y131F, D146L, and Y131F/D146L have been describe previously (25). Additional point substitutions were made using QuikChange (Stratagene). Protein were expressed and purified as described (25).
Fluorescence Spectroscopy.
Fluorescence emission spectra were collected on an Aviv ATF 105 spectropolarimeter in a 1.0-cm path-length cuvette. The protein concentration was 3 μM, in storage buffer [20 mM NaPO4, 150 mM NaCl, 0.1 mM tris(2-carboxyethyl)phosphine (TCEP) (pH 7.8)] with 0 or 5.4 M urea, at 20 °C.
Equilibrium Unfolding.
Urea-induced unfolding was monitored by CD (Aviv Model 400 CD spectrometer) at 220 nm or fluorescence by exciting at 295 nm and recording emission at 324 nm. Urea was deionized by chromatography over mixed-bed resin (Bio-Rad). Urea concentration was determined by refractometry. Urea titrations were carried out using a computer-controlled Microlab syringe titrator (Hamilton). Samples contained 2–4 μM protein, 20 mM sodium phosphate, 150 mM NaCl, 0.1 mM TCEP (pH 7.8). At each urea concentration, samples were equilibrated for 5 min at 20 °C and signal-averaged for 30 s. Two-state analysis of equilibrium unfolding transitions was carried out as described by Street et al. (60).
Kinetic Folding Studies.
Fluorescence-detected unfolding and refolding kinetic measurements were made on an Applied Photophysics SX18MV-R stopped-flow rapid mixing device. Emission was monitored using a 320 cutoff filter, following excitation at 280 nm to monitor changes in the environment surrounding the single tryptophan at the C terminus. Final protein concentrations were 1–3 μM. Experiments were done in 20 mM sodium phosphate, 150 mM NaCl, 0.1 mM TCEP (pH 7.8) and at 20 °C. Unfolding and refolding amplitudes and rate constants were determined using nonlinear least-squares to fit the following equation to the individual progress curves:
Where
To capture this complexity in refolding and unfolding, data were fitted with a sequential three-state model:
where the denatured (D) state is converted to the native (N) state through a single on-pathway intermediate (I) (41, 61). Microscopic rate constants from the sequential three-state model and their urea dependences were obtained from the global fitting of the observed rate constants for refolding and unfolding and, when applicable, the associated amplitudes for major and minor unfolding phases, all as a function of urea. The program Profit 6.1.16 (Quantum Soft) was used for fitting, with scripts kindly provided by Thomas Kiefhaber (Technische Universität München, Garching, Germany) and Andreas Moeglich (Humboldt-Universität zu Berlin, Berlin, Germany).
Φ-values were calculated from the following relationships:
where the rate constants and free energy changes were extrapolated to zero molar denaturant and mut and wt indicate values for variants and WT PP32, respectively. Free energies of unfolding from N to D were determined from equilibrium urea unfolding experiments.
Hydrogen Exchange of WT PP32.
15N-labeled PP32 [800 μM in 20 mM NaPO4, 50 mM NaCl, 0.2 mM TCEP (pH 6.8)] was exchanged into D2O buffer (20 mM NaPO4, 50 mM NaCl, 0.1 mM EDTA, 0.2 mM TCEP, pDs 6.2, 6.7, or 7.2) by centrifugal gel filtration as described by (62). A total of 2 mL of preswollen Sephadex G-25 Fine (GE Healthcare) was placed into a 3-mL spin column (Fisher Scientific), washed 4–5 times with 2 mL of D2O buffer by low-speed spinning (tabletop swing bucket centrifuge for 3 min at 3000 × g), followed by 560 mL of protein sample. Final protein concentration was ∼650 μM in 95% D2O sample buffer.
1H-15N HSQC spectra were immediately collected after solvent exchange on a Bruker 600 MHz spectrometer equipped with a cryoprobe at 20 °C. About 20 spectra were collected during the first 12 h. Subsequently, one spectrum was collected every few hours, everyday, then every few days for the next 3 mo. Spectra were processed using NMRPipe (63), and displayed and analyzed with Sparky (64). Assignments for amide resonances of PP32 are from ref. 25. Peak heights were determined in Sparky. To compare the exchange rates of different residues of PP32, we calculated protection factors (PF) with the following equation:
where kint is the rate constant for exchange from amino acids in random coil (65), and kex is the observed exchange rate constant obtained from fitting a single-exponential decay function to the change in the heights of amide cross-peaks in the series of HSQC spectra.
Protection factors were fitted as a function of n, the residue number (Fig. 6C) with a cosine-modulated Gaussian function:
The fitted parameters are the amplitude a, mean μ, and SD σ of the Gaussian function, and the amplitude b, repeat period nrep, and phase c of the cosine modulation. Fitted values and confidence intervals are given in Table S1.
Acknowledgments
We thank Dr. Cynthia Wolberger for providing us with the gene encoding the LRR domain PP32, and Dr. Carlos A. Castañeda for assistance in analysis of HX data. We thank The Johns Hopkins University Biomolecular NMR Center for providing facilities and resources. This research is supported by National Institutes of Health Grant GM068462 (to D.B.).
Footnotes
- ↵1To whom correspondence should be addressed. Email: barrick{at}jhu.edu.
Author contributions: T.P.D., A.M., and D.B. designed research; T.P.D. and A.M. performed research; T.P.D., A.M., and D.B. analyzed data; and T.P.D. wrote the paper.
The authors declare no conflict of interest.
↵*This Direct Submission article had a prearranged editor.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1412165112/-/DCSupplemental.
References
- ↵.
- Levinthal C
- ↵
- ↵
- ↵
- ↵
- ↵.
- Wolynes PG,
- Onuchic JN,
- Thirumalai D
- ↵.
- Hoang L,
- Bédard S,
- Krishna MMG,
- Lin Y,
- Englander SW
- ↵.
- Hu W, et al.
- ↵.
- Jennings PA,
- Wright PE
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵.
- Lowe AR,
- Itzhaki LS
- ↵
- ↵.
- Mello CC,
- Barrick D
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵.
- Schmid FX,
- Baldwin RL
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵.
- Ray Wu LG
- Matthews CR
- ↵.
- Otzen DE,
- Itzhaki LS,
- elMasry NF,
- Jackson SE,
- Fersht AR
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵.
- Moran LB,
- Schneider JP,
- Kentsis A,
- Reddy GA,
- Sosnick TR
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵.
- Xu D,
- Farmer A,
- Collett G,
- Grishin NV,
- Chook YM
- ↵
- ↵.
- Monecke T, et al.
- ↵.
- Correia JJ,
- Detrich HW
- Street TO,
- Courtemanche N,
- Barrick D
- ↵
- ↵
- ↵
- ↵.
- Goddard TD,
- Kneller DG
- ↵
Citation Manager Formats
Article Classifications
- Biological Sciences
- Biophysics and Computational Biology