Benchmarking all-atom simulations using hydrogen exchange
- Departments of aBiochemistry and Molecular Biology and
- dChemistry,
- eJames Franck Institute,
- fComputation Institute, and
- cInstitute for Biophysical Dynamics, University of Chicago, Chicago, IL 60637; and
- bCenter for Proteome Biophysics, Department of New Biology, Daegu Gyeongbuk Institute of Science and Technology, Daegu 711-873, Korea
See allHide authors and affiliations
Edited by S. Walter Englander, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, and approved September 29, 2014 (received for review March 5, 2014)

Significance
Molecular dynamics simulations have recently become capable of observing multiple protein unfolding and refolding events in a single-millisecond–long trajectory. This major advance produces atomic-level information with nanosecond resolution, a feat unmatched by experimental methods. Such simulations are being extensively analyzed to assess their description of protein folding, yet the results remain difficult to validate experimentally. We apply a combination of hydrogen exchange, NMR, and other techniques to test the simulations with a resolution of single H-bonds. Several significant discrepancies between the simulations and experimental data were uncovered for regions of the energy surface outside of the native basin. This comparison yields suggestions for improving the force fields and provides a general method for experimentally validating folding simulations.
Abstract
Long-time molecular dynamics (MD) simulations are now able to fold small proteins reversibly to their native structures [Lindorff-Larsen K, Piana S, Dror RO, Shaw DE (2011) Science 334(6055):517–520]. These results indicate that modern force fields can reproduce the energy surface near the native structure. To test how well the force fields recapitulate the other regions of the energy surface, MD trajectories for a variant of protein G are compared with data from site-resolved hydrogen exchange (HX) and other biophysical measurements. Because HX monitors the breaking of individual H-bonds, this experimental technique identifies the stability and H-bond content of excited states, thus enabling quantitative comparison with the simulations. Contrary to experimental findings of a cooperative, all-or-none unfolding process, the simulated denatured state ensemble, on average, is highly collapsed with some transient or persistent native 2° structure. The MD trajectories of this protein G variant and other small proteins exhibit excessive intramolecular H-bonding even for the most expanded conformations, suggesting that the force fields require improvements in describing H-bonding and backbone hydration. Moreover, these comparisons provide a general protocol for validating the ability of simulations to accurately capture rare structural fluctuations.
Molecular dynamics (MD) simulations can now probe protein dynamics on millisecond timescales and thereby enable investigation of a variety of biological problems, including binding, conformational changes, and folding. A landmark example is the all-atom simulations by Shaw and coworkers where multiple folding and unfolding events were observed in long time trajectories (1, 2). In addition to predicting or matching observed folding rates with a single set of parameters, these simulations produced native-like models for 12 small, fast-folding proteins. Equally impressive is their observation of multiple discrete folding and unfolding transitions, which indicates that folding proceeds on an energy landscape with two major states separated by a free energy barrier. This barrier-limited folding behavior replicates that observed for many proteins. Not surprisingly, these remarkable simulations are being extensively analyzed (3⇓–5).
The applicability of MD for many situations is limited by the extent to which the entire landscape is recapitulated. An accurate representation of native-like states does not imply a correct representation of other states (e.g., intermediates and unfolded structures). Proper validation requires a comparison with experiments that probe lowly populated conformations. NMR measurements probe subsecond dynamics with single residue resolution, although with a limitation to states with populations exceeding 0.5% (6). Fluorescence, CD, FRET, and small angle X-ray scattering (SAXS) measurements are well adapted to kinetic studies but provide limited spatial resolution.
Hydrogen exchange (HX) data report on the H-bond patterns and populations of extremely rare states, independent of the timescale on which these states are sampled (7, 8). Furthermore, the cooperativity and spatial extent of fluctuations can be assayed in a site-resolved manner by using NMR methods to measure HX as a function of denaturant concentration (9). Thus, HX is an ideal probe for characterizing excited states, such as the denatured state ensemble (DSE) and kinetically invisible intermediates, even under fully native conditions.
Here we describe a detailed comparison of the folding simulations to experimental data for a fast-folding variant of protein G [NuG2b; 76% identity (1)] with an optimized amino hairpin (10). The simulated DSE of NuG2b (and other proteins) contains mostly overly compact conformations, as recently noted by Shaw and coworkers (11) and others (3). We also find that the DSEsim exhibits high levels of H-bonding, often at levels comparable to the native state ensemble (NSE) and with some secondary structural elements arranged in a native-like manner.
For NuG2b, experimental measurements imply that folding is highly cooperative and the DSEexp is largely unstructured. Based on HX measurements, the DSE is highly solvated and devoid of stable H-bonding. The linear dependence on denaturant concentration of the stability and activation folding free energy (ΔGf‡) along with the chevron analysis indicate that DSEexp remains solvated and unstructured, even in the absence of denaturant. In addition, kinetic SAXS measurements reveal that DSEexp is highly expanded. Taken together, these measurements provide a detailed description of DSEexp, which is compared with DSEsim.
Results
Our results are presented in stages. We first analyze the level of structure present in DSEsim, primarily focusing on the extent of H-bonding. Next, we present the experimental methodology and describe the results. The section ends with a comparison between the experiment and simulation.
Properties of the Simulated DSE.
The four NuG2b MD trajectories (1) were analyzed to identify the number of native and total H-bonds, the Cα-rmsd from the native state, the radius of gyration (Rg), and the TM score [a 0-to-1 metric for structural similarity (12)] (Fig. 1A and Fig. S1). These quantities often change in concert during discrete folding transitions. The discrete character of the transitions enabled us to identify NSEsim as the set of conformations for which the rmsd is below 4.0 Å, the number of native H-bonds exceeds 20, and the TM score is greater than 0.6 (Fig. S2A). All other conformations were classified as members of DSEsim. The time-averaged H-bond fraction for each residue was calculated using all 6 × 106 conformations in the trajectories (SI Methods and Fig. S2 B and C). For each conformation, an amide proton was considered protected from HX if it participated in an intramolecular H-bond within 0.5 ns, irrespective of whether the partner was the native carbonyl oxygen.
Analysis of H-bonds and 2° structure in MD simulations. (A) H-bonding (green, native; red, total), Rg, backbone rmsd, and TM score for one MD trajectory (three others are shown in Fig. S1). The right column presents the distribution for the DSE. RgSAXS is indicated for the chemically denatured state (blue line). (B) H-bond frequencies, color-coded according to whether they are the native partners, or within ±1, ±2, or >±2 of the native partners. (C) Secondary structure frequencies (red, α, 3–10 and π helices; blue, extended; green, other; cyan, turn). (D) H-bond and contact maps.
The total number of native plus nonnative H-bonds remains relatively constant across the entire trajectory, even in many of the expanded conformations of DSEsim (Fig. 1A). On average, DSEsim structures are highly collapsed with
Both the residue–residue contact map and the pattern of H-bond partners in DSEsim resemble those of NSEsim (Fig. 1D). The major fraction (∼80%) of the conformations in DSEsim contains a folded N-terminal β hairpin, whereas about half the ensemble retains at least a portion of the helix (Fig. 1). The C-terminal hairpin frequently remains fully extended and forms nonnative H-bonds with the β1 strand and with residues that are helical in the native state (Fig. 1D).
Even when the chain contains nonnative contacts in DSEsim, H-bonds still form, often within one or two residues of the native partner for the helix and the N-terminal hairpin (Fig. 1B), reflecting helical over/underwinding and β slippage, respectively. The N- and C-terminal strands often are in contact in DSEsim, although typically with the nonnative antiparallel orientation. The largest cluster (5) in DSEsim (36%) has the N-terminal hairpin and most of the helix positioned in a native-like arrangement, with an average TM score of 0.46. The majority of highly extended structures in the DSEsim also contain one or more elements of native 2° structure. In addition, some conformations of DSEsim adopt native-like topologies with a high TM score, 0.6. The average TM score in DSEsim is 0.34 and exceeds that for a random pair of structures (0.17) (12). Overall, DSEsim is often highly collapsed and extensively H-bonded with some structural elements arranged in a native-like manner.
Experimental Characterization of the DSE.
HX, kinetic, CD, and SAXS measurements were conducted to characterize NuG2b’s energy surface to compare with the simulations. The following two sections provide conceptual background for the interpretation of the HX and kinetic data.
Background: Hydrogen exchange.
The analysis of HX data assumes that backbone amide protons (NH) can exist in states that are either HX-competent or -incompetent, termed open and closed, respectively (7, 8). Typically, closed states are involved in intramolecular H-bonds. The measured HX rate (kex) for each NH is a function of the rates of opening (kop), closing (kcl), and chemical exchange from the open state (kchem) according to the reaction
where Kop = kop/kcl is the equilibrium constant for the open↔closed reaction, and the free energy is given by ΔGHX = RTlnKop. Accordingly, a direct comparison of ΔGisim and ΔGiHX is possible for each residue i, even for reactions that are orders of magnitude faster than the HX measurement timescale.
The combined use of chemical denaturants with HX enables the quantification of the amount of exposed protein surface in the various conformational states. Denaturants such as guanidinium chloride (GdmCl) preferentially stabilize states with more exposed surface area (16), primarily peptide group ASA (17). We note that denaturants do not create new states, but rather promote preexisting ones with larger ASA (7⇓–9). The sensitivity of the population shift can be translated into a plot of ΔG versus [GdmCl], which frequently is linear with a slope termed the m-value (units of kilocalories per mole per molar), that is, ΔG([den]) = ΔG(0) − m·[den]. When monitoring equilibrium folding as a function of denaturant (e.g., by CD or fluorescence), the m-value reflects the difference in ASA exposure between the NSE and the DSE, termed “mglobal.”
In combination with NMR methods, HX also enables determination of the residue-specific mHX-values that reflect the extent of structure remaining in states where a specific amide proton is exchange competent (Fig. S3A). Accordingly, the mHX-value for each residue depends upon whether its exchange occurs in the NSE, the DSE, or an intermediate state (7⇓–9). A residue produces mHX = 0 if the amide proton exchanges from the NSE through a local perturbation that exposes little or no additional ASA and has lower free energy than the globally unfolded state. Amide protons that require global unfolding should exchange with the global stability (ΔGiHX = ΔGglobal) and with an m-value matching the global value (mHX = mglobal). Intermediate exchange behavior representing subglobal structural openings also is possible and is observed in larger proteins (7⇓–9). However, only global and local opening events appear for NuG2b.
Background: Chevron analysis to test for residual DSE structure.
The denaturant dependence of the folding and unfolding activation free energies, ΔGf‡ and ΔGu‡, are presented using the chevron analysis, a widely used method to demonstrate that a DSE is devoid of significant residual structure (18⇓–20). This test involves a comparison of the free energy difference between the DSE and NSE, measured independently in equilibrium melting and in kinetic folding experiments (ΔGeq = ΔGu‡ − ΔGf‡). A similar comparison is made for surface burial parameters (mglobal = mu – mf), where mf and mu are the denaturant dependence of the ΔGf‡ and ΔGu‡ and provide measures of the ASA that is buried going to the transition state ensemble (TSE) from the DSE or the NSE, respectively. Agreement between the equilibrium and kinetically determined parameters for ΔGeq and mglobal, irrespective of the probe used to measure folding, implies that no conformations with significant stabilization or surface burial are populated on the DSE side of the TSE, and that the DSE under refolding conditions has the same level of exposed denaturant sensitive surface area as the DSE studied by equilibrium denaturation at elevated denaturant concentrations.
Experimental folding behavior of NuG2b.
The extent of H-bonding for the experimental ensemble was determined by collecting HX data as a function of denaturant (Fig. 2, Upper). HX rates were obtained by measuring the increase in amide peak volumes in 1H-15N heteronuclear single quantum coherence (HSQC) spectra over time for predeuterated NuG2b exchanging in 90% H2O at 313 K (Fig. 2 and Fig. S3B). Based upon stability and denaturant dependence, 12 sites throughout the protein seem to exchange through a global unfolding event. They have ΔGHX = 8.1 ± 0.3 kcal⋅mol−1 and mHX = 1.31 ± 0.08 kcal⋅mol−1⋅M−1 (errors reflect SD for the 12 sites).
Experimental studies. (Upper) Dependence on GdmCl concentration for D-to-H exchange of the 12 most stable, globally exchanging amide sites in 90% H2O at 313 K (expanded view in Fig. S3B). Stabilities are compared with values obtained from equilibrium denaturation (black) and kinetic fluorescence measurements in 100% H2O (red, width of the triangle reflects the statistical error on extrapolation to zero denaturant). The pHread for each denaturant concentration were as follows: 0 mM, pH 7.51; 250 mM, pH 6.98; 500 mM, pH 6.87; 750 mM, pH 6.83; 1 M, pH 7.12; 1.25 M, pH 6.66; 1.40 M, pH 6.52. (Inset) Locations of residues undergoing global exchange are mapped onto the NuG2b structure. (Middle) Equilibrium denaturation monitored by CD at 313 K. (Lower) Folding rates at 293, 313, and 333 K.
To test whether exchange of the most stable amide protons occurs from an unstructured DSE, the ΔGHX and mHX-values were compared with those obtained from equilibrium and kinetic folding studies at 313 K in 100% H2O (Fig. 2). The CD-monitored equilibrium unfolding data fit well to a two-state model, yielding a stability of 7.3 ± 0.2 kcal⋅mol−1 and mglobal = 1.31 ± 0.04 kcal⋅mol−1⋅M−1. These values closely match those obtained from kinetic data using chevron analysis under the same conditions (Fig. 2 and Fig. S4; ΔGeq = 7.7 ± 0.4 kcal⋅mol−1, mglobal = 1.39 ± 0.04 kcal⋅mol−1⋅M−1). Hence, NuG2b folding satisfies the chevron criterion indicative of an unstructured DSE.
Overall, the similarity of the ΔG and m-values calculated from CD, kinetic, and HX measurements suggests that they are probing the same all-or-none unfolding transition. Nearly identical m-values were observed despite the different denaturant ranges accessible to each experiment, with the HX measured down to zero denaturant. Furthermore, the folding arm of the chevron plot is linear from 6 M to the lowest measurable denaturant concentration (0.8 M at 293 K, Fig. 2). This linearity indicates that no partially collapsed species accumulate before the major folding phase even for jumps to low denaturant (18, 21).
The overall dimensions of DSEexp were derived from SAXS data collected for NuG2b in 7 M GdmCl at 291 K. DSEexp has an average radius of gyration of 23–25 Å (Fig. S5A), consistent with a self-avoiding statistical coil (13), as found for many chemically denatured small proteins (22⇓⇓⇓⇓–27). The Rg of DSEexp at low denaturant was unobtainable using rapid mixing methods with millisecond resolution owing to the extremely fast folding rate of NuG2b. As a surrogate, we performed mixing measurements on the slower-folding protein G. No measurable decrease in the Rg was found when jumping from 4 to 0.5 M GdmCl before the observed folding phase (kf = 20 s−1; Fig. S5B), indicating that protein G's DSEexp remains expanded in low denaturant.
To further investigate the degree of structure in DSEexp, we performed CD measurements on a peptide corresponding to NuG2b’s redesigned N-terminal hairpin, because this region contains the most residual structure in the MD simulations. The CD spectrum of the 20-mer displays a negative minimum at 200 nm that is very similar to that of the spectra of other unfolded states, such as a thermally denatured protein G mutant, but quite distinct from the native NuG2b spectrum (Fig. S6). Moreover, the peptide’s spectrum is invariant over the measured temperature range (293–333 K). Because residual H-bonded structure is very likely to be thermally labile, the temperature invariance of the spectrum implies that the peptide does not contain residual H-bonded structure.
Limits on structure in the DSEExp.
HX, CD, and kinetic data produce similar global stabilities and denaturant dependence despite being conducted under a wide range of denaturant concentrations. The extrapolated stability from the denaturant melt at 6 M matches the HX value in water and its dependence on GdmCl. These results, together with the SAXS and peptide CD data and the chevron criterion, indicate that, under native conditions, the chains in DSEexp are expanded and expose the same amount of surface area as at high denaturant. In addition, the most protected H-bonds, which appear in all 2° structure elements, undergo exchange with the global m-value. All these results point to a cooperative unfolding process to an unstructured ensemble under all measured conditions.
However, the 12 amide sites exchanging with the global m-value have an average ΔGHX that is 0.4 kcal⋅mol−1 higher than the stability determined using the kinetic data (which requires less extrapolation than the CD data). This mild difference could reflect residual structure, although it is at the level of our statistical error and also could result from minor errors in pH and T, or the inaccuracy of the linear extrapolation. Recently, the same energy difference was observed for the Fyn SH3 domain by Kay and coworkers (28), who likewise interpreted HX as occurring from an unstructured state because the sites exhibited random coil chemical shifts in the DSE.
Alternatively, low levels of residual H-bonded structure could exist under native conditions. Six amide sites yield ΔGHX ≥0.4 kcal⋅mol−1, consistent with these H-bonds being formed with KeqHB∼1 [ΔG = RT ln (1+ KeqHB)]. The sites are located on the N- and C-terminal strands as well as on the helix, so they are unlikely to form in a concerted fashion, and hence other alternatives should be considered such as transient helix formation. Using the recent parameterization of m-values with ASA (17), we estimate that a time average of four helical or six nonlocal H-bonds would reduce the m-value by ∼0.3 kcal⋅mol−1⋅M−1 (and ∼0.04 kcal⋅mol−1⋅M−1 for each additional H-bond). This reduction would noticeably affect the mHX and mf-values of 1.3 and 1.1 kcal⋅mol−1⋅M−1, respectively, and ΔGHX. Hence, we believe this level provides a reasonable upper bound on the amount of residual structure consistent with ΔGHX and mHX matching their equilibrium counterparts, the two-state chevron fit with linear folding arms, and the N-terminal hairpin being unstructured in water. The measured Rg, which is consistent with a random coil (23), provides an additional constraint on the level of residual structure.
Comparison Between Experiment and Simulation.
Our analysis of the NuG2b simulations indicates that the DSESim is typically compact and highly H-bonded, often with some secondary structural elements arranged in a native-like manner. Experimentally, the HX, CD, and kinetic data produce similar global stabilities despite being conducted under a wide range of denaturant concentrations. These results, together with the SAXS and peptide data and the two-state chevron fit, indicate that under highly native conditions the chains in DSEexp are expanded and expose the same amount of surface area as at high denaturant.
Benchmarking simulations against experimental data are best conducted when both are collected under the same conditions. Our comparisons for NuG2b, however, use simulations conducted at 350 K (near the
Comparison of HX data to simulations. The stabilities of individual H-bonds derived from HX data (Table S1) are referenced to the global stability determined from chevron analysis (black). H-bond stabilities calculated from the MD trajectories (red) are referenced to the global stability (using the relative populations of the NSE and DSE; ΔGglobal = +0.2 kcal⋅mol−1 with Keq = [NSEsim]/[DSEsim]). Dashed lines indicate mean stability differences relative to ΔGglobal.
In addition to the differences in magnitude and pattern of H-bond protection, quite dissimilar opening events lead to H-bond breakage in the simulations and experiments. When any one of the 12 globally exchanging H-bonds is broken for 1+ ns in DSEsim, on average about two-thirds of the H-bonds remain intact. Experimentally, however, the 12 most stable amide sites exchange through a large unfolding transition involving the simultaneous breakage of most or all H-bonds and with a denaturant dependence that is consistent with global unfolding.
An analysis of surface exposure produces an even larger discrepancy. For the states where at least one of the 12 globally exchanging H-bonds is broken, ASAHX/ASANSE = 1.14–1.17, whereas ASAstatistical coil/ASANSE = 1.9 (Fig. S2D). Using these values and a recent m-value:ASA parameterization (17), we obtain msim HX = 0.20 and mglobal = 1.35 kcal⋅mol−1⋅M−1, a readily distinguishable difference. The calculated mglobal matches the experimental value, providing support for the parameterization and our use of a statistical coil model for the DSE. Hence, the structural fluctuations that break the most stable H-bonds are very different for the simulations and the experiments.
The less structured DSE observed experimentally might be argued as a consequence of a difference in conditions. However, the experimental characterization was conducted at lower temperatures where residual structure would have been stabilized. Therefore, more H-bonded structure is expected to appear at lower temperatures rather than less. In addition, the HX measurements were performed under fully aqueous conditions, and the global m-value agrees with the value at high denaturant. Finally, the N-terminal hairpin, the predominant residual structure in the simulations, is an unstructured peptide in water at 293–313 K (Fig. S6). Hence, the discrepancies between experiment and simulation are unlikely to be attributable to differences in conditions.
Nevertheless, the
Discussion
Long-time MD simulations are accurate enough to refold small proteins to within 2 Å rmsd with multiple discrete folding transitions (1). However, the DSEs found in MD simulations often contain overly compact structures (11) with extensive amounts of H-bonding compared with the experimental data. The high level of residual structure indicates that the force field requires further improvement to characterize nonnative regions of the energy landscape and to describe the high degree of folding cooperativity observed experimentally. Nevertheless, the level of H-bonding in DSEsim is physically reasonable from the standpoint that buried H-bond donors and acceptors nearly always form H-bonds (29). However, the predominance of collapsed species in DSEsim suggests that terms in the force field promoting protein–protein and/or water–water H-bonds are too strong relative to those associated with backbone hydration.
In contrast, the experimental HX, denaturation, CD, peptide, two-state chevron fit, and SAXS data are all consistent with an extremely cooperative unfolding process to an unstructured DSE. The DSEexp contains highly solvated and expanded conformations with minimal H-bonded structure for concentrations of GdmCl from 0 to 7 M. The most stable H-bonds exchange through a highly cooperative transition to a globally unfolded state that contains few, if any, H-bonds, even in the absence of denaturant. Furthermore, linear folding arms of the chevron plot also point to a highly solvated DSEexp under native conditions. A more collapsed DSE under native conditions would have produced a measureable downward curvature in the folding arm owing to smaller ΔASA between the DSE and TSE. Our studies do not rule out a subtler change in the DSE, such as a change in local structure [e.g., a shift in the population of conformations having backbone angles in the polyproline II region of the Ramachandran map to angles in the helical region (13, 30)], which explains the frequently observed sloping DSE baseline in CD222nm-monitored denaturation (31, 32).
Similar signatures of an unstructured DSE appear in other proteins. For a ubiquitin variant (33) and a coiled-coil (34), the HX determined stabilities and m-values match those derived from global measurements. Also, Pace and coworkers (35) found that HX and equilibrium-determined stability matched for 19 of 20 proteins, indicating that the two methods typically probe the same transition to a DSE that is devoid of stable H-bonds. The DSE for many [but not all (36)] small proteins remains expanded upon a shift to native conditions according to SAXS (22, 24⇓–26). In addition, most [but not all (37)] small proteins satisfy the two-state chevron criterion, sometimes even down to 0 M denaturant (18, 19). This highly cooperative behavior is not unique to experiments because some coarse-grained models now can produce nearly all-or-none folding behavior with linear (38) or near linear chevrons (39).
Analyses by Shaw and coworkers of group trajectories for λ repressor (1) and ubiquitin (40) produce similar pictures of highly collapsed and H-bonded DSEsims containing considerable native character, although to a lesser degree than observed for NuG2b (Figs. S7 and S8). Other all-atom MD simulations also yield collapsed DSEs (41⇓–43). Experimentally, λ repressor and ubiquitin satisfy the authentic two-state chevron criterion (19, 21, 44, 45), again indicating that their DSEs at low denaturant concentration are highly solvated and that early intermediates are not significantly populated. Furthermore, NMR tracking of the thermal denaturation of λ repressor beautifully demonstrated that this protein fully unfolds in a single transition without populating any other species (46). In addition, time-resolved SAXS studies of ubiquitin indicate that chains in DSEexp remain expanded down to 0.7 M GdmCl at 293 K (25). The FRET-determined Rg of the low denaturant DSE often is less than that observed by SAXS [e.g., Rg ∼22 versus ∼26 Å (47)], possibly owing to transient hydrophobic contacts (48), but it still is considerably larger than found in the all-atom simulations (27). Thus, the choice of experimental technique is unlikely to provide an alternative explanation for the discrepancy, thus suggesting the result is general.
HX–MD Comparisons.
HX is an excellent tool for verifying MD simulations because it provides the stability and cooperativity of unfolding events with individual H-bond resolution (7, 8). Past attempts to compare MD to HX have been limited by the short durations of simulations and by the uncertainty in how to relate structural ensembles and HX rates (49). Recent technical advances in MD simulations (50) combined with the modeling of HX protection presented here make it possible to directly compare these two techniques.
The H-bond pattern in the NuG2b trajectories clearly differs from what was observed experimentally. However, future simulations may differ in subtler ways. In such cases, one should use a site-by-site comparison of the ΔG and m-values by reweighting each state in DSEsim according to surface exposure, ΔΔG([den]) = (%ASA)·[den]·mglobal. The reweighted ensemble can be used to calculate the H-bond populations as a function of denaturant from which individual m-values can be determined. This method could thereby be applied to various systems to extend the comparison beyond the DSE to include states that break only a subset of H-bonds, such as subglobal openings.
However, we assume that the HX rate is governed by the H-bond fraction and the intrinsic HX rate (kchem). This assumption is valid when considering unfolding of a segment to a well solvated conformation, but may not apply for small openings where local geometry may affect kchem. We leave this issue for future studies because it does not affect the present conclusions.
Conclusions
The MD simulations by Shaw and coworkers can reversibly fold small proteins and reproduce their native structures, folding rates, and melting temperatures. However, the discrete transitions observed in the MD simulations for NuG2b and other small proteins occur between the native state and a highly collapsed, H-bonded species. In contrast, experimental data indicate that small proteins often fold much more cooperatively with a denatured state that is expanded and largely unstructured even at low denaturant concentrations. The excessive amount of intramolecular H-bonding observed in the simulations suggests that changes in the force field are warranted to produce the correct balance between H-bonding, backbone solvation, and nonbonded interactions.
Methods
Before HX measurements, NuG2b was exchanged into D2O and then lyophilized. Exchange measurements were initiated by the addition of solvent [10% (vol/vol) D2O] to lyophilized samples. HX was measured by standard NMR methods using a 500-MHz magnet with a Bruker AVANCE III console. Refolding measurements used a Biologic SFM-400, -4000 instrument integrated with a PTI light source, and CD measurements were taken on a Jasco 715 spectrometer [100% (vol/vol) H2O]. See SI Methods for additional details.
Acknowledgments
We thank S. Piana-Agostinetti and D. Shaw for helpful discussions, comments on the paper, and data sharing. This work was supported by National Institutes of General Medical Sciences (NIGMS) Research Grant R01 GM055694 and National Science Foundation Grant CHE-13630120 (to K.F.F.). Use of the Advanced Photon Source, an Office of Science User Facility, operated for the Department of Energy (DOE) Office of Science by Argonne National Laboratory, was supported by the DOE under Contract DEAC02- 06CH11357. This project was supported by National Center for Research Resources Grant 2P41RR008630-18 and NIGMS Grant 9 P41 GM103622-18, both under National Institutes of Health. W.Y. was supported in part by National Creative Research Initiatives (Center for Proteome Biophysics) of the National Research Foundation of Korea Grant 2011-0000041.
Footnotes
↵1J.J.S. and W.Y. contributed equally to this work.
- ↵2To whom correspondence may be addressed. Email: freed{at}uchicago.edu or trsosnic{at}uchicago.edu.
Author contributions: J.J.S., W.Y., K.F.F., and T.R.S. designed research; J.J.S., W.Y., E.K.G., M.C.B., J.R.H., and T.R.S. performed research; J.J.S., W.Y., E.K.G., M.C.B., J.R.H., and T.R.S. analyzed data; and J.J.S., W.Y., K.F.F., and T.R.S. wrote the paper.
The authors declare no conflict of interest.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1404213111/-/DCSupplemental.
References
- ↵.
- Lindorff-Larsen K,
- Piana S,
- Dror RO,
- Shaw DE
- ↵.
- Shaw DE, et al.
- ↵.
- Best RB,
- Hummer G,
- Eaton WA
- ↵.
- Beauchamp KA,
- McGibbon R,
- Lin YS,
- Pande VS
- ↵
- ↵
- ↵
- ↵
- ↵.
- Bai Y,
- Sosnick TR,
- Mayne L,
- Englander SW
- ↵
- ↵
- ↵
- ↵.
- Jha AK,
- Colubri A,
- Freed KF,
- Sosnick TR
- ↵
- ↵
- ↵.
- Auton M,
- Holthauzen LM,
- Bolen DW
- ↵.
- Guinn EJ,
- Kontur WS,
- Tsodikov OV,
- Shkel I,
- Record MT Jr
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵.
- Kohn JE, et al.
- ↵
- ↵
- ↵
- ↵
- ↵.
- Long D,
- Bouvignies G,
- Kay LE
- ↵
- ↵
- ↵.
- Sosnick TR,
- Shtilerman MD,
- Mayne L,
- Englander SW
- ↵
- ↵
- ↵.
- Meisner WK,
- Sosnick TR
- ↵
- ↵.
- Kimura T, et al.
- ↵
- ↵.
- Liu Z,
- Reddy G,
- O’Brien EP,
- Thirumalai D
- ↵
- ↵.
- Piana S,
- Lindorff-Larsen K,
- Shaw DE
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵.
- Merchant KA,
- Best RB,
- Louis JM,
- Gopich IV,
- Eaton WA
- ↵.
- Meng W,
- Lyle N,
- Luan B,
- Raleigh DP,
- Pappu RV
- ↵
- ↵
Citation Manager Formats
Article Classifications
- Biological Sciences
- Biophysics and Computational Biology