Previous Article |
Table of Contents
| Next Article
BIOLOGICAL SCIENCES / BIOPHYSICS
A base-excision DNA-repair protein finds intrahelical lesion bases by fast sliding in contact with DNA

,¶
Departments of *Chemistry and Chemical Biology and
Molecular and Cellular Biology, Harvard University, 12 Oxford Street, Cambridge, MA 02138
Edited by Peter H. von Hippel, University of Oregon, Eugene, OR, and approved February 14, 2006 (received for review November 8, 2005)
| Abstract |
|---|
|
|
|---|
nonspecific binding | shear flow stretching | single-molecule imaging | DNA glycosylase mechanism | hopping
The mechanism of lesion recognition has received extensive attention in the case of human oxoguanine DNA glycosylase 1 (hOgg1) (Fig. 1A), the enzyme responsible for removal of the highly mutagenic base 8-oxoguanine (oxoG) from the human genome (47). Arising from the attack of reactive oxygen species on guanine (G) residues, oxoG differs from G at only two atomic positions (Fig. 1B). Despite its structural similarity to G, oxoG mispairs almost exclusively with adenine (A) during replication, causing G:C
T:A transversion mutations. Interrogation of DNA by hOgg1 to locate oxoG residues is made difficult not only by the vast excess of structurally similar G residues but also because oxoG causes no discernible alteration to the helical structure of DNA (8, 9) and only slightly destabilizes the duplex (10). It has recently been demonstrated that the extrahelical lesion-recognition pocket of hOgg1 has remarkable selectivity for oxoG versus G (7 kcal/mol; 1 kcal = 4.18 kJ) (7), but it remains unknown whether hOgg1 also possesses the ability to distinguish intrahelical oxoG:C pairs from G:C. Here we report direct real-time observations of the behavior of single hOgg1 enzyme molecules moving along undamaged DNA that provide insight into the strategy used by the enzyme to search out lesion bases.
|
These proteins ability to bind non-target (nonspecific) DNA is key to their rapid target-binding activity. The law of mass action dictates that even proteins with modest nonspecific binding constants spend the majority of time bound to nonspecific DNA under the in vivo condition of high DNA concentration (14, 15). Multistep target location involving initial association with nonspecific DNA and subsequent binding to the target is called "facilitated diffusion." It is generally accepted that rapid target search in vivo is made possible through a combination of one-dimensional diffusion along DNA segments and three-dimensional transfer among DNA segments (12, 13, 16); the work reported here concerns the one-dimensional process.
Although facilitated diffusion has been inferred from the kinetic behavior of several classes of DNA-binding proteins (11, 1720), direct real-time observation of this phenomenon has proven difficult and has only been reported in the case of Escherichia coli RNA polymerase (2123). This lack of direct experimental data has hindered progress toward a detailed understanding of facilitated diffusion.
For proteins that do not consume biochemical energy to assist their diffusion, target search along DNA is a thermally activated and directionally unbiased process. Two distinct mechanisms for one-dimensional diffusion along DNA, sliding and hopping, have been proposed (11, 12). Sliding is a process wherein the protein undergoes diffusion while remaining bound to the DNA. Hopping refers to the protein repeatedly dissociating from the DNA and rebinding at a new location on the DNA. The likelihood of proximate dissociation and rebinding sites, particularly on stretched DNA, makes distinguishing the two mechanisms difficult. Innovative biochemical studies with restriction enzymes have demonstrated hopping and suggested that sliding, if it occurs, only contributes to movement on length scales of <173 bp (17, 24). The question remains whether sliding, in addition to hopping, contributes to one-dimensional movement in the context of facilitated diffusion.
The ability to record trajectories of single molecules moving on DNA afforded by recent advances in single-molecule microscopy (21) in combination with high-throughput data collection and analysis allows us to directly address this question. Our in vitro assay (Fig. 2), based on flow-stretching DNA, (25, 26) makes it possible to track the one-dimensional diffusion of many individual enzyme molecules along DNA with high spatial and temporal resolution across various solution conditions.
|
| Results |
|---|
|
|
|---|
x2
/2t. All data for Fig. 4 A and C were taken at a salt concentration of 0.010 M.
|
|
We calculate the upper limit of hOgg1s diffusion constant to be 1.2 x 107 bp2/s at 25°C, based on a hydrodynamic model (28) developed for a protein sliding freely along the DNA helix (Fig. 4A, dashed line). The measured diffusion constant is smaller than this theoretical value because of the presence of an activation barrier for translocation along DNA (12, 28). To determine the magnitude of this energy barrier, we consider that the protein undergoes a random walk along DNA with a step size of 1 bp. The random walk stepping rate corresponding to the free sliding diffusion constant is klim = 2.4 x 107 steps per second, and the rate corresponding to the measured diffusion constant is k = 1.0 ± 0.2 x 107 steps per second. According to the Arrhenius relation, the mean activation free energy for hOgg1 to slide is 
G
/kBT = ln (klim/k) = 0.9 ± 0.2 kBT (pH 7.8). Note that because k and klim vary in the same manner for different chosen step sizes, the value of 
G
does not depend on the step size specified. The low barrier height we have determined is consistent with previous theoretical predictions that
2 kBT is the maximum roughness of the energy landscape tolerated for fast search (29) and that sliding with a rate close to the hydrodynamic limit is possible (12).
Surprisingly, hOgg1s diffusion constant on DNA grows by more than an order of magnitude between pH 6.6 and 7.8 (Fig. 4 A and C). The only amino acid with a side-chain pKa close to this range is histidine (pKa = 6.1) (30), which becomes positively charged at low pH. The crystal structures of hOgg1DNA complexes (5, 7) reveal a conserved histidine (His-270) contacting the DNA that could explain the pH dependence. We replaced His-270 with alanine and found that the diffusion constant for the H270A mutant is reduced to 0.8 ± 0.4 x 106 bp2/s, and is independent of pH across the entire range tested (Fig. 4 A and C). This result proves that His-270 is indeed responsible for the observed pH dependence and indicates that hOgg1s specialized DNA-binding interface contributes to its rapid sliding activity. Recent observation of DNA groove tracking by a nonspecifically bound enzyme supports the notion of highly complementary proteinDNA interactions in such complexes (31). It appears that native interactions within the hOgg1DNA interface can lower the activation barrier for protein translocation along DNA in a manner analogous to the reduction of activation barriers for chemical reactions by enzymes (32). In this sense, hOgg1 seems to be fine-tuned by nature to play a catalytic role in its own translocation along DNA by making specific interactions within the DNA-binding interface that reduce the activation energy for sliding.
The mean activation energy for sliding by hOgg1 is plotted as a function of pH in Fig. 5. It is interesting to note that the measured activation barriers for sliding of WT and His-270-containing hOgg1 fall below 2 kBT at the intranuclear pH of mammalian cells, pH 7.557.79 (33). We further note that a low barrier for sliding at the pH of the mitochondrial matrix, pH 8, would allow hOgg1 to search rapidly for lesions on mitochondrial DNA, which experiences high oxidative stress.
|
| Discussion |
|---|
|
|
|---|
A protein molecules search behavior may differ in cells compared with the in vitro single-molecule assay. For instance, search in vivo may be directly or indirectly coupled to DNA-dependent molecular motors. Intracellular solution conditions and the state of cellular DNA are significantly different from those of the single-molecule assay. In the assay, we approach cellular salt concentrations and replicate physiological pH but not the compact DNA configuration, occlusion of binding sites by DNA-bound proteins, high total concentration of searching protein molecules, and macromolecular crowding known to occur in vivo.
From measurements of hOgg1s copy number in cells (34), we know that the physiological concentration is orders of magnitude higher than is used in the single-molecule assay. However, the stoichiometry of bound glycosylase to base pairs of DNA is remarkably similar in cells and in the in vitro assay: approximately one bound protein molecule per 104 bp of DNA. In vivo, much of the DNA is inaccessible due to bound proteins including histones, which constitute at least partial blockades to the motion of a sliding protein molecule. We can estimate hOgg1s in vivo sliding length based on the observed diffusion constant at physiological pH and binding lifetime measured at high salt concentration. Taking D = 5 x 106 bp2/s and the mean binding lifetime of 0.025 s at the highest salt concentration tested, we compute the mean sliding length at 440 bp (see Supporting Text). Such short sliding lengths seem compatible with searching the short segments of DNA accessible in a cellular environment. It has been noted previously that selective occlusion of non-target DNA could significantly accelerate target location (29).
If searching proteins depended solely on translational diffusion to move along DNA (e.g., by only hopping), we would expect the diffusion constant along DNA to be much lower in the crowded interior of cells. However, the rate at which a protein slides along the DNA helix is limited primarily by the proteins rotational diffusion constant (28). Because macromolecular crowding slows rotational diffusion only weakly compared with translational diffusion (35), we propose that hOgg1s diffusion along DNA in cells is slowed by a factor much smaller than the increase in solution viscosity.
These considerations predict efficient search of DNA in vivo through the synergistic interplay of hopping and sliding. The relative insensitivity of the sliding rate to macromolecular crowding allows fast local one-dimensional searching. Frequent and rapid hops, arising from the short mean binding time and rapid rebinding overcome the redundancy of sliding on long length scales and constitute an efficient mechanism for obstacle bypass.
To test the generality of rapid one-dimensional diffusion among oxoG glycosylases, we observed single molecules of hOgg1s prokaryotic counterpart, MutM, bound to undamaged DNA. Although MutM is functionally similar to hOgg1, the two proteins have completely different folds and their DNA complexes are dissimilar in overall shape (5, 36). We find that MutM slides along DNA with D = 3.5 ± 0.6 x 105 bp2/s (Fig. 6, which is published as supporting information on the PNAS web site), which corresponds to an activation energy for sliding of 3.3 ± 0.2 kBT. Although the sliding rate measured for MutM is lower than that for hOgg1 at physiological pH (all measurements made at room temperature; compare Fig. 3 with Fig. 6), the bacterial enzyme has only 1/1,000th the number of base pairs to patrol, and MutMs diffusion constant will be significantly higher at the 5565°C growth temperature of Bacillus stearothermophilus than we measured at room temperature.
The sliding exhibited by these glycosylases has implications for the mechanism of lesion recognition. Sliding along DNA with D = 5 x 106 bp2/s, hOgg1 can traverse 1,000 bp of DNA in 0.1 seconds according to
x
= (2Dt)1/2. This distance is the expected displacement after 1,000,000 random walk steps of length 1 bp along the DNA or, alternatively, 1,000,000 base pair sampling events. Such rapid sliding, with an activation barrier of only
0.5 kcal/mol (0.9 1.3 kBT, pH 7.57.8, 298 K), all but excludes the possibility that hOgg1 extrudes each base from the DNA helix and tests that base for equilibration into the enzymes extrahelical active site (1). Furthermore, rapid sliding excludes the possibility that hOgg1 captures a spontaneously formed extrahelical oxoG, a species expected to appear at low frequency (3739) (no measurements of oxoG:C opening rates are available). Consequently, hOgg1 must possess some mechanism of recognizing intrahelical oxoG residues and facilitating their extrusion from the DNA helix.
Unlike unidirectional movement along DNA by molecular motors, the Brownian target search is extraordinarily redundant. With this search mechanism, it is not important that hOgg1 recognize oxoG upon each encounter, because the protein will have many additional encounters while interrogating a particular segment of DNA. However, it is crucially important that hOgg1 minimize the frequency at which it selects normal bases for detailed interrogation because of the large number of encounters with nonlesion base pairs. Direct evidence for such "negative selection" against a time-consuming search of G:C base pairs has recently been gleaned from cocrystal structures of MutM bound to non-lesion-containing DNA (40). Collectively, these considerations argue that hOgg1 extrudes oxoG lesions into the active site with a higher rate than it does for undamaged bases, thus discriminating lesions by kinetic control.
We find that DNA glycosylases make good use of the time they spend bound to DNA, forming nonspecific complexes that support rapid sliding to search out defect bases. Given these proteins success in solving a difficult search problem, a highly efficient mechanism for target identification, such as selective kinetically controlled lesion capture, must be expected. This study highlights the promise that single-molecule tracking measurements can complement structural studies to form a deeper understanding of complex protein-DNA interactions.
| Materials and Methods |
|---|
|
|
|---|
DNA was tethered to a glass surface at one end and stretched by a laminar flow of buffer (Fig. 2 AD) (25, 26). Single hOgg1 molecules that bind to and diffuse along the DNA were illuminated by a laser beam and imaged with a fluorescence microscope (Fig. 2A). Fluorescent particles are assigned as hOgg1 monomers because they demonstrate an emission rate consistent with a single dye molecule, a unimodal intensity distribution, and photobleach in one step (see the light-scattering measurements in Supporting Text). We recorded 20100 images per second with a charge-coupled device camera (Fig. 2 A and E). In a typical movie, we observed hundreds to thousands of hOgg1 molecules bind, move along, and dissociate from individual DNA molecules (Movie 1, which is published as supporting information on the PNAS web site). For each experiment, the excitation power was adjusted such that photobleaching of the label occurred on a longer time scale than the duration of DNA-binding events.
Each image of single molecules was fit with multiple (one for each molecule) two-dimensional Gaussian functions (41) by using DIATRACK software (Semasopht, North Epping, Australia). The centroid positions of the fits from a series of images were reconstructed as spatial trajectories of each protein molecule appearing in the sequence (Fig. 2F). The accuracy of a measured centroid (1050 nm under typical conditions) was primarily limited by the number of photons collected (see Supporting Text). Because we were initially concerned about the effect of dynamic conformational fluctuations of the DNA (26) on measurements of one-dimensional diffusion, a dedicated effort was made to investigate their potential effect. Brownian dynamics simulations (Fig. 7A, which is published as supporting information on the PNAS web site) and observations of
DNA stained with a fluorescent intercalating dye (Fig. 7B) indicated that conformational fluctuations of the DNA in the flow direction were sufficiently fast such that they did not interfere with measurements of protein translocation under our experimental conditions.
DNA Preparation and Flow Stretching.
DNA (New England Biolabs) was enzymatically ligated to a modified oligo [5'-ggg cgg cga cct (aaa)4-biotin] and incubated at 1 pM in a flow cell constructed as described in ref. 25 (Fig. 2B). The poly(ethylene glycol) brush coating the coverslip was critical to suppress adsorption of labeled protein molecules to the glass coverslip. Buffer solution was drawn through the flow cell at a rate of 10.0 ml/hr by a syringe pump (model no. PHD2000, Harvard Apparatus), creating shear flow near the coverslip surface and stretching the DNA (Fig. 2C) (26). The flow velocity at the DNA was 100 µm/s (see Supporting Text).
Single-molecule measurements were made with 0.051.0 nM hOgg1Cy3B in 0.01 M Tris or phosphate buffer/0.010.1 M NaCl/0.005 M 2-mercaptoethanol/0.5 mg/ml BSA/0.0001 M EDTA/5% glycerol at final pH 6.67.8 or with 0.010.10 nM MutMCy3B in similar buffers (lacking EDTA) at final pH 7.07.5.
Protein Preparation and Labeling.
We overproduced
-hOgg1 with an N-terminal hexahistidine tag (
indicates the hOgg1 splice variant targeted to the nucleus) including amino acids 12345 in E. coli. All hOgg1 proteins contained an engineered cysteine at the C terminus for labeling; some had the point mutation K249Q, which inactivates glycosylase activity, or H270A. Single-molecule experiments performed with WT and K249Q hOgg1 demonstrated that the K249Q mutation does not affect hOgg1s pH-dependent translocation activity (Fig. 5).
Single-molecule experiments using hOgg1 with a C-terminal truncation (at residue 322) reveal that neither the presence nor the absence of the amino acids excluded from the C terminus in crystallographic studies (residues 327345) affect the proteins pH-dependent translocation activity (data not shown). From the same measurements, we conclude that the translocation dynamics observed are independent of the position of the engineered C-terminal cysteine and dye molecule label.
Purified hOgg1 was labeled by reaction with Cy3Bmaleimide (Amersham Pharmacia Biosciences) at the C-terminal engineered cysteine. Conjugation under native conditions (0.02 M Tris or phosphate buffer, pH 7.5/0.5 M NaCl/0.001 M EDTA/10% glycerol) with excess reactive dye depended on the presence of the terminal cysteine, indicating that none of the eight native cysteine residues are labeled under the conditions used. Reaction with the sterically accessible C-terminal cysteine proceeded to completion over 60 min at room temperature under argon, assayed by visible (for quantitation of the label) and UV (for quantitation of the protein) absorption after stringent removal of the free dye by protein-affinity purification of the labeled sample. The selectivity of labeling was confirmed by mass spectrometry.
hOgg1 was also labeled with tetramethylrhodamine (Molecular Probes), Rhodamine red (Molecular Probes), and Cy3 (Amersham Pharmacia Biosciences). Because the translocation activity observed did not vary with either the position or the charge of the label (data not shown), we concluded that the dye molecule label does not significantly influence the translocation dynamics.
WT MutM from B. stearothermophilus was cloned with a C-terminal cysteine residue and overproduced in E. coli. Similarly to hOgg1, the unmodified protein does not react with Cy3Bmaleimide, but that containing the engineered cysteine reacts readily at room temperature.
Fluorescence Imaging. An inverted microscope (Olympus IX70) was used for wide-field imaging. A 40-µm diameter area was illuminated with 80300 W/cm2 at the critical angle for total internal reflection by focusing the beam of a 532-nm diode laser (UGA-250; LambdaPro, Beijing) into the back aperture of a x60 objective lens (numerical aperture 1.45; PlanApo, Olympus) (Fig. 2A). Fluorescent emission was collected through a dichroic mirror (model no. Z532RDC; Chroma Technology, Rockingham, VT) and bandpass filter (model no. HQ575/50; Chroma Technology) to be imaged on a back-illuminated and amplified charge-coupled device camera (Cascade:512B; Roper Scientific, Trenton, NJ). The protein concentration was chosen for each experiment such that, on average, fewer than three hOgg1 molecules were bound per DNA molecule.
Calculation of Diffusion Constant for Free Sliding Proteins on DNA.
The calculation was performed according to
|
|
as described in the literature for the E. coli lac repressor (28), where the denominator corresponds to the "effective friction" and
is the solution viscosity. We used a = hOgg1s hydrodynamic radius of 3.2 nm (see Supporting Text) and applied two improvements to the model to accommodate aspects of hOgg1DNA complexes visible in cocrystal structures (5, 7). First, we adjusted for the fact that the center of mass of the protein is displaced 2.5 nm from the axis of the DNA. Incorporating the 5.6-fold-longer translational path implied by the offset increases the effective friction for sliding by 10%. Second, the aspheric shape of hOgg1 apparent from the crystal structure was taken into account. We approximated hOgg1 as an oblate ellipsoid of revolution with axial ratio 2.2 and chose rotation about the short axis as that coupled to sliding, increasing the effective friction by an additional 20% compared with a sphere of equal volume (42). The total adjustment of 30% reduced the theoretical upper limit on the diffusion constant for free sliding to 1.2 µm2/s, 62 times lower than that measured for free diffusion of hOgg1 in solution (see Supporting Text). The calculation for MutM was carried out in a similar manner but for modeling the protein as a prolate ellipsoid of revolution. We found the sensitivity of the free sliding rate to the size, shape, and orientation of the protein (through rotational friction) notable.
| Acknowledgements |
|---|
|
|
|---|
| Footnotes |
|---|
Abbreviations: oxoG, 8-oxoguanine; hOgg1, human oxoguanine DNA glycosylase 1.
¶To whom correspondence may be addressed. E-mail: verdine{at}chemistry.harvard.edu or xie{at}chemistry.harvard.edu
Present address: Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, 240 Longwood Avenue, Boston, MA 02115. ![]()
Author contributions: G.L.V., X.S.X., and P.C.B. designed research; P.C.B. performed research; P.C.B. and A.B. contributed new reagents/analytic tools; P.C.B. analyzed data; and P.C.B., G.L.V., X.S.X., and A.M.v.O. wrote the paper.
Conflict of interest statement: No conflicts declared.
This paper was submitted directly (Track II) to the PNAS office.
© 2006 by The National Academy of Sciences of the USA
| References |
|---|
|
|
|---|
This article has been cited by other articles in HighWire Press-hosted journals:
![]() |
Z. Wunderlich and L. A. Mirny Spatial effects on the speed and reliability of protein-DNA search Nucleic Acids Res., May 3, 2008; (2008) gkn173v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Hu, A. Ma, and A. R. Dinner A two-step nucleotide-flipping mechanism enables kinetic discrimination of DNA lesions by AGT PNAS, March 25, 2008; 105(12): 4615 - 4620. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Zhao and A. R. Dinner Apparent Directional Scanning for DNA Repair Biophys. J., January 1, 2008; 94(1): 47 - 52. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. H. Kim and R. G. Larson Single-molecule analysis of 1D diffusion and transcription elongation of T7 RNA polymerase along individual stretched DNA molecules Nucleic Acids Res., June 28, 2007; 35(11): 3848 - 3858. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Elf, G.-W. Li, and X. S. Xie Probing Transcription Factor Dynamics at the Single-Molecule Level in a Living Cell Science, May 25, 2007; 316(5828): 1191 - 1194. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Y. Ali, E. B. Krementsova, G. G. Kennedy, R. Mahaffy, T. D. Pollard, K. M. Trybus, and D. M. Warshaw From the Cover: Myosin Va maneuvers through actin intersections and diffuses along microtubules PNAS, March 13, 2007; 104(11): 4332 - 4336. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. A. Kuznetsov, V. V. Koval, G. A. Nevinsky, K. T. Douglas, D. O. Zharkov, and O. S. Fedorova Kinetic Conformational Analysis of Human 8-Oxoguanine-DNA Glycosylase J. Biol. Chem., January 12, 2007; 282(2): 1029 - 1038. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||