# FtsK-dependent XerCD-*dif* recombination unlinks replication catenanes in a stepwise manner

^{a}Department of Mathematics, Saitama University, Saitama 380-8570, Japan;^{b}Faculty of Education, Yamaguchi University, Yamaguchi 753-8512, Japan;^{c}School of Environmental and Life Sciences, University of Newcastle, Callaghan, NSW 2308, Australia;^{d}Department of Biochemistry, University of Oxford, Oxford OX1 3QU, United Kingdom; and^{e}Department of Mathematics, San Francisco State University, San Francisco, CA 94132

See allHide authors and affiliations

Edited by De Witt Sumners, Florida State University, Tallahassee, FL, and accepted by the Editorial Board October 17, 2013 (received for review May 10, 2013)

## Significance

Newly replicated circular chromosomes are topologically linked. XerC/XerD-*dif* (XerCD-*dif*)–FtsK recombination acts in the replication termination region of the *Escherichia coli* chromosome to remove links introduced during homologous recombination and replication, whereas Topoisomerase IV removes replication links only. Based on gel mobility patterns of the products of recombination, a stepwise unlinking pathway has been proposed. Here, we present a rigorous mathematical validation of this model, a significant advance over prior biological approaches. We show definitively that there is a unique shortest pathway of unlinking by XerCD-*dif*–FtsK that strictly reduces the complexity of the links at every step. We delineate the mechanism of action of the enzymes at each step along this pathway and provide a 3D interpretation of the results.

## Abstract

In *Escherichia coli*, complete unlinking of newly replicated sister chromosomes is required to ensure their proper segregation at cell division. Whereas replication links are removed primarily by topoisomerase IV, XerC/XerD-*dif* site-specific recombination can mediate sister chromosome unlinking in Topoisomerase IV-deficient cells. This reaction is activated at the division septum by the DNA translocase FtsK, which coordinates the last stages of chromosome segregation with cell division. It has been proposed that, after being activated by FtsK, XerC/XerD-*dif* recombination removes DNA links in a stepwise manner. Here, we provide a mathematically rigorous characterization of this topological mechanism of DNA unlinking. We show that stepwise unlinking is the only possible pathway that strictly reduces the complexity of the substrates at each step. Finally, we propose a topological mechanism for this unlinking reaction.

The *Escherichia coli* chromosome is a 4.6-Mbp circular double-stranded (ds) DNA duplex, in which the two DNA strands are wrapped around each other ∼420,000 times. During replication, DNA gyrase acts to remove the majority of these strand crossings, but those that remain result in two circular sister molecules that are nontrivially linked. This creates the topological problem of separating the two linked sister chromosomes to ensure proper segregation at the time of cell division. Unlinking of replication links in *E. coli* is largely achieved by Topoisomerase IV (TopoIV), a type II topoisomerase (1, 2). However, Ip et al. demonstrated that XerC/XerD-dif (XerCD-*dif*) site-specific recombination, coupled with action of the translocase FtsK, could resolve linked plasmid substrates in vitro and hypothesized that this system could work alongside, yet independently of, TopoIV during in vivo unlinking of replicative catenanes in the bacterial chromosome (3). Grainge et al. then demonstrated that increased site-specific recombination could indeed compensate for a loss of TopoIV activity in unlinking chromosomes in vivo (4). When the activity of TopoIV is blocked, the result is cell lethality. We here propose a mathematically rigorous analysis to describe the pathway and mechanisms of unlinking of replication links by XerCD–FtsK. This work places a fundamental biological process within a mathematical context.

Site-specific recombination is a process of breakage and reunion at two specific dsDNA duplexes (the recombination sites). When the DNA substrate consists of circular DNA molecules, the recombination sites may occur in a single DNA circle or in separate circles. Two sites are in direct repeat if they are in the same orientation on one DNA circle (Fig. 1). The relative orientation of the sites is harder to characterize when the two sites are on separate DNA circles. In the case of simple torus links with 2m crossings (also called 2m-catenanes, or 2m-cats) for an integer *m > 1*, the sites are said to be in parallel or antiparallel orientation with respect to each other (Fig. 1). Site-specific recombination occurs in two steps (5, 6): first, the recombination sites are brought together (synapsis); second, each site is cleaved and the DNA ends are exchanged, then rejoined.

In *E. coli*, XerCD-*dif* recombination plays an essential role in chromosome dimer resolution (reviewed in ref. 7). Furthermore, when coupled with FtsK, XerCD recombination at *dif* sites can unlink 2m-cats produced in vitro by λ-Integrase (3). These results suggested a potential in vivo role for XerCD–FtsK recombination, which was then hypothesized to work with TopoIV to unlink DNA links produced by DNA replication. To test this hypothesis, a pair of supercoiled linked plasmids, each with one *dif* site, was produced in vivo by replication in TopoIV-deficient cells, and these were then incubated in vitro with XerCD–FtsK_{50C} (4). The ATP-dependent reaction efficiently produced unlinked circles. The ATP dependence of the reaction is likely twofold: firstly the DNA translocase activity of FtsK relies upon ATP hydrolysis for movement, and in the absence of translocation there is no stimulation of recombination. Secondly, the energy from ATP hydrolysis is also used to align the two recombining *dif* sites so that subsequent recombination produces the observed stepwise reduction in complexity. In addition to right-handed (RH) torus links with parallel sites and with 2–14 crossings, unknotted dimers and a few dimeric knots were also observed. The experimental data suggested a stepwise reaction where crossings are removed one at a time, iteratively converting links into knots, into links, until two free circles are obtained (Fig. 1). A control experiment demonstrated that XerCD–FtsK_{50C} recombination could convert knotted dimers (RH torus knots with two directly repeated *dif* sites) to free circles. Separate experiments showed that chromosome unlinking in *E. coli* can be accomplished in vivo by multiple rounds of XerCD-*dif* or Cre-*loxP* site-specific recombination. The reactions required DNA translocation by FtsK. Overexpression of FtsK_{50C} in TopoIV-deficient cells was sufficient to drive the topology simplification. Furthermore, in vivo XerCD activation by actively translocating FtsK is essential to effectively unlink replication links (4). In the absence of FtsK, an active XerCD complex may produce complicated DNA knots and links, with a small yield of unlinks (8). Whereas Xer site-specific recombination on DNA plasmids in vitro has been well-characterized at a local biochemical level, the mechanism of Xer-mediated DNA unlinking in vivo remains unclear, and is extremely technically difficult to address experimentally. Mathematical analysis using tangle calculus provided evidence that the stepwise unlinking mechanism (Fig. 1) was most consistent with the experimental data (4). However, this work relied on a number of assumptions. Here, we present an expanded mathematical analysis with a minimum number of assumptions. We characterize the topology of the recombination products and show that there is a unique shortest unlinking pathway that strictly reduces the complexity of the substrates at each step. This unique mathematical analysis significantly advances previous biological approaches.

## Tangle Method and Xer Recombination

The tangle method uses tangles to model changes in topology of the synaptic complex before and after site-specific recombination (9⇓⇓–12). For the purposes of this paper, a tangle is a ball with two strings inside. In a DNA–protein complex, the strings of a tangle can represent two dsDNA duplexes bound to the enzyme(s) (the ball) thus forming a local synapse (Fig. 2*A*). The tangle method relies on a few justified assumptions and on knowledge of the topology of substrate and products. One round of recombination is translated into a system of two tangle equations, each of which corresponds to the enzyme-bound DNA substrate and product. When substrate and product are in a specific family of knots and links, and when the tangles involved are assumed to be rational, possible topological mechanisms for the enzymatic action can be computed using tangle calculus (9, 10, 13). A rational tangle is one wherein a sequence of rotations of one pair of string ends relative to the other will lead to a trivial tangle, i.e., a tangle which admits a planar projection with no crossings (Fig. 2*B*). The strings of a rational tangle can always be depicted as winding around the ball’s surface without crossing themselves, much like DNA winding around the protein surface during synapsis. The tangle method has been used extensively to model the topological mechanism of site-specific recombination enzymes, and specifically Xer recombination (4, 12, 14⇓–16). Tangle theory and tangle calculus are introduced in *Mathematical Methods*.

Colloms et al. (17) showed that, when acting on unknotted DNA circles with two *psi* sites in direct repeat, XerCD yielded products of unique topology: a RH 4-cat with antiparallel sites. Experimental data support a view where the sites wrap around accessory proteins ∼3 times before recombination (18⇓–20). This reaction can be written as a system of tangle equations. Under biologically reasonable assumptions and using results from low-dimensional topology, one can show that the tangles involved are all rational (14, 15, 21). Therefore, all solutions to the XerCD–*psi* system of equations can be computed using tangle calculus. There are only three solutions consistent with the experimental data (15). It was further shown that these solutions can be seen as different projections of the same three-dimensional (3D) object, and a unique topological mechanism for XerCD at *psi* was proposed that incorporated all three solutions (15).

In Grainge et al. (4), several systems of tangle equations were proposed for the pathway taking replication links to two open circles (the unlink). Tangle calculus was used to solve each system. For example, all possible systems of two equations converting a RH 6-cat with parallel sites into a knotted product with five or fewer crossings were considered. Using tangle calculus, only three biologically meaningful solutions were found, all of which produced the RH 5-crossing torus knot with directly repeated sites. The authors proposed that the three solutions are equivalent by 3D rigid motion (i.e., the three solutions reflect different views of the same 3D shape). This study concluded that the stepwise unlinking pathway of Fig. 1 is the most likely pathway of XerCD–FtsK recombination when acting on 2m-cats, and posited a stepwise mechanism of action.

The mathematical study in ref. 4 assumed that solutions to the tangle equations were rational, sums of rational tangles, or closely related to these, because tangle calculus is not helpful in finding solutions outside these families (9, 13). Here, we extend the methods from refs. 4 and 15 with the following two objectives: 1. Completely characterize the shortest unlinking pathways by XerCD–FtsK; 2. Determine rationality of the tangles involved and compute the exact mechanism of action at each step of the process. We first determine that any shortest pathway to unlink a 2m-cat has at least 2m steps (*Theorem 1*, *SI Text* S1 and S2, and Figs. S1 and S2). If we further assume that topological complexity of the substrate, as measured by its crossing number, declines after each step, then the enzymes unlink in exactly 2m steps and all of the intermediates are torus knots/links (*Theorem 2*; *SI Text* S3 and Figs. S3 and S4). We then proceed to fully characterize the mechanisms of unlinking from trefoil to 2-cat, to unknot, to unlink (*Propositions 1–3*; *SI Text* S4). For other transitions we assume that the solutions are rational or sums of rational tangles and revert to using tangle calculus (*Propositions 4 and 5*; *SI Text* S4). In *Discussion*, we propose a mechanism that unifies all solutions (*SI Text* S5; Fig. S5) and consider the case where the assumption on the stepwise decline in topological complexity is removed. Support for this assumption is shown in *SI Text* S6 and Figs. S6 and S7.

## Results

### The Shortest Pathway of Unlinking of 2m-Cats by XerCD–FtsK Has Exactly 2m Steps.

Here, dsDNA is modeled as the curve drawn by the axis of the double helix. XerCD–FtsK-*dif* is represented by a tangle: a ball (the enzymes) with two strings (the dsDNA) inside that intersect the boundary of the ball at four points. These strings may be intertwined in nontrivial ways. We partition this tangle into the sum of two tangles *O*_{b} *+ P*. The tangle *P* (parental) encloses only the core regions of the recombination sites, whereas the tangle *O*_{b} encloses any geometrical information captured within the enzymatic complex before recombination (i.e., all other DNA crossings held in place by the protein complex outside the core sites). Let *O*_{f} be the tangle outside XerCD–FtsK-*dif*. *O*_{f} includes all DNA outside the complex. For simplicity, let *O = O*_{f} *+ O*_{b}. Cleavage and strand exchange occur inside the tangle *P*, converting it into a tangle *R* while keeping *O* fixed. The recombination event is then represented as a system of two tangle equations: *N*(*O + P*) = substrate and *N*(*O + R*) = product (Fig. 2*A*).

In this model, *P* and *R* enclose only the core regions of the *dif* recombination sites, each of which is 28 base pairs long. The relatively short length of the core sites, combined with the fact that XerCD are tyrosine recombinases (i.e., go through a Holliday Junction Intermediate), implies that *P* and *R* are trivial tangles. Without loss of generality we assume that the sites come together as *P =* (*0*) with parallel or antiparallel orientation, and *R =* (*0,0*) or (*±1*) (Fig. 2*B*).

*Theorem 1* below considers the minimum number of recombination events needed to convert the RH *2*m-cat to the unlink. Note that the proof of this theorem does not preclude the *R* tangle from changing from one event to the next.

### Theorem 1.

*Assume each action of XerCD–FtsK is modeled as a system of equations N*(*O + P*) *= substrate and N*(*O + R*) *= product where P =* (*0*) *and R =*(*0,0*)*,* (*−1*)*, or* (*1*)*. Then at least 2m recombination events are needed to convert the RH 2m-cat with parallel dif sites to the unlink.*

##### Proof:

Detailed background needed for this proof is provided in *SI Text* S1 and S2 and Figs. S1 and S2. The proof uses the signature of a link, a link invariant denoted by *σ.* If *L* is a RH 2m-cat with parallel *dif* sites, then . A result of Murasugi (22) implies that the difference of the signatures of the substrate and the product of recombination by XerCD–FtsK is at most 1 (*SI Text*, *Lemma S1*). Because the signature of the unlink is 0, we need at least (2m−1) recombination events to go from the 2m-cat to the unlink. As each recombination changes a link into a knot and vice versa, the total number of recombination events must be even. Hence at least 2m recombination events are needed to convert the 2m-cat to the unlink.

Figs. 1 and 3 illustrate shortest pathways of unlinking starting with the 6-cat and with the 2m-cat, respectively.

### The Shortest DNA Unlinking Pathway by XerCD–FtsK Is Unique.

*Theorem 1* provides formal proof that to unlink a replication catenane with 2m crossings, the XerCD–FtsK complex must perform at least 2m single recombination events. The pathway proposed in ref. 4 is one such shortest pathway (Figs. 1 and 3). In *Theorem 2* we ask whether there are other possible pathways, assuming a stepwise decrease in topological complexity.

We assume that each product of XerCD–FtsK recombination has a smaller number of crossings than its substrate, except perhaps for the last product of recombination (i.e., the unlink). The assumption that each recombination step reduces the topological complexity of its substrates is supported by the experimental data described below and by its quantification (*SI Text* S6; Figs. S6 and S7). In ref. 4, supercoiled replication catenanes produced in vivo in TopoIV-deficient cells were incubated in vitro with XerCD–FtsK_{50C}. The reaction yielded a spectrum of knotted and linked DNA plasmids. A time-course analysis showed that, over time, the substrate catenanes were efficiently converted to unlinked circles. It was proposed that knotted dimers appeared as recombination intermediates. In a separate reaction the authors confirmed that the XerCD–FtsK complex could also efficiently unknot knotted DNA with two directly repeated *dif* sites. Decatenation data suggest that unlinking and unknotting occur gradually, with a steady decrease in topological complexity (4).

### Theorem 2.

*Consider unlinking pathways for the 2m-cat. Assume that each recombination event, other than the final one, reduces the number of crossings. Then every intermediate is a RH 2k-cat, a RH* (*2k + 1*)*-torus knot, or the unknot. Furthermore, the pathway is unique:*The proof of *Theorem 2* is based on *Theorem 3* below, which relies on properties of the Euler characteristic of a Seifert surface for a knot/link, and the signature.

### Theorem 3.

*Let O, P, and R be tangles such that:*

*If P is assumed to be* (*0*) *and R is assumed to be* (*w,0*) *for some integer w, and the orientations of K*_{1} *and K*_{2} *agree outside P and R, then the following is true:*

*a*.*If K*_{1}*= RH 2k-cat T*(*2,2k*)*with parallel sites**, and K*_{2}*is a knot with at most 2k − 1 crossings, then K*_{2}*is the RH torus knot T*(*2,2k − 1*)*with directly repeated sites;**b*.*If K*_{1}*= RH torus knot T*(*2,2k + 1*)*with directly repeated sites**, and K*_{2}*is a link with at most 2k crossings then K*_{2}*is the RH 2k-cat T*(*2,2k*)*with parallel sites.*

Proof of *Theorem 3* is given in *SI Text* S3 and Fig. S3.

##### Proof of Theorem 2:

*Theorem 3* implies that the product of XerCD–FtsK recombination on a RH 6-cat with parallel sites is the RH 5-crossing torus knot with sites in direct repeat, and that the product of recombination on the 5-noded knot is the RH 4-cat with parallel sites. Repeated application of *Theorem 3* confirms that the pathway in Fig. 1 is the only shortest pathway of unlinking for the parallel RH 6-cat if we assume that recombination reduces the topological complexity at every step but the last. This same argument can be applied to XerCD–FtsK unlinking of any parallel RH 2m-cat, thus completing the proof (Fig. 3).

The condition of *Theorem 2* can be relaxed by allowing the crossing number to remain constant or to be reduced. This assumption gives rise to other more complicated pathways.

### Full Characterization of the Last Three Steps of DNA Unlinking by XerCD–FtsK.

By *Theorem 2*, each shortest unlinking pathway starting with a RH 2m-cat ends with a sequence of RH trefoil, Hopf link, unknot, and unlink (Fig. 1). Here we show that, under biologically reasonable assumptions, each of the last three recombination steps admits a very specific topological mechanism. As in ref. 15, assumptions for *P* and *R* are based on the length of the core region of the recombination sites, and restrict the number of meaningful solutions to a small finite number.

Recombination converting a RH trefoil with directly repeated sites into a 2-cat was characterized in ref. 16. Based on this, *Proposition 1* shows that if a trefoil substrate is acted upon by XerCD–FtsK, then there are exactly three possible topological mechanisms of action.

### Proposition 1 (adapted from ref. 16).

*Suppose N*(*O + P*) *= RH trefoil*, *N*(*O + R*) *= 2-cat. Then we show the following*:

*if P =*(*0*)*parallel and R =*(*−1*),*then O =*(*3*);*if P =*(*0*)*antiparallel and R =*(*0,0*),*then O =*(*3,−1,0*);*if P =*(*0*)*parallel*and*R =*(*1*),*then O =*(*3,−2,0*).

By an argument similar to that in ref. 11, *Proposition 2* shows that if the substrate of recombination is the 2-cat with parallel sites and the product is the unknot, and if *R* is rational, then the geometry in the synapse is uniquely determined for each choice of *R*.

### Proposition 2.

*Suppose N*(*O + P*) = *2-cat*, *N*(*O + R*) = *unknot. Then we show the following:*

*if P =*(*0*)*parallel and R =*(*−1*),*then O =*(*2*);*if P =*(*0*)*antiparallel and R =*(*0,0*),*then O =*(*2,−1,0*);*if P =*(*0*)*parallel and R =*(*1*),*then O =*(*2,−2,0*).

Finally *Proposition 3* states that, if the substrate of recombination is the unknot with two sites in direct repeat and the product is the unlink, then the geometry in the synapse is uniquely determined for each choice of *R*. The proof is based on a result included in *SI Text* S4 and Fig. S4.

### Proposition 3.

*Suppose N*(*O + P*) = *unknot, and N*(*O + R*) = *unlink. Then we show the following:*

*if P =*(*0*)*parallel and R =*(*−1*),*then O =*(*1*);*if P =*(*0*)*antiparallel and R =*(*0,0*),*then O =*(*0,0*);*if P =*(*0*)*parallel and R =*(*1*),*then O =*(*−1*).

In summary, the shortest unlinking pathway takes the trefoil to the 2-cat, the 2-cat to the unknot, and the unknot to the unlink (*Theorems 1* and *2*). There are exactly three topological mechanisms of action to account for each of these unlinking steps by XerCD–FtsK (*Propositions 1–3*).

### Unlinking of 2m-Cats and (2m−1) Torus Knots.

We have proposed a shortest pathway to account for XerCD–FtsK unlinking of replication catenanes and have proved that this pathway is unique. We have fully characterized the unlinking mechanisms taking trefoil to 2-cat, 2-cat to unknot, and unknot to unlink. With the current mathematical machinery we can provide full characterization only for the last three recombination events along the pathway. In the general case where a 2k-cat is converted into a (2k−1)-torus knot (*Proposition 4*) and a (2k−1)-torus knot is converted into a (2k−2)-cat (*Proposition 5*), we can propose topological mechanisms by assuming that *O* is equivalent to a rational tangle or sum of rational tangles. In this way, we use tangle calculus to propose the most likely mechanisms; however, there is no guarantee that these are the only ones. The proofs for the following two propositions are analogous, and are given in *SI Text* S4.

### Proposition 4.

*Suppose N*(*O + P*) = *RH 2k*-*cat*, *N*(*O + R*) = *RH* (*2k − 1)*-*torus knot. Suppose that O is equivalent to rational or the sum of rational tangles. Then we show the following:*

*if P =*(*0*)*and R =*(*−1*),*then O =*(*2k*);*if P =*(*0*)*and R =*(*0,0*), then*O =*(*2k,−1,0*);*if P =*(*0*)*and R =*(*1*),*then O =*(*2k,−2,0*).

### Proposition 5.

*Suppose N*(*O + P*) = *RH (2k − 1)*-*torus knot*, *N*(*O + R*) = *RH* (*2k − 2)*-*cat*. *Suppose that O is equivalent to rational or sum of rational tangles. Then we show the following:*

*if P =*(*0*)*and R =*(*−1*),*then O =*(*2k − 1*);*if P =*(*0*)*and R =*(*0,0*),*then O =*(*2k − 1,−1,0*);*if P = (0) and R =*(*1*),*then O = (2k − 1,−2,0).*

The results of *Propositions 4* and *5* are consistent with those of *Propositions 1–3*. In the general case, the shortest unlinking pathway takes a RH 2k-cat with parallel sites into a RH (2k − 1)-knot with sites in direct repeats, and then takes that knot into a RH (2k − 2)-cat with parallel sites (*Theorems 1* and *2*). Using *Propositions 4* and *5*, each of the recombination events above admits three possible topological mechanisms of action.

## Discussion

### Unification of the Individual Stepwise Unlinking Pathways.

We have shown that for each recombination step in the unlinking pathway there are three corresponding topological mechanisms of action for the enzyme. These mechanisms can be interpreted as three views of the same 3D object (Figs. 4 and 5). The mechanisms computed for each recombination step can be clustered together to propose three stepwise unlinking mechanisms (Fig. 4). Notice that each row in Fig. 4 illustrates the topological mechanism for the last three steps of XerCD–FtsK DNA unlinking. In refs. 15 and 17 it was proposed that the topological mechanism with *P* = (*0*) with sites in parallel alignment and *R* = (*−1*) was the most consistent with the experimental data. The mechanism proposed in ref. 17 corresponds to the pathway in Fig. 4, row 1 (and is revisited in *SI Text* S5 and Fig. S5). The other two pathways (rows 2 and 3) are obtained by rigid rotation of the pathway in row 1, and correspond to *P* = (*0*) antiparallel and *R* = (*0,0*), and to *P* = (*0*) parallel and *R* = (*1*), respectively.

### Tangle Analysis of Iterative Recombination.

Because the recombination event involves two separate pairs of strand exchanges and proceeds through a Holliday Junction (HJ) Intermediate, there must be a step resetting the synapse between recombination events to maintain the same recombination mechanism at each event (i.e., if XerD is always to exchange the first pair of strands to form the HJ which is resolved by XerC, as shown in ref. 8). There is little evidence of the ability of XerCD and other tyrosine recombinases to act processively (i.e., bind and recombine a single pair of recombination sites multiple times before dissociating). However, in refs. 3, 4, and 24 the authors report an action of XerCD recombination consistent with iterative cleavage and strand exchange. If the recombination reaction is assumed to be iterative, and is modeled as processive (9, 25), then it can be proven that the tangle *O* is rational, and therefore all solutions are computable.

### 3D Interpretation of the Tangle Solutions.

In the tangle method, *P* is defined as a very small ball containing the cleavage core regions of the recombination sites. In this case, the orientation of the recombination sites is inherited into the tangle *P*. Two sites in *P =* (*0*) are in parallel alignment if both arrows point in the same direction in the tangle diagram; otherwise they are in antiparallel alignment. However, the concept of parallel and antiparallel alignment represents a local geometric property of the sites and is well-defined only in the tangle diagram (a planar projection of the 3D tangle). For the same 3D tangle, one can always obtain a planar projection with parallel sites and another with antiparallel sites. The only exception to this statement is when two sites are coplanar, in a strict mathematical sense. In ref. 15 the authors proposed a unique 3D topological mechanism for XerCD at *psi* that incorporated all three solutions to the tangle equations. Given a solution where *O* is rational, *P =* (*0*) and *R=* (*1*) or *R =* (*−1*), one can obtain a solution *S′* equivalent to *S* by rotating the synapse *P =* (*0*) to *P =* (*0,0*). In the case of XerCD at *psi*, performing this simple transformation on the two solutions for *P* parallel yields the unique solution for *P* antiparallel. Similar arguments have been proposed for the recombinase TnP1-IRS (11) and for XerCD–FtsK-*dif* (4). Here, we extend these arguments and apply them to each recombination step in the XerCD–FtsK-*dif* unlinking pathway (Figs. 4 and 5). The observed geometrical equivalence, obtained by rigid motion, between the three solutions for one of the recombination steps suggests that a unique 3D representation of the synaptic complex and topological mechanism of the enzymes can be interpreted as three different tangle solutions when viewed from different spatial directions. Computer visualization methods, combined with molecular modeling software such as PyMol and with X-ray crystallographic data, can be applied to realize these spatial equivalences and to propose a 3D molecular model for the enzymatic action, as was done in ref. 15. An animation generated with Knotplot (www.knotplot.com) is presented in Movie S1 to illustrate the transition from the 6-cat to the 5-torus knot. These studies indicate a limitation of the tangle model and suggest the need to consider equivalence classes of planar tangle diagrams related by 3D rigid motion (rotations and translations).

## Conclusions

We have presented a thorough mathematical analysis of the process of DNA unlinking by XerCD–FtsK. Based on the experimental data of ref. 4, we first prove that the shortest pathway of unlinking a substrate consisting of 2m-cats with sites in parallel orientation has exactly 2m steps. We then show that if we assume that recombination reduces the complexity of the substrate at each step, as suggested by the experimental data, then the shortest pathway is unique and every recombination intermediate is a torus knot or link. We analyze each recombination step and find three topological mechanisms at each step. The mechanisms can be unified in two different ways: (1) At each recombination step, the obtained mechanisms can be interpreted as different projections of the same 3D object, as illustrated in Figs. 4 and 5 and in the animation in Movie S1; or (2) The obtained mechanisms can be sorted by the local action of the enzyme (specified by *P, R*, and the relative orientation of the sites). This unification produces three separate mechanisms of stepwise unlinking consistent with the unique, shortest pathway previously reported.

## Mathematical Methods

Knot theory is the study of non–self-intersecting curves in 3D space, and of the spaces defined by these curves. A knot is defined as a non–self-intersecting circular path in space. A link (or catenane) is composed of two or more such curves, which can be intertwined. If the path can be laid flat without any over- or undercrossings, then *K* is the unknot or the trivial knot. Likewise, the unlink, or trivial link, is the disjoint union of two unknotted circles. Knots and links can be studied through their diagrams, i.e., planar projections that distinguish between under- and overcrossings. Two knots or links *K*_{1} and *K*_{2} are equivalent if *K*_{1} can be smoothly deformed into *K*_{2} without breaking the chain. We indistinguishably denote the knot/link, or its equivalence class, by *K*. Knots and links are classified according to their crossing numbers. The crossing number of *K* is the minimum number of crossings taken over all projections of all elements in the equivalence class *K*. Although finer classifications are attempted, it is generally difficult to determine when two knots/links are topologically identical (26).

A tangle is a pair consisting of a ball and two strings in it. Fig. 2*B* illustrates the four simplest tangles, called trivial tangles. Rational tangles are obtained by intertwining the two strings in a trivial tangle. All tangles in Fig. 6 are rational. Rational tangles most often appear in the mathematical analysis of site-specific recombination. There is a one-to-one correspondence between the set of rational tangles and the set of extended rational numbers (i.e., the union of the set of rational numbers and the infinity 1/0) (27). A rational tangle can be expressed using a vector representation, called the Conway vector. Conway vectors are illustrated in Fig. 6. Notice that the same rational tangle can admit different vector representations. The tangle method uses two operations: tangle addition *A + B;* and numerator *N*(*A*) (Fig. 6*C*). *A + B* is a tangle, whereas *N*(*A*) is a knot or a link. If *A* is rational then *N*(*A*) belongs to the well-studied family of 4-plat knots or links (28).

The tangle method uses biologically reasonable assumptions to model a site-specific recombination event as a system of two or more tangle equations on three unknowns *O*, *P,* and *R*. If *P =* (*0*), *R =* (*w*) or (*w,0*) for some integer *w*, then the system can be solved for *O* rational, sums of rational tangles (9, 13). The solution set can be very large. To obtain the most biologically relevant solutions we often resort to assumptions where *P =* (*0*) and *R =* (*k*) or (*k,0*)*.* These assumptions are backed by biological data as detailed in ref. 15. Computing the solutions is not mathematically challenging but can be very tedious. Mathematical software is available to make the method more accessible to the scientific community (29, 30). TangleSolve (http://ewok.sfsu.edu/TangleSolve) is a stand-alone Java program and web-based applet with a user-friendly interface for analyzing and visualizing site-specific recombination mechanisms using the tangle method (29).

Using arguments from low-dimensional topology, *O*, *P,* and *R* can sometimes be shown to be equivalent to rational or sums of two rational tangles (9, 11, 13⇓⇓–16, 25). In these cases one concludes that, under the assumptions of the tangle method, the solutions computed are the only possible solutions to a given system of equations. Biologically this implies that the tangle method computes all possible topological mechanisms of action for the enzyme. In some cases reasonable assumptions can be made about *P* and *R* to limit the number of solutions to a small finite number (14, 15). Mathematical analyses are often useful to characterize topological mechanisms of recombination (4, 10, 11, 13⇓⇓–16, 25, 31⇓⇓–34). A preliminary tangle analysis of Xer unlinking is given in ref. 4.

## Acknowledgments

The authors thank Rob Scharein for help with some of the figures and Movie S1, which were generated using Knotplot (www.knotplot.com), and Barbara Ustanko for editorial assistance with this manuscript. The authors also thank the referees for their careful review of the manuscript. M.V., K.S., and K.I. thank the Institute of Mathematics and Its Applications for its hospitality and support. This research was supported by the following: Japan Society for the Promotion of Science KAKENHI 22540066 and 25400080 (to K.S.); National Science Foundation DMS0920887 and CAREER Grant DMS1057284 (to M.V.); UK Engineering and Physical Sciences Research Council EP/H031367 (to K.I.); Australian Research Council FT120100153 and NHMRC APP1005697 (to I.G.); and Wellcome Trust WT099204AIA (to D.J.S.).

## Footnotes

- ↵
^{1}To whom correspondence should be addressed. E-mail: mariel{at}sfsu.edu.

Author contributions: K.S., D.J.S., and M.V. designed research; K.S., K.I., and M.V. performed research; K.S., K.I., I.G., D.J.S., and M.V. contributed new reagents/analytic tools; K.S., K.I., I.G., D.J.S., and M.V. analyzed data; and K.S., I.G., D.J.S., and M.V. wrote the paper.

The authors declare no conflict of interest.

This article is a PNAS Direct Submission. D.W.S. is a guest editor invited by the Editorial Board.

See Commentary on page 20854.

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1308450110/-/DCSupplemental.

Freely available online through the PNAS open access option.

## References

- ↵
- ↵
- Schvartzman JB,
- Stasiak A

- ↵
- Ip SC,
- Bregu M,
- Barre FX,
- Sherratt DJ

- ↵
- Grainge I,
- et al.

- ↵
- ↵
- ↵
- Higgins PN

- Barre FX,
- Sherratt DJ

- ↵
- Grainge I,
- Lesterlin C,
- Sherratt DJ

*dif*recombination by the FtsK DNA translocase. Nucleic Acids Res 39(12):5140–5148. - ↵
- ↵
- ↵
- Kawauchi A

- Zheng W,
- Galloy C,
- Hallet B,
- Vazquez M

*OCAMI Studies*, ed Kawauchi A (Osaka Municipal Universities Press, Sakai, Japan), Vol 1, pp 251–271. - ↵
- Arsuaga J,
- Diao Y,
- Vazquez M

*Mathematics of DNA Structure, Function and Interactions*, The IMA Volumes in Mathematics and its Applications 150, eds CJ Benham et al. (Springer Science + Business Media, New York), pp 7–36. - ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- Alén C,
- Sherratt DJ,
- Colloms SD

- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- Murasugi K

- ↵
- Leech J

- Conway JH

- ↵
- Burde G,
- Zieschang H

- ↵
- Saka Y,
- Vázquez M

- ↵
- Darcy IK,
- Scharein RG

- ↵
- ↵
- ↵
- ↵

## Citation Manager Formats

## Article Classifications

- Physical Sciences
- Applied Mathematics

- Biological Sciences
- Biochemistry

## See related content: