Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2013 Nov 11;110(52):20906–20911. doi: 10.1073/pnas.1308450110

FtsK-dependent XerCD-dif recombination unlinks replication catenanes in a stepwise manner

Koya Shimokawa a, Kai Ishihara b, Ian Grainge c, David J Sherratt d, Mariel Vazquez e,1
PMCID: PMC3876235  PMID: 24218579

Significance

Newly replicated circular chromosomes are topologically linked. XerC/XerD-dif (XerCD-dif)–FtsK recombination acts in the replication termination region of the Escherichia coli chromosome to remove links introduced during homologous recombination and replication, whereas Topoisomerase IV removes replication links only. Based on gel mobility patterns of the products of recombination, a stepwise unlinking pathway has been proposed. Here, we present a rigorous mathematical validation of this model, a significant advance over prior biological approaches. We show definitively that there is a unique shortest pathway of unlinking by XerCD-dif–FtsK that strictly reduces the complexity of the links at every step. We delineate the mechanism of action of the enzymes at each step along this pathway and provide a 3D interpretation of the results.

Keywords: DNA topology, tangle method, Xer recombination, band surgery, topology simplification

Abstract

In Escherichia coli, complete unlinking of newly replicated sister chromosomes is required to ensure their proper segregation at cell division. Whereas replication links are removed primarily by topoisomerase IV, XerC/XerD-dif site-specific recombination can mediate sister chromosome unlinking in Topoisomerase IV-deficient cells. This reaction is activated at the division septum by the DNA translocase FtsK, which coordinates the last stages of chromosome segregation with cell division. It has been proposed that, after being activated by FtsK, XerC/XerD-dif recombination removes DNA links in a stepwise manner. Here, we provide a mathematically rigorous characterization of this topological mechanism of DNA unlinking. We show that stepwise unlinking is the only possible pathway that strictly reduces the complexity of the substrates at each step. Finally, we propose a topological mechanism for this unlinking reaction.


The Escherichia coli chromosome is a 4.6-Mbp circular double-stranded (ds) DNA duplex, in which the two DNA strands are wrapped around each other ∼420,000 times. During replication, DNA gyrase acts to remove the majority of these strand crossings, but those that remain result in two circular sister molecules that are nontrivially linked. This creates the topological problem of separating the two linked sister chromosomes to ensure proper segregation at the time of cell division. Unlinking of replication links in E. coli is largely achieved by Topoisomerase IV (TopoIV), a type II topoisomerase (1, 2). However, Ip et al. demonstrated that XerC/XerD-dif (XerCD-dif) site-specific recombination, coupled with action of the translocase FtsK, could resolve linked plasmid substrates in vitro and hypothesized that this system could work alongside, yet independently of, TopoIV during in vivo unlinking of replicative catenanes in the bacterial chromosome (3). Grainge et al. then demonstrated that increased site-specific recombination could indeed compensate for a loss of TopoIV activity in unlinking chromosomes in vivo (4). When the activity of TopoIV is blocked, the result is cell lethality. We here propose a mathematically rigorous analysis to describe the pathway and mechanisms of unlinking of replication links by XerCD–FtsK. This work places a fundamental biological process within a mathematical context.

Site-specific recombination is a process of breakage and reunion at two specific dsDNA duplexes (the recombination sites). When the DNA substrate consists of circular DNA molecules, the recombination sites may occur in a single DNA circle or in separate circles. Two sites are in direct repeat if they are in the same orientation on one DNA circle (Fig. 1). The relative orientation of the sites is harder to characterize when the two sites are on separate DNA circles. In the case of simple torus links with 2m crossings (also called 2m-catenanes, or 2m-cats) for an integer m > 1, the sites are said to be in parallel or antiparallel orientation with respect to each other (Fig. 1). Site-specific recombination occurs in two steps (5, 6): first, the recombination sites are brought together (synapsis); second, each site is cleaved and the DNA ends are exchanged, then rejoined.

Fig. 1.

Fig. 1.

Proposed stepwise unlinking by XerCD-dif–FtsK recombination: Parallel RH 2m-cats [e.g., T(2,6)p] are converted to RH torus knots [e.g., T(2,5)] with directly repeated sites; such knots are converted to RH cats, and so on, iteratively, until the reaction stops at two open circles. (In ref. 4, RH torus links with parallel sites and up to 14 crossings, also called parallel 2m-cats and denoted by T(2,2m)p, were used as substrates of Xer recombination.)

In E. coli, XerCD-dif recombination plays an essential role in chromosome dimer resolution (reviewed in ref. 7). Furthermore, when coupled with FtsK, XerCD recombination at dif sites can unlink 2m-cats produced in vitro by λ-Integrase (3). These results suggested a potential in vivo role for XerCD–FtsK recombination, which was then hypothesized to work with TopoIV to unlink DNA links produced by DNA replication. To test this hypothesis, a pair of supercoiled linked plasmids, each with one dif site, was produced in vivo by replication in TopoIV-deficient cells, and these were then incubated in vitro with XerCD–FtsK50C (4). The ATP-dependent reaction efficiently produced unlinked circles. The ATP dependence of the reaction is likely twofold: firstly the DNA translocase activity of FtsK relies upon ATP hydrolysis for movement, and in the absence of translocation there is no stimulation of recombination. Secondly, the energy from ATP hydrolysis is also used to align the two recombining dif sites so that subsequent recombination produces the observed stepwise reduction in complexity. In addition to right-handed (RH) torus links with parallel sites and with 2–14 crossings, unknotted dimers and a few dimeric knots were also observed. The experimental data suggested a stepwise reaction where crossings are removed one at a time, iteratively converting links into knots, into links, until two free circles are obtained (Fig. 1). A control experiment demonstrated that XerCD–FtsK50C recombination could convert knotted dimers (RH torus knots with two directly repeated dif sites) to free circles. Separate experiments showed that chromosome unlinking in E. coli can be accomplished in vivo by multiple rounds of XerCD-dif or Cre-loxP site-specific recombination. The reactions required DNA translocation by FtsK. Overexpression of FtsK50C in TopoIV-deficient cells was sufficient to drive the topology simplification. Furthermore, in vivo XerCD activation by actively translocating FtsK is essential to effectively unlink replication links (4). In the absence of FtsK, an active XerCD complex may produce complicated DNA knots and links, with a small yield of unlinks (8). Whereas Xer site-specific recombination on DNA plasmids in vitro has been well-characterized at a local biochemical level, the mechanism of Xer-mediated DNA unlinking in vivo remains unclear, and is extremely technically difficult to address experimentally. Mathematical analysis using tangle calculus provided evidence that the stepwise unlinking mechanism (Fig. 1) was most consistent with the experimental data (4). However, this work relied on a number of assumptions. Here, we present an expanded mathematical analysis with a minimum number of assumptions. We characterize the topology of the recombination products and show that there is a unique shortest unlinking pathway that strictly reduces the complexity of the substrates at each step. This unique mathematical analysis significantly advances previous biological approaches.

Tangle Method and Xer Recombination

The tangle method uses tangles to model changes in topology of the synaptic complex before and after site-specific recombination (912). For the purposes of this paper, a tangle is a ball with two strings inside. In a DNA–protein complex, the strings of a tangle can represent two dsDNA duplexes bound to the enzyme(s) (the ball) thus forming a local synapse (Fig. 2A). The tangle method relies on a few justified assumptions and on knowledge of the topology of substrate and products. One round of recombination is translated into a system of two tangle equations, each of which corresponds to the enzyme-bound DNA substrate and product. When substrate and product are in a specific family of knots and links, and when the tangles involved are assumed to be rational, possible topological mechanisms for the enzymatic action can be computed using tangle calculus (9, 10, 13). A rational tangle is one wherein a sequence of rotations of one pair of string ends relative to the other will lead to a trivial tangle, i.e., a tangle which admits a planar projection with no crossings (Fig. 2B). The strings of a rational tangle can always be depicted as winding around the ball’s surface without crossing themselves, much like DNA winding around the protein surface during synapsis. The tangle method has been used extensively to model the topological mechanism of site-specific recombination enzymes, and specifically Xer recombination (4, 12, 1416). Tangle theory and tangle calculus are introduced in Mathematical Methods.

Fig. 2.

Fig. 2.

(A) XerCD–FtsK-dif complex before recombination is modeled by the sum of tangles Ob + P. There, P encloses the core regions of the recombination sites and therefore can be seen as a ball with two very short DNA strings. Site-specific recombination occurs inside P. The action of cleavage and strand exchange is modeled by converting P into a tangle R. In the case of unknotted substrates, the two DNA strings outside the enzymatic complex have been observed to be supercoiled but not intertwined. However, in the case of knotted or linked substrates one would expect the DNA outside XerCD–FtsK not to be topologically trivial. For simplicity, we push any such topological complexity into the shaded area of the tangle in the figure, which we then refer to as the outside tangle O. We define O = Of + Ob, where Of includes the DNA outside XerCD-dif-FtsK. The recombination event is represented as a system of equations N(O + P) = substrate and N(O + R) = product, and the tangle O is not changed during the recombination. (B) Orientations of the recombination sites induce orientations to the strings inside P and R. When the two sites are inside P = (0) in parallel alignment, then R = (−1) or (1). When the sites are in P = (0) in antiparallel alignment, then R = (0,0). Tangles (0), (0,0), (−1), and (1) are the trivial tangles.

Colloms et al. (17) showed that, when acting on unknotted DNA circles with two psi sites in direct repeat, XerCD yielded products of unique topology: a RH 4-cat with antiparallel sites. Experimental data support a view where the sites wrap around accessory proteins ∼3 times before recombination (1820). This reaction can be written as a system of tangle equations. Under biologically reasonable assumptions and using results from low-dimensional topology, one can show that the tangles involved are all rational (14, 15, 21). Therefore, all solutions to the XerCD–psi system of equations can be computed using tangle calculus. There are only three solutions consistent with the experimental data (15). It was further shown that these solutions can be seen as different projections of the same three-dimensional (3D) object, and a unique topological mechanism for XerCD at psi was proposed that incorporated all three solutions (15).

In Grainge et al. (4), several systems of tangle equations were proposed for the pathway taking replication links to two open circles (the unlink). Tangle calculus was used to solve each system. For example, all possible systems of two equations converting a RH 6-cat with parallel sites into a knotted product with five or fewer crossings were considered. Using tangle calculus, only three biologically meaningful solutions were found, all of which produced the RH 5-crossing torus knot with directly repeated sites. The authors proposed that the three solutions are equivalent by 3D rigid motion (i.e., the three solutions reflect different views of the same 3D shape). This study concluded that the stepwise unlinking pathway of Fig. 1 is the most likely pathway of XerCD–FtsK recombination when acting on 2m-cats, and posited a stepwise mechanism of action.

The mathematical study in ref. 4 assumed that solutions to the tangle equations were rational, sums of rational tangles, or closely related to these, because tangle calculus is not helpful in finding solutions outside these families (9, 13). Here, we extend the methods from refs. 4 and 15 with the following two objectives: 1. Completely characterize the shortest unlinking pathways by XerCD–FtsK; 2. Determine rationality of the tangles involved and compute the exact mechanism of action at each step of the process. We first determine that any shortest pathway to unlink a 2m-cat has at least 2m steps (Theorem 1, SI Text S1 and S2, and Figs. S1 and S2). If we further assume that topological complexity of the substrate, as measured by its crossing number, declines after each step, then the enzymes unlink in exactly 2m steps and all of the intermediates are torus knots/links (Theorem 2; SI Text S3 and Figs. S3 and S4). We then proceed to fully characterize the mechanisms of unlinking from trefoil to 2-cat, to unknot, to unlink (Propositions 1–3; SI Text S4). For other transitions we assume that the solutions are rational or sums of rational tangles and revert to using tangle calculus (Propositions 4 and 5; SI Text S4). In Discussion, we propose a mechanism that unifies all solutions (SI Text S5; Fig. S5) and consider the case where the assumption on the stepwise decline in topological complexity is removed. Support for this assumption is shown in SI Text S6 and Figs. S6 and S7.

Results

The Shortest Pathway of Unlinking of 2m-Cats by XerCD–FtsK Has Exactly 2m Steps.

Here, dsDNA is modeled as the curve drawn by the axis of the double helix. XerCD–FtsK-dif is represented by a tangle: a ball (the enzymes) with two strings (the dsDNA) inside that intersect the boundary of the ball at four points. These strings may be intertwined in nontrivial ways. We partition this tangle into the sum of two tangles Ob + P. The tangle P (parental) encloses only the core regions of the recombination sites, whereas the tangle Ob encloses any geometrical information captured within the enzymatic complex before recombination (i.e., all other DNA crossings held in place by the protein complex outside the core sites). Let Of be the tangle outside XerCD–FtsK-dif. Of includes all DNA outside the complex. For simplicity, let O = Of + Ob. Cleavage and strand exchange occur inside the tangle P, converting it into a tangle R while keeping O fixed. The recombination event is then represented as a system of two tangle equations: N(O + P) = substrate and N(O + R) = product (Fig. 2A).

In this model, P and R enclose only the core regions of the dif recombination sites, each of which is 28 base pairs long. The relatively short length of the core sites, combined with the fact that XerCD are tyrosine recombinases (i.e., go through a Holliday Junction Intermediate), implies that P and R are trivial tangles. Without loss of generality we assume that the sites come together as P = (0) with parallel or antiparallel orientation, and R = (0,0) or (±1) (Fig. 2B).

Theorem 1 below considers the minimum number of recombination events needed to convert the RH 2m-cat to the unlink. Note that the proof of this theorem does not preclude the R tangle from changing from one event to the next.

Theorem 1.

Assume each action of XerCD–FtsK is modeled as a system of equations N(O + P) = substrate and N(O + R) = product where P = (0) and R =(0,0), (−1), or (1). Then at least 2m recombination events are needed to convert the RH 2m-cat with parallel dif sites to the unlink.

Proof:

Detailed background needed for this proof is provided in SI Text S1 and S2 and Figs. S1 and S2. The proof uses the signature of a link, a link invariant denoted by σ. If L is a RH 2m-cat with parallel dif sites, then Inline graphic. A result of Murasugi (22) implies that the difference of the signatures of the substrate and the product of recombination by XerCD–FtsK is at most 1 (SI Text, Lemma S1). Because the signature of the unlink is 0, we need at least (2m−1) recombination events to go from the 2m-cat to the unlink. As each recombination changes a link into a knot and vice versa, the total number of recombination events must be even. Hence at least 2m recombination events are needed to convert the 2m-cat to the unlink.

Figs. 1 and 3 illustrate shortest pathways of unlinking starting with the 6-cat and with the 2m-cat, respectively.

Fig. 3.

Fig. 3.

Unlinking pathway of RH 2m-cat: RH 2m-cat, RH (2m − 1)-torus knot, RH (2m − 2)-cat,…, RH trefoil, 2-cat, the unknot, trivial link.

The Shortest DNA Unlinking Pathway by XerCD–FtsK Is Unique.

Theorem 1 provides formal proof that to unlink a replication catenane with 2m crossings, the XerCD–FtsK complex must perform at least 2m single recombination events. The pathway proposed in ref. 4 is one such shortest pathway (Figs. 1 and 3). In Theorem 2 we ask whether there are other possible pathways, assuming a stepwise decrease in topological complexity.

We assume that each product of XerCD–FtsK recombination has a smaller number of crossings than its substrate, except perhaps for the last product of recombination (i.e., the unlink). The assumption that each recombination step reduces the topological complexity of its substrates is supported by the experimental data described below and by its quantification (SI Text S6; Figs. S6 and S7). In ref. 4, supercoiled replication catenanes produced in vivo in TopoIV-deficient cells were incubated in vitro with XerCD–FtsK50C. The reaction yielded a spectrum of knotted and linked DNA plasmids. A time-course analysis showed that, over time, the substrate catenanes were efficiently converted to unlinked circles. It was proposed that knotted dimers appeared as recombination intermediates. In a separate reaction the authors confirmed that the XerCD–FtsK complex could also efficiently unknot knotted DNA with two directly repeated dif sites. Decatenation data suggest that unlinking and unknotting occur gradually, with a steady decrease in topological complexity (4).

Theorem 2.

Consider unlinking pathways for the 2m-cat. Assume that each recombination event, other than the final one, reduces the number of crossings. Then every intermediate is a RH 2k-cat, a RH (2k + 1)-torus knot, or the unknot. Furthermore, the pathway is unique:

graphic file with name pnas.1308450110uneq1.jpg

The proof of Theorem 2 is based on Theorem 3 below, which relies on properties of the Euler characteristic of a Seifert surface for a knot/link, and the signature.

Theorem 3.

Let O, P, and R be tangles such that:

graphic file with name pnas.1308450110uneq2.jpg
graphic file with name pnas.1308450110uneq3.jpg

If P is assumed to be (0) and R is assumed to be (w,0) for some integer w, and the orientations of K1 and K2 agree outside P and R, then the following is true:

  • a.

    If K1 = RH 2k-cat T(2,2k) with parallel sites Inline graphic, and K2 is a knot with at most 2k − 1 crossings, then K2 is the RH torus knot T(2,2k − 1) with directly repeated sites;

  • b.

    If K1 = RH torus knot T(2,2k + 1) with directly repeated sites Inline graphic, and K2 is a link with at most 2k crossings then K2 is the RH 2k-cat T(2,2k) with parallel sites.

Proof of Theorem 3 is given in SI Text S3 and Fig. S3.

Proof of Theorem 2:

Theorem 3 implies that the product of XerCD–FtsK recombination on a RH 6-cat with parallel sites is the RH 5-crossing torus knot with sites in direct repeat, and that the product of recombination on the 5-noded knot is the RH 4-cat with parallel sites. Repeated application of Theorem 3 confirms that the pathway in Fig. 1 is the only shortest pathway of unlinking for the parallel RH 6-cat if we assume that recombination reduces the topological complexity at every step but the last. This same argument can be applied to XerCD–FtsK unlinking of any parallel RH 2m-cat, thus completing the proof (Fig. 3).

The condition of Theorem 2 can be relaxed by allowing the crossing number to remain constant or to be reduced. This assumption gives rise to other more complicated pathways.

Full Characterization of the Last Three Steps of DNA Unlinking by XerCD–FtsK.

By Theorem 2, each shortest unlinking pathway starting with a RH 2m-cat ends with a sequence of RH trefoil, Hopf link, unknot, and unlink (Fig. 1). Here we show that, under biologically reasonable assumptions, each of the last three recombination steps admits a very specific topological mechanism. As in ref. 15, assumptions for P and R are based on the length of the core region of the recombination sites, and restrict the number of meaningful solutions to a small finite number.

Recombination converting a RH trefoil with directly repeated sites into a 2-cat was characterized in ref. 16. Based on this, Proposition 1 shows that if a trefoil substrate is acted upon by XerCD–FtsK, then there are exactly three possible topological mechanisms of action.

Proposition 1 (adapted from ref. 16).

Suppose N(O + P) = RH trefoil, N(O + R) = 2-cat. Then we show the following:

  • if P = (0) parallel and R = (−1), then O = (3);

  • if P = (0) antiparallel and R = (0,0), then O = (3,−1,0);

  • if P = (0) parallel and R = (1), then O = (3,−2,0).

By an argument similar to that in ref. 11, Proposition 2 shows that if the substrate of recombination is the 2-cat with parallel sites and the product is the unknot, and if R is rational, then the geometry in the synapse is uniquely determined for each choice of R.

Proposition 2.

Suppose N(O + P) = 2-cat, N(O + R) = unknot. Then we show the following:

  • if P = (0) parallel and R = (−1), then O = (2);

  • if P = (0) antiparallel and R = (0,0), then O = (2,−1,0);

  • if P = (0) parallel and R = (1), then O = (2,−2,0).

Finally Proposition 3 states that, if the substrate of recombination is the unknot with two sites in direct repeat and the product is the unlink, then the geometry in the synapse is uniquely determined for each choice of R. The proof is based on a result included in SI Text S4 and Fig. S4.

Proposition 3.

Suppose N(O + P) = unknot, and N(O + R) = unlink. Then we show the following:

  • if P = (0) parallel and R = (−1), then O = (1);

  • if P = (0) antiparallel and R = (0,0), then O = (0,0);

  • if P = (0) parallel and R = (1), then O = (−1).

In summary, the shortest unlinking pathway takes the trefoil to the 2-cat, the 2-cat to the unknot, and the unknot to the unlink (Theorems 1 and 2). There are exactly three topological mechanisms of action to account for each of these unlinking steps by XerCD–FtsK (Propositions 1–3).

Unlinking of 2m-Cats and (2m−1) Torus Knots.

We have proposed a shortest pathway to account for XerCD–FtsK unlinking of replication catenanes and have proved that this pathway is unique. We have fully characterized the unlinking mechanisms taking trefoil to 2-cat, 2-cat to unknot, and unknot to unlink. With the current mathematical machinery we can provide full characterization only for the last three recombination events along the pathway. In the general case where a 2k-cat is converted into a (2k−1)-torus knot (Proposition 4) and a (2k−1)-torus knot is converted into a (2k−2)-cat (Proposition 5), we can propose topological mechanisms by assuming that O is equivalent to a rational tangle or sum of rational tangles. In this way, we use tangle calculus to propose the most likely mechanisms; however, there is no guarantee that these are the only ones. The proofs for the following two propositions are analogous, and are given in SI Text S4.

Proposition 4.

Suppose N(O + P) = RH 2k-cat, N(O + R) = RH (2k − 1)-torus knot. Suppose that O is equivalent to rational or the sum of rational tangles. Then we show the following:

  • if P = (0) and R = (−1), then O = (2k);

  • if P = (0) and R = (0,0), then O = (2k,−1,0);

  • if P = (0) and R = (1), then O = (2k,−2,0).

Proposition 5.

Suppose N(O + P) = RH (2k − 1)-torus knot, N(O + R) = RH (2k − 2)-cat. Suppose that O is equivalent to rational or sum of rational tangles. Then we show the following:

  • if P = (0) and R = (−1), then O = (2k − 1);

  • if P = (0) and R = (0,0), then O = (2k − 1,−1,0);

  • if P = (0) and R = (1), then O = (2k − 1,−2,0).

The results of Propositions 4 and 5 are consistent with those of Propositions 1–3. In the general case, the shortest unlinking pathway takes a RH 2k-cat with parallel sites into a RH (2k − 1)-knot with sites in direct repeats, and then takes that knot into a RH (2k − 2)-cat with parallel sites (Theorems 1 and 2). Using Propositions 4 and 5, each of the recombination events above admits three possible topological mechanisms of action.

Discussion

Unification of the Individual Stepwise Unlinking Pathways.

We have shown that for each recombination step in the unlinking pathway there are three corresponding topological mechanisms of action for the enzyme. These mechanisms can be interpreted as three views of the same 3D object (Figs. 4 and 5). The mechanisms computed for each recombination step can be clustered together to propose three stepwise unlinking mechanisms (Fig. 4). Notice that each row in Fig. 4 illustrates the topological mechanism for the last three steps of XerCD–FtsK DNA unlinking. In refs. 15 and 17 it was proposed that the topological mechanism with P = (0) with sites in parallel alignment and R = (−1) was the most consistent with the experimental data. The mechanism proposed in ref. 17 corresponds to the pathway in Fig. 4, row 1 (and is revisited in SI Text S5 and Fig. S5). The other two pathways (rows 2 and 3) are obtained by rigid rotation of the pathway in row 1, and correspond to P = (0) antiparallel and R = (0,0), and to P = (0) parallel and R = (1), respectively.

Fig. 4.

Fig. 4.

Last three steps of DNA unlinking by XerCD–FtsK. At the top are the last three steps in the DNA unlinking pathway. Each of the three rows below shows one of the three possible topological mechanisms by which a trefoil is converted to a 2-cat, a 2-cat to an unknot, and an unknot to an unlink. Locally, the first mechanism, where P = (0) parallel and R = (−1), is consistent with the mechanism proposed for XerCD at psi (17). The other two pathways (rows 2 and 3) can be obtained by rigid rotation of the pathway in row 1, and correspond to P = (0) antiparallel and R = (0,0), and to P = (0) parallel and R = (1), respectively. The rotation axes are m1 and m2, as in Fig. 5. These mechanisms are consistent with the three proposed for Xer at psi (15).

Fig. 5.

Fig. 5.

Local site alignment. Due to a high level of sequence homology, we here assume that the XerCD–DNA synapse can be modeled using the data for Cre–DNA. This diagram is based on the crystal structure of Cre–DNA from ref. 23, in which the recombinase core complex is slightly off-planar, so that the cores appear either parallel or antiparallel in different projections. To improve clarity, the angles are doubled in this diagram. Let m2 be the horizontal axis (dashed line), and let m1 be the vertical axis (solid line). When looking down m2, the two sites appear in antiparallel alignment, whereas when seen from the reader’s perspective, the sites are in parallel. A rigid left-hand rotation of the DNA conformation around the axis m1 will result in string IV crossing over string I (15). Then the sites appear in antiparallel alignment and one crossing is added to the domain. Further left-hand rotation around the m2 axis results in III crossing over II and the sites returning to a parallel alignment.

Tangle Analysis of Iterative Recombination.

Because the recombination event involves two separate pairs of strand exchanges and proceeds through a Holliday Junction (HJ) Intermediate, there must be a step resetting the synapse between recombination events to maintain the same recombination mechanism at each event (i.e., if XerD is always to exchange the first pair of strands to form the HJ which is resolved by XerC, as shown in ref. 8). There is little evidence of the ability of XerCD and other tyrosine recombinases to act processively (i.e., bind and recombine a single pair of recombination sites multiple times before dissociating). However, in refs. 3, 4, and 24 the authors report an action of XerCD recombination consistent with iterative cleavage and strand exchange. If the recombination reaction is assumed to be iterative, and is modeled as processive (9, 25), then it can be proven that the tangle O is rational, and therefore all solutions are computable.

3D Interpretation of the Tangle Solutions.

In the tangle method, P is defined as a very small ball containing the cleavage core regions of the recombination sites. In this case, the orientation of the recombination sites is inherited into the tangle P. Two sites in P = (0) are in parallel alignment if both arrows point in the same direction in the tangle diagram; otherwise they are in antiparallel alignment. However, the concept of parallel and antiparallel alignment represents a local geometric property of the sites and is well-defined only in the tangle diagram (a planar projection of the 3D tangle). For the same 3D tangle, one can always obtain a planar projection with parallel sites and another with antiparallel sites. The only exception to this statement is when two sites are coplanar, in a strict mathematical sense. In ref. 15 the authors proposed a unique 3D topological mechanism for XerCD at psi that incorporated all three solutions to the tangle equations. Given a solution where O is rational, P = (0) and R= (1) or R = (−1), one can obtain a solution S′ equivalent to S by rotating the synapse P = (0) to P = (0,0). In the case of XerCD at psi, performing this simple transformation on the two solutions for P parallel yields the unique solution for P antiparallel. Similar arguments have been proposed for the recombinase TnP1-IRS (11) and for XerCD–FtsK-dif (4). Here, we extend these arguments and apply them to each recombination step in the XerCD–FtsK-dif unlinking pathway (Figs. 4 and 5). The observed geometrical equivalence, obtained by rigid motion, between the three solutions for one of the recombination steps suggests that a unique 3D representation of the synaptic complex and topological mechanism of the enzymes can be interpreted as three different tangle solutions when viewed from different spatial directions. Computer visualization methods, combined with molecular modeling software such as PyMol and with X-ray crystallographic data, can be applied to realize these spatial equivalences and to propose a 3D molecular model for the enzymatic action, as was done in ref. 15. An animation generated with Knotplot (www.knotplot.com) is presented in Movie S1 to illustrate the transition from the 6-cat to the 5-torus knot. These studies indicate a limitation of the tangle model and suggest the need to consider equivalence classes of planar tangle diagrams related by 3D rigid motion (rotations and translations).

Conclusions

We have presented a thorough mathematical analysis of the process of DNA unlinking by XerCD–FtsK. Based on the experimental data of ref. 4, we first prove that the shortest pathway of unlinking a substrate consisting of 2m-cats with sites in parallel orientation has exactly 2m steps. We then show that if we assume that recombination reduces the complexity of the substrate at each step, as suggested by the experimental data, then the shortest pathway is unique and every recombination intermediate is a torus knot or link. We analyze each recombination step and find three topological mechanisms at each step. The mechanisms can be unified in two different ways: (1) At each recombination step, the obtained mechanisms can be interpreted as different projections of the same 3D object, as illustrated in Figs. 4 and 5 and in the animation in Movie S1; or (2) The obtained mechanisms can be sorted by the local action of the enzyme (specified by P, R, and the relative orientation of the sites). This unification produces three separate mechanisms of stepwise unlinking consistent with the unique, shortest pathway previously reported.

Mathematical Methods

Knot theory is the study of non–self-intersecting curves in 3D space, and of the spaces defined by these curves. A knot is defined as a non–self-intersecting circular path in space. A link (or catenane) is composed of two or more such curves, which can be intertwined. If the path can be laid flat without any over- or undercrossings, then K is the unknot or the trivial knot. Likewise, the unlink, or trivial link, is the disjoint union of two unknotted circles. Knots and links can be studied through their diagrams, i.e., planar projections that distinguish between under- and overcrossings. Two knots or links K1 and K2 are equivalent if K1 can be smoothly deformed into K2 without breaking the chain. We indistinguishably denote the knot/link, or its equivalence class, by K. Knots and links are classified according to their crossing numbers. The crossing number of K is the minimum number of crossings taken over all projections of all elements in the equivalence class K. Although finer classifications are attempted, it is generally difficult to determine when two knots/links are topologically identical (26).

A tangle is a pair consisting of a ball and two strings in it. Fig. 2B illustrates the four simplest tangles, called trivial tangles. Rational tangles are obtained by intertwining the two strings in a trivial tangle. All tangles in Fig. 6 are rational. Rational tangles most often appear in the mathematical analysis of site-specific recombination. There is a one-to-one correspondence between the set of rational tangles and the set of extended rational numbers (i.e., the union of the set of rational numbers and the infinity 1/0) (27). A rational tangle can be expressed using a vector representation, called the Conway vector. Conway vectors are illustrated in Fig. 6. Notice that the same rational tangle can admit different vector representations. The tangle method uses two operations: tangle addition A + B; and numerator N(A) (Fig. 6C). A + B is a tangle, whereas N(A) is a knot or a link. If A is rational then N(A) belongs to the well-studied family of 4-plat knots or links (28).

Fig. 6.

Fig. 6.

(A) Two rational tangles and their Conway vectors. (B) Tangle addition of rational tangles A = (−5,0) and B = (−1). A + B is equivalent to the tangles ((\x{2212}5,\x{2212}1) and (6,\x{2212}1,0)). (C) N(A + B) is a 6-crossing RH torus link (RH 6-cat).

The tangle method uses biologically reasonable assumptions to model a site-specific recombination event as a system of two or more tangle equations on three unknowns O, P, and R. If P = (0), R = (w) or (w,0) for some integer w, then the system can be solved for O rational, sums of rational tangles (9, 13). The solution set can be very large. To obtain the most biologically relevant solutions we often resort to assumptions where P = (0) and R = (k) or (k,0). These assumptions are backed by biological data as detailed in ref. 15. Computing the solutions is not mathematically challenging but can be very tedious. Mathematical software is available to make the method more accessible to the scientific community (29, 30). TangleSolve (http://ewok.sfsu.edu/TangleSolve) is a stand-alone Java program and web-based applet with a user-friendly interface for analyzing and visualizing site-specific recombination mechanisms using the tangle method (29).

Using arguments from low-dimensional topology, O, P, and R can sometimes be shown to be equivalent to rational or sums of two rational tangles (9, 11, 1316, 25). In these cases one concludes that, under the assumptions of the tangle method, the solutions computed are the only possible solutions to a given system of equations. Biologically this implies that the tangle method computes all possible topological mechanisms of action for the enzyme. In some cases reasonable assumptions can be made about P and R to limit the number of solutions to a small finite number (14, 15). Mathematical analyses are often useful to characterize topological mechanisms of recombination (4, 10, 11, 1316, 25, 3134). A preliminary tangle analysis of Xer unlinking is given in ref. 4.

Supplementary Material

Supporting Information

Acknowledgments

The authors thank Rob Scharein for help with some of the figures and Movie S1, which were generated using Knotplot (www.knotplot.com), and Barbara Ustanko for editorial assistance with this manuscript. The authors also thank the referees for their careful review of the manuscript. M.V., K.S., and K.I. thank the Institute of Mathematics and Its Applications for its hospitality and support. This research was supported by the following: Japan Society for the Promotion of Science KAKENHI 22540066 and 25400080 (to K.S.); National Science Foundation DMS0920887 and CAREER Grant DMS1057284 (to M.V.); UK Engineering and Physical Sciences Research Council EP/H031367 (to K.I.); Australian Research Council FT120100153 and NHMRC APP1005697 (to I.G.); and Wellcome Trust WT099204AIA (to D.J.S.).

Footnotes

The authors declare no conflict of interest.

This article is a PNAS Direct Submission. D.W.S. is a guest editor invited by the Editorial Board.

See Commentary on page 20854.

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1308450110/-/DCSupplemental.

References

  • 1.Espeli O, Levine C, Hassing H, Marians KJ. Temporal regulation of topoisomerase IV activity in E. coli. Mol Cell. 2003;11(1):189–201. doi: 10.1016/s1097-2765(03)00013-3. [DOI] [PubMed] [Google Scholar]
  • 2.Schvartzman JB, Stasiak A. A topological view of the replicon. EMBO Rep. 2004;5(3):256–261. doi: 10.1038/sj.embor.7400101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Ip SC, Bregu M, Barre FX, Sherratt DJ. Decatenation of DNA circles by FtsK-dependent Xer site-specific recombination. EMBO J. 2003;22(23):6399–6407. doi: 10.1093/emboj/cdg589. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Grainge I, et al. Unlinking chromosomes catenanes in vivo by site-specific recombination. EMBO J. 2007;26(19):4228–4238. doi: 10.1038/sj.emboj.7601849. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Sadowski PD. Site-specific genetic recombination: Hops, flips, and flops. FASEB J. 1993;7(9):760–767. doi: 10.1096/fasebj.7.9.8392474. [DOI] [PubMed] [Google Scholar]
  • 6.Hallet B, Sherratt DJ. Transposition and site-specific recombination: Adapting DNA cut-and-paste mechanisms to a variety of genetic rearrangements. FEMS Microbiol Rev. 1997;21(2):157–178. doi: 10.1111/j.1574-6976.1997.tb00349.x. [DOI] [PubMed] [Google Scholar]
  • 7.Barre FX, Sherratt DJ. Chromosome dimer resolution. In: Higgins PN, editor. The Bacterial Chromosome. Washington: ASM; 2005. pp. 513–524. [Google Scholar]
  • 8.Grainge I, Lesterlin C, Sherratt DJ. Activation of XerCD-dif recombination by the FtsK DNA translocase. Nucleic Acids Res. 2011;39(12):5140–5148. doi: 10.1093/nar/gkr078. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Ernst C, Sumners DW. A calculus for rational tangles: Applications to DNA recombination. Math Proc Camb Philos Soc. 1990;108(3):489–515. [Google Scholar]
  • 10.Sumners DW, Ernst C, Spengler SJ, Cozzarelli NR. Analysis of the mechanism of DNA recombination using tangles. Q Rev Biophys. 1995;28(3):253–313. doi: 10.1017/s0033583500003498. [DOI] [PubMed] [Google Scholar]
  • 11.Zheng W, Galloy C, Hallet B, Vazquez M. The tangle model for site-specific recombination: A computer interface and the TnpI-IRS recombination system. In: Kawauchi A, editor. Knot Theory for Specific Objects, OCAMI Studies. Vol 1. Sakai, Japan: Osaka Municipal Universities Press; 2007. pp. 251–271. [Google Scholar]
  • 12.Arsuaga J, Diao Y, Vazquez M. 2007. DNA topology in recombination and chromosome organization. Mathematics of DNA Structure, Function and Interactions, The IMA Volumes in Mathematics and its Applications 150, eds CJ Benham et al. (Springer Science + Business Media, New York), pp 7–36.
  • 13.Ernst C, Sumners DW. Solving tangle equations arising in a DNA recombination model. Math Proc Camb Philos Soc. 1999;126(1):23–36. [Google Scholar]
  • 14.Darcy I. Biological distances on DNA knots and links: applications to Xer recombination. J. Knot Theory Ramifications. 2001;10(2):269–294. [Google Scholar]
  • 15.Vazquez M, Colloms SD, Sumners DW. Tangle analysis of Xer recombination reveals only three solutions, all consistent with a single three-dimensional topological pathway. J Mol Biol. 2005;346(2):493–504. doi: 10.1016/j.jmb.2004.11.055. [DOI] [PubMed] [Google Scholar]
  • 16.Darcy IK, Ishihara K, Medikonduri RK, Shimokawa K. Rational tangle surgery and Xer recombination on catenanes. Algebr. Geom. Topol. 2012;12:1183–1210. [Google Scholar]
  • 17.Colloms SD, Bath J, Sherratt DJ. Topological selectivity in Xer site-specific recombination. Cell. 1997;88(6):855–864. doi: 10.1016/s0092-8674(00)81931-5. [DOI] [PubMed] [Google Scholar]
  • 18.Alén C, Sherratt DJ, Colloms SD. Direct interaction of aminopeptidase A with recombination site DNA in Xer site-specific recombination. EMBO J. 1997;16(17):5188–5197. doi: 10.1093/emboj/16.17.5188. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Colloms SD, Alén C, Sherratt DJ. The ArcA/ArcB two-component regulatory system of Escherichia coli is essential for Xer site-specific recombination at psi. Mol Microbiol. 1998;28(3):521–530. doi: 10.1046/j.1365-2958.1998.00812.x. [DOI] [PubMed] [Google Scholar]
  • 20.Colloms SD. The topology of plasmid-monomerizing Xer site-specific recombination. Biochem Soc Trans. 2013;41(2):589–594. doi: 10.1042/BST20120340. [DOI] [PubMed] [Google Scholar]
  • 21.Hirasawa M, Shimokawa K. Dehn surgeries on strongly invertible knots which yield lens spaces. Proc Am Math Soc. 2000;128(11):3445–3451. [Google Scholar]
  • 22.Murasugi K. On a certain numerical invariant of link types. Trans Am Math Soc. 1965;117:387–422. [Google Scholar]
  • 23.Van Duyne GD. A structural view of Cre-loxP site-specific recombination. Annu Rev Biophys Biomol Struct. 2001;30:87–104. doi: 10.1146/annurev.biophys.30.1.87. [DOI] [PubMed] [Google Scholar]
  • 24.Gourlay SC, Colloms SD. Control of Cre recombination by regulatory elements from Xer recombination systems. Mol Microbiol. 2004;52(1):53–65. doi: 10.1111/j.1365-2958.2003.03962.x. [DOI] [PubMed] [Google Scholar]
  • 25.Vazquez M, Sumners DW. Tangle analysis of Gin recombination. Math Proc Camb Philos Soc. 2004;136(3):565–582. [Google Scholar]
  • 26.Murasugi K. Knot Theory and Its Applications. Boston: Birkhäuser; 2008. [Google Scholar]
  • 27.Conway JH. An enumeration of knots and links, and some of their algebraic properties. In: Leech J, editor. Computational Problems in Abstract Algebra. Oxford: Pergamon; 1970. pp. 329–358. [Google Scholar]
  • 28.Burde G, Zieschang H. Knots. Berlin: de Gruyter; 2003. [Google Scholar]
  • 29.Saka Y, Vázquez M. TangleSolve: Topological analysis of site-specific recombination. Bioinformatics. 2002;18(7):1011–1012. doi: 10.1093/bioinformatics/18.7.1011. [DOI] [PubMed] [Google Scholar]
  • 30.Darcy IK, Scharein RG. TopoICE-R: 3D visualization modeling the topology of DNA recombination. Bioinformatics. 2006;22(14):1790–1791. doi: 10.1093/bioinformatics/btl164. [DOI] [PubMed] [Google Scholar]
  • 31.Buck D, Flapan E. Predicting knot or catenane type of site-specific recombination products. J Mol Biol. 2007;374(5):1186–1199. doi: 10.1016/j.jmb.2007.10.016. [DOI] [PubMed] [Google Scholar]
  • 32.Darcy I, Luecke J, Vazquez M. Tangle analysis of difference topology experiments: Applications to a Mu protein-DNA complex. Algebr. Geom. Topol. 2009;9:2247–2309. [Google Scholar]
  • 33.Valencia K, Buck D. Predicting knot and catenane type of products of site-specific recombination on twist knot substrates. J Mol Biol. 2011;411(2):350–367. doi: 10.1016/j.jmb.2011.05.048. [DOI] [PubMed] [Google Scholar]
  • 34.Darcy IK, Vazquez M. Determining the topology of stable protein-DNA complexes. Biochem Soc Trans. 2013;41(2):601–605. doi: 10.1042/BST20130004. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information
Download video file (4.7MB, mp4)

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES