Abstract
We theoretically study the integration of short viral DNA in a DNA braid made up by two entwined double-stranded DNA molecules. We show that the statistics of single integration events substantially differ in the straight and buckled, or plectonemic, phase of the braid and are more likely in the latter. We further discover that integration is most likely close to plectoneme tips, where the larger bending energy helps overcome the associated energy barrier and that successive integration events are spatio-temporally correlated, suggesting a potential mechanistic explanation of clustered integration sites in host genomes. The braid geometry we consider provides a novel experimental set-up to quantify integration in a supercoiled substrate in vitro, and to better understand the role of double-stranded DNA topology during this process.
Keywords: DNA modelling, topology, supercoiling, DNA integration, DNA braids
1. Introduction
Retroviruses, such as HIV, are common pathogens with a significant medical and economic impact on society [1]. In spite of decades of research, there is still a lot that is poorly understood regarding the mechanisms of retroviral infection [2]. One of the essential steps in the infection process is the integration of the viral DNA (vDNA) into the host genome. The integration step is mediated by the intasome, a nucleoprotein complex containing the viral enzyme integrase and vDNA ends, each one carrying a copy of the long terminal repeat [3,4]. Once both 3’-dinucleotides have been removed from the vDNA ends, the intasome together with the tDNA forms a target capture complex (TCC) whose integrase catalyses the strand transfer between the viral DNA ends and the tDNA, resulting in the strand transfer complex (STC). An outstanding challenge in this field is understanding how the three-dimensional organization of the tDNA in vivo affects these integration steps [5,6]. Importantly, it is by now well established that different families of retroviruses display different preferences for integration sites [7] and that the selection of the latter is non-random. Mechanistic understanding of integration site selection is important, because it can potentially inform more efficient cures against retroviruses, as well as new strategies for gene therapy (which often employs retroviruses as carriers to deliver desired genes into the nucleus).
In eukaryotes, DNA and histone proteins associate thereby assembling the chromatin fibre which, at short length scales (approx. 10−30 kb or 10–30 nm), is thought to assume heteromorphic conformations, such as one- or two-start helices [8–11]. At larger length scales (approx. 100 kb–1 Mbp or 100 nm) chromosome conformation capture (HiC) revealed the presence of ‘topologically associated domains’ [12–15]. At even larger scales (1 μm), chromosome territories emerge [16]. This hierarchical and non-random organization is thought to affect the site selection of integration events [17–19], yet quantitative assays dissecting the role of each of these levels of organization on the integration selection are largely lacking to date. Most of the existing in vitro biophysical assays investigate the effect of naked DNA features, such as flexibility [20] and intrinsic curvature [21].
Recent works have allowed the investigation of the relationship between tDNA organization and retrovirus integration. Prototype foamy viruses (PFVs) use one-dimensional (1D) diffusion mechanisms to enhance the search for integration sites [22] and it was found a greater (two- to fivefold) integration efficiency into supercoiled substrates with respect to torsionally relaxed ones [22,23]. Moreover, magnetic tweezers (MT) assays have been used to study the strand transfer time [22] and the stability of TCCs and STCs [23]. Interestingly, STCs were found to be stable even when large pulling forces were applied to the tDNA tethered between the two magnetic beads composing the MT set-up. This stability is also facilitated by the presence of auxiliary binding interfaces [23,24] allowing the viral intasome to bind the supercoiled tDNA while relaxing torsional stress. The preference for retroviruses to integrate into supercoiled tDNA is in line with the known fact that DNA substrates displaying pre-bent and distorted regions are preferentially targeted by lentiviruses [7,25], including HIV [20,21], and foamy viruses, such as PFV [7].
In spite of this, the precise location at which retroviruses integrate their genetic material as a function of mechanical stress is still debated. Here, we propose and theoretically analyse a different single-molecule biophysical assay. We consider two (torsionally unconstrained) dsDNA molecules whose ends are anchored to walls (or equivalently to macroscopic beads of a MT) and interwoven to form braids of fixed linking number, or catenation Ca (figure 1). This system is known to display a buckling transition at a critical catenation number Ca* which depends on the stretching force f, i.e. Ca* = Ca*(f) [26–30]. For Ca < Ca*(f), the braid is straight whereas for Ca > Ca*(f) the braid buckles and forms plectonemes that resemble those shown by single dsDNA molecules in MT assays [31–33].
Figure 1.

Retroviral integration in DNA braids. (a) Two chains representing two dsDNA molecules (one coloured red and the other blue to ease visualization) are tethered to two impenetrable walls. (b) The top wall is rotated n times to introduce a catenation between the dsDNA strands equal to Ca = n, while a force f is simultaneously applied to stretch the braid. (c) For Ca larger than a critical Ca*(f), the braid buckles and forms plectonemes (see electronic supplementary material, figure S1, for the corresponding phase diagram). (d) A snapshot from BD simulations of the system under investigation. (e,f) Example of an integration event, illustrated by successive snapshots in a BD simulation.
The strength of this simple approach is that, on top of simulating the dynamics of plectonemes in braided DNAs [30], we can monitor the precise integration site and the integration efficiency as a function of the supercoiling injected in the braid. Even if this set-up shows some differences with respect to a single supercoiled ds-DNA, its investigation should shed light on properties such as the preferred position of the integration sites, the statistics of repeated integration events and the interplay between supercoiling and integration that are hardly accessible in current experiments. As a result, our simulations can be used to build a mechanistic model for retroviral integration in supercoiled substrates. Additionally, single-molecule experiments studying integration in a DNA braid, which we study here, are in principle possible, and we hope they can be performed in the future to test our predictions.
2. Material and methods
Integration of vDNA into DNA braids is simulated via Brownian dynamics (BD) simulations and a sketch of the set-up is shown in figure 1. Two dsDNA molecules with 250 beads each (each bead of size σ represents 2.5 nm or 7.3 bp) are tethered to two walls, with the top wall being spun in order to interweave the molecules to form a braid characterized by a desired catenation number Ca, which is a key parameter in our model. The catenation number is kept constant throughout a simulation and a stretching force f is applied to the top wall (figure 1). A Kratky–Porod potential is used to impose a persistence length equal to lp = 20σ = 50 nm, which is typical for dsDNA [34]. Viral DNA is modelled as circular chains composed of 40 beads, or about 300 bp. This length is chosen so that it is larger than the persistence length of DNA but short enough to allow efficient and feasible simulations. In each simulation, we introduce 10 vDNA rings which can integrate into either of the two dsDNA molecules. The numerical density of rings in the simulation box ranges between 10−4σ−3 and 10−3σ−3: these low concentrations are chosen in order to minimize the probability of simultaneous integration events, and to be able to study correlations between consecutive integration events. The stochastic integration process is simulated by using a Monte Carlo algorithm: whenever a pair of bonds, one belonging to the tDNA braid and the other to the vDNA, are proximate in three dimensions, a swap between them is attempted in a single simulation step [19,35]. The attempted swap is then accepted or rejected based on the difference in energy between the initial and final conformations, i.e. according to a Metropolis scheme with probability p = exp(−(E′−E)/kBT), where kB is the Boltzmann constant, T the room temperature and E and E′ are the energy of the configurations respectively before and after the bond-swapping move. Additional details are given in the electronic supplementary material.
3. Results
3.1. Integration affects plectonemic dynamics in the buckled phase
For catenation numbers greater than the critical value Ca*(f), the system undergoes a buckling transition which lowers the total free energy by converting twist into writhe [27,29,36]. From now on, all simulations will be performed with a pulling force whose critical catenation is Ca* = 29. This parameter choice is representative of the general behaviour. A buckled state is characterized by the formation of plectonomes which can be identified by computing the instantaneous map of contacts between DNA segments in the same molecule (see figure S1 and electronic supplementary material for details about the algorithm). Plectoneme dynamics are rich, and encompass fusion, fission and 1D Ostwald ripening [30]. In this section, we study how viral integration qualitatively affects the dynamics of plectonemes and braided DNA in the buckled phase. More specifically, we analyse the plectoneme positions (figure 2a) and the end-to-end distance Z (figure 2b) over time, following multiple integration events. When the first integration occurs the length of the blue dsDNA molecule displays a sudden increase, in turn yielding a jump in the end-to-end distance Z (figure 2b). After each integration, the braid displays a slow relaxation to a new steady state in Z, which can be resolved if integration events are sufficiently spaced in time. Concomitantly with the increase in end-to-end distance, plectonemes decrease in number and size following integration. Thus, the simulation shown in figure 2 started with three stable plectonemes, but after three integrations none remained. The dynamical pathway through which plectonemes disappear is nontrivial. For instance, immediately after the first integration (between snapshots (1) and (2) in figure 2) a fusion and a hopping event result in the formation of a single larger plectoneme whose size is reduced when the second integration occurs. The reason why plectonemes disappear following successful integration events is that the total length of the braid L increases after each event, which reduces both the catenation density Ca/L and the ratio of d/L where d is the lateral spacing between the tethering points on the same wall. As shown in previous works [27,29], both these parameter changes drive the system away from the buckled phase.
Figure 2.
Effect of integrations on braid dynamics in the buckled state. (a) Kymograph showing the evolution of plectonemes (monitored on the red DNA, see electronic supplementary material) for a simulation with and Ca = 36. Yellow regions represent the segments inside plectonemes while black dots their boundaries. During this run four integration events are observed, three in the blue DNA and one in the red one. The red arrow indicates the moment and the position of the integration in the red strand. The numbers (1), (2), (3) and (4) refer to the top snapshots. Between (1) and (2) there is an integration inside a plectoneme, while between (3) and (4) another integration takes place in the braided part of the system. The grey background shows the step-wise increase in length of the red strand, which is initially equal to 250σ (see y axis) and it increases at time t ∼ 80000 τLJ when a ring integrates. (b) End-to-end Z distance versus time. Blue and red arrows show the moments of integration in the blue and red strand respectively. At each integration event, we observe a jump in Z.
The qualitative effects of integration on plectoneme and braid dynamics shown in figure 2 could be studied experimentally by using methods similar to those in [37], and we hope that similar studies will be performed in the future. We now proceed to perform a more quantitative investigation of the statistics of single and multiple integration events in our simulations.
3.2. Statistics of single integration events
In [22], the authors find that for PFV intasome on stretched single-molecule DNA, more than 97% of the search events do not generate successful integrations. This is in line with the fact that integration events need to significantly distort the DNA substrate [19,21] and hence must overcome a substantial energy barrier.
With our model, we can explore this process in more detail and on braided structures. In figure 3a, we report the trajectories of three vDNA rings for the simulation represented in figure 2 described in terms of their minimal distance dbr from the braid. It can be noted that, while ring1 integrates as soon as (the maximum distance at which integration may occur), ring3 never gets sufficiently close to the braid to attempt an integration (). Interestingly, ring2 displays a sufficient small distance dbr multiple times, but it never integrates. This highlights the stochastic nature of integration, which we now address more quantitatively by first analysing the statistics of single integration events. In particular, we are interested in understanding how the searching and integration efficiency of the vDNA rings depend on the braid configuration, and on whether the braid is in the straight or buckled phase. To quantify these features, we perform 150 independent simulations in which 10 vDNA rings can diffuse within the system and integrate into substrates with 22 ≤ Ca ≤ 38. We count the number of times any of these rings reaches a distance and identify this quantity as the number of integration attempts Nattem (in line with [22]). Denoting by Nint the number of successful integration events we compute the integration probability as the ratio P(int) = Nint/Nattem averaged over all trajectories. This is reported as a function of the catenation number Ca in figure 3b. The overall trend is that the integration probability increases with Ca. Note that at low catenation number just 10% of the vDNA rings approaching the braid actually integrate, while for higher Ca we observe up to 60% of successful integrations. This is in line with the experiments performed in [22] where authors observed that 90–97% of the times PFV intasome search events on stretched non-supercoiled DNA fail to produce successful integrations and that supercoiling generally increases this integration probability between two- and fivefold. The agreement is significant, given that we are considering a different system (braided DNA instead of single supercoiled double-stranded DNA) and that we are using a simple coarse-grained model for both tDNAs and vDNA rings. Intriguingly, we also find that the slope of P(int) versus Ca flattens in proximity of the critical value separating the straight and buckled phase (Ca* = 29), which may underlie a qualitative difference between the integration in the two phases.
Figure 3.
Statistics of single integrations. (a) Trajectory of three rings diffusing in the system corresponding to the kymograph in figure 2. The distance between the ring and the braid drb is monitored over time: ring1 integrates, ring2 gets close to the braid three times without integrating, while ring3 never gets close to the braid. (b) Plot of the integration probability against the catenation number. The grey background refers to Ca < Ca* (straight braid phase), while the orange one to Ca ≥ Ca* (buckled phase). It is interesting to note that the curve significantly changes its slope in proximity of Ca*. (c) Plot of the average integration time (i.e. the average time to the first successful integration) as a function of Ca. The higher the catenation number, the shorter the time needed to observe the first integration.
The catenation number Ca also strongly affects the time needed for integrations to take place. In figure 3c, we show that the average time to integrate into the braid for the first time, Tfi, is shorter for larger catenation numbers, again highlighting the fact that plectonemic supercoiling aids integration, making it kinetically faster as well as more likely. An explanation of the trends observed in figure 3b,c will be given in the next paragraphs.
3.3. Statistics of multiple integration events
Within our simulations, we can explore multiple rounds of integration events within the same braid, and therefore investigate cooperative effects and spatial correlations between subsequent integration events. That such correlations may exist in reality is suggested by the observations of clustered distributions of retroviral integration sites [38] and transposable elements [6] in the genome.
To quantify how an integration event affects the next one, we performed simulations for braids with catenation number 22 ≤ Ca ≤ 38 where we only allowed two rings to integrate. The results of these simulations show that for each Ca the probability that the two integrations occur in the same molecule is . Instead, in the case of random events, the first integration would yield a 40 σ increase in the strand length and the second ring would just have a ∼54% probability of integrating into the same strand.
By analysing the distance along the braid backbone separating the sites of two subsequent integration events, which we call Δfs, we find that the corresponding distributions are significantly different from those expected for random, uncorrelated, integration events, for which we expect a distance distribution given by P0(x) = (2(L − x))/L2 (figure 4). The curves in figure 4 do not depend appreciably on the exact values of Ca (data correspond to Ca = 22, 31, 36), and all show a maximum for intermediate distances, i.e. for Δfs ∼ 50σ. We interpret this effect as due to the transient creation of a writhing segment in proximity of the first integration; this effectively favours a second integration in close proximity to the first before the excess in contour length relaxes.
Figure 4.

Statistics of multiple integrations. Probability distribution for Δfs—the distance along the braid between sites of successive integrations—for three different values of Ca compared to the case in which integrations occur randomly along the polymer, i.e. with a probability P0(x) = (2(L−x))/L2. For the case Ca = 36, we report also the errorbars. We observe a maximum for intermediate values of Δfs, suggesting that there is a favoured typical spatial separation between successive integration events. As a convention, if the integration breaks the bond between the beads with ID number id1 < id2, we identify the integration site with id1. Inset: average of the contour distance Δfs between successive integrations for the same three values of Ca used in the main figure.
3.4. The spatio-temporal pattern of bending energy explains the integration statistics
Integration events in our model are associated with an energy barrier, which is related at least in part to the need to locally bend the DNA molecule where the retrovirus gets incorporated [19] (see electronic supplementary material, figure S3). We therefore ask to what extent we can understand the nontrivial integration statistics described in the two previous sections by analysing the spatio-temporal pattern of bending within the braid, as this should determine the energy barrier along the latter. To answer this question, we first monitor the total bending energy (per bead) of each braided DNA molecule, which can be computed as
| 3.1 |
where θi is the angle between two consecutive pairs of beads having bead i in common. In figure 5a, we report the value of Eb(x) for the trajectory shown in figure 2. Note that, in response to the first integration occurring on one molecule (labelled ‘blue’) the bending energy of the second molecule, Eb(red), drops markedly, indicating that the second dsDNA molecule (labelled ‘red’) becomes straighter. Similarly, an integration on the red strand produces a sudden drop of the bending energy of the blue counterpart, Eb(blue). These findings indicate that integrations within one dsDNA render the molecule effectively floppier and hence easier to deform further. This explains the observed persistence of multiple integrations on the same molecule: the first integration breaks the symmetry between the two DNAs thereby reducing the effective persistence length of the one where integration took place (figure 5a). Thus, it is natural to expect that a second integration will preferably occur in the same, now more flexible, DNA where the elastic energy barrier to overcome is smaller [19]. It is worth noting that this result holds for a system composed by a dsDNA braid and it cannot be compared directly to MT experiments performed on a single dsDNA molecule. In the latter case, indeed, a retroviral integration cuts both strands of the dsDNA molecule leading to supercoiling relaxation. Consequently, the flexibility of the molecule decreases, as the latter is tethered between the two magnetic beads [23]. Since variations in the bending energy favour multiple integrations within the same strand, it is of interest to ask whether they can also affect the specific location (along the backbone) where integrations occur. To see whether this is the case, we compute the average bending energies for the different geometrical motifs present in the system: the braids, the plectonemes and their tips. Note that in the straight braid phase only the braided part is present, while in the buckled phase a typical configuration can be described as an alternating sequence of braids, plectonemes and tips (figure 1). In figure 5b, we show the distributions of the bending energy for the three motifs: as expected, the straight braids have the smallest bending energy, plectonemes carry intermediate values whereas the largest bending energy is stored at the tips. In light of what seen above, it is natural to expect that integrations may occur more likely in regions with higher bending energies (i.e. more prone to thermal deformations) and so in plectonemes and tips. To verify this prediction, we classify all integration events according to the region in the braid where they occur, and report the results in figure 5c: the data show that the integration probability within tips is the largest, while integrations into braids or plectonemes have a similar probability. It is important to note that this analysis identifies a tip as the five beads surrounding the centre of mass of a plectoneme. This is just an arbitrary choice as there is not an objective way to define tips. Nonetheless, it might happen that in different set-ups (i.e. different Ca) the tip size changes producing the presence of peaks in the green curve of figure 5c.
Figure 5.
Bending energy and integration sites. (a) Bending energies per bead for the same run reported in figure 2. Blue and red curves are bending energies Eblue, Ered of the blue and red DNA, respectively. Blue and red arrows indicate the moments of integration events. (b) Histograms of local bending energies per bead in the braided part (blue), plectonemic part (red) and plectonemic tips (green). (c) Integration probabilities in the braided part (blue), plectenemic part (red) and plectonemic tips (green), normalized by the number of beads involved in each of the three motifs, versus Ca. (d) Distribution of Δit, the distance between the integration site and the tip of the closest plectoneme, for two values of the catenation number Ca = 31 and Ca = 36. In both cases there is a peak close to zero, indicating that rings prefer to integrate near plectonemic tips. Inset in (a) represents a zoom, while insets in (b,d) report the average values of the quantity plotted in the corresponding panel.
The preference for tip integration can alternatively be quantified by analysing the distribution of the distance between the integration sites and the nearest plectoneme tip, which we call Δit (figure 5d). The data show that the larger the value of the catenation number, the stronger the preference tends to be. We can further rationalize the statistical preference for integration at plectoneme tips by computing the local curvature of a tip of length l as a function of temperature T and external stretching force f. This can be done by minimizing the free energy
| 3.2 |
with respect to l. In equation (3.2), the first term is the elastic bending energy required to bend the tip into a tear-drop shape (ζ = 16) while the second term accounts for the fact that the tip does not feel the effect of the stretching force. Minimization of the free energy gives and a local curvature [29]. This means that higher forces and smaller persistence lengths (or even mismatches and defects [32,33,39]) increase the curvature of the tip and hence the probability of integrations within this motif.
The preference for integration within the tip might also qualitatively explain the entropic selection mechanism for the separation between successive integration events (figure 4). Indeed, immediately following an integration event, the region of the braid where this has occurred has extra uncompensated contour length in one of the DNAs, which does not feel the tension from the stretching force before the braid reorganizes locally. Because stretching suppresses writhing, the transient absence of stretching will favour local writhing and this writhe excess might, in turn, attract retroviral DNA during the timescale of writhe relaxation, hence causing the observed cooperativity between successive integration events (see electronic supplementary material, movie S1). The role that the local bending energy profile has in the integration process can also account for the statistics of single and multiple integration events reported in figure 3. For instance, the monotonic increase in successful integrations versus attempts, Nint/Nattem, with Ca can be explained by noting that increasing the catenation number leads to a linear increase in the average amount of writhe in the braid, given the decomposition Ca = Tw + Wr [40]. In turn, as the increased writhe translates into an increase in local DNA bending energy (see electronic supplementary material, figure S3), it also leads to an increase in the integration probability and to a decrease in integration time (as seen in figure 3b,c).
The model considered here does not include a 1D diffusing sliding mechanism (i.e. along the backbones of the two chains) as experimentally observed in [22]. However, the details of the diffusive process—whether it is three-dimensional diffusion in the bulk, 1D diffusion along the braid, or a combination of the two as in facilitated diffusion—should not change either the integration probability Pint (figure 3b) or the relative probabilities of having integrations in tips, plectonemes and braided parts (figure 5c) as long as we use the same simulation parameters. In line with these considerations, we also expect that 1D diffusive sliding mechanism could reduce the integration time Tfi at fixed catenation number, but should leave the general behaviour in figure 3b,c unaltered.
4. Conclusion
To summarize, in this work, we used BD simulations to study the integration of viral DNA in a DNA braid made up by two interwoven double-stranded DNA molecules. With these simulations we could study the relationship between supercoiling and integration efficiency, investigating the statistics of the exact integration site. We have found that integration events in buckled braids qualitatively affect the dynamics of plectonemes, because they modify the length of one or both the polymers in the braid, pushing the system away from the buckled phase dynamically (figure 2). Such qualitative effects should be experimentally testable by microscopy (e.g. by using similar methods as in [37]), or via optical tweezer experiments measuring the braid extension as a function of time.
We have also quantified the statistics of single and multiple integration events in the braid. Our results show that integration is facilitated by supercoiling and writhing, as the integration probability increases in the buckled phase, and the typical integration time concomitantly decreases sharply. Within the buckled phase, integration is also favoured close to the tips of plectonemes. Pleasingly, these results are quantitatively in line with those found experimentally for integration in single supercoiled DNA substrates [22]. An analysis of the pattern of local bending energies along the braid for different values of the catenation number shows that these determine in large part the observed pattern of integration sites. This is because integration requires DNA bending, hence larger local bending energies correlate with lower energy barriers opposing integration (and hence with higher integration probabilities). Thus, the buckled phase has a larger stored bending energy with respect to the straight braid phase, and this is why integration in the former is more likely (and faster). Additionally, the largest bending energy in a braid is stored at plectoneme tips and we predict that retroviruses tend to integrate there more often than at another random position in the braid.
Finally, we predict that successive integration events should be correlated. First, integration in one of the two molecules in a braid breaks the symmetry in the system, and favours successive integrations in the same molecule. Second, in the buckled phase, we have found that there is an enhanced probability for the second integration event to occur at a typical distance from the site of first integration. This is because the first integration event promotes the local increase of writhe. We thus argue that it is more energetically favourable for the second vDNA to integrate in the proximity of the first, while the writhe is relaxing and redistributing along the chain (see also electronic supplementary material, movie S1). Our results, that should be testable with single molecule experiments, provide a first indication of how retroviral integration events are affected by the presence of plectonemic structures in the substrate. Moreover, it would be interesting to perform simulations that include more details about the biology of retrovirus integration. For instance, one can investigate the role played by auxiliary binding interfaces [23,24] to better understand the dynamics of STCs.
Acknowledgements
G.F., E.O. and D.Mi. acknowledge the networking support by the Cooperation in Science and Technology (COST) action ‘European Topology Interdisciplinary Action’ (EUTOPIA) CA17139.
Contributor Information
G. Forte, Email: G.Forte@sms.ed.ac.uk.
E. Orlandini, Email: orlandini@pd.infn.it.
Data accessibility
The data generated to produce the figures in this paper and computer codes can be found at https://git.ecdf.ed.ac.uk/dmichiel/integrations_in_braids.
Authors' contributions
E.O. and G.F. conceived the project. G.F. performed simulations. G.F. and D.Mi. analysed the data, and all authors wrote the paper.
Competing interests
We declare we have no competing interests.
Funding
D.Mi. is a Royal Society University Research Fellow and is supported by the ERC (Starting Grant, Topologically Active Polymers, Ref. 947918).
References
- 1.Dixon S, McDonald S, Roberts J. 2002The impact of HIV and AIDS on Africa’s economic development. Br. Med. J. 324, 232-234. ( 10.1136/bmj.324.7331.232) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Craigie R, Bushman FD. 2012HIV DNA integration. Cold Spring Harb. Persp. Med. 2, a006890. ( 10.1101/cshperspect.a006890) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Bushman FD, Fujiwara T, Craigie R. 1990Retroviral DNA integration directed by HIV integration protein in vitro. Science 249, 1555-1558. ( 10.1126/science.2171144) [DOI] [PubMed] [Google Scholar]
- 4.Lesbats P, Engelman AN, Cherepanov P. 2016Retroviral DNA integration. Chem. Rev. 116, 12 730-12 757. ( 10.1021/acs.chemrev.6b00125) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Lusic M, Siliciano RF. 2017Nuclear landscape of HIV-1 infection and integration. Nat. Rev. Microbiol. 15, 69-82. ( 10.1038/nrmicro.2016.162) [DOI] [PubMed] [Google Scholar]
- 6.Bousios A, Nuetzmann H-W, Buck D, Michieletto D. 2020Integrating transposable elements in the 3D genome. Mob. DNA 11, 1-10. ( 10.1186/s13100-020-0202-3) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Kvaratskhelia M, Sharma A, Larue RC, Serrao E, Engelman A. 2014Molecular mechanisms of retroviral integration site selection. Nucleic Acids Res. 42, 10 209-10 225. ( 10.1093/nar/gku769) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Kruithof M, Chien F-T, Routh A, Logie C, Rhodes D, van Noort J. 2009Single-molecule force spectroscopy reveals a highly compliant helical folding for the 30-nm chromatin fiber. Nat. Struct. Mol. Biol. 16, 534-540. ( 10.1038/nsmb.1590) [DOI] [PubMed] [Google Scholar]
- 9.Collepardo-Guevara R, Schlick T. 2014Chromatin fiber polymorphism triggered by variations of DNA linker lengths. Proc. Natl Acad. Sci. USA 111, 8061-8086. ( 10.1073/pnas.1315872111) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Garcés R, Podgornik R, Lorman V. 2015Antipolar and anticlinic mesophase order in chromatin induced by nucleosome polarity and chirality correlations. Phys. Rev. Lett. 114, 238102. ( 10.1103/physrevlett.114.238102) [DOI] [PubMed] [Google Scholar]
- 11.Buckle A, Brackley CA, Boyle S, Marenduzzo D, Gilbert N. 2018Polymer simulations of heteromorphic chromatin predict the 3D folding of complex genomic loci. Mol. Cell 72, 786-797. ( 10.1016/j.molcel.2018.09.016) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Rao SSPet al.2014A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159, 1665-1680. ( 10.1016/j.cell.2014.11.021) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Jost D, Carrivain P, Cavalli G, Vaillant C. 2014Modeling epigenome folding: formation and dynamics of topologically associated chromatin domains. Nucleic Acids Res. 42, 9553-9563. ( 10.1093/nar/gku698) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Michieletto D, Orlandini E, Marenduzzo D. 2016Polymer model with epigenetic recoloring reveals a pathway for the de novo establishment and 3D organization of chromatin domains. Phys. Rev. X 6, 041047. ( 10.1103/physrevx.6.041047) [DOI] [Google Scholar]
- 15.Nuebler J, Fudenberg G, Imakaev M, Abdennur N, Mirny LA. 2018Chromatin organization by an interplay of loop extrusion and compartmental segregation. Proc. Natl Acad. Sci. USA 115, E6697-E6706. ( 10.1073/pnas.1717730115) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Cremer T, Cremer C. 2001Chromosome territories, nuclear architecture and gene regulation in mammalian cells. Nat. Rev. Genet. 2, 292-301. ( 10.1038/35066075) [DOI] [PubMed] [Google Scholar]
- 17.Marini Bet al.2015Nuclear architecture dictates HIV-1 integration site selection. Nature 521, 227-231. ( 10.1038/nature14226) [DOI] [PubMed] [Google Scholar]
- 18.Lucic Bet al.2019Spatially clustered loci with multiple enhancers are frequent targets of HIV-1 integration. Nat. Commun. 10, 4059. ( 10.1038/s41467-019-12046-3) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Michieletto D, Lusic M, Marenduzzo D, Orlandini E. 2019Physical principles of retroviral integration in the human genome. Nat. Commun. 10, 575. ( 10.1038/s41467-019-08333-8) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Pruss D, Reeves R, Bushman F, Wolffe A. 1994The influence of DNA and nucleosome structure on integration events directed by HIV integrase. J. Biol. Chem. 269, 25 031-25 041. ( 10.1016/S0021-9258(17)31494-1) [DOI] [PubMed] [Google Scholar]
- 21.Pasi Met al.2016DNA minicircles clarify the specific role of DNA structure on retroviral integration. Nucleic Acids Res. 44, 7830-7847. ( 10.1093/nar/gkw651) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Jones ND, Lopez MA Jr, Hanne J, Peake MB, Lee J-b, Fishel R, Yoder KE. 2016Retroviral intasomes search for a target DNA by 1D diffusion which rarely results in integration. Nat. Commun. 7, 11409. ( 10.1038/ncomms11409) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Vanderlinden W, Brouns T, Walker PU, Kolbeck PJ, Milles LF, Ott W, Nickels PC, Debyser Z, Lipfert J. 2019The free energy landscape of retroviral integration. Nat. Commun. 10, 4738. ( 10.1038/s41467-019-12649-w) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Maskell DPet al.2015Structural basis for retroviral integration into nucleosomes. Nature 523, 366-369. ( 10.1038/nature14495) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Serrao E, Krishnan L, Shun MC, Li X, Cherepanov P, Engelman A, Maertens GN. 2014Integrase residues that determine nucleotide preferences at sites of HIV-1 integration: implications for the mechanism of target DNA binding. Nucleic Acids Res. 42, 5164-5176. ( 10.1093/nar/gku136) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Stone MD, Bryant Z, Crisona NJ, Smith SB, Vologodskii A, Bustamante C, Cozzarelli NR. 2003Chirality sensing by Escherichia coli topoisomerase IV and the mechanism of type II topoisomerases. Proc. Natl Acad. Sci. USA 100, 8654-8659. ( 10.1073/pnas.1133178100) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Charvin G, Vologodskii A, Bensimon D, Croquette V. 2005Braiding DNA: experiments, simulations, and models. Biophys. J. 88, 4124-4136. ( 10.1529/biophysj.104.056945) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Charvin G, Bensimon D, Croquette V. 2003Single-molecule study of DNA unlinking by eukaryotic and prokaryotic type-II topoisomerases. Proc. Natl Acad. Sci. USA 100, 9820-9825. ( 10.1073/pnas.1631550100) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Brahmachari S, Marko JF. 2017Torque and buckling in stretched intertwined double-helix DNAs. Phys. Rev. E 95, 052401. ( 10.1103/PhysRevE.95.052401) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Forte G, Caraglio M, Marenduzzo D, Orlandini E. 2019Plectoneme dynamics and statistics in braided polymers. Phys. Rev. E 99, 52503. ( 10.1103/PhysRevE.99.052503) [DOI] [PubMed] [Google Scholar]
- 31.Strick TR, Allemand JF, Bensimon D, Bensimon A, Croquette V. 1996The elasticity of a single supercoiled DNA molecule. Science 271, 1835-1837. ( 10.1126/science.271.5257.1835) [DOI] [PubMed] [Google Scholar]
- 32.Dittmore A, Brahmachari S, Takagi Y, Marko JF, Neuman KC. 2017Supercoiling DNA locates mismatches. Phys. Rev. Lett. 119, 147801. ( 10.1103/PhysRevLett.119.147801) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Achari S, Dittmore A, Takagi Y, Neuman KC, Marko JF. 2018Defect-facilitated buckling in supercoiled double-helix DNA. Phys. Rev. E 97, 022416. ( 10.1103/PhysRevE.97.022416) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Calladine CR, Drew H, Luisi FB, Travers AA, Bash E. 1997Understanding DNA: the molecule and how it works, vol. 1. San Diego, CA: Elsevier Academic Press.
- 35.Sides SW, Grest GS, Stevens MJ, Plimpton SJ. 2004Effect of end-tethered polymers on surface adhesion of glassy polymers. J. Polym. Sci. Part B Polym. Phys. 42, 199. ( 10.1002/polb.10672) [DOI] [Google Scholar]
- 36.Brahmachari S, Gunn KH, Giuntoli RD, Mondragón A, Marko JF. 2017Nucleation of multiple buckled structures in intertwined DNA double helices. Phys. Rev. Lett. 119, 188103. ( 10.1103/PhysRevLett.119.188103) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.van Loenhout MT, de Grunt M, Dekker C. 2012Dynamics of DNA supercoils. Science 338, 94-97. ( 10.1126/science.1225810) [DOI] [PubMed] [Google Scholar]
- 38.Wang GP, Ciuffi A, Leipzig J, Berry CC, Bushman FD. 2007HIV integration site selection: analysis by massively parallel pyrosequencing reveals association with epigenetic modifications. Genome Res. 17, 1186-1194. ( 10.1101/gr.6286907) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Irobalieva RNet al.2015Structural diversity of supercoiled DNA. Nat. Commun. 6, 8440. ( 10.1038/ncomms9440) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Fuller FB. 1978Decomposition of the linking number of a closed ribbon: a problem from molecular biology. Proc. Natl Acad. Sci. USA 75, 3557-3561. ( 10.1073/pnas.75.8.3557) [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The data generated to produce the figures in this paper and computer codes can be found at https://git.ecdf.ed.ac.uk/dmichiel/integrations_in_braids.



