A Polymer Model for the Quantitative Reconstruction of Chromosome Architecture from HiC and GAM Data

Guillaume Le Treut; François Képès; Henri Orland

doi:10.1016/j.bpj.2018.10.032

. 2018 Nov 10;115(12):2286–2294. doi: 10.1016/j.bpj.2018.10.032

A Polymer Model for the Quantitative Reconstruction of Chromosome Architecture from HiC and GAM Data

Guillaume Le Treut ^1,^∗, François Képès ², Henri Orland ^3,⁴

PMCID: PMC6301988 PMID: 30527448

Abstract

It is widely believed that the folding of the chromosome in the nucleus has a major effect on genetic expression. For example, coregulated genes in several species have been shown to colocalize in space despite being far away on the DNA sequence. In this manuscript, we present a new, to our knowledge, method to model the three-dimensional structure of the chromosome in live cells based on DNA-DNA interactions measured in high-throughput chromosome conformation capture experiments and genome architecture mapping. Our approach incorporates a polymer model and directly uses the contact probabilities measured in high-throughput chromosome conformation capture experiments and genome architecture mapping experiments rather than estimates of average distances between genomic loci. Specifically, we model the chromosome as a Gaussian polymer with harmonic interactions and extract the coupling coefficients best reproducing the experimental contact probabilities. In contrast to existing methods, we give an exact expression of the contact probabilities at thermodynamic equilibrium. The Gaussian effective model reconstructed with our method reproduces experimental contacts with high accuracy. We also show how Brownian dynamics simulations of our reconstructed Gaussian effective model can be used to study chromatin organization and possibly give some clue about its dynamics.

Introduction

Although the chromosome has been classically seen as the carrier of the genetic information, there has been increasing evidence that its folding is a determinant of genetic regulation (1, 2). In particular, coexpressed genes were found to be more often in contact than unrelated genes (3, 4, 5), and the epigenetic state of the chromatin was shown to be related to its folding (6). The advent of chromosome conformation capture (3C) experiments has provided unprecedented insights on chromosome architecture in live cells (7), and the combination of 3C techniques with high-throughput sequencing methods (high-throughput chromosome conformation capture experiments; Hi-C) has enabled the measurement of contacts between thousands of loci on the chromosome. Extensive Hi-C data have now been generated for several eukaryotic cells including human (8, 9), yeast (10), and fly (11) but also bacteria (12, 13, 14). In eukaryotes, the patterns observed in contact matrices generated from Hi-C experiments have revealed a high-level organization in sub-megabasepair topologically associated domains (15, 16). This organization displays significant changes throughout the cell cycle (17) but also during cell differentiation (18) and in the context of cell pluripotency (19) or cell senescence (20). More recently, the genome architecture mapping (GAM) technique was developed, representing an alternative way to measure interactions between chromosomal loci (21). Its application to mouse embryonic stem cells confirmed that actively transcribed genes sometimes separated by large genomic distances were more often in contact. Based on these experimental findings, several studies have suggested that chromosome architecture and genetic expression are intimately connected (22, 23, 24, 25, 26, 27, 28).

Several methods have been proposed to reconstruct the chromosome folding from Hi-C data (see Supporting Materials and Methods, Section 2 for a short review). A first class of models aimed at reconstructing chromosome configurations such that the distances d_ij between chromosomal loci take prescribed values, inferred from the Hi-C contact probabilities c_ij (10, 12, 29, 30, 31). Those studies generally assumed that these average distances would scale like d_ij ∼ 1/c_ij. Yet a scaling analysis tells us that $d_{i j} \sim c_{i j}^{- γ}$ , with γ = 0.3 for a self-avoiding chain (see Supporting Materials and Methods, Section 3). Another class of models aimed at finding an ensemble of chromosome configurations that reproduces the experimental contact probabilities, $c_{i j}^{e x p}$ (32, 33). Yet, most of these methods did not incorporate a realistic polymer model of the chromosome. Thus, the configurations obtained may violate topological constraints imposed by the chain structure of the chromosome.

Here, we model the chromosome as a Gaussian polymer and introduce harmonic interactions to constrain its folding (see Fig. 1). The rigidity of these interactions will be determined by the cross-linking frequency between pairs of genomic loci obtained from the Hi-C protocol. This defines our Gaussian effective model (GEM). The inverse problem to solve consists in finding the effective couplings such that the contact probabilities of the model, c_ij, reproduce the contact probabilities obtained from a Hi-C experiment, $c_{i j}^{e x p}$ , similarly to previous studies (34, 35, 36). Yet, in those methods, the contact probabilities of the model could only be computed through Monte Carlo or Brownian dynamics (BD) simulations. In contrast, we provide an exact relation between the contact probabilities and the harmonic couplings of our model. Based on this relation, we propose a minimization scheme to find a physical GEM with contact probabilities as close as possible to the experimental ones. We then apply our method to Hi-C and GAM data, thus demonstrating that experimental contact probability matrices can be quantitatively reproduced by our effective polymer model.

(A) Configurations adopted by a chromosome in a cell population are retrieved using 3C techniques. (B) We use the count matrix generated by the Hi-C protocol, containing information on the ensemble of chromosome configurations, to reconstruct a GEM. Harmonic interactions with elastic coefficients k_ij are added on top of a Gaussian polymer model and adjusted to reproduce the experimental contacts. To see this figure in color, go online.

We suggest that our reconstructed GEM can be used to study chromatin organization. Typically, coarse-grained models of the chromosome are simulated by BD (37, 38). Because of the complexity of the DNA-DNA and DNA-protein interactions, practical implementations generally require some dimensional reduction or arbitrary choices for unknown parameters such as binding energies or protein binding sites. In contrast, BD simulations of the reconstructed GEM offer a simple alternative that reproduces faithfully the contacts observed in Hi-C or GAM experiments.

Methods

GEM

We model the chromosome as a beads-on-string polymer comprising N + 1 monomers with coordinates {r_i}_{i = 0...N}, each monomer corresponding to a genomic bin with size b, which, depending on the resolution, may represent from 5 kbp to 1 Mbp. Despite some controversy (39), euchromatin is generally regarded as a fiber of diameter 30 nm and persistence length l_p = 60 nm ≈ 6 kbp (40). Thus, we choose to neglect the bending rigidity of the chromosome and consider the Gaussian chain potential for the chromosome backbone:

β U_{0} [{r_{i}}] = \frac{3}{2 b^{2}} \sum_{i = 1}^{N} {(r_{i} - r_{i - 1})}^{2},

(1)

where β = (k_BT)⁻¹ is the inverse temperature.

The Hi-C protocol uses a cross-linking agent to induce proximity ligations between DNA fragments that are close to each other in the nucleus (Fig. 1 A). The matrix of contacts generated subsequently encodes information on the ensemble of configurations adopted by the chromosome (Fig. 1 B). We represent the underlying interactions that constrain its folding as harmonic springs with rigidity 3k_ij/b², leading to the interaction potential

β U_{I} [{r_{i}}] = \frac{3}{2 b^{2}} \sum_{0 \leq i < j \leq N} k_{i j} {(r_{i} - r_{j})}^{2} .

(2)

The probability of a particular configuration at equilibrium is given by a Boltzmann weight. Namely, if we denote the total energy as U = U₀ + U_I, we have

Pr ({r_{i}}) = \frac{1}{Z} e^{- β U [{r_{i}}]} .

(3)

Actually, the total energy is quadratic in the r_i variables and may be written

\begin{matrix} β U [\{r_{i}\}] & = \frac{3}{2 b^{2}} \sum_{i, j} σ_{i j}^{- 1} r_{i} \cdot r_{j} \end{matrix} .

(4)

As a result, the probability distribution in Eq. 3 is Gaussian, hence the name GEM. The GEM is completely determined by its covariance matrix $Σ = {[σ_{i j}]}_{i, j = 1 \dots N}$ or equivalently its two-point correlation functions. In particular, we have $〈 r_{i} \cdot r_{j} 〉 = σ_{i j} b^{2}$ and $〈 r_{i}^{2} 〉 = σ_{i i}$ , where the brackets denote an average taken over the Gaussian distribution in Eq. 3. Its inverse is expressed as

Σ^{- 1} = T + W,

(5)

where T is a tridiagonal matrix enforcing the chain structure from Eq. 1 and W is a matrix of reduced couplings enforcing the interactions from Eq. 2. The matrix W has the structure of a Kirchhoff (or valency-adjacency) matrix as defined in graph theory (41). These matrices read as follows:

\begin{matrix} T = (\begin{matrix} 2 & - 1 & \dots & 0 & 0 \\ - 1 & 2 & \dots & 0 & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ 0 & 0 & \dots & 2 & - 1 \\ 0 & 0 & \dots & - 1 & 1 \end{matrix}), \\ W = (\begin{matrix} \sum_{\begin{matrix} j = 0 \\ j \neq 1 \end{matrix}} k_{1 j} & - k_{12} & \dots & - k_{1 N - 1} & - k_{1 N} \\ - k_{21} & \sum_{\begin{matrix} j = 0 \\ j \neq 2 \end{matrix}} k_{2 j} & \dots & - k_{2 N - 1} & - k_{2 N} \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ - k_{N - 11} & - k_{N - 12} & \dots & \sum_{\begin{matrix} j = 0 \\ j \neq N - 1 \end{matrix}} k_{N - 1 j} & - k_{N - 1 N} \\ - k_{N 1} & - k_{N 2} & \dots & - k_{N N - 1} & \sum_{\begin{matrix} j = 0 \\ j \neq N \end{matrix}} k_{N j} \end{matrix}) \end{matrix} .

(6)

As an essential feature of the GEM, the pair distances have Gaussian distributions:

Pr (r_{i j} = r) = {(\frac{2 π 〈 r_{i j}^{2} 〉}{3})}^{- 3 / 2} exp (- \frac{3}{2} \frac{r^{2}}{〈 r_{i j}^{2} 〉}),

(7)

where the mean-square distance $〈 r_{i j}^{2} 〉$ is related to the covariance matrix through the classical identities $〈 r_{i j}^{2} 〉 = 〈 r_{i}^{2} 〉 + 〈 r_{j}^{2} 〉 - 2 〈 r_{i} \cdot r_{j} 〉$ .

We now formally express the contact probability between monomers i and j as

\begin{matrix} c_{i j} & = & 〈 μ (r_{i j}) 〉, \\ = & \int d^{3} r μ (r) 〈 δ (r_{i j} - r) 〉 . \end{matrix}

(8)

In Eq. 8, μ(r_ij) is the probability that a cross-link is formed between monomers i and j that are separated by a distance r_ij. The cross-linking agent used in Hi-C experiments, namely formaldehyde, is known to polymerize in solution, resulting in cross-links of variable lengths (42). Therefore, in this work, we have considered a Gaussian form factor

μ_{ξ} (r) = exp (- \frac{3}{2} \frac{r^{2}}{ξ^{2}}),

(9)

where the threshold ξ represents the typical distance under which two monomers can be cross-linked. With this definition, we can compute the thermodynamic average in Eq. 8 and obtain (see Supporting Materials and Methods, Section 5) the following:

c_{i j} = {(1 + \frac{〈 r_{i j}^{2} 〉}{ξ^{2}})}^{- 3 / 2} .

(10)

We have thus expressed explicitly the contact probability between monomers i and j as a function of their mean-square distance. As might be expected, the contact probability c_ij is a decreasing function of $〈 r_{i j}^{2} 〉$ . Similar expressions can be obtained for other choices of form factors (see Supporting Materials and Methods, Section 5).

In summary, Eqs. 5 and 10 define a unique correspondence between the coupling matrix [k_ij]_{i, j = 0...N} and the contact probability matrix [c_ij]_{i, j = 0...N}. The only free parameter is the threshold ξ. We can therefore reconstruct the GEM reproducing a given contact probability matrix. For example, we have successfully applied this method to contact probabilities obtained by sampling configurations of a predefined GEM through BD simulations (see Supporting Materials and Methods, Section 5). We note that our model does not take into account excluded volume effects.

Reconstruction of an admissible GEM

We realized that the presence of noise in the contact probabilities could lead to an unstable GEM having a covariance matrix with negative eigenvalues and therefore a nonfinite free energy (see Supporting Materials and Methods, Section 6). To solve this issue, we reasoned that although a GEM is unstable, there may exist a stable GEM with very close contact probabilities. We therefore introduce the least-square estimator (LSE) between some experimental contact probability matrix and the one of a candidate (stable) GEM:

LSE = \frac{1}{{(N + 1)}^{2}} \sum_{i, j} {(c_{i j} - c_{i j}^{e x p})}^{2} .

(11)

In Eq. 11, the LSE is a function of the k_ij variables because the c_ij are computed from the coupling matrix using the GEM mapping introduced above. Our goal is then to minimize the LSE under the constraint that the GEM is stable. A rigorous enforcement of this principle would be to ensure that its covariance matrix Σ has strictly positive eigenvalues, which is difficult to implement in practice. Instead, we consider the more restrictive condition

k_{i j} \geq 0,

(12)

which is a sufficient condition of stability of the GEM.

Implementation

We use a steepest descent algorithm with projection to minimize Eq. 11 under the constraint in Eq. 12 (see Supporting Materials and Methods, Section 7). We thus obtain the positive couplings $k_{i j}^{*}$ , minimizing the LSE. As seen earlier, computing the c_ij as a function of the k_ij relies on the choice of a threshold ξ. Therefore, we repeat the above minimization procedure for several values of ξ and choose the one with the smallest LSE. In fine, the reconstructed couplings $k_{i j}^{o p t}$ define the best physically admissible GEM with contact probabilities $c_{i j}^{o p t}$ , reproducing the experimental values of the contact probabilities.

Results

We have applied our reconstruction method to Hi-C data generated from human lymphoblastoid cells (type GM12878) (9). For a given chromosome, these data come under the form of count matrices, in which each entry n_ij corresponds to the number of contacts detected between bins i and j on the chromosome. To compute the contact probability matrix, we applied a global normalization factor N_c to the Hi-C count matrices, c_ij = n_ij/N_c (see Supporting Materials and Methods, Section 4). One may picture N_c as the number of cells in the experimental sample. Because this normalization is not known, we adjusted both free parameters ξ and N_c when applying our reconstruction method so as to minimize the LSE between experimental and GEM contact probabilities. For data of chromosome 8 at a bin resolution of 5 kbp, the best reconstructed GEM was obtained for N_c = 10³ and ξ = 0.96 (see Fig. 2).

Application of the GEM reconstruction method to Hi-C data from (9) for chromosome 8 at bin resolution 5 kbp. The best GEM is obtained for values of ξ and N_c that minimize the LSE between experimental and GEM contact probabilities. The maximal number of contacts detected among (i, j) bin pairs is denoted as max(n_ij). To see this figure in color, go online.

The typical discrepancy between experimental and GEM contact probabilities was small, LSE^1/2 = 0.022, suggesting that this chromosome region can be well represented by a GEM. Much of the structure found in the experimental contact probability matrix was indeed well captured in the reconstructed model (Fig. 3 A). This agreement was also readily seen when considering the average contact probability $〈 c_{i j} 〉$ at a given contour length (Fig. 3 C).

Best reconstructed GEM for Hi-C data of human chromosome 8 at 5 kbp resolution (9). (A) A comparison between experimental (*lower left*) and GEM (*upper right*) contact probabilities. (B) A comparison of experimental and GEM contact probabilities (two-dimensional (2D) histogram). We give the Pearson correlation coefficient. (C) A comparison of the average contact probability as a function of the contour length. To see this figure in color, go online.

Other methods, more sophisticated than the one used above, have been proposed to estimate contact probabilities from Hi-C count matrices (9, 43, 44, 45). For completeness, we have also applied our reconstruction procedure to contact probabilities generated from the same Hi-C data but using the matrix balancing normalization, which produces a stochastic matrix of contact probabilities (see Supporting Materials and Methods, Section 4). In this case, the only free parameter to adjust was the threshold ξ. We found that the reconstructed GEM also reproduced well the experimental contact probabilities (see Fig. S11). Yet, the LSE was larger than for the previous normalization. A possible explanation for this increased value may be that a stochastic contact probability matrix is a poor representation of a cross-linked polymer.

To demonstrate that the effectiveness of our method is not limited to Hi-C data only, we have also applied our reconstruction procedure to GAM experimental data of mouse embryonic stem cells (21). Briefly, with this technique, slices of cell nuclei are obtained by making cryosections, and their DNA content is sequenced. The main output is an array of cosegregation frequencies, representing the probability for two genomic bins to be present in the same slice. We developed a normalization scheme to convert these cosegregation frequencies into contact probabilities (see Supporting Materials and Methods, Section 4). This does not introduce additional parameters, so when applying our reconstruction procedure, we only had to adjust the threshold ξ. For example, we applied our method to GAM data generated from mouse embryonic stems cells for chromosome 19 with a bin resolution of 30 kbp (Fig. 4). Again, the reconstructed model well reproduced the experimental contact probabilities, with a typical discrepancy LSE^1/2 = 0.032. Although this value is slightly greater than in the Hi-C case presented above, the size of the corresponding polymer is larger, with N = 1000. Therefore, the quantitative agreement between experiment and reconstructed model remains very good. Note that the optimal threshold of the reconstruction was quite small, ξ^opt = 0.48. Yet it appears that the precise value of the threshold is not critical. Indeed, below $ξ \underset{˜}{<} 1.0$ , the relative variations of the LSE became very small (see Fig. S17). Hence, the threshold may actually be seen as a regularization parameter for the reconstructed contact probability matrix.

Best reconstructed GEM for GAM data of mouse chromosome 19 at 30 kbp resolution (21). (A) A comparison between experimental (*lower left*) and GEM (*upper right*) contact probabilities. (B) A comparison of experimental and GEM contact probabilities (2D histogram). We give the Pearson correlation coefficient. (C) A comparison of the average contact probability as a function of the contour length. To see this figure in color, go online.

We have applied our reconstruction procedure to various chromosomes and bin resolutions from either Hi-C or GAM data sets (see Table S1 together with Figs. S1–S25). Overall, the contact probabilities of the reconstructed GEMs quantitatively reproduced the experimental ones. We found in general that the typical distance between experimental and reconstructed model contact probabilities was LSE^1/2 ∼ 0.01–0.05. Thus, we conclude that our method allows us to represent to a quantifiable accuracy the ensemble of configurations adopted by the chromosome.

To illustrate possible applications of our method to study chromosome organization, we used the reconstructed coupling matrices to perform BD simulations of the chromosome (see Supporting Materials and Methods, Section 8). To do so, we replaced the Gaussian chain potential in Eq. 1 with a finitely-extensible non-linear elastic bond potential, we took into account the polymer bending rigidity, and we introduced excluded volume interactions. We then performed BD simulations and used the sampled configurations to compute the equilibrium contact probabilities, which we compared to the ones of the GEM (see Fig. 5 A; Figs. S26 and S27). In the presence of excluded volume and semiflexibility, the obtained contact probabilities were not as close to the GEM ones. Yet, the essential structure of the contact probability matrix remained. In Fig. 5 B, we show a typical configuration for human chromosome 16.

BD of the reconstructed GEM for Hi-C data of human chromosome 16 (9) (5 kbp resolution). (A) Contact probability matrices obtained through BD simulation of 1) the GEM, 2) the GEM with bending rigidity, and 3) the GEM with bending rigidity and with excluded volume. The contact probabilities were computed from BD trajectories and are compared with the theoretical values for the GEM. (B) A snapshot of a configuration obtained by BD of the reconstructed GEM with bending rigidity and excluded volume. The couplings are represented by tie lines, from weak couplings (in *blue*) to strong couplings (in *red*). (C) LSE as a function of the threshold ξ between contact probabilities computed from the BD trajectory and the theoretical values. To see this figure in color, go online.

Discussion

In this article, we have proposed a polymer model constrained by Hi-C or GAM experimental measurements to represent the chromosome. We modeled the DNA as a flexible polymer (because the resolution is much larger than the persistence length of the DNA), with harmonic interactions between chromosomal loci encoding the contact frequency in Hi-C and GAM experiments. The spring constants are chosen so as to best reproduce the experimentally measured contact probabilities. We computed the explicit mapping defined in Eqs. 5 and 10, which relates the harmonic couplings to the contact probabilities between monomers. We then used this property to reconstruct a physically admissible GEM of the chromosome by minimizing the distance between experimental and model contact probabilities. We applied this method to many chromosomes and data sets. Overall, the quantitative agreement obtained suggested that the GEM offers a good representation of the chromosome. To illustrate potential applications of our method, we then used the reconstructed GEM to perform BD simulations of the chromosome. Although it is not a substitute for first-principles molecular dynamics simulations, this approach is valuable because the trajectories simulated by BD reproduce the experimental contact probabilities.

Models for cross-linked polymer

Properties of cross-linked polymers have been extensively studied (46, 47, 48). However, in those studies, the rigidities of the harmonic interactions were uniform (i.e., k_ij = k in Eq. 4). A similar model was also reintroduced to account for the particular scaling of the radius of gyration of the chromosome in the interphase nucleus, in which the k_ij were distributed as Bernoulli variables and hence defined random loops (49, 50). Recently, another model with quadratic interactions was proposed to obtain polymer states with arbitrary fractal dimension (51), in which the harmonic couplings followed a power law of the contour distances. Yet, these studies did not attempt to compute Hi-C contact probabilities or to predict chromatin conformations. Our model also presents some similarities with the Gaussian elastic network model used in the context of protein folding (52, 53).

Do the reconstructed couplings represent biological interactions?

Hi-C data are often generated from a population of cells. Thus, if a pair of chromosomal loci has a number of contacts that is statistically significant, it means that specific interactions should favor their colocalization. Therefore, the couplings k_ij can be seen as defining coarse-grained potentials representing the superimposition of many microscopical interactions, such as the bridging by divalent proteins, and used as effective interactions in coarse-grained models of the chromosome. Yet, the mean pair potentials $e_{i j} = 3 / 2 k_{i j} 〈 r_{i j}^{2} 〉$ , expressed in k_BT, provide a more physical interpretation of the reconstructed interactions. Eventually, the effective model obtained can give clues about where the major constraints that determine the folding of the chromosome are applied.

Fractal globule scaling of the contact probabilities

It is believed that the so-called fractal globule model (or crumpled polymer) provides a more realistic framework to describe the chromosome than classical polymer models (54, 55). In short, the presence of excluded volume and confinement results in high energy barriers from one configuration to the other, leading to a behavior different from an ideal polymer. In particular, the fractal globule was shown to reproduce the scaling for the mean contact probability as a function of the contour length, c_ij ∝ |i − j|⁻¹, observed in Hi-C experiments (8). We note that although our GEM does not incorporate excluded volume, it reproduces the experimental scaling because the couplings are reconstructed from the experimental contacts.

Robustness of the method

To investigate the robustness of the reconstructed GEM, we repeated the minimization procedure but considered only a subset of the experimental contacts in the sum from Eq. 11. Specifically, we retained only the top fraction of the experimental contact probabilities. In Fig. 6 A, we compared the contact probabilities of the original reconstructed GEM for human chromosome 8 with the contact probabilities of the GEMs reconstructed by considering only the top 90, 50, and 10%. Starting from 50%, we noticed that some artifacts appear in the reconstructed GEM for long-range contacts. These are located in regions that are sparse in contacts in the experimental contact probability matrix. As a result, very few significant contacts are retained in those regions for the minimization procedure. In fact, contacts below the thresholding quantile, which were discarded from the reconstruction, tend to be overestimated in the newly reconstructed GEM (Fig. 6 B). This suggests that regions of the contact probability matrix that contain little meaningful information (significant contacts in our case) will be poorly reconstructed. Overall, Fig. 6 C shows that the distance to the original reconstructed GEM increases as the fraction of contacts retained shrinks, and Fig. 6 D illustrates that long-range contacts are indeed the first to suffer from reconstruction artifacts. The same analysis for other data sets is given in Figs. S28 and S29.

Robustness of GEM reconstruction for Hi-C data of human chromosome 8 (9) (5 kbp resolution). For all GEM reconstructions, we used a threshold ξ = 1 and a normalization factor N_c = 10³. (A) A comparison of the contact probabilities of the reconstructed GEM with those of a GEM obtained by performing the minimization only on the top 90, 50, and 10% experimental contacts. (B) 2D histograms corresponding to the matrices shown in (A). We give the Pearson correlation coefficients. The thresholding quantiles are represented by vertical dashed lines. (C) A comparison of the GEMs reconstructed from a decreasing fraction of the experimental contacts with the original GEM. LSE^1/2 is the Euclidean distance between contact probabilities divided by (N + 1). (D) Average contact probability as a function of the contour length for GEMs reconstructed from a decreasing fraction of the experimental contacts. To see this figure in color, go online.

Future improvements

A first improvement to our model would be to explicitly include semiflexibility in the polymer structure. This can be done by adding harmonic interactions extending to second-nearest neighbors in Eq. 1. However, this refinement might appear superfluous as long as we consider bin resolutions beyond ∼5 kbp. A second improvement would be to extend the method to several chromosomes by adjusting the matrix T, which defines the chain structure.

The code used to perform the reconstruction of a GEM by minimization is available at https://github.com/gletreut/gem_reconstruction. Other data and code involved in this study are available upon request.

Author Contributions

F.K. and H.O. designed the research. G.L.T. and H.O. performed the research. G.L.T. wrote the code and analyzed the data. All authors contributed to the writing of the article.

Acknowledgments

This work was supported by the “IDI 2013” project funded by the IDEX Paris-Saclay, ANR-11-IDEX-0003-02. G.L.T. is grateful to the institute of Systems and Synthetic Biology and the Institut de Physique Théorique for giving him access to their computing facilities.

Editor: Tamar Schlick.

Footnotes

François Képès’s present address is Synovance, Évry, France.

Supporting Materials and Methods, 37 figures, one table, and one data file are available at http://www.biophysj.org/biophysj/supplemental/S0006-3495(18)31225-6.

Supporting Citations

References (56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66) appear in the Supporting Material.

Supporting Material

Document S1. Supporting Materials and Methods, Figs. S1–S37, and Table S1

mmc1.pdf^{(22MB, pdf)}

Data S1. LAMMPS Configuration Files

The LAMMPS configuration files used to perform the BD simulations in Fig. 5 are provided.

mmc2.zip^{(42.2KB, zip)}

Document S2. Article plus Supporting Material

mmc3.pdf^{(23.7MB, pdf)}

References

1.Képès F., Vaillant C. Transcription-based solenoidal model of chromosomes. Complexus. 2003;1:171–180. [Google Scholar]
2.Junier I., Martin O., Képès F. Spatial and topological organization of DNA chains induced by gene co-localization. PLoS Comput. Biol. 2010;6:e1000678. doi: 10.1371/journal.pcbi.1000678. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Spilianakis C.G., Lalioti M.D., Flavell R.A. Interchromosomal associations between alternatively expressed loci. Nature. 2005;435:637–645. doi: 10.1038/nature03574. [DOI] [PubMed] [Google Scholar]
4.Montero Llopis P., Jackson A.F., Jacobs-Wagner C. Spatial organization of the flow of genetic information in bacteria. Nature. 2010;466:77–81. doi: 10.1038/nature09152. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Schoenfelder S., Sexton T., Fraser P. Preferential associations between co-regulated genes reveal a transcriptional interactome in erythroid cells. Nat. Genet. 2010;42:53–61. doi: 10.1038/ng.496. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Boettiger A.N., Bintu B., Zhuang X. Super-resolution imaging reveals distinct chromatin folding for different epigenetic states. Nature. 2016;529:418–422. doi: 10.1038/nature16496. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Dekker J., Marti-Renom M.A., Mirny L.A. Exploring the three-dimensional organization of genomes: interpreting chromatin interaction data. Nat. Rev. Genet. 2013;14:390–403. doi: 10.1038/nrg3454. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Lieberman-Aiden E., van Berkum N.L., Dekker J. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science. 2009;326:289–293. doi: 10.1126/science.1181369. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Rao S.S., Huntley M.H., Aiden E.L. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell. 2014;159:1665–1680. doi: 10.1016/j.cell.2014.11.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Duan Z., Andronescu M., Noble W.S. A three-dimensional model of the yeast genome. Nature. 2010;465:363–367. doi: 10.1038/nature08973. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Sexton T., Yaffe E., Cavalli G. Three-dimensional folding and functional organization principles of the Drosophila genome. Cell. 2012;148:458–472. doi: 10.1016/j.cell.2012.01.010. [DOI] [PubMed] [Google Scholar]
12.Umbarger M.A., Toro E., Church G.M. The three-dimensional architecture of a bacterial genome and its alteration by genetic perturbation. Mol. Cell. 2011;44:252–264. doi: 10.1016/j.molcel.2011.09.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Cagliero C., Grand R.S., O’Sullivan J.M. Genome conformation capture reveals that the Escherichia coli chromosome is organized by replication and transcription. Nucleic Acids Res. 2013;41:6058–6071. doi: 10.1093/nar/gkt325. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Marbouty M., Le Gall A., Nollmann M. Condensin- and replication-mediated bacterial chromosome folding and origin condensation revealed by Hi-C and super-resolution imaging. Mol. Cell. 2015;59:588–602. doi: 10.1016/j.molcel.2015.07.020. [DOI] [PubMed] [Google Scholar]
15.Dixon J.R., Selvaraj S., Ren B. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature. 2012;485:376–380. doi: 10.1038/nature11082. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Olivares-Chauvet P., Mukamel Z., Tanay A. Capturing pairwise and multi-way chromosomal conformations using chromosomal walks. Nature. 2016;540:296–300. doi: 10.1038/nature20158. [DOI] [PubMed] [Google Scholar]
17.Nagano T., Lubling Y., Tanay A. Cell-cycle dynamics of chromosomal organization at single-cell resolution. Nature. 2017;547:61–67. doi: 10.1038/nature23001. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Fraser J., Ferrai C., Nicodemi M., FANTOM Consortium Hierarchical folding and reorganization of chromosomes are linked to transcriptional changes in cellular differentiation. Mol. Syst. Biol. 2015;11:852. doi: 10.15252/msb.20156492. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Sexton T., Cavalli G. The 3D genome shapes up for pluripotency. Cell Stem Cell. 2013;13:3–4. doi: 10.1016/j.stem.2013.06.013. [DOI] [PubMed] [Google Scholar]
20.Chandra T., Ewels P.A., Reik W. Global reorganization of the nuclear landscape in senescent cells. Cell Rep. 2015;10:471–483. doi: 10.1016/j.celrep.2014.12.055. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Beagrie R.A., Scialdone A., Pombo A. Complex multi-enhancer contacts captured by genome architecture mapping. Nature. 2017;543:519–524. doi: 10.1038/nature21411. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Cavalli G. Chromosome kissing. Curr. Opin. Genet. Dev. 2007;17:443–450. doi: 10.1016/j.gde.2007.08.013. [DOI] [PubMed] [Google Scholar]
23.Baù D., Sanyal A., Marti-Renom M.A. The three-dimensional folding of the α-globin gene domain reveals formation of chromatin globules. Nat. Struct. Mol. Biol. 2011;18:107–114. doi: 10.1038/nsmb.1936. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Nora E.P., Lajoie B.R., Heard E. Spatial partitioning of the regulatory landscape of the X-inactivation centre. Nature. 2012;485:381–385. doi: 10.1038/nature11049. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Di Stefano M., Rosa A., Micheletti C. Colocalization of coregulated genes: a steered molecular dynamics study of human chromosome 19. PLoS Comput. Biol. 2013;9:e1003019. doi: 10.1371/journal.pcbi.1003019. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Jost D., Carrivain P., Vaillant C. Modeling epigenome folding: formation and dynamics of topologically associated chromatin domains. Nucleic Acids Res. 2014;42:9553–9561. doi: 10.1093/nar/gku698. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Di Stefano M., Paulsen J., Micheletti C. Hi-C-constrained physical models of human chromosomes recover functionally-related properties of genome organization. Sci. Rep. 2016;6:35985. doi: 10.1038/srep35985. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Soler-Oliva M.E., Guerrero-Martínez J.A., Reyes J.C. Analysis of the relationship between coexpression domains and chromatin 3D organization. PLoS Comput. Biol. 2017;13:e1005708. doi: 10.1371/journal.pcbi.1005708. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Baù D., Marti-Renom M.A. Genome structure determination via 3C-based data integration by the integrative modeling platform. Methods. 2012;58:300–306. doi: 10.1016/j.ymeth.2012.04.004. [DOI] [PubMed] [Google Scholar]
30.Lesne A., Riposo J., Mozziconacci J. 3D genome reconstruction from chromosomal contacts. Nat. Methods. 2014;11:1141–1143. doi: 10.1038/nmeth.3104. [DOI] [PubMed] [Google Scholar]
31.Wang S., Xu J., Zeng J. Inferential modeling of 3D chromatin structure. Nucleic Acids Res. 2015;43:e54. doi: 10.1093/nar/gkv100. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Varoquaux N., Ay F., Vert J.P. A statistical approach for inferring the 3D structure of the genome. Bioinformatics. 2014;30:i26–i33. doi: 10.1093/bioinformatics/btu268. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Tjong H., Li W., Alber F. Population-based 3D genome structure analysis reveals driving forces in spatial genome organization. Proc. Natl. Acad. Sci. USA. 2016;113:E1663–E1672. doi: 10.1073/pnas.1512577113. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Giorgetti L., Galupa R., Heard E. Predictive polymer modeling reveals coupled fluctuations in chromosome conformation and transcription. Cell. 2014;157:950–963. doi: 10.1016/j.cell.2014.03.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Meluzzi D., Arya G. Recovering ensembles of chromatin conformations from contact probabilities. Nucleic Acids Res. 2013;41:63–75. doi: 10.1093/nar/gks1029. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Chiariello A.M., Annunziatella C., Nicodemi M. Polymer physics of chromosome large-scale 3D organisation. Sci. Rep. 2016;6:29775. doi: 10.1038/srep29775. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Brackley C.A., Brown J.M., Marenduzzo D. Predicting the three-dimensional folding of cis-regulatory regions in mammalian genomes using bioinformatic data and polymer models. Genome Biol. 2016;17:59. doi: 10.1186/s13059-016-0909-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Michieletto D., Orlandini E., Marenduzzo D. Polymer model with epigenetic recoloring reveals a pathway for the de novo establishment and 3D organization of chromatin domains. Phys. Rev. X. 2016;6:041047. [Google Scholar]
39.Fussner E., Ching R.W., Bazett-Jones D.P. Living without 30nm chromatin fibers. Trends Biochem. Sci. 2011;36:1–6. doi: 10.1016/j.tibs.2010.09.002. [DOI] [PubMed] [Google Scholar]
40.Langowski J. Polymer chain models of DNA and chromatin. Eur. Phys. J. E Soft Matter. 2006;19:241–249. doi: 10.1140/epje/i2005-10067-9. [DOI] [PubMed] [Google Scholar]
41.Kasteleyn P. Academic Press; New York: 1967. Graph Theory and Crystal Physics. [Google Scholar]
42.Jackson V. Formaldehyde cross-linking for studying nucleosomal dynamics. Methods. 1999;17:125–139. doi: 10.1006/meth.1998.0724. [DOI] [PubMed] [Google Scholar]
43.Imakaev M., Fudenberg G., Mirny L.A. Iterative correction of Hi-C data reveals hallmarks of chromosome organization. Nat. Methods. 2012;9:999–1003. doi: 10.1038/nmeth.2148. [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Yaffe E., Tanay A. Probabilistic modeling of Hi-C contact maps eliminates systematic biases to characterize global chromosomal architecture. Nat. Genet. 2011;43:1059–1065. doi: 10.1038/ng.947. [DOI] [PubMed] [Google Scholar]
45.Cournac A., Marie-Nelly H., Mozziconacci J. Normalization of a chromosomal contact map. BMC Genomics. 2012;13:436. doi: 10.1186/1471-2164-13-436. [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Solf M.P., Vilgis T.A. Statistical mechanics of macromolecular networks without replicas. J. Phys. Math. Gen. 1995;28:6655–6668. [Google Scholar]
47.Kantor Y., Kardar M. Conformations of randomly linked polymers. Phys. Rev. E Stat. Phys. Plasmas Fluids Relat. Interdiscip. Topics. 1996;54:5263–5267. doi: 10.1103/physreve.54.5263. [DOI] [PubMed] [Google Scholar]
48.Bryngelson J.D., Thirumalai D. Internal constraints induce localization in an isolated polymer molecule. Phys. Rev. Lett. 1996;76:542–545. doi: 10.1103/PhysRevLett.76.542. [DOI] [PubMed] [Google Scholar]
49.Bohn M., Heermann D.W., van Driel R. Random loop model for long polymers. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 2007;76:051805. doi: 10.1103/PhysRevE.76.051805. [DOI] [PubMed] [Google Scholar]
50.Mateos-Langerak J., Bohn M., Goetze S. Spatially confined folding of chromatin in the interphase nucleus. Proc. Natl. Acad. Sci. USA. 2009;106:3812–3817. doi: 10.1073/pnas.0809501106. [DOI] [PMC free article] [PubMed] [Google Scholar]
51.Polovnikov K., Nechaev S., Tamm M.V. Effective Hamiltonian of topologically stabilized polymer states. Soft Matter. 2018;14:6561–6570. doi: 10.1039/c8sm00785c. [DOI] [PubMed] [Google Scholar]
52.Bahar I., Atilgan A.R., Erman B. Direct evaluation of thermal fluctuations in proteins using a single-parameter harmonic potential. Fold. Des. 1997;2:173–181. doi: 10.1016/S1359-0278(97)00024-2. [DOI] [PubMed] [Google Scholar]
53.Haliloglu T., Bahar I., Erman B. Gaussian dynamics of folded proteins. Phys. Rev. Lett. 1997;79:3090–3093. [Google Scholar]
54.Grosberg A., Rabin Y., Neer A. Crumpled globule model of the three-dimensional structure of DNA. EPL. 1993;23:373–378. [Google Scholar]
55.Mirny L.A. The fractal globule as a model of chromatin architecture in the cell. Chromosome Res. 2011;19:37–51. doi: 10.1007/s10577-010-9177-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Serra F., Di Stefano M., Marti-Renom M.A. Restraint-based three-dimensional modeling of genomes and genomic domains. FEBS Lett. 2015;589:2987–2995. doi: 10.1016/j.febslet.2015.05.012. [DOI] [PubMed] [Google Scholar]
57.Jhunjhunwala S., van Zelm M.C., Murre C. The 3D structure of the immunoglobulin heavy-chain locus: implications for long-range genomic interactions. Cell. 2008;133:265–279. doi: 10.1016/j.cell.2008.03.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
58.de Gennes P. Cornell University Press; Ithaca, NY: 1979. Scaling Concepts in Polymer Physics. [Google Scholar]
59.Sheinman M., Bénichou O., Voituriez R. Classes of fast and specific search mechanisms for proteins on DNA. Rep. Prog. Phys. 2012;75:026601. doi: 10.1088/0034-4885/75/2/026601. [DOI] [PubMed] [Google Scholar]
60.Knight P.A., Ruiz D. A fast algorithm for matrix balancing. IMA J. Numer. Anal. 2013;33:1029–1047. [Google Scholar]
61.Mirny Lab. 2018. Cooler package. https://github.com/mirnylab/cooler.
62.Reuss G., Disteldorf W., Hilt A. Wiley-VCH Verlag GmbH & Co. KGaA; Weinheim, Germany: 2000. Formaldehyde. [Google Scholar]
63.Kremer K., Grest G.S. Dynamics of entangled linear polymer melts: a molecular dynamics simulation. J. Chem. Phys. 1990;92:5057–5086. [Google Scholar]
64.Plimpton S. Fast parallel algorithms for short-range molecular dynamics. J. Comput. Phys. 1995;117:1–19. [Google Scholar]
65.Press W.H. Cambridge University Press; Cambridge, UK: 2007. Numerical Recipes, 3rd Edition: The Art of Scientific Computing. [Google Scholar]
66.Elowitz M.B., Surette M.G., Leibler S. Protein mobility in the cytoplasm of Escherichia coli. J. Bacteriol. 1999;181:197–203. doi: 10.1128/jb.181.1.197-203.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Supporting Materials and Methods, Figs. S1–S37, and Table S1

mmc1.pdf^{(22MB, pdf)}

Data S1. LAMMPS Configuration Files

The LAMMPS configuration files used to perform the BD simulations in Fig. 5 are provided.

mmc2.zip^{(42.2KB, zip)}

Document S2. Article plus Supporting Material

mmc3.pdf^{(23.7MB, pdf)}

[bib1] 1.Képès F., Vaillant C. Transcription-based solenoidal model of chromosomes. Complexus. 2003;1:171–180. [Google Scholar]

[bib2] 2.Junier I., Martin O., Képès F. Spatial and topological organization of DNA chains induced by gene co-localization. PLoS Comput. Biol. 2010;6:e1000678. doi: 10.1371/journal.pcbi.1000678. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib3] 3.Spilianakis C.G., Lalioti M.D., Flavell R.A. Interchromosomal associations between alternatively expressed loci. Nature. 2005;435:637–645. doi: 10.1038/nature03574. [DOI] [PubMed] [Google Scholar]

[bib4] 4.Montero Llopis P., Jackson A.F., Jacobs-Wagner C. Spatial organization of the flow of genetic information in bacteria. Nature. 2010;466:77–81. doi: 10.1038/nature09152. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib5] 5.Schoenfelder S., Sexton T., Fraser P. Preferential associations between co-regulated genes reveal a transcriptional interactome in erythroid cells. Nat. Genet. 2010;42:53–61. doi: 10.1038/ng.496. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib6] 6.Boettiger A.N., Bintu B., Zhuang X. Super-resolution imaging reveals distinct chromatin folding for different epigenetic states. Nature. 2016;529:418–422. doi: 10.1038/nature16496. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib7] 7.Dekker J., Marti-Renom M.A., Mirny L.A. Exploring the three-dimensional organization of genomes: interpreting chromatin interaction data. Nat. Rev. Genet. 2013;14:390–403. doi: 10.1038/nrg3454. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib8] 8.Lieberman-Aiden E., van Berkum N.L., Dekker J. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science. 2009;326:289–293. doi: 10.1126/science.1181369. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib9] 9.Rao S.S., Huntley M.H., Aiden E.L. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell. 2014;159:1665–1680. doi: 10.1016/j.cell.2014.11.021. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib10] 10.Duan Z., Andronescu M., Noble W.S. A three-dimensional model of the yeast genome. Nature. 2010;465:363–367. doi: 10.1038/nature08973. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib11] 11.Sexton T., Yaffe E., Cavalli G. Three-dimensional folding and functional organization principles of the Drosophila genome. Cell. 2012;148:458–472. doi: 10.1016/j.cell.2012.01.010. [DOI] [PubMed] [Google Scholar]

[bib12] 12.Umbarger M.A., Toro E., Church G.M. The three-dimensional architecture of a bacterial genome and its alteration by genetic perturbation. Mol. Cell. 2011;44:252–264. doi: 10.1016/j.molcel.2011.09.010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] 13.Cagliero C., Grand R.S., O’Sullivan J.M. Genome conformation capture reveals that the Escherichia coli chromosome is organized by replication and transcription. Nucleic Acids Res. 2013;41:6058–6071. doi: 10.1093/nar/gkt325. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib14] 14.Marbouty M., Le Gall A., Nollmann M. Condensin- and replication-mediated bacterial chromosome folding and origin condensation revealed by Hi-C and super-resolution imaging. Mol. Cell. 2015;59:588–602. doi: 10.1016/j.molcel.2015.07.020. [DOI] [PubMed] [Google Scholar]

[bib15] 15.Dixon J.R., Selvaraj S., Ren B. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature. 2012;485:376–380. doi: 10.1038/nature11082. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib16] 16.Olivares-Chauvet P., Mukamel Z., Tanay A. Capturing pairwise and multi-way chromosomal conformations using chromosomal walks. Nature. 2016;540:296–300. doi: 10.1038/nature20158. [DOI] [PubMed] [Google Scholar]

[bib17] 17.Nagano T., Lubling Y., Tanay A. Cell-cycle dynamics of chromosomal organization at single-cell resolution. Nature. 2017;547:61–67. doi: 10.1038/nature23001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib18] 18.Fraser J., Ferrai C., Nicodemi M., FANTOM Consortium Hierarchical folding and reorganization of chromosomes are linked to transcriptional changes in cellular differentiation. Mol. Syst. Biol. 2015;11:852. doi: 10.15252/msb.20156492. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib19] 19.Sexton T., Cavalli G. The 3D genome shapes up for pluripotency. Cell Stem Cell. 2013;13:3–4. doi: 10.1016/j.stem.2013.06.013. [DOI] [PubMed] [Google Scholar]

[bib20] 20.Chandra T., Ewels P.A., Reik W. Global reorganization of the nuclear landscape in senescent cells. Cell Rep. 2015;10:471–483. doi: 10.1016/j.celrep.2014.12.055. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib21] 21.Beagrie R.A., Scialdone A., Pombo A. Complex multi-enhancer contacts captured by genome architecture mapping. Nature. 2017;543:519–524. doi: 10.1038/nature21411. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib22] 22.Cavalli G. Chromosome kissing. Curr. Opin. Genet. Dev. 2007;17:443–450. doi: 10.1016/j.gde.2007.08.013. [DOI] [PubMed] [Google Scholar]

[bib23] 23.Baù D., Sanyal A., Marti-Renom M.A. The three-dimensional folding of the α-globin gene domain reveals formation of chromatin globules. Nat. Struct. Mol. Biol. 2011;18:107–114. doi: 10.1038/nsmb.1936. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib24] 24.Nora E.P., Lajoie B.R., Heard E. Spatial partitioning of the regulatory landscape of the X-inactivation centre. Nature. 2012;485:381–385. doi: 10.1038/nature11049. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib25] 25.Di Stefano M., Rosa A., Micheletti C. Colocalization of coregulated genes: a steered molecular dynamics study of human chromosome 19. PLoS Comput. Biol. 2013;9:e1003019. doi: 10.1371/journal.pcbi.1003019. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib26] 26.Jost D., Carrivain P., Vaillant C. Modeling epigenome folding: formation and dynamics of topologically associated chromatin domains. Nucleic Acids Res. 2014;42:9553–9561. doi: 10.1093/nar/gku698. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib27] 27.Di Stefano M., Paulsen J., Micheletti C. Hi-C-constrained physical models of human chromosomes recover functionally-related properties of genome organization. Sci. Rep. 2016;6:35985. doi: 10.1038/srep35985. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib28] 28.Soler-Oliva M.E., Guerrero-Martínez J.A., Reyes J.C. Analysis of the relationship between coexpression domains and chromatin 3D organization. PLoS Comput. Biol. 2017;13:e1005708. doi: 10.1371/journal.pcbi.1005708. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib29] 29.Baù D., Marti-Renom M.A. Genome structure determination via 3C-based data integration by the integrative modeling platform. Methods. 2012;58:300–306. doi: 10.1016/j.ymeth.2012.04.004. [DOI] [PubMed] [Google Scholar]

[bib30] 30.Lesne A., Riposo J., Mozziconacci J. 3D genome reconstruction from chromosomal contacts. Nat. Methods. 2014;11:1141–1143. doi: 10.1038/nmeth.3104. [DOI] [PubMed] [Google Scholar]

[bib31] 31.Wang S., Xu J., Zeng J. Inferential modeling of 3D chromatin structure. Nucleic Acids Res. 2015;43:e54. doi: 10.1093/nar/gkv100. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib32] 32.Varoquaux N., Ay F., Vert J.P. A statistical approach for inferring the 3D structure of the genome. Bioinformatics. 2014;30:i26–i33. doi: 10.1093/bioinformatics/btu268. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib33] 33.Tjong H., Li W., Alber F. Population-based 3D genome structure analysis reveals driving forces in spatial genome organization. Proc. Natl. Acad. Sci. USA. 2016;113:E1663–E1672. doi: 10.1073/pnas.1512577113. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib34] 34.Giorgetti L., Galupa R., Heard E. Predictive polymer modeling reveals coupled fluctuations in chromosome conformation and transcription. Cell. 2014;157:950–963. doi: 10.1016/j.cell.2014.03.025. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib35] 35.Meluzzi D., Arya G. Recovering ensembles of chromatin conformations from contact probabilities. Nucleic Acids Res. 2013;41:63–75. doi: 10.1093/nar/gks1029. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib36] 36.Chiariello A.M., Annunziatella C., Nicodemi M. Polymer physics of chromosome large-scale 3D organisation. Sci. Rep. 2016;6:29775. doi: 10.1038/srep29775. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib37] 37.Brackley C.A., Brown J.M., Marenduzzo D. Predicting the three-dimensional folding of cis-regulatory regions in mammalian genomes using bioinformatic data and polymer models. Genome Biol. 2016;17:59. doi: 10.1186/s13059-016-0909-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib38] 38.Michieletto D., Orlandini E., Marenduzzo D. Polymer model with epigenetic recoloring reveals a pathway for the de novo establishment and 3D organization of chromatin domains. Phys. Rev. X. 2016;6:041047. [Google Scholar]

[bib39] 39.Fussner E., Ching R.W., Bazett-Jones D.P. Living without 30nm chromatin fibers. Trends Biochem. Sci. 2011;36:1–6. doi: 10.1016/j.tibs.2010.09.002. [DOI] [PubMed] [Google Scholar]

[bib40] 40.Langowski J. Polymer chain models of DNA and chromatin. Eur. Phys. J. E Soft Matter. 2006;19:241–249. doi: 10.1140/epje/i2005-10067-9. [DOI] [PubMed] [Google Scholar]

[bib41] 41.Kasteleyn P. Academic Press; New York: 1967. Graph Theory and Crystal Physics. [Google Scholar]

[bib42] 42.Jackson V. Formaldehyde cross-linking for studying nucleosomal dynamics. Methods. 1999;17:125–139. doi: 10.1006/meth.1998.0724. [DOI] [PubMed] [Google Scholar]

[bib43] 43.Imakaev M., Fudenberg G., Mirny L.A. Iterative correction of Hi-C data reveals hallmarks of chromosome organization. Nat. Methods. 2012;9:999–1003. doi: 10.1038/nmeth.2148. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib44] 44.Yaffe E., Tanay A. Probabilistic modeling of Hi-C contact maps eliminates systematic biases to characterize global chromosomal architecture. Nat. Genet. 2011;43:1059–1065. doi: 10.1038/ng.947. [DOI] [PubMed] [Google Scholar]

[bib45] 45.Cournac A., Marie-Nelly H., Mozziconacci J. Normalization of a chromosomal contact map. BMC Genomics. 2012;13:436. doi: 10.1186/1471-2164-13-436. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib46] 46.Solf M.P., Vilgis T.A. Statistical mechanics of macromolecular networks without replicas. J. Phys. Math. Gen. 1995;28:6655–6668. [Google Scholar]

[bib47] 47.Kantor Y., Kardar M. Conformations of randomly linked polymers. Phys. Rev. E Stat. Phys. Plasmas Fluids Relat. Interdiscip. Topics. 1996;54:5263–5267. doi: 10.1103/physreve.54.5263. [DOI] [PubMed] [Google Scholar]

[bib48] 48.Bryngelson J.D., Thirumalai D. Internal constraints induce localization in an isolated polymer molecule. Phys. Rev. Lett. 1996;76:542–545. doi: 10.1103/PhysRevLett.76.542. [DOI] [PubMed] [Google Scholar]

[bib49] 49.Bohn M., Heermann D.W., van Driel R. Random loop model for long polymers. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 2007;76:051805. doi: 10.1103/PhysRevE.76.051805. [DOI] [PubMed] [Google Scholar]

[bib50] 50.Mateos-Langerak J., Bohn M., Goetze S. Spatially confined folding of chromatin in the interphase nucleus. Proc. Natl. Acad. Sci. USA. 2009;106:3812–3817. doi: 10.1073/pnas.0809501106. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib51] 51.Polovnikov K., Nechaev S., Tamm M.V. Effective Hamiltonian of topologically stabilized polymer states. Soft Matter. 2018;14:6561–6570. doi: 10.1039/c8sm00785c. [DOI] [PubMed] [Google Scholar]

[bib52] 52.Bahar I., Atilgan A.R., Erman B. Direct evaluation of thermal fluctuations in proteins using a single-parameter harmonic potential. Fold. Des. 1997;2:173–181. doi: 10.1016/S1359-0278(97)00024-2. [DOI] [PubMed] [Google Scholar]

[bib53] 53.Haliloglu T., Bahar I., Erman B. Gaussian dynamics of folded proteins. Phys. Rev. Lett. 1997;79:3090–3093. [Google Scholar]

[bib54] 54.Grosberg A., Rabin Y., Neer A. Crumpled globule model of the three-dimensional structure of DNA. EPL. 1993;23:373–378. [Google Scholar]

[bib55] 55.Mirny L.A. The fractal globule as a model of chromatin architecture in the cell. Chromosome Res. 2011;19:37–51. doi: 10.1007/s10577-010-9177-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib56] 56.Serra F., Di Stefano M., Marti-Renom M.A. Restraint-based three-dimensional modeling of genomes and genomic domains. FEBS Lett. 2015;589:2987–2995. doi: 10.1016/j.febslet.2015.05.012. [DOI] [PubMed] [Google Scholar]

[bib57] 57.Jhunjhunwala S., van Zelm M.C., Murre C. The 3D structure of the immunoglobulin heavy-chain locus: implications for long-range genomic interactions. Cell. 2008;133:265–279. doi: 10.1016/j.cell.2008.03.024. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib58] 58.de Gennes P. Cornell University Press; Ithaca, NY: 1979. Scaling Concepts in Polymer Physics. [Google Scholar]

[bib59] 59.Sheinman M., Bénichou O., Voituriez R. Classes of fast and specific search mechanisms for proteins on DNA. Rep. Prog. Phys. 2012;75:026601. doi: 10.1088/0034-4885/75/2/026601. [DOI] [PubMed] [Google Scholar]

[bib60] 60.Knight P.A., Ruiz D. A fast algorithm for matrix balancing. IMA J. Numer. Anal. 2013;33:1029–1047. [Google Scholar]

[bib61] 61.Mirny Lab. 2018. Cooler package. https://github.com/mirnylab/cooler.

[bib62] 62.Reuss G., Disteldorf W., Hilt A. Wiley-VCH Verlag GmbH & Co. KGaA; Weinheim, Germany: 2000. Formaldehyde. [Google Scholar]

[bib63] 63.Kremer K., Grest G.S. Dynamics of entangled linear polymer melts: a molecular dynamics simulation. J. Chem. Phys. 1990;92:5057–5086. [Google Scholar]

[bib64] 64.Plimpton S. Fast parallel algorithms for short-range molecular dynamics. J. Comput. Phys. 1995;117:1–19. [Google Scholar]

[bib65] 65.Press W.H. Cambridge University Press; Cambridge, UK: 2007. Numerical Recipes, 3rd Edition: The Art of Scientific Computing. [Google Scholar]

[bib66] 66.Elowitz M.B., Surette M.G., Leibler S. Protein mobility in the cytoplasm of Escherichia coli. J. Bacteriol. 1999;181:197–203. doi: 10.1128/jb.181.1.197-203.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

A Polymer Model for the Quantitative Reconstruction of Chromosome Architecture from HiC and GAM Data

Guillaume Le Treut

François Képès

Henri Orland

Abstract

Introduction

Figure 1.

Methods