Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2018 Jul 1.
Published in final edited form as: Methods. 2017 Feb 5;123:47–55. doi: 10.1016/j.ymeth.2017.01.007

Comparative analysis of 2D and 3D distance measurements to study spatial genome organization

Elizabeth H Finn 1, Gianluca Pegoraro 1, Sigal Shachar 1, Tom Misteli 1,*
PMCID: PMC5522766  NIHMSID: NIHMS852710  PMID: 28179124

Abstract

The spatial organization of eukaryotic genomes is non-random, cell-type specific, and has been linked to cellular function. The investigation of spatial organization has traditionally relied extensively on fluorescence microscopy. The validity of the imaging methods used to probe spatial genome organization often depends on the accuracy and precision of distance measurements. Imaging-based measurements may either use 2 dimensional datasets or 3D datasets including the z-axis information in image stacks. Here we compare the suitability of 2D versus 3D distance measurements in the analysis of various features of spatial genome organization. We find in general good agreement between 2D and 3D analysis with higher convergence of measurements as the interrogated distance increases, especially in flat cells. Overall, 3D distance measurements are more accurate than 2D distances, but are also more prone to noise. In particular, z-stacks are prone to error due to imaging properties such as limited resolution along the z-axis and optical aberrations, and we also find significant deviations from unimodal distance distributions caused by low sampling frequency in z. These deviations can be ameliorated by sampling at much higher frequency in the z-direction. We conclude that 2D distances are preferred for comparative analyses between cells, but 3D distances are preferred when comparing to theoretical models in large samples of cells. In general, 2D distance measurements remain preferable for many applications of analysis of spatial genome organization.

INTRODUCTION

The eukaryotic genome is functionally organized across several length scales [1,2]. Double-stranded DNA is wrapped around nucleosomes, which are composed of octameric core histones, and further coiled into a chromatin fiber [3], which forms higher order functional conformations, most prominently loops between promoters and enhancers [4,5,6], or between co-regulated genes [711]. Furthermore, chromatin forms distinct domains with variable density related to transcriptional activity and histone modifications [12,13]. Large domains of heterochromatin and euchromatin appear to self-associate and also associate with particular nuclear landmarks such as the nuclear lamina [14], the nucleolus [15], and nuclear bodies [16,17]. At the highest level of organization, chromosomes form territories which assume preferred positions within the nucleus [18]. Many of these organizational features have been observed to change during differentiation [19], to be associated with changes in transcription [20,21], and to be disordered in disease [22,23], suggesting that the physical distance between specific genomic locations or relative to nuclear landmarks is an important regulatory feature.

Given their potential regulatory function, measurements of physical distances are of considerable interest. While recently developed biochemical C-methods detect physical interactions between genome regions globally and with relatively high resolution [24], they do not provide spatial distance measurements and can only provide information on pair-wise interactions. The most commonly used means to measure physical distances in the genome is by microscopy-based imaging techniques, particularly fluorescence in-situ hybridization (FISH; [2]).

Measurements of spatial genomic distances can either be performed in 2D or in 3D from a z-stack of images. For most 2D distance measurements, single images for analysis are generated either by maximal projection of z-stacks or by selection of a representative slice. On the one hand, this often simplifies image processing steps and reduces the computational resources needed, especially when high-throughput, automated image analysis is used on large image datasets. On the other, while all the signals in the image are captured, vertical distance information is lost in the projection process. As a consequence, objects spatially separated in 3D space may be detected as co-localizing in the 2D projection if aligned closely along the z- optical axis (Fig. 1A).

Figure 1.

Figure 1

2D and 3D measurements of z-stacks. A: Three orthogonal slices through a field of cells stained for three regions on chromosome 1 (xy, yz, and xz as labeled; xy position of relevant Z-stack marked with white bar). B: Three orthogonal slices through a field of 2.5 μm diameter fluorescent beads (xy, yz, and xz as labeled; xy position marked with white bar). C: Example pipeline for generating 2D and 3D distances from a z-stack.

Alternatively, distances between signals can be measured in three dimensions. While 3D datasets preserve all the information contained in the stack, they can introduce artifacts due to low resolution (depth of field), diffraction of signals, and collection of out-of-focus light. Axial resolution is also sensitive to spherical aberrations in lenses, which elongate z-signals ([25,26], Fig. 1A,B), and by chromatic aberrations, which may lead to signal shifts between multiple channels [26]. In addition, when using confocal microscopes, axial resolution depends on the size of the pinhole [27, 28], with a smaller pinhole increasing axial resolution but also decreasing signal, which may lead to signal loss. Finally, variation in the refractive index between cells and media, or between different organelles of the cell, including the nucleus, cause considerable axial distortion in confocal images [27].

Given the advantages and disadvantages of 2D and 3D measurements, we sought to empirically compare the two measurement modalities in different experimental contexts using a spinning-disk confocal microscope, in order to determine the most suitable measurement approach.

MATERIAL AND METHODS

Cell Culture

Human foreskin fibroblasts immortalized with hTert (neomycin resistance; [29]) were grown in DMEM media with 10% FBS, 2mM glutamine, and penicillin/streptomycin, and split 1:4 every 3–4 days. These cells have a normal karyotype as verified by SKY staining (Figure S1). Cells were plated in 384 well plates (CellCarrier Ultra, PerkinElmer) at a density of approximately 5000 cells/well between passages 40 and 45 and grown overnight. Plates of cells were then fixed for 10 minutes in 4% paraformaldehyde, washed, and stored in 70% ethanol at −20°C.

HiFISH Imaging

High-throughput fluorescence in-situ hybridization (HiFISH) was performed in triplicate as described previously [22,30,31]. Probes were generated via nick translation as described previously [32] from bacterial artificial chromosomes (BACs) to several regions across chromosome 1 (see Table 1). Mixes, reagents, and conditions are exactly as in Meaburn [32], except fluorescently labelled dUTP was substituted for biotin- or digoxigenin- labelled dUTP (Green: ChromaTide Alexa Fluor 488-5-dUTP, ThermoFisher; Red: ChromaTide Alexa Fluor 568-dUTP, ThermoFisher; Far-red: Amersham CyDye Cy5-dUTP, GE Healthcare) and nucleotide mixes were used at a 1/3 dilution. Probes were mixed, precipitated, and resuspended at a final concentration of 6.67ng/μL in hybridization buffer (50% formamide, 10% dextran sulfate, 1% tween-20, 2× SSC).

Table 1.

Probes used

Label Chr Start End BAC ID
11 1 2,435,023 2,569,510 RP11-1012C20
52 1 12,768,721 12,925,598 RP11-380A17
74 1 18,566,410 18,766,881 RP11-164D21
80 1 20,012,370 20,177,633 RP11-451I3
88 1 22,000,503 22,169,684 RP11-655O19
91 1 22,549,855 22,721,150 RP11-702L8
354 1 88,222,723 88,418,435 RP11-647L1
360 1 89,759,076 89,913,487 RP11-465E12
375 1 93,528,778 93,687,931 RP11-657I7
385 1 96,067,993 96,226,767 RP11-489I11
389 1 97,025,756 97,207,861 RP11-357E20
422 1 105,256,971 105,427,281 RP11-75M16
463 1 115,549,942 115,729,562 RP11-675C19
476 1 118,815,747 118,989,849 RP11-419G7

Fixed cells were permeabilized for 20 min in 0.5% saponin/0.5% Triton X-100/PBS, washed twice in PBS, incubated for 15 min in 0.1N HCl, neutralized for 5 min in 2× SSC, and equilibrated for at least 30 min in 50% formamide/2× SSC before probes were added. Probes and nuclear DNA were denatured at 85°C for 7.5 min and plates were immediately moved to a 37°C humid chamber for hybridization overnight. The next day, plates were washed thrice in 1× SSC, thrice in 0.1× SSC, stained with DAPI, mounted in PBS, and imaged.

Imaging was performed in four channels (405, 488, 561, 640 nm excitation lasers) in an automated fashion using a spinning disk high-throughput confocal microscope (PerkinElmer Opera QEHS) using a 40× water immersion lens (NA = 0.9) and pixel binning of 2 (pixel size = 320 nm). 20–40 fields were imaged per well. For sparse sampling, z-stacks of 7 μm thickness with images spaced 1μm apart were acquired in three separate exposures. For Nyquist sampling, z-stacks with total 4.2 μm thickness and 300 nm image intervals were generated. In all exposures the light path included a primary excitation dichroic (405/488/561/640 nm), a 1st emission dichroic longpass mirror: 650/660–780, HR 400–640 nm and a secondary emission dichroic shortpass mirror: 568/HT 400–550, HR 620–790 nm. In exposure 1, samples were excited with the 405 and 640 nm lasers, and the emitted signal was detected by two separate 1.3 Mp CCD cameras (Detection filters: bandpass 450/50 nm and 690/70 nm, respectively). In exposure 2, samples were excited with the 488 nm laser and the emitted light was detected through a 1.3 Mp CCD camera (Detection filter: bandpass 520/35). In exposure 3, samples were excited with the 561 nm laser and the emitted light was detected through a 1.3 Mp CCD camera (Detection filter: bandpass 600/40).

2D and 3D Image Analysis

Automated analysis of all images was performed based on a modified version of a previously described Acapella 2.6 (PerkinElmer) custom script [30, 3335]. This custom script performed automated nucleus detection based on the maximal projection of the DAPI image (ex. 405 nm) to identify cells. Spots within these cells were subsequently identified in maximal projections of the Green (ex 488 nm), Red (ex. 561 nm) and Far Red (ex. 640 nm) images, using local (relative to the surrounding pixels) and global (relative to the entire nucleus) contrast filters. The x and y coordinates of the brightest pixel in each spot were calculated. The z coordinate of the spot center was then calculated by identifying the slice in the z-stack with the highest value in fluorescence intensity for each of the spot centers. Datasets containing x,y and z coordinates for spots in the Green, Red and Far Red channels as well as experiment, row, column, field, cell, and spot indices, were exported from Acapella as tab separated tabular text files. These coordinates datasets were imported in R [36]. 2D and 3D distances for each pair of Red:Green, Red:Far Red, or Green:Far Red probes within a cell were generated on a per-spot basis using the SpatialTools R package [37]. Subsequent analyses were performed in R using the plyr [38], dplyr [39], ggplot2 [40], data.table [41], knitr [42] and stringr [43] packages. All images, scripts, and datasets are available upon request.

Statistical Analysis

For 2D/3D scatterplots, 2D and 3D distances were calculated on a per-green-spot basis using the SpatialTools R package [37]. These distances were plotted using ggplot2 [40].

For modelled 2D/3D scatterplots, random pairs of coordinates were generated with a normal distribution; standard deviation was 100 for x and y and 10, 30, 50, or 100 for z. Distances between these pairs were calculated in 2D and 3D using the SpatialTools R package [37] and these points were plotted using ggplot2 [40].

For 2D/3D colocalization frequencies in fibroblasts, both 2D and 3D distances were calculated using the SpatialTools R package [37]. Minimal distances were calculated on a per-green-spot basis using the data.table R package [41]. The percentage of spot pairs within thresholds of 350 nm, 700 nm, 1 μm, 2 μm, 3 μm, and 4 μm was calculated using the data.table R package [41] and plotted using ggplot2 [40].

For triplet associations in fibroblasts, both 2D and 3D minimal distances were calculated on a per-green-spot basis as above. Triplets were defined as events where a single green spot was within 1 μm of a red spot and a far-red spot. Both pairing frequencies within 1 μm, and triplets, were counted. For each probe set, expected proportions of triplets were calculated as p(G:R)*p(G:F) where p(G:R) is the proportion of green and red spots colocalizing within 1μm and p(G:F) is the proportion of green and far red spots colocalizing within 1μm. 95% confidence intervals were calculated with the modified Wald method.

For distance distribution histograms in fibroblasts, minimal distances were calculated on a per-green-spot basis and plotted.

Data Availability

The data generated has been uploaded to dryad for public sharing (doi: 10.5061/dryad.236j7). This contains the original .flex image files generated by the PerkinElmer Opera QEHS high-throughput microscope, the PerkinElmer Acapella script and parameter files used for image analysis, the .txt single-object level image analysis results files generated by Acapella, the .rmd files containing the R code used to calculate 2D and 3D distances and perform subsequent analyses, and the summary experiment-level spot-to-spot distance tables for each experiment generated by R. We have included the following files:

  1. AcapellaScriptsAndParameters.zip: This file contains the Acapella scripts and paramaters used to analyze each of the experiments. The files are named relative to the image dataset used.

  2. Image file folders: These files are named according to the date of acquisition and the z-stack resolution, in the format “BC_YYMMDD_Res.zip”. Each folder contains all the .flex files acquired in that experiment. The experiments used to generate each figure panel are as follows:
    1. Figure 2: All experiments were used to generate panel A.
    2. Figure 3: BC_160525_300.zip, BC_160531_300.zip, and BC_160614_300.zip were used.
    3. Figure 4: BC_160524_1000.zip and BC_160525_300.zip were used.
    4. Figure S3: BC_160614_300.zip was used.
  3. SpotPositionMeasurements.zip: This file contains the output of the Acapella scripts (.txt files), the experimental metadata assigning probes to wells, and the summarized per-spot-pair distance measurements generated by R. They are organized into folders by experiment.

  4. RmdScripts.zip: R markdown scripts used to calculate spot distances and perform subsequent analyses.

Figure 2.

Figure 2

2D vs 3D distances in nuclei of multiple shapes. A: Scatterplot showing 2D vs 3D distance for minimal distances between FISH signals in fibroblasts. B–E: Scatterplots showing modeled relationship between 2D and 3D distances at random points in volumes of various sizes.

Figure 3.

Figure 3

Co-localization proportions in 2D and 3D. A: Scatterplot showing co-localization proportion based on 2D distances vs. co-localization proportion based on 3D distances. Distance cutoff color coded. B: Ideogram showing probe assignment into triplets. C: Bar graph showing proportion of green spots interacting with red and far-red spots simultaneously. Orange: observed; blue: expected.

Figure 4.

Figure 4

Various observed artifacts in 3D. A: Ideogram showing probes used. B: Histograms of distance distributions for minimal distances in fibroblasts for various probe sets with various average distances. Discontinuities due to lowered resolution in z are visible in 3D distance distributions with 1 μm z-slices but not 2D distance distributions or 3D distance distributions with 300 nm z-slices. C: Discontinuities due to resolution in x-y are visible at very short distances with very small binning regardless of the method for calculating distances.

RESULTS

2D vs 3D distance measurement over varying length scales

In order to determine how well 2D and 3D distance measurements correspond to each other, we systematically compared distance measurements across several length scales using 2D or 3D measurement regimes. As a model system we used a series of probes tiling two regions of approximately 20 Mb each on chromosome 1 (diagrammed in Fig. S2). Each probe pair was separated by at least 10 Mb and in total 20 different probe pairs were examined in approximately 1,000 cells per pair. Pairs were selected to have a variety of chromatin contexts and interaction profiles. We determined pairwise distances between each pair of loci, both in 2D from maximal projections and in 3D using automated high-throughput spinning disc confocal microscopy as described in figure 1C (Materials and Methods). We then compared 2D and 3D distances for each pair of spots in each cell by pooling in silico all the probe pairs studied (Fig. 2A). We observe a strong distance dependence for the concordance of 2D and 3D distances. 2D and 3D distances between spot centers are in good agreement for distances above approximately 5 μm with average discrepancy between measurements of less than 1% (Fig. 2A). Below 5 μm we see on average a 29% difference between 2D and 3D distances, and below 1 μm we see on average an 83% difference. Similarly, while we observe a statistically significant difference between 2D and 3D distances overall (two sample t-test, p=7.057e-13), this difference is much less significant for points separated by at least 5 μm (two sample t-test, p=0.01744).

Prompted by the fact that the 5 μm cut-off is within the dimensions of the average height of the fibroblast cell nucleus, we interrogated the effect of nuclear shape on 2D and 3D distance measurements by generating theoretical models for 2D and 3D distance distributions in nuclei with varying degrees of flatness (Fig. 2B). In our simulated models, the x and y coordinates were sampled from a random normal distribution with a mean of 300 arbitrary units and a standard deviation of 100 arbitrary units; the z coordinate was set with a mean of 300 and standard deviations of 10 (z/x = 0.1), 30 (z/x = 0.3), 50 (z/x = 0.5) and 100 (z/x = 1). In scatter plots of 2D versus 3D distance for all modeled points, we observe that flatter nuclei show more agreement between 2D and 3D analysis, and that agreement is stronger at larger distances. Consistent with our observations in fibroblasts, deviations are greater at shorter distances, and in addition we observe a striking effect of nuclear shape: considering all spots, average discrepancy between measurements was 21% for a model of perfectly round nuclei (z/x = 1), 10% for z/x = 0.5, 6% for z/x = 0.3, and only 1% for our flattest model (z/x = 0.1). Considering only spot pairs within 75 units, which is 15% of the maximum diameter, or roughly 5 μm in a fibroblast with maximum distance between points around 30 μm, average discrepancy between measurements is still 21% for the round model, but increases to 21% for z/x = 0.5, 17% for z/x = 0.3, and 5% for the flattest model. Similarly, the statistically significant difference between 2D and 3D distances is observable mostly in round cells and at short distances. When considering all pairs, we observe highly significant differences between 2D and 3D modeled differences for z/x = 1 (p < 2.2e-16), and z/x = 0.5 (p = 1.379e-13), significant differences when z/x = 0.3 (p = 0.001228), and no significant differences when z/x = 0.1 (p = 0.7132, two sample t-test). Considering only pairs separated by at least 75 units, we see a general decrease in significance (z/x = 1: p < 2.2e-16, z/x = 0.5: 2.067e-10, z/x = 0.3: 0.009486, z/x = 0.1: p = 0.7753). It is worth noting here as well that the difference in the means for z/x = 0.3 with point pairs separated by at least 75 units is only 5 units (2D mean: 197.0418, 3D mean: 202.2165) which corresponds to approximately 333 nm, which may be within normal noise due to drift of the microscope and aberrations in mirrors and lenses. Thus, both cell shape and average distance between spots affect the ratio of measurements made in 2D to those made in 3D. Overall, at longer distances and in flatter cells, 2D and 3D measurements are not significantly different.

The use of central slices versus maximal projections

The principle advantage of using a sparse z-stack, and calculating 2D distances from the maximal projection, is a time savings in image acquisition. This time savings would be even more significant if a single central slice could be imaged and spot centers determined from a single image per field. However, as spots do not always span the entire height of the cell, some spots will be lost when a single central slice is taken. To determine the extent of spot loss upon using a central slice rather than a maximal projection, we reanalyzed a subset (630 fields) of our data using a single central slice. These samples probe interactions between our common upstream “Bait” probe and four probes roughly 10Mb downstream (referred to as A, B, C, and D; Figure S3A). Whereas all probes yielded predominantly diploid signals in maximal projections, fewer than half of the cells showed diploid signals in the central slice (Figure S3B). Across all probes and channels, we identified 37,996 spots in these fields in maximal projections and only 22,032 spots in central slices. It is worth noting, however, that this decrease in data density is not sufficient to significantly change distance distributions (Figure S3C). It is likely that acquiring precise 2D distance distributions depends more on the number of images taken rather than the number of cells imaged: imaging more z-positions per cell will ensure that diploid signals are determined more frequently, but imaging more cells at only one z-position will yield sufficient diploid cells. Finding this balance, between taking sufficient z-slices to acquire most FISH spots and imaging sufficient fields to sample a large number of cells, will depend entirely on the application being tested.

2D and 3D association probabilities

Since 2D and 3D distances diverge at short ranges, it seems likely that association probabilities between genome regions in 3D space, which rely on detection of short distance measurements, would differ depending on whether they are calculated based on 2D or 3D distances. We tested this hypothesis by comparing 2D and 3D co-localization frequencies for pairs of regions on chromosome 1, separated by between 2 and 20 Mb (Fig. S2A). For analysis, we considered varying association thresholds between 350 nm and 4 μm (Fig. 3). We observe that the distance threshold has a notable effect on the frequencies calculated from 2D or 3D distances. In particular, deviations were more noticeable at shorter thresholds. With a threshold of 3 or 4 μm, we see little difference between 2D and 3D frequencies (Fig. 3A, green and yellow dots) whereas with a threshold of 1 μm or less, deviations are greater (Fig. 3A, orange, dark blue, and gray dots). Furthermore, we observe greater effects at loci which interact more frequently; for example, two loci which colocalize in 25% of cells in 3D appear to interact in as many as 55% of cells in 2D, whereas two loci which colocalize in under 10% of cells in 3D colocalize in under 20% of cells in 2D (Fig. 3A). These observations suggest that 2D and 3D analysis of rare associations and long-range interactions are equivalent, however, frequent, short-range associations may be overestimated in 2D measurements. Hence, for the study of close physical associations between loci, 3D distance measurements are likely to be more accurate.

Accuracy of 2D vs 3D distance measurements

The accuracy of a measurement is defined as closeness of agreement between the measurement and the true value of the measured entity. The higher accuracy of 3D measurements should lead to an improved ability to detect deviations from expected relationships. One very simple such theoretical model is that two pairwise interactions between one bait and two targets are independent. That is to say, that a co-localization between locus A and locus B does not change the probability of a co-localization between locus A and locus C. Under this assumption, the “clustering” frequency will be equal to the product of the pairwise association frequencies: p(A:B&A:C) = p(A:B)*p(A:C). However, it is likely that biological factors, such as chromatin state, transcriptional activity, and protein binding play a role in both of the pairwise interactions, and that in some cases the triplets will occur more often, or less often, than expected. If 3D measurements are more accurate than 2D measurements, they will be more sensitive to deviations from a neutral model. To test this hypothesis, we used 2D and 3D measurements to examine clustering behavior at four triplets spanning 20Mb on chromosome 1 (from 2,301,890 to 22,549,855). It is worth noting that this comparison of deviations from a model of independence between pairwise associations does not depend on the specific chromatin state or gene expression at the triplets tested. In fact, frequent observations from numerous studies exploring the correlation between gene activity and physical distance suggest that at both active and inactive regions, the phenomena bringing two loci together are common, and as such a deviation from the model of independence may be expected at many triplets independent of chromatin state.

Triplets were chosen from among our pairwise associations (Fig. 3B). Each probe triplet contained our most frequently occurring interaction, consisting of a bait between 12.75 and 13 Mb on chromosome 1 and a target 10 Mb upstream. The third probes mapped to a region approximately 10 Mb downstream from the bait and were spaced by 0.5–2 Mb. We modeled triplet associations, assuming independence between each pairwise association, and compared observed frequencies of clustered interactions detected by 2D or 3D analysis to modeled frequencies. We observe a statistically significant tendency for probes to cluster more than expected in both 2D and 3D (Fig. 3C). The enrichment and significance were greatly enhanced in 3D. The interaction frequency in 2D is on average 1.12× that of the expected value (p-value: 5.133E-5), in 3D the interaction frequency is on average 1.29× that of the expected value (p-value: 1.329E-10, two-tailed proportion t-test). Finally, while we see differences between observed rates of triplet associations and those modeled based on pairwise association frequencies at all four triplets in 3D, we find individual triplets were rarely significantly enriched (p-value: 0.01) in 2D (Fig. 3C). Thus, the inaccuracy inherent in 2D distance metrics makes it difficult to detect statistically significant trends in the data, whereas the improved accuracy with 3D distance thresholds identifies such interactions.

Variability and noise in 2D vs 3D datasets

When examining overall distance distributions in 2D and 3D, we noticed periodic noise in 3D distances taken with a 1 μm z-stack which was absent in 2D (Fig. 4A middle column). This noise could plausibly come from poor z-resolution. To determine whether noise caused by decreased resolution in Z significantly deforms distance distributions, we compared distance distributions for 2D distance measurements and 3D distance measurements generated from imaging different fields in the same wells using z-stacks with either a 1 μm slice or a 300 nm slice (Fig. 4A). Data were classified into 300nm bins starting from 0 and histograms were plotted. 3D distance measurements generated from z-stacks with a 1 μm slice contained prominent periodicity in distance distributions which were absent in 2D distance measurements and more densely sampled 3D distance measurements (Fig. 4A). The discontinuities in 3D distances were most visible at 1 and 2 μm distances, but were visible in distances up to 4 μm. The 3D distance measurements have a period of approximately 1 μm, equivalent to the spacing between z-sections in our dataset. It is worth noting that our x-y distances also represent discrete measurements with an x-y pixel size of 320 nm. As such, we observe discontinuities in all distance measurements at very short length scales (<1μm) (Fig. 4B), however, when the resolution in the x-y is greater than in z, discontinuities at very low distances in the x-y plane do not significantly alter the overall distance distributions. We conclude that 3D distances are sensitive to noise generated by decreased resolution across the optical axis.

DISCUSSION

Determination of distances in the cell nucleus is routinely performed in 2D, mostly due to the ease of measurements and the ability to analyze a larger number of individual cells. The use of 2D measurements has become standard, despite the fact that, intuitively, measurements of distance in 3D should be more accurate than in 2D. However, technical and physical limitations of microscopes mean that the vertical position of a signal is difficult to pinpoint exactly, and as such measurements in 3D may be overall less precise than measurements in 2D. We have here systematically compared 2D vs 3D distance measurements using FISH datasets.

We do not find that 3D distance measurements are always better than 2D measurements, suggesting that use of 2D analysis is a valid approach to studying nuclear organization. In fact, for some cases, such as determination of large distances, particularly in flat cells, 2D and 3D distances will yield very similar results. As a rough guideline based on our modeling and observations, distances greater than 5 μm on average, and cells with a height to width ratio of 1/3 or less, fall into this category. Furthermore, when the computational requirements for 3D cells become onerous, when the need for precision is high, or when only a few cells can be sampled, 2D measurements are advantageous. In addition, calculating 2D distances using only a single slice may speed acquisition time, reduce photobleaching and phototoxicity, and facilitate applications such as live cell microscopy. In addition, most practical applications are comparative in nature: the distances or association frequencies between a test pair of loci are compared with a set of control loci analyzed the same way. Provided that the measured distances for the test and the control pairs are similarly distributed they should be affected by the same systematic errors in measurements, thus making an internal comparison valid, regardless of whether 2D or 3D measurements are used.

The most accurate and precise data will be generated by imaging with thin z-slices and measuring distances in 3D. The advantages of imaging this way are two-fold: not only do denser z-stacks ameliorate the discontinuities in distance distributions which we observed, but they also minimize problems due to slight spherical aberrations present even in PLAN APO lenses. In short, a slight spherical aberration can be worsened by taking a thick z-slice and choosing the brightest pixel as the spot center if the ‘true’ position of the spot is between two z-slices. Dense z-slices can lessen the extent of this problem, as can computational techniques such as fitting a Gaussian curve to determine z-position. In particular, the optimal height for a z-slice can be determined analytically based on the objective and the wavelength of the fluorophores used, according to the Nyquist-Shannon sampling theorem, which determines the minimal sampling density needed to capture all information from the microscope into the image. One rule of thumb is to sample at one half the Rayleigh criterion, r = 1.4 ηλ/NA2 for wavelength λ, refractive index of mounting/immersion media η, and numerical aperture NA (for our experimental set up this yields a suggested sampling rate between 500 and 800 nm). Another suggestion, which addresses the fact that z-resolution is often limited by pinhole size and depth of field rather than diffraction of light, is to sample at one half or one third the maximal resolution (for our experimental set up with 1 μm depth of field this yields a suggested sampling rate between 300 and 500 nm). In our hands, a z-slice of 300nm worked well to yield both accurate and precise data.

However, in higher-throughput conditions, when hundreds of thousands or even millions of images must be generated, imaging at such a depth may prove prohibitive. In these situations, the choice between 2D and 3D distances is determined by the need of accuracy, defined as the reflection of the measurement of the ground truth, versus precision, defined as the internal consistency of multiple independent measurements (Table 2). 2D distances are precise by nature, due to better resolution in x and y, and easier normalization for optical aberrations in the x-y plane. 3D distances, even with thicker z slices, are by their nature more accurate – although the discrete nature of the data and low resolution in z can add noise to the distributions. Thus, when precision is crucial, for instance when examining the correlation between two parameters, 2D distances may be preferred. On the other hand, when accuracy is crucial, for instance when comparing to a theoretical model, sparse 3D distances are preferred. Overall, when imaging time must be optimized, and given the generally good alignment of 2D and 3D measurements, the choice between 2D and 3D distance measurements for frequent interactions depends on the biological and statistical question asked (Table 2). It is worth noting that our imaging set-up was not designed to maximize resolution. The use of higher-magnification, unbinned images and/or widefield images that have been deconvolved can improve resolution. With respect to increasing magnification and removing binning, we see the same discontinuities and noise in sparse z-stacks of unbinned 60× images (data not shown). As these discontinuities are likely due to the difference between axial and lateral resolution, this is to be expected. As the microscope used in this study does not have a widefield setting, we were unable to test deconvolved widefield images. However, even these images show a difference between axial and lateral resolution, and in addition the deconvolution process would add significant time to the analysis pipeline when applied to hundreds of thousands or millions of images gathered in a fairly typical high-throughput screening study. The best way to currently maximize accuracy and precision in localization experiments is to use super-resolution microscopy, but until these techniques can be applied in an automated fashion to generate large data sets, studying populations of cells with these techniques will remain difficult.

Table 2.

Comparison of imaging modalities

2D 3D
1μm z-stack resolution
3D
300nm z-stack resolution
Fast? Yes Yes No
Precise? Yes No Yes
Accurate? No Yes Yes

Although not specifically tested here, measurements of radial gene positioning, i.e. the location of a gene relative to the center of the nucleus, likely benefit from 2D analysis compared to 3D measurements [24]. Because radial position is often computed as a percent of the radius of the cell, in each measurement a long distance is being considered. The edge of the nucleus is more computationally difficult to find than a second spot center, and such computations become much more difficult in 3D, especially in a high-throughput format, putting a premium on computational resources. In addition, there are several ways to reduce the bias generated by the maximal projection. Since radial position is most often considered in a comparative fashion, the systematic bias resulting from loci at the ‘top’ or ‘bottom’ of the nucleus appearing in the center in a focal plane or maximal projection will be, essentially, considered in the background measurements. In addition, since radial position is measured on a per-spot basis, it is possible to remove some of the most biased spots by selecting a single central focal plane, ruling out spots at the very top or bottom of the nucleus. Thus, for measurements of radial position it is likely that the advantages of using 3D distances are limited and 2D distance measurements are preferred in most cases.

In sum, we find here, that both 2D and sparse 3D measurements have systematic biases that must be taken into account. In 2D, interactions appear more likely than they are in reality due to the overlay effect created when generating a maximal projection. In sparse 3D analysis, noise is added due to poor z-resolution. To properly use 2D distances, systematic errors must be eliminated either by empirically measuring background values, or by cross-comparing multiple samples which will all have the same bias, rather than using a theoretical model. To properly use 3D distances, it is imperative that a large number of cells is imaged in order to minimize the effect of the additional sources of noise such as chromatic aberrations or imperfections in the imaging light path. Taken together, our comparative analysis shows that both 2D and 3D distance measurements are imperfect, but suggests good agreement between the two approaches. While specific experimental designs may require the strict use of either method, the logistically simpler and practically faster 2D distance measurements will likely be preferred in many situations and will be appropriate in most cases.

Supplementary Material

1
2
3

HIGHLIGHTS.

  • 2D and 3D measurements yield similar results in flat cells and at long distances.

  • 2D measurements provide more precise data.

  • 3D measurements provide more accurate data, but are noisier.

  • To gain precision in 3D distances, it is crucial to sample at 1/3 the depth of field.

Acknowledgments

This research was supported, in part, by the Intramural Research Program of the National Institutes of Health (NIH), National Cancer Institute, and Center for Cancer Research and by the 4D Nucleome Common Fund (5U54DK107980-01). The SKY staining was performed by Danny Wangsa in Thomas Ried’s lab (CCR, NCI, NIH).

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  • 1.Misteli T. Beyond the Sequence: Cellular Organization of Genome Function. Cell. 2007;128:787–800. doi: 10.1016/j.cell.2007.01.028. [DOI] [PubMed] [Google Scholar]
  • 2.Bickmore WA. The spatial organization of the human genome. Annual review of genomics and human genetics. 2013;14:67–84. doi: 10.1146/annurev-genom-091212-153515. [DOI] [PubMed] [Google Scholar]
  • 3.Ricci MA, Manzo C, García-Parajo MF, Lakadamyali M, Cosma MP. Chromatin Fibers Are Formed by Heterogeneous Groups of Nucleosomes In Vivo. Cell. 2015;160:1145–1158. doi: 10.1016/j.cell.2015.01.054. [DOI] [PubMed] [Google Scholar]
  • 4.Tolhuis B, Palstra RJ, Splinter E, Grosveld F, De Laat W. Looping and interaction between hypersensitive sites in the active b-globin locus. Molecular Cell. 2002;10:1453–1465. doi: 10.1016/s1097-2765(02)00781-5. [DOI] [PubMed] [Google Scholar]
  • 5.Baù D, Sanyal A, Lajoie BR, Capriotti E, Byron M, Lawrence JB, Dekker J, Marti-Renom Ma. The three-dimensional folding of the α-globin gene domain reveals formation of chromatin globules. Nature structural & molecular biology. 2011;18:107–114. doi: 10.1038/nsmb.1936. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Sanyal A, Lajoie BR, Jain G, Dekker J. The long-range interaction landscape of gene promoters. Nature. 2012;489:109–113. doi: 10.1038/nature11279. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Osborne CS, Chakalova L, Brown KE, Carter D, Horton A, Debrand E, Goyenechea B, Mitchell Ja, Lopes S, Reik W, et al. Active genes dynamically colocalize to shared sites of ongoing transcription. Nature genetics. 2004;36:1065–1071. doi: 10.1038/ng1423. [DOI] [PubMed] [Google Scholar]
  • 8.Schoenfelder S, Sugar R, Dimond A, Javierre B-m, Armstrong H, Mifsud B, Dimitrova E, Matheson L, Tavares-Cadete F, Furlan-Magaril M, et al. Polycomb repressive complex PRC1 spatially constrains the mouse embryonic stem cell genome. Nature Genetics. 2015;47:1179–1186. doi: 10.1038/ng.3393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Volpi EV, Chevret E, Jones T, Vatcheva R, Williamson J, Beck S, Campbell RD, Goldsworthy M, Powis SH, Ragoussis J, et al. Large-scale chromatin organization of the major histocompatibility complex and other regions of human chromosome 6 and its response to interferon in interphase nuclei. Journal of cell science. 2000;113:1565–1576. doi: 10.1242/jcs.113.9.1565. [DOI] [PubMed] [Google Scholar]
  • 10.Montefiori L, Wuerffel R, Roqueiro D, Lajoie B, Guo C, Gerasimova T, De S, Wood W, Becker KG, Dekker J, et al. Extremely Long-Range Chromatin Loops Link Topological Domains to Facilitate a Diverse Antibody Repertoire. Cell Reports. 2016;14:1–11. doi: 10.1016/j.celrep.2015.12.083. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hakim O, Sung M-H, Voss TC, Splinter E, John S, Sabo PJ, Thurman RE, Stamatoyannopoulos JA, de Laat W, Hager GL. Diverse gene reprogramming events occur in the same spatial clusters of distal regulatory elements. Genome Research. 2011;21:697–706. doi: 10.1101/gr.111153.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Francis NJ, Kingston RE, Woodcock CL. Chromatin compaction by a polycomb group protein complex. Science. 2004;306:1574–1577. doi: 10.1126/science.1100576. [DOI] [PubMed] [Google Scholar]
  • 13.Boettiger AN, Bintu B, Moffitt JR, Wang S, Beliveau BJ, Fudenberg G, Imakaev M, Mirny La, Wu C-t, Zhuang X. Super-resolution imaging reveals distinct chromatin folding for different epigenetic states. Nature. 2016;529:418–422. doi: 10.1038/nature16496. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Kosak ST, Skok JA, Medina KL, Riblet R, Le Beau MM, Fisher AG, Singh H. Subnuclear Compartmentalization of Immunoglobulin Loci During Lymphocyte Development. Science. 2002;296:158–162. doi: 10.1126/science.1068768. [DOI] [PubMed] [Google Scholar]
  • 15.Manuelidis L, Borden J. Reproducible compartmentalization of individual chromosome domains in human CNS cells revealed by in situ hybridization and three-dimensional reconstruction. Chromosoma. 1988;96:397–410. doi: 10.1007/BF00303033. [DOI] [PubMed] [Google Scholar]
  • 16.Rieder D, Ploner C, Krogsdam AM, Stocker G, Fischer M, Scheideler M, Dani C, Amri E-Z, Müller WG, McNally JG, et al. Co-expressed genes prepositioned in spatial neighborhoods stochastically associate with SC35 speckles and RNA polymerase II factories. Cellular and Molecular Life Sciences. 2013;71:1741–1759. doi: 10.1007/s00018-013-1465-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Wang Q, Sawyer IA, Sung M-H, Sturgill D, Shevtsov SP, Pegoraro G, Hakim O, Baek S, Hager GL, Dundr M. Cajal bodies are linked to genome conformation. Nature Communications. 2016;7:10966. doi: 10.1038/ncomms10966. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Cremer T, Cremer M, Dietzel S, Müller S, Solovei I, Fakan S. Chromosome territories - a functional nuclear landscape. Current Opinion in Cell Biology. 2006;18:307–316. doi: 10.1016/j.ceb.2006.04.007. [DOI] [PubMed] [Google Scholar]
  • 19.Bartova E, Kozubek S, Jirsova P, Kozubek M, Gajova H, Lukasova E, Skalnikova M, Ganova A, Koutna I, Hausmann M. Nuclear structure and gene activity in human differentiated cells. J Struct Biol. 2002;139 doi: 10.1016/s1047-8477(02)00560-9. [DOI] [PubMed] [Google Scholar]
  • 20.Baxter J, Merkenschlager M, Fisher AG. Nuclear organisation and gene expression. Current Opinion in Cell Biology. 2002;14:372–376. doi: 10.1016/s0955-0674(02)00339-3. [DOI] [PubMed] [Google Scholar]
  • 21.Noordermeer D, de Wit E, Klous P, van de Werken H, Simonis M, Lopez-Jones M, Eussen B, de Klein A, Singer RH, de Laat W. Variegated gene expression caused by cell-specific long-range DNA interactions. Nature cell biology. 2011;13:944–951. doi: 10.1038/ncb2278. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Wright JB, Brown SJ, Cole MD. Upregulation of c-MYC in cis through a large chromatin loop linked to a cancer risk-associated single-nucleotide polymorphism in colorectal cancer cells. Molecular and cellular biology. 2010;30:1411–1420. doi: 10.1128/MCB.01384-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Meaburn KJ, Agunloye O, Devine M, Leshner M, Roloff GW, True LD, Misteli T. Tissue-of-origin-specific gene repositioning in breast and prostate cancer. Histochemistry and Cell Biology. 2016;145:433–446. doi: 10.1007/s00418-015-1401-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Lieberman-Aiden E, Van Berkum NL, Williams L, Imakaev M, Ragoczy T, Telling A, Amit I, Lajoie BR, Sabo PJ, Dorschnew MO, et al. Comprehensive Mapping of Long-Range Interactions Reveals Folding Principles of the Human Genome. Science. 2009;326(5950):289–293. doi: 10.1126/science.1181369. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Hibbs AR, MacDonald G, Garsha K. Practical Confocal Microscopy. In: Pawley BJ, editor. Handbook Of Biological Confocal Microscopy. Springer; US, Boston, MA: 2006. pp. 650–671. [DOI] [Google Scholar]
  • 26.North AJ. Seeing is believing? A beginners’ guide to practical pitfalls in image acquisition. The Journal of Cell Biology. 2006;172:9–18. doi: 10.1083/jcb.200507103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Pawley JB. Fundamental Limits in Confocal Microscopy. In: Pawley BJ, editor. Handbook Of Biological Confocal Microscopy. Springer; US, Boston, MA: 2006. pp. 20–42. [DOI] [Google Scholar]
  • 28.Carlini AR, Wilson T. The Role Of Pinhole Size And Position In Confocal Imaging Systems. 1987;0809:97–100. [Google Scholar]
  • 29.Benanti JA, Galloway DA. Normal Human Fibroblasts Are Resistant to RAS-Induced Senesence. Mol Cell Biol. 2004;24:2842–2852. doi: 10.1128/MCB.24.7.2842-2852.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Shachar S, Voss TC, Pegoraro G, Sciascia N, Misteli T. Identification of Gene Positioning Factors Using High-Throughput Imaging Mapping. 2015 doi: 10.1016/j.cell.2015.07.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Shachar S, Pegoraro G, Misteli T. HIPMap: A High-Throughput Imaging Method for Mapping Spatial Gene Positions. Cold Spring Harbor symposia on quantitative biology. 2015 doi: 10.1101/sqb.2015.80.027417. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Meaburn KJ. Fluorescence in situ hybridization on 3D cultures of tumor cells. Methods Mol Biol. 2010;659:323–36. doi: 10.1007/978-1-60761-789-1_25. [DOI] [PubMed] [Google Scholar]
  • 33.Hakim O, Resch W, Yamane A, Klein I, Kieffer-Kwon K-R, Jankovic M, Oliveira T, Bothmer A, Voss TC, Ansarah-Sobrinho C, et al. DNA damage defines sites of recurrent chromosomal translocations in B lymphocytes. Nature. 2012;484:69–74. doi: 10.1038/nature10909. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Roukos V, Burman B, Misteli T. The cellular etiology of chromosome translocations. Current opinion in cell biology. 2013;25:357–364. doi: 10.1016/j.ceb.2013.02.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Burman B, Misteli T, Pegoraro G. Quantitative detection of rare interphase chromosome breaks and translocations by high-throughput imaging. Genome Biology. 2015;16:146–146. doi: 10.1186/s13059-015-0718-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing; Vienna, Austria: 2015. [Google Scholar]
  • 37.French J. SpatialTools: Tools for Spatial Data Analysis R package version 1.0.2. 2015 https://CRAN.R-project.org/package=SpatialTools.
  • 38.Wickham H. The Split-Apply-Combine Strategy for Data Analysis. Journal of Statistical Software. 2011;40:1–29. [Google Scholar]
  • 39.Wickham H, Francois R. dplyr: A Grammar of Data Manipulation R package version 0.4.3 2015 [Google Scholar]
  • 40.Wickham H. ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag; New York: 2009. [Google Scholar]
  • 41.Dowle M, Srinivasan A, Short T, Liangolou S, Saporta R, Antonyan E. data.table: Extension of Data.frame R package version 1.9.6 2015 [Google Scholar]
  • 42.Xie Y. knitr: A Comprehensive Tool for Reproducible Research in R. In: Stodden V, et al., editors. Implementing Reprodubible Computational Research. Chapman and Hall/CRC; 2014. [Google Scholar]
  • 43.Wickham H. stringr: Simple, Consistent Wrappers for Common String Operations R package version 1.0.0 2015 [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1
2
3

Data Availability Statement

The data generated has been uploaded to dryad for public sharing (doi: 10.5061/dryad.236j7). This contains the original .flex image files generated by the PerkinElmer Opera QEHS high-throughput microscope, the PerkinElmer Acapella script and parameter files used for image analysis, the .txt single-object level image analysis results files generated by Acapella, the .rmd files containing the R code used to calculate 2D and 3D distances and perform subsequent analyses, and the summary experiment-level spot-to-spot distance tables for each experiment generated by R. We have included the following files:

  1. AcapellaScriptsAndParameters.zip: This file contains the Acapella scripts and paramaters used to analyze each of the experiments. The files are named relative to the image dataset used.

  2. Image file folders: These files are named according to the date of acquisition and the z-stack resolution, in the format “BC_YYMMDD_Res.zip”. Each folder contains all the .flex files acquired in that experiment. The experiments used to generate each figure panel are as follows:
    1. Figure 2: All experiments were used to generate panel A.
    2. Figure 3: BC_160525_300.zip, BC_160531_300.zip, and BC_160614_300.zip were used.
    3. Figure 4: BC_160524_1000.zip and BC_160525_300.zip were used.
    4. Figure S3: BC_160614_300.zip was used.
  3. SpotPositionMeasurements.zip: This file contains the output of the Acapella scripts (.txt files), the experimental metadata assigning probes to wells, and the summarized per-spot-pair distance measurements generated by R. They are organized into folders by experiment.

  4. RmdScripts.zip: R markdown scripts used to calculate spot distances and perform subsequent analyses.

Figure 2.

Figure 2

2D vs 3D distances in nuclei of multiple shapes. A: Scatterplot showing 2D vs 3D distance for minimal distances between FISH signals in fibroblasts. B–E: Scatterplots showing modeled relationship between 2D and 3D distances at random points in volumes of various sizes.

Figure 3.

Figure 3

Co-localization proportions in 2D and 3D. A: Scatterplot showing co-localization proportion based on 2D distances vs. co-localization proportion based on 3D distances. Distance cutoff color coded. B: Ideogram showing probe assignment into triplets. C: Bar graph showing proportion of green spots interacting with red and far-red spots simultaneously. Orange: observed; blue: expected.

Figure 4.

Figure 4

Various observed artifacts in 3D. A: Ideogram showing probes used. B: Histograms of distance distributions for minimal distances in fibroblasts for various probe sets with various average distances. Discontinuities due to lowered resolution in z are visible in 3D distance distributions with 1 μm z-slices but not 2D distance distributions or 3D distance distributions with 300 nm z-slices. C: Discontinuities due to resolution in x-y are visible at very short distances with very small binning regardless of the method for calculating distances.

RESOURCES