SUMMARY
Cell fate change involves significant genome reorganization, including change in replication timing, but how these changes are related to genetic variation has not been examined. To study how change in replication timing that occurs during reprogramming impacts the copy number variation (CNV) landscape, we generated genome-wide replication timing profiles of induced pluripotent stem cells (iPSCs) and their parental fibroblasts. A significant portion of the genome changes replication timing as a result of reprogramming, indicative of overall genome reorganization. We found that early and late replicating domains in iPSCs are differentially affected by copy number gains and losses, and that in particular CNV gains accumulate in regions of the genome that change to earlier replication during the reprogramming process. This differential relationship was present irrespective of reprogramming method. Overall, our findings reveal a functional association between reorganization of replication timing and the CNV landscape that emerges during reprogramming.
INTRODUCTION
The genome is topologically organized in three-dimensional space within the nucleus and is highly dynamic as cell fate changes during normal development, as well as in disease states, such as cancer (Ryba et al., 2012). The significant cell fate changes that occur during embryonic stem cell (ESC) differentiation and reprogramming of somatic cells to pluripotency also involve genome-wide resetting of the higher order chromatin structure (Watanabe et al., 2013). In both scenarios, genome reorganization precedes gene expression changes, suggesting a potential causal relationship (Apostolou et al., 2013; Phillips-Cremins et al., 2013; Wei et al., 2013; Zhang et al., 2013).
Higher order genome organization has been shown to significantly influence the distribution of genomic aberrations in both immortalized somatic cells and cancer cells, but the underlying mechanism remains elusive (De and Michor, 2011; Fudenberg et al., 2011; Koren et al.; Schuster-Bockler and Lehner, 2012). One method of mapping genomic organization is by segmenting the genome into domains based on replication timing, which occurs in a tightly regulated cell type-specific manner (Gilbert et al., 2010). Domains with similar replication timing tend to co-localize within the nucleus and as a result, genome organization defined by replication timing overlaps with other methods of defining chromatin topology including DNase I hypersensitivity profiles, various epigenetic markers, and genome-wide chromatin interaction mapping (e.g. Hi-C) (Dixon et al., 2012; Hu et al., 2013; Ryba et al., 2010). Several studies have revealed that the cancer mutational landscape is closely associated with replication timing by demonstrating that CNV boundaries tend to have similar replication timing and gains and losses distribute differentially with respect to replication timing (De and Michor, 2011; Liu et al., 2013; Schuster-Bockler and Lehner, 2012). These studies share one important limitation: the replication timing maps were not generated from the same cell types in which the CNVs were analyzed. This limitation is significant because cellular identity is closely coupled to genome organization; in fact, only half of the genome has stable replication timing across all cell types, with the remainder being organized in a highly cell type-specific manner (Hansen et al., 2010; Hiratani et al., 2010; Hiratani et al., 2008). Thus, when an unmatched replication timing profile is used, up to 50% the genome may not be accounted for during the analysis due to the cell type-specificity of genome organization.
How cell fate change and its associated genome reorganization are related to genome variation has not been examined. We overcome the limitation of unmatched replication timing maps by generating replication timing profiles of both iPSCs and their parental fibroblasts in order to explore the link between the CNV landscape and genome reorganization due to cell fate change. We show that nuclear reprogramming results in dramatic replication timing reorganization which influences the observed CNV landscape in iPSCs.
RESULTS
CNVs reside within replication timing domains
Primary dermal fibroblasts from one healthy volunteer (WT) were reprogrammed using retroviral transduction of the standard Yamanaka factors to generate multiple iPSC lines (Takahashi et al., 2007) (Figure S1). Replication timing profiles were generated from an iPSC line and the parental fibroblasts. Newly replicated DNA from early and late S-phase was differentially labeled and hybridized to a whole-genome oligonucleotide microarray. The ratio of the abundance of each probe in the early vs. late fractions generates a replication profile, which reveals clear demarcation between megabase-sized regions of coordinated replication called replication domains (Figure 1A). Computational segmentation algorithms can be used to define the replication domains (Ryba et al., 2011). The iPSC profile we generated is identical to previously published profiles of both hESCs and iPSCs, reflecting human pluripotent stem cell-specific genome organization (Figure S1D) (Ryba et al., 2010).
Figure 1. iPSC CNVs reside within replication timing domains and are distributed non-randomly.
(A) Newly replicated DNA from early and late S-phase are differentially labeled and hybridized to a whole-genome oligonucleotide microarray to produce a replication timing profile. Loess smoothed replication timing ratio (filled gray) of chromosome 1 first 50Mb is shown as an example. Segmentation algorithms are used to define replication domains (black segments) as regions with similar replication timing ratio. Timing transition regions (TTRs) are defined as genomic regions with significant rate of replication timing ratio change (purple). TTRs cover ~1/3 of the genome. CNVs (green segments) are plotted above of the replication profile and they can fall either entirely within individual domains or cross boundaries and involve more than one domain. A CNV may or may not involve a TTR.
(B) Permutation analysis demonstrates CNVs are significantly more likely to reside within individual replication domains than expected by chance in both iPSCs and parental fibroblasts. Boxes display the percentages obtained from 1000 simulations and red diamonds represent the actual percentages.
(C) TTRs are not enriched within CNVs. The actual amount (black) of TTR and non-TTR regions affected by CNVs, compared to expected numbers (gray) based on random distribution, are shown with p-values (Fisher’s exact test) above each plot. See also Figure S1.
To map the corresponding genome variation landscape of iPSCs, we analyzed the fibroblast and low passage (p6) iPSC lines using high-resolution Affymetrix SNP Array 6.0 (Tables S1 and S2). CNVs calls were validated using quantitative PCR with overall validation rate of 91% (Supplementary Text). Raw data from the replication timing profiles also independently validated all the homozygous losses detected by SNP arrays in the same cell lines (Figure S1E) (Ryba et al., 2012). Most CNVs of iPSCs were not detected in the parental fibroblasts and are heretofore referred to as ‘iPSC-manifested’ (Table S1 and S2). iPSC-manifested CNVs in low passage iPSCs could represent clonal expansion of low-frequency genetic aberrations in the tissue of origin or de novo mutations during the reprogramming process (Liang and Zhang, 2013).
We found that while two thirds of CNVs detected in iPSCs involve genic regions (including exons and introns), the affected genes do not fall into any clear functional groups by Gene Ontology analysis (Table S3). The CNV affected genes did not show any particular relationship to pluripotency, nor did we find any enrichment of tumor suppression genes or oncogenes. Our data are similar to other published reports that failed to demonstrate consistent functional consequences due to genomic aberrations in iPSCs (Amps et al., 2011; Hussein et al., 2011; Martins-Taylor et al., 2011).
We mapped the iPSC CNVs to the newly generated replication timing profiles and found that more than 95% of CNVs are contained within individual replication domains, irrespective of whether the iPSC or the parental fibroblast profile was used. Permutation analysis using size-controlled random CNV lists demonstrates that the detected CNVs are significantly more likely to reside completely within individual replication domains and do not span domain boundaries than would be expected to occur by chance (Figure 1B).
Timing transition regions (TTRs), defined by genomic regions with a significant rate of replication timing ratio change, have different replication kinetics compared to center of the replication domain. TTRs likely represent sequences vulnerable to DNA damage (Labib and Hodgson, 2007; Watanabe et al., 2002; Watanabe et al., 2004). However, we did not find an enrichment of TTR sequence in CNV affected genomic regions in iPSCs (Figure 1C).
Impact of replication timing dynamics on the CNV landscape
Reprogramming and ESC differentiation involves global changes in chromosome structure that include temporal reorganization of replication domains and spatial reorganization of the genome; in fact, one half of the genome changes organization at some stage during development (Hiratani et al., 2008; Orkin and Hochedlinger, 2011; Ryba et al., 2010). Comparison of fibroblast and iPSC replication timing profiles revealed that during reprogramming 40% of the genome changes replication timing, as defined by a greater than 0.5 difference in the early/late ratio between fibroblast and iPSC profiles (Figures 2A). This cutoff corresponds to the amount of time (about 1 hour) required to complete a replication factory (Leonhardt et al., 2000) and is the same cutoff for significance for transcription (D.M. Gilbert unpublished data). This change is nearly double in magnitude compared to the change observed when ESCs are differentiated into NPCs (Hiratani et al., 2008), reflecting the extensive genome reorganization during nuclear reprogramming.
Figure 2. Reorganization of replication timing program during reprogramming affects CNV distribution.
(A) Comparison between replication timing profiles of iPSCs (black) and parental fibroblasts (gray) of chr 7 120–143 Mb shows changes during reprogramming. Pink: regions changing to earlier replication; greater than +0.5 difference in replication timing ratio between fibroblast and iPSC. Blue: regions changing to later replication; greater than −0.5 difference in replication timing ratio between fibroblast and iPSC.
(B) Kernel density plots showing the replication timing distribution of CNV-affected regions in fibroblasts (left) and in iPSCs (right). CNV gains have earlier replication timing in iPSCs than fibroblasts. CNV losses are late replicating in both iPSCs and fibroblasts. p-values from the Kolmogorov-Smirnov (K-S) tests between fibroblast and iPSC distribution are shown.
(C) Replication timing values of CNV gains (top) or losses (bottom) were determined using the profiles of both fibroblast and iPSC and plotted as a scatter plot with density smoothed genome-wide replication timing values (RTV) of over 620000 probes serving as background. The gray central region represents ~60% of the genome with no change in replication timing defined as iPSC RTV - fibroblast RTV < +/− 0.5. Regions with greater than +0.5 change (iPSC RTV – fibroblast RTV) are classified as “earlier in iPSC” (red) and those with less than −0.5 change (iPSC RTV – fibroblast RTV) are classified as “later in iPSCs” (blue). The bar graphs depict the distribution of CNV gains or losses compared to the genome-wide distribution of RTVs among these regions. Upward pointing arrow heads indicate statistically significant enrichment of CNVs in the region, while downward pointing arrow heads indicate statistically significant depletion of CNVs compared to the genome distribution in corresponding regions based on binomial test. Number of arrow heads correlates with p-value significance (see Figure S2).
To test how genome reorganization influences the genome variation landscape, we determined the fibroblast and iPSC replication timing values of genomic loci affected by iPSC-manifested CNVs using cell type-specific replication timing profiles. The distribution of replication timing values of CNV gain loci is statistically different in fibroblasts and iPSCs; they are predominantly late replicating in fibroblast, but early replicating in iPSCs. In contrast, loci affected by CNV losses are generally late replicating in both cell types. (Figure 2B).
To better understand the dynamics of the CNV landscape, a genome-wide representation of replication timing dynamics during reprogramming was generated by plotting fibroblast versus iPSC replication timing values for all 600K probes on the array (Figure 2C). In this plot, the gray diagonal region represents genomic loci with no change in replication timing during reprogramming, defined as a less than 0.5 difference in replication timing ratio between fibroblast and iPSCs. The portions of the genome that replicate either earlier (red) or later (blue) in iPSCs than in fibroblasts fall outside the diagonal. When the replication timing values of CNVs were plotted on this genome background, we show that CNV gains are enriched in the genome compartment that changes to earlier replication during reprogramming. This scatter plot presentation reveals the replication timing dynamics underlying the distribution shift of gains seen in Figure 2B. In contrast, the majority of losses reside in late replicating regions in both fibroblasts and iPSCs (Figure 2C), consistent with the absence of a shift in the CNV loss distribution (Figure 2B). Together these findings further support the strong relationship between replication timing and genome variation by demonstrating that the distribution of CNVs observed in low passage iPSCs is influenced by replication timing reorganization associated with cell fate change.
Human iPSC genomic variation has been studied using a variety of approaches. Hussein et al. and Laurent et al. generated iPSC using the same retroviral factors as we did here, but while Hussein et al. also used the Affymetrix 6.0 SNP array, Laurent et al. used the Illumina Omniquad v.1 SNP array (Hussein et al., 2011; Laurent et al., 2011). More recently, Abyzov et al. used whole genome sequencing to identify CNVs in retroviral reprogrammed iPSC lines (Abyzov et al., 2012). None of these studies considered the role of higher order chromatin organization on genomic variation and their datasets serve as de facto controls to exclude potential lab-, CNV detection platform-, or parental fibroblast-specific effects in our experiments. Therefore, we analyzed these datasets relative to our newly generated replication timing profiles. To maintain consistency with our experimental approach, we limited the analysis to data sets of low passage iPSCs generated from dermal fibroblasts. An overall similar trend was seen using datasets from different laboratories: CNV distribution is non-random, with gains predominating in regions changing to earlier replication in iPSCs (Figure S2B).
Non-random, differential CNV distribution is a generalized feature of reprogramming
To exclude the possibility that the non-random replication timing distribution of the CNVs is due to a direct effect of the retroviral vectors (RV) used for reprogramming, we applied our analysis to iPSCs generated using other reprogramming methods. To this end, we generated iPSCs using episomal (EP) factors that do not integrate into the genome. In low passage EP iPSC, like RV iPSCs, CNV gains are significantly enriched in regions of the genome whose replication timing changed to earlier during reprogramming. Again, we found that the majority of CNV losses occur at regions that do not significantly change replication timing during reprogramming (Figure 3A and S3).
Figure 3. Non-random, differential CNV distribution is a generalized feature of reprogramming.
(A) Replication timing dynamics of CNVs from episomal reprogrammed iPSCs. iPSC-manifested CNVs in low passage iPSCs are presented in the same format as in Figure 2D. Matched BJ fibroblast replication timing profile is used. p-values in Figure S3A.
(B) Kernel density plots show the replication timing distribution of CNV gains and losses detected in iPSCs derived using retroviral (RV, purple), piggyBac transposon (PB, blue), or episomal (EP, green) vectors. RV and PB iPSCs were derived from the same parental lines (Hussein et al. 2011). EP iPSCs were reprogrammed from human foreskin fibroblast line BJ. Gains enrich in regions that change to earlier replication during reprogramming in all cases. (p-values from K-S tests are shown in Figure S3B.)
Next, we analyzed Hussein et al.’s data on sister iPSC lines derived from the same parental fibroblast line (HFF) using either retroviruses or piggyBac transposons (PB). Distribution plots similar to Figure 2B from multiple datasets were directly compared to each other (Figure 3B and S3). Gains and losses from these PB and RV iPSCs had a similar non-random distribution with respect to replication timing dynamics during reprogramming: gains were likely to have earlier replication in iPSCs compared to fibroblasts, while losses maintain late replication. Together, these findings demonstrate that the trend for non-random, differential CNV gain and loss accumulation as a function of change in replication timing is a generalized feature of reprogramming.
DISCUSSION
Improved mapping and understanding of the higher-order genome organization is providing a clearer picture of how genome functions, such as gene expression and genome integrity, are related to its spatial organization (Cavalli and Misteli, 2013). A static association between genome organization and genome aberration has been described, but how the mutagenic landscape is affected by changes in genome organization has not been studied. Here, by leveraging reprogramming to alter cell fate and analyzing the genome aberration landscape before and after using cell-type matched genome-wide replication timing profiles, we show that the CNV landscape of iPSCs is spatially constrained by chromatin organization, and the distribution of CNVs is influenced by changes in replication timing due to reprogramming.
Some studies have reported that reprogramming results in genetic aberrations such as point mutations and CNVs in human iPSCs (Gore et al., 2011; Hussein et al., 2011; Laurent et al., 2011; Martins-Taylor et al., 2011; Mayshar et al., 2010). However, more recently others have demonstrated that the majority of iPSC CNVs pre-existed in cells within the parental population (Abyzov et al., 2012; Cheng et al., 2012; Quinlan et al., 2011). Consistent with these reports, we found shared iPSC-manifested CNVs among the iPSC clones in each of our reprogramming experiments, albeit at low frequency. Interestingly, we also found that about 50% iPSC-manifested iPSC CNVs have overlap with known CNVs in the Database of Genomic Variation (Macdonald et al., 2013) and these likely represent previously existing low frequency CNVs in the starting cell population.
Ultimately, the accumulation of genomic changes in any genome will result from a combination of mutation rates, repair rates and selection. In a completely stochastic process, akin to genetic drift, CNV gains and losses would be distributed randomly. Our results show that this is not the case, because CNVs are non-randomly distributed in iPSCs as a function of replication timing change. iPSC-manifested gains predominate in loci that change to earlier replication during reprogramming, whereas losses occur in loci that remain late replicating. The mechanism and timing of this relationship is not immediately apparent but it is conceivable that both pre-existing and de novo CNVs are subject to the influence of the extensive genome reorganization (Figure 4).
Figure 4. The distribution of genomic variations in human iPSCs is related to replication timing reorganization during reprogramming.
Individual cells in the starting population have a unique set of pre-existing background CNVs (CNV gains: circled +, CNV losses: circled −), which can be captured by clonal nature of reprogramming. IPSC-manifested mutations may also lead to de novo CNVs that occur during the reprogramming process (non-circled +, −). Pre-existing and de novo mutations may contribute to reprogramming fitness. The extensive genomic and epigenomic changes associated with reprogramming can in part be reflected by changes in replication timing (color scheme: gray – no change during reprogramming, red – earlier in iPSCs, blue – later in iPSCs). Early and late replicating domains in iPSCs are differentially affected by copy number gains and losses, during reprogramming. Specifically, CNV gains accumulate in regions of the genome that change to early replication during the reprogramming process. In contrast, CNV losses tend be late replicating in both fibroblasts and iPSCs. This specific CNV distribution, like individual mutations, contributes to reprogramming fitness.
Switching to earlier replication involves epigenetic resetting that results in increased active chromatin modifications, such as histone acetylation (Hiratani et al., 2008; Schwaiger et al., 2009) and an overall open chromatin state. These regions contain genes essential to the cell’s new functions, and are therefore more likely to be under increased selection during reprogramming. While this functional selection is plausible, it is difficult to reconcile with the inability to link CNVs in iPSCs to gene expression changes. Neither our gene expression analysis, nor the published literature, has consistently identified gene expression changes that correlate with genomic aberration to prove this mechanism is dominant (Amps et al., 2011; Hussein et al., 2011; Martins-Taylor et al., 2011). In fact, in a recent analysis of 125 human pluripotent stem cell lines, the most commonly identified aberration was a minimal amplicon at 20q11.21 (affecting >20% of lines), yet there was no correlation between the presence of this amplicon and the expression of the genes it contains (Amps et al., 2011).
Variable efficiency of DDR pathways in different chromatin compartments could contribute to the differential distribution of gains and losses. It is possible that DNA damage occurring in early S phase is more accurately repaired due to higher fidelity repair mechanisms, including copying from the correct template (Chang and Cimprich, 2009). Not only is DDR more efficient in open chromatin, but also heterochromatin appears to retard repair processes and could potentially lead to accumulation of deletions in the late replicating compartment (Goodarzi et al., 2008; Murga et al., 2007; Rube et al., 2011). Another mechanism to explain differential distribution of de novo mutations is that replication timing reorganization could adversely affect the interaction between replication and transcription machineries, leading to replication and transcription collisions, and consequently to DSBs and CNVs (Helmrich et al., 2013).
Interestingly, we noticed subtle differences in CNV occurrence among the three reprogramming methods. In contrast to RV iPSCs, PB and EP iPSCs overall had significantly more CNV gains compared to losses (Figure 3B), suggesting there may be a differential impact of distinct reprogramming methods on different genomic compartments and the iPSC CNV landscape. This finding warrants further investigation and is likely intertwined in the complex interplay of various other internal and external genomic stresses, the cell’s DNA damage response (DDR) network, and selective pressure exerted by the cell’s environment.
In sum, genome-wide analysis of iPSC CNVs using the replication timing profiles of both fibroblast and iPSCs demonstrates an important functional association between genome variations and replication timing reorganization resulting from reprogramming. We have focused on CNVs and replication timing during reprogramming, but in the future, genomic aberrations identified by other techniques (e.g. whole genome sequencing) as well as epigenomic changes can be similarly mapped to dynamic genome organization landscapes. Understanding the link between genome organization and function will be enhanced by improved biochemical techniques and computational approaches to allow higher throughput and improved resolution (Misteli, 2012; Wang et al., 2012). Because iPSC technology readily allows the manipulation of cell fate change, it will serve as a powerful experimental platform to further study the relationship between genome organization and function.
EXPERIMENTAL PROCEDURES
Generation, characterization and genome-wide analyses of iPSCs
WT fibroblasts hFib2 were reprogrammed with retroviral vectors as described (Park et al., 2008). For episomal vector mediated reprogramming, vectors were obtained from Addgene (pCXLE-hUL, pCXLE-hSK,pCXLE-hOCT3/4) and introduced into BJ fibroblasts by electroporation and reprogrammed colonies picked and expanded. Immunofluorescent staining of pluripotency markers Oct4, Nanog, SSEA-4, Tra-1–60, Tra-1–81 and teratoma formation assays demonstrated the pluripotency of all the established iPSC lines. Genome-wide expression profiling with Affymetrix Human Genome U133 Plus 2.0 microarrays and CNV analysis with Affymetric SNP 6.0 arrays were performed according to manufacturer’s instructions at the Genotyping and Microarray Center of Coriell Institute for Medical Research. Replication timing profiling was performed and analyzed as Ryba et al., 2011. All microarray data have been deposited with the NCBI Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/) under accession number GSE46848.
Computational analyses
To statistically analyze if CNVs fall within replication timing domains (Figure 1B), we performed permutation analysis. For each simulation, we generated a new list of CNVs by assuming uniform distribution of CNV start positions along chromosomes (omitting centromeric regions) while keeping their size distribution. For each new list, we mapped to the replication domain profile and obtain the percent of new CNVs falling within domains. We repeated the permutation 1,000 times. The permutation p-value is defined as the number of permutations yielding a higher percentage than the actual experimental result. In Figures 2 and 3, the Kolmogorov-Smirnov (K-S) test was used to detect differences between distributions. Binomial tests were used to assess the statistical significance of CNV enrichment in the genomic regions. The null hypothesis is random distribution, i.e. no CNV gain or loss enrichment in the regions of interest.
Supplementary Material
Highlights.
Reprogramming results in extensive replication timing reorganization.
iPSC gains and losses differentially affect early and late replicating domains.
iPSC gains enrich in regions that change to earlier replication during reprogramming.
ACKNOWLEDGEMENTS
We thank R. Zhao, S. Agarwal, P. Cahan, and S. Sunyaev for critical reading and discussion of the manuscript; and T. Ryba for computational help. This work was supported by NICHD grant R00HD061981 to P.H.L., NIH grant 5R01GM080625 to J.S.L, R21CA161666 and P01GM085354 D.M.G. J.J.C. and H.L. are supported by Howard Hughes Medical Institute, SysCODE (Systems-based Consortium for Organ Design & Engineering), and NIH grant RL1DE019021. H.L. is supported by Mayo Clinic Center for Individualized Medicine and Mayo Clinic Center for Regenerative Medicine.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
REFERENCES
- Abyzov A, Mariani J, Palejev D, Zhang Y, Haney MS, Tomasini L, Ferrandino AF, Rosenberg Belmaker LA, Szekely A, Wilson M, et al. Somatic copy number mosaicism in human skin revealed by induced pluripotent stem cells. Nature. 2012;492:438–442. doi: 10.1038/nature11629. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Amps K, Andrews PW, Anyfantis G, Armstrong L, Avery S, Baharvand H, Baker J, Baker D, Munoz MB, Beil S, et al. Screening ethnically diverse human embryonic stem cells identifies a chromosome 20 minimal amplicon conferring growth advantage. Nature Biotechnology. 2011;29:1132–1144. doi: 10.1038/nbt.2051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Apostolou E, Ferrari F, Walsh RM, Bar-Nur O, Stadtfeld M, Cheloufi S, Stuart HT, Polo JM, Ohsumi TK, Borowsky ML, et al. Genome-wide Chromatin Interactions of the Nanog Locus in Pluripotency, Differentiation, and Reprogramming. Cell Stem Cell. 2013 doi: 10.1016/j.stem.2013.04.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cavalli G, Misteli T. Functional implications of genome topology. Nat Struct Mol Biol. 2013;20:290–299. doi: 10.1038/nsmb.2474. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chang DJ, Cimprich KA. DNA damage tolerance: when it's OK to make mistakes. Nat Chem Biol. 2009;5:82–90. doi: 10.1038/nchembio.139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheng L, Hansen NF, Zhao L, Du Y, Zou C, Donovan FX, Chou BK, Zhou G, Li S, Dowey SN, et al. Low incidence of DNA sequence variation in human induced pluripotent stem cells generated by nonintegrating plasmid expression. Cell Stem Cell. 2012;10:337–344. doi: 10.1016/j.stem.2012.01.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- De S, Michor F. DNA replication timing and long-range DNA interactions predict mutational landscapes of cancer genomes. Nat Biotechnol. 2011;29:1103–1108. doi: 10.1038/nbt.2030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dixon JR, Selvaraj S, Yue F, Kim A, Li Y, Shen Y, Hu M, Liu JS, Ren B. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature. 2012;485:376–380. doi: 10.1038/nature11082. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fudenberg G, Getz G, Meyerson M, Mirny LA. High order chromatin architecture shapes the landscape of chromosomal alterations in cancer. Nat Biotechnol. 2011;29:1109–1113. doi: 10.1038/nbt.2049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gilbert DM, Takebayashi SI, Ryba T, Lu J, Pope BD, Wilson KA, Hiratani I. Space and time in the nucleus: developmental control of replication timing and chromosome architecture. Cold Spring Harb Symp Quant Biol. 2010;75:143–153. doi: 10.1101/sqb.2010.75.011. [DOI] [PubMed] [Google Scholar]
- Goodarzi AA, Noon AT, Deckbar D, Ziv Y, Shiloh Y, Lobrich M, Jeggo PA. ATM signaling facilitates repair of DNA double-strand breaks associated with heterochromatin. Mol Cell. 2008;31:167–177. doi: 10.1016/j.molcel.2008.05.017. [DOI] [PubMed] [Google Scholar]
- Gore A, Li Z, Fung HL, Young JE, Agarwal S, Antosiewicz-Bourget J, Canto I, Giorgetti A, Israel MA, Kiskinis E, et al. Somatic coding mutations in human induced pluripotent stem cells. Nature. 2011;471:63–67. doi: 10.1038/nature09805. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hansen RS, Thomas S, Sandstrom R, Canfield TK, Thurman RE, Weaver M, Dorschner MO, Gartler SM, Stamatoyannopoulos JA. Sequencing newly replicated DNA reveals widespread plasticity in human replication timing. Proc Natl Acad Sci U S A. 2010;107:139–144. doi: 10.1073/pnas.0912402107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Helmrich A, Ballarino M, Nudler E, Tora L. Transcription-replication encounters, consequences and genomic instability. Nat Struct Mol Biol. 2013;20:412–418. doi: 10.1038/nsmb.2543. [DOI] [PubMed] [Google Scholar]
- Hiratani I, Ryba T, Itoh M, Rathjen J, Kulik M, Papp B, Fussner E, Bazett-Jones DP, Plath K, Dalton S, et al. Genome-wide dynamics of replication timing revealed by in vitro models of mouse embryogenesis. Genome Res. 2010;20:155–169. doi: 10.1101/gr.099796.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hiratani I, Ryba T, Itoh M, Yokochi T, Schwaiger M, Chang CW, Lyou Y, Townes TM, Schubeler D, Gilbert DM. Global reorganization of replication domains during embryonic stem cell differentiation. PLoS Biol. 2008;6:e245. doi: 10.1371/journal.pbio.0060245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hu M, Deng K, Qin Z, Dixon J, Selvaraj S, Fang J, Ren B, Liu JS. Bayesian inference of spatial organizations of chromosomes. PLoS Comput Biol. 2013;9:e1002893. doi: 10.1371/journal.pcbi.1002893. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hussein SM, Batada NN, Vuoristo S, Ching RW, Autio R, Narva E, Ng S, Sourour M, Hamalainen R, Olsson C, et al. Copy number variation and selection during reprogramming to pluripotency. Nature. 2011;471:58–62. doi: 10.1038/nature09871. [DOI] [PubMed] [Google Scholar]
- Koren A, Polak P, Nemesh J, Michaelson JJ, Sebat J, Sunyaev SR, McCarroll SA. Differential relationship of DNA replication timing to different forms of human mutation and variation. Am J Hum Genet. 2012;91:1033–1040. doi: 10.1016/j.ajhg.2012.10.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Labib K, Hodgson B. Replication fork barriers: pausing for a break or stalling for time? EMBO Rep. 2007;8:346–353. doi: 10.1038/sj.embor.7400940. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Laurent LC, Ulitsky I, Slavin I, Tran H, Schork A, Morey R, Lynch C, Harness JV, Lee S, Barrero MJ, et al. Dynamic changes in the copy number of pluripotency and cell proliferation genes in human ESCs and iPSCs during reprogramming and time in culture. Cell Stem Cell. 2011;8:106–118. doi: 10.1016/j.stem.2010.12.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Leonhardt H, Rahn HP, Weinzierl P, Sporbert A, Cremer T, Zink D, Cardoso MC. Dynamics of DNA replication factories in living cells. The Journal of cell biology. 2000;149:271–280. doi: 10.1083/jcb.149.2.271. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liang G, Zhang Y. Genetic and Epigenetic Variations in iPSCs: Potential Causes and Implications for Application. Cell Stem Cell. 2013;13:149–159. doi: 10.1016/j.stem.2013.07.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu L, De S, Michor F. DNA replication timing and higher-order nuclear organization determine single-nucleotide substitution patterns in cancer genomes. Nat Commun. 2013;4:1502. doi: 10.1038/ncomms2502. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Macdonald JR, Ziman R, Yuen RK, Feuk L, Scherer SW. The database of genomic variants: a curated collection of structural variation in the human genome. Nucleic Acids Res. 2013 doi: 10.1093/nar/gkt958. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martins-Taylor K, Nisler BS, Taapken SM, Compton T, Crandall L, Montgomery KD, Lalande M, Xu RH. Recurrent copy number variations in human induced pluripotent stem cells. Nat Biotechnol. 2011;29:488–491. doi: 10.1038/nbt.1890. [DOI] [PubMed] [Google Scholar]
- Mayshar Y, Ben-David U, Lavon N, Biancotti JC, Yakir B, Clark AT, Plath K, Lowry WE, Benvenisty N. Identification and classification of chromosomal aberrations in human induced pluripotent stem cells. Cell Stem Cell. 2010;7:521–531. doi: 10.1016/j.stem.2010.07.017. [DOI] [PubMed] [Google Scholar]
- Misteli T. Parallel genome universes. Nat Biotechnol. 2012;30:55–56. doi: 10.1038/nbt.2085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Murga M, Jaco I, Fan Y, Soria R, Martinez-Pastor B, Cuadrado M, Yang SM, Blasco MA, Skoultchi AI, Fernandez-Capetillo O. Global chromatin compaction limits the strength of the DNA damage response. J Cell Biol. 2007;178:1101–1108. doi: 10.1083/jcb.200704140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Orkin SH, Hochedlinger K. Chromatin connections to pluripotency and cellular reprogramming. Cell. 2011;145:835–850. doi: 10.1016/j.cell.2011.05.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Park IH, Lerou PH, Zhao R, Huo H, Daley GQ. Generation of human-induced pluripotent stem cells. Nat Protoc. 2008;3:1180–1186. doi: 10.1038/nprot.2008.92. [DOI] [PubMed] [Google Scholar]
- Phillips-Cremins JE, Sauria ME, Sanyal A, Gerasimova TI, Lajoie BR, Bell JS, Ong CT, Hookway TA, Guo C, Sun Y, et al. Architectural Protein Subclasses Shape 3D Organization of Genomes during Lineage Commitment. Cell. 2013 doi: 10.1016/j.cell.2013.04.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Quinlan AR, Boland MJ, Leibowitz ML, Shumilina S, Pehrson SM, Baldwin KK, Hall IM. Genome Sequencing of Mouse Induced Pluripotent Stem Cells Reveals Retroelement Stability and Infrequent DNA Rearrangement during Reprogramming. Cell Stem Cell. 2011;9:366–373. doi: 10.1016/j.stem.2011.07.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rube CE, Lorat Y, Schuler N, Schanz S, Wennemuth G, Rube C. DNA repair in the context of chromatin: new molecular insights by the nanoscale detection of DNA repair complexes using transmission electron microscopy. DNA Repair (Amst) 2011;10:427–437. doi: 10.1016/j.dnarep.2011.01.012. [DOI] [PubMed] [Google Scholar]
- Ryba T, Battaglia D, Chang BH, Shirley JW, Buckley Q, Pope BD, Devidas M, Druker BJ, Gilbert DM. Abnormal developmental control of replication-timing domains in pediatric acute lymphoblastic leukemia. Genome Res. 2012;22:1833–1844. doi: 10.1101/gr.138511.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ryba T, Battaglia D, Pope BD, Hiratani I, Gilbert DM. Genome-scale analysis of replication timing: from bench to bioinformatics. Nat Protoc. 2011;6:870–895. doi: 10.1038/nprot.2011.328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ryba T, Hiratani I, Lu J, Itoh M, Kulik M, Zhang J, Schulz TC, Robins AJ, Dalton S, Gilbert DM. Evolutionarily conserved replication timing profiles predict long-range chromatin interactions and distinguish closely related cell types. Genome Res. 2010;20:761–770. doi: 10.1101/gr.099655.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schuster-Bockler B, Lehner B. Chromatin organization is a major influence on regional mutation rates in human cancer cells. Nature. 2012 doi: 10.1038/nature11273. [DOI] [PubMed] [Google Scholar]
- Schwaiger M, Stadler MB, Bell O, Kohler H, Oakeley EJ, Schubeler D. Chromatin state marks cell-type- and gender-specific replication of the Drosophila genome. Genes Dev. 2009;23:589–601. doi: 10.1101/gad.511809. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Takahashi K, Tanabe K, Ohnuki M, Narita M, Ichisaka T, Tomoda K, Yamanaka S. Induction of pluripotent stem cells from adult human fibroblasts by defined factors. Cell. 2007;131:861–872. doi: 10.1016/j.cell.2007.11.019. [DOI] [PubMed] [Google Scholar]
- Wang P, Zhang W, Yang J, Qu J, Liu GH. Higher-order genomic organization in pluripotent stem cells. Protein Cell. 2012 doi: 10.1007/s13238-012-2806-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Watanabe A, Yamada Y, Yamanaka S. Epigenetic regulation in pluripotent stem cells: a key to breaking the epigenetic barrier. Philos Trans R Soc Lond B Biol Sci. 2013;368:20120292. doi: 10.1098/rstb.2012.0292. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Watanabe Y, Fujiyama A, Ichiba Y, Hattori M, Yada T, Sakaki Y, Ikemura T. Chromosome-wide assessment of replication timing for human chromosomes 11q and 21q: disease-related genes in timing-switch regions. Hum Mol Genet. 2002;11:13–21. doi: 10.1093/hmg/11.1.13. [DOI] [PubMed] [Google Scholar]
- Watanabe Y, Ikemura T, Sugimura H. Amplicons on human chromosome 11q are located in the early/late-switch regions of replication timing. Genomics. 2004;84:796–805. doi: 10.1016/j.ygeno.2004.08.001. [DOI] [PubMed] [Google Scholar]
- Wei Z, Gao F, Kim S, Yang H, Lyu J, An W, Wang K, Lu W. Klf4 organizes long-range chromosomal interactions with the oct4 locus in reprogramming and pluripotency. Cell Stem Cell. 2013;13:36–47. doi: 10.1016/j.stem.2013.05.010. [DOI] [PubMed] [Google Scholar]
- Zhang H, Jiao W, Sun L, Fan J, Chen M, Wang H, Xu X, Shen A, Li T, Niu B, et al. Intrachromosomal looping is required for activation of endogenous pluripotency genes during reprogramming. Cell Stem Cell. 2013;13:30–35. doi: 10.1016/j.stem.2013.05.012. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.




