Abstract
Comparative genomic studies in primates have the potential to reveal the genetic and mechanistic basis for human specific traits. These studies may also help us better understand inter-species phenotypic differences that are clinically relevant. Unfortunately, the obvious limitation on sample collection and experimentation in humans and non-human apes severely restrict our ability to perform dynamic comparative studies in primates. Induced pluripotent stem cells (iPSCs), and their corresponding differentiated cells, may provide a suitable alternative system for dynamic comparative studies. Yet, to effectively use iPSCs and differentiated cells for comparative studies, one must characterize the extent to which these systems faithfully represent biological processes in primary tissues. To do so, we compared gene expression data from primary adult heart tissue and iPSC-derived cardiomyocytes from multiple human and chimpanzee individuals. We determined that gene expression in cultured cardiomyocytes from both human and chimpanzee is most similar to that of adult hearts compared to other adult tissues. Using a comparative framework, we found that 50% of gene regulatory differences between human and chimpanzee hearts are also observed between species in cultured cardiomyocytes; conversely, inter-species regulatory differences seen in cardiomyocytes are found significantly more often in hearts than in other primary tissues. Our work provides a detailed description of the utility and limitation of differentiated cardiomyocytes as a system for comparative functional genomic studies in primates.
Introduction
Comparative studies of humans and non-human apes are extremely restricted because we only have access to a few types of cell lines and to a limited collection of frozen tissues1. In order to gain true insight into regulatory processes that underlie inter-species variation in complex phenotypes, we must have access to faithful model systems for a wide range of tissues and cell types. To address this challenge, we previously established a panel of iPSCs from human and chimpanzee fibroblasts2–4. We can use this comparative iPSC panel to derive multiple cell types representative of the three germ layers. For example, we recently differentiated the human and chimpanzee iPSCs into definitive endoderm cells3 to study conservation in gene expression trajectories during early development. Our hope is that employing iPSC-based models from humans and chimpanzees will provide researchers with a dynamic and flexible system for comparative functional genomic studies in a large number of cell types.
Towards this goal, the purpose of the current study is to evaluate how well inter-species gene expression differences in heart are recapitulated in iPSC-derived cardiomyocytes. This effort is not unique; quite a few previous studies focused on characterizing similarities and differences between pluripotent stem cell derived cell types and their fetal and adult tissue counterparts in both human and mouse5–14. Generally, results from these studies have demonstrated that the derived cell types are most equivalent to fetal tissues rather than to the corresponding adult tissues. A few studies specifically explored protocol properties that may result in more mature derived cells5,15–21.
That said, these published works do not specifically address properties pertaining to the utility of differentiated cardiomyocytes in the context of a comparative study in primates. First, nearly all published studies were conducted using relatively few individuals (three or fewer), such that the observation of high similarity of gene expression patterns between cultured cells and primary tissue may be explained by lack of statistical power. Second, previous studies did not consider their observations in the broader context of other tissues or other species, so it is challenging to benchmark the observation of what is claimed to be ‘small or ‘large’ regulatory differences between tissues and cultured cells. Different protocols and batch effects make it difficult to perform meta-analysis of existing data to effectively address this issue. Finally, to date, no study that focused on the fidelity of differentiated cells included samples from chimpanzees.
To address these gaps, we performed a comparative study that was specifically designed to allow us to effectively compare gene expression data between cultured cardiomyocytes and primary hearts from humans and chimpanzees. Our study was also designed to allow us to benchmark the results against other primary tissues and across these two species. A key finding from previous studies pertaining to the fidelity of iPSC-derived cell types is the importance of cellular maturation after terminal differentiation. While the initial steps of cardiomyocyte differentiation are fairly well established, there remains some debate in the field as to the best method of cellular maturation22,23. Common strategies include: treatment with triiodothyronine (T3)24–26, electrical stimulation21 and temporal maturation in culture25. The fidelity of cardiomyocytes to primary adult heart tissue is likely to increase with cellular maturation. Thus, to test the effects of temporal maturation and treatment with T3, we compared gene expression profiles from human and chimpanzee iPSC-derived cardiomyocytes at day 15, day 27 (with and without T3 treatment) post induction of differentiation with data generated from primary heart tissue from both species.
Results
The primary goal of this work is not to test a particular hypothesis, but to develop an understanding of the degree to which iPSC derived cardiomyocytes can serve as a model system for comparative functional genomic studies in primates. To do so, we used matched panels of 9 human and 10 chimpanzee iPSC lines; a subset of these iPSCs was previously described2–4. Standard functional characterizations of iPSC lines, quality controls, and metadata for the entire panel of samples are provided in Supplemental Figs 1–3 and Table 1.
We differentiated the iPSC lines into cardiomyocytes using a recently developed protocol27–29 with minor modifications (Methods). The same protocol was used for both the human and the chimpanzee iPSC lines. All of the cardiomyocyte cultures started beating 6–8 days post induction of differentiation. On days 20–27, a subset of cultures was treated with T3 (methods), which was shown to aid in the maturation of iPSC-derived cardiomyocytes20,24,26. We assessed the purity of the iPSC-derived cardiomyocytes using flow cytometry (Methods) for cardiac troponin I (TNNI3) and cardiac troponin T (TNNT2). Purity information for all samples is provided in Supplemental Table 1. Representative movies demonstrating beating cardiomyocytes at different time periods: day 15, day 27 with T3, day 27 without T3, and calcium transients imaging are provided as supplemental files 4–69. By day 15, the purity of our samples was comparable to that obtained by other studies with an average purity of 68% of cells being TNNT2/TNNI3 double positive across all 19 individuals; Supplemental Table 1. As part of the goal of this study we sought to estimate the impact of target cell purity on gene expression variance, thus we choose to sequence RNA from a subset of samples that had less than 50% cardiomyocytes. Importantly, we found no significant difference in sample purity across species (Supplemental Fig. 4).
We performed the cardiomyocyte differentiations in two batches, iPSCs from 4 of the human and 6 of the chimpanzee individuals were differentiated in both batches; thus, our study contains technical replicates of differentiation for a subset of samples. In the first experimental batch, we collected RNA on day 27 from samples that were treated with T3. From the samples that were differentiated in the second batch, we collected RNA from the iPSCs, from differentiated cardiomyocytes on day 15, and again on day 27, from samples that were either treated or not treated with T3. To benchmark the gene expression data from the iPSC and iPSC-derived cardiomyocytes, we also collected RNA from post-mortem flash frozen heart tissues samples from 21 chimpanzees and 11 humans (Methods). Our overall study design (Supplemental Fig. 4) affords us the flexibility to choose to analyze different subsets of samples that are balanced with respect to different technical and biological factors. We can thus effectively compare within and between species gene expression data from primary tissue and iPSC-derived cardiomyocytes that were harvested on days 15 and 27, with or without the treatment T3.
We examined and recorded the quality of the RNA samples from both tissues and cultures (Supplemental Table 1, Fig. 5), and sequenced each sample to an average coverage of 49.4 million total reads. We mapped the sequencing reads to an updated two-way human-chimpanzee orthologous exon reference set (Methods) and estimated relative gene expression levels using edgeR30. Throughout the differentiation and sample processing steps, we have recorded a large number of biological and technical properties (sample metadata available in Supplemental Table 1). As a first step of our analysis, we determined the potential impact of different properties of the study design and sample metadata on the observed gene expression levels (using variance partitioning; Methods). Most of the variation in our data can be explained by biological factors (species, day, and individual). Most study design properties explain only a modest amount of variation, but as expected, technical differentiation batch is associated with substantial amount of variation, as is sample purity (Fig. 1). Importantly, neither sample purity nor technical batch are associated with species (Supplemental Fig. 6).
Which differentiated cardiomyocytes most resemble primary hearts?
Overall, we collected data from 110 samples (19 iPSCs, 59 differentiated cardiomyocytes, and 32 primary heart samples). To examine global trends in gene expression levels, we normalized data across all samples (using TMM; Methods), and visualized the data using principal component analysis (PCA; Fig. 2A). We found that the major source of variation in the data is correlated with cell type; PC1 and PC2, which explain 36% and 22% of the variance, respectively, are highly associated with cell type (P < 10−16). The next major source of variation, as expected, is species (which is strongly associated with PC3; P < 10−16).
We performed unsupervised hierarchical agglomerative clustering on the correlation matrix of gene expression data, and found that the major trends seen in the PCA are recapitulated (Fig. 2B). We noted that data from iPSC-derived cardiomyocytes clusters by species first, then by maturation day or treatment.
We focused on the differences between iPSC-derived cardiomyocytes at days 15 and 27, with and without T3 treatment. For this analysis, we considered within-species gene expression patterns. Using the data from each species independently, we asked which cardiomyocyte cultures are most similar to adult heart tissue with respect to their gene expression profiles. We first focused on temporal maturation (the untreated samples at days 15 and 27) and found that average pairwise correlation in gene expression data is higher between untreated day 27 cardiomyocytes and hearts than between day 15 cardiomyocytes and hearts, regardless of species (Fig. 3A, paired t-test; P < 10−16).
This result indicates that temporal maturation increases the similarity between cardiomyocytes and adult heart tissue for both species. We proceeded to compare gene expression data between heart tissues and cardiomyocytes cultures at day 27 that were treated or not treated with T3. We found that gene expression data from samples treated with T3 are more similar to that of primary heart tissues, regardless of species (Fig. 3A, P < 10−4). We compared the maturation effects across species and found that, at day 15, human cardiomyocytes are more similar to human adult heart tissue than chimpanzee cardiomyocytes are to chimpanzee adult hearts (Fig. 3B, P < 10−7). However, by day 27, we did not observe any significant difference across species in this comparison (P = 0.13).
Which genes are differentially regulated between hearts and cultured cardiomyocytes?
Based on our observations we concluded that as the cells mature, the fidelity of iPSC-derived cardiomyocytes as a model for primary hearts increases in both species. Moreover, T3 treatment helps to mature iPSC-derived cardiomyocytes, altering their gene expression such that it is more similar to that of adult tissues than temporal maturation alone. Yet, even T3 treated samples at day 27 are clearly distinct from heart samples (Figs 2 and 3). We thus characterized the specific regulatory difference between heart tissues and iPSC-derived cardiomyocytes.
To do so, we considered a subset of day 15 and T3 treated day 27 cardiomyocyte samples from 7 humans and 7 chimpanzees, all with purity above 50% (Supplemental Fig. 7). We used a linear model framework to perform a combined analysis of all data from these samples, as well as from the corresponding iPSCs, and from the primary heart tissues (Methods). We focused on genes that were classified as differentially expressed between cell/tissue types in both humans and chimpanzees (Methods). At an FDR of 5% we classified 3,446 differentially expressed genes (of 13,878 orthologous genes) between samples at day 27 (T3 treated) and heart tissue, and 4,115 differentially expressed genes between samples at day 15 and heart tissue (Supplemental Table 2). For comparison, using the same approach, we classified nearly 8,000 differentially expressed genes between iPSCs and cardiomyocytes at either day 15 or day 27.
To gain insight into the processes that are differentially regulated between primary hearts and iPSC-derived cardiomyocytes, we considered GO functional annotations31,32. We classified the differentially expressed genes between hearts and cardiomyocytes to (i) those that are not expressed at all in hearts, (ii) or in cardiomyocytes, and those that expressed in both tissues but are more highly expressed in (iii) heart or (iv) cardiomyocytes. Among genes that are expressed in hearts but not in cardiomyocytes, we found an almost exclusive enrichment for biological processes relating to immune responses and inflammation. In turn, among genes that are expressed in cardiomyocytes but not in hearts, we found an enrichment of genes involved in cell cycle regulation. Among genes expressed in both cardiomyocytes and hearts, we found that genes involved in transcription regulation tend to be expressed at higher levels in cardiomyocytes and genes involved in metabolic processes specifically related to lipid metabolism tend to be expressed at higher levels in hearts (Fig. 4; complete enrichment results in Supplemental Table 3; all reported enrichments associated with FDR < 0.05). Remarkably, when we consider all of the enrichment results with FDR < 0.05, we are able to account for 67% of genes that are classified as differentially expressed between hearts and iPSC-derived cardiomyocytes.
Up to this point, we considered gene expression patterns within species. To specifically examine iPSC-derived cardiomyocytes as a model system for comparative genomic studies, we proceeded to compare inter-species regulatory differences in either cell cultures or their corresponding primary tissues. We used a framework of linear modeling as above, but this time focused on the species-by-cell-type interaction effect. To account for incomplete power when we classified genes as differentially expressed between species in any given cell or tissue type, we used a relaxed statistical cutoff for secondary observations. Namely, when a gene was classified as differentially expressed between species in one cell or tissue type using a stringent statistical cutoff (FDR of 5%), we used a relaxed statistical cutoff to classify the same gene as differentially expressed between species in the other cell or tissue type (nominal P < 0.05; Methods).
Using this approach, we found that roughly 50% of genes that are classified as differentially expressed between human and chimpanzee in heart tissues are also differentially expressed between cultured cardiomyocytes of the two species, regardless of whether we considered cardiomyocytes that were harvested at day 15 or day 27 (Fig. 5). This result is robust with respect to the degree of purity of the cardiomyocyte cultures (tested using all iPSC derived cardiomyocyte samples spanning a purity range of 40–70%; Supplemental Fig. 8, Methods) and statistical stringency (Supplemental Table 4). When we considered genes that are classified as differentially expressed between the species either in hearts or in cardiomyocytes (Methods), we found enrichment of similar functional terms when we considered either the day 27 cardiomyocytes or the heart tissue samples (Supplemental Table 5). Specifically, we found enrichment in terms relating to Carboxylic acid metabolism, cellular adhesion and xenobiotic metabolic processes. When we considered genes that are classified as differentially expressed between the species in both hearts and in cardiomyocytes, we found an enrichment of terms relating to multiple different metabolic processes, including monocarboxylic acid metabolic process, oxidation-reduction, and oxoacid metabolism (Supplemental Table 5, Fig. 9, all reported enrichments associated with FDR < 0.05).
Cardiomyocytes are more similar to hearts than to other primary tissues
Our observations thus far indicate that while much of the heart regulatory divergence between species can be captured by using iPSC-derived cardiomyocytes, there are also thousands of genes whose expression differs between hearts and cultured cardiomyocytes. To provide context for these observations, we used data from multiple publicly available RNA-seq studies, which include samples from four different adult tissues from both humans and chimpanzees, as well as from human fetal tissues33–47. We also included data from human and chimpanzee iPSCs, human ESCs, and human day 31 iPSC derived cardiomyocytes2,34,41,48. We processed all public data and our own newly collected data (from a total of 231 human samples and 111 chimpanzee samples) using a common analysis pipeline and a joint statistical model for the combined data (Methods).
We first focused on similarities and differences between tissues and cell types. We performed PCA using the combined data from all 342 samples, which suggested (based on visual inspection) separation of the samples by cell and tissue type in the first several principal components (Fig. 6A; PC1 and PC2 are both highly associated with cell type or tissue type; P < 10−16). We only observed an association with species when we considered the loading on PC6 (P <10−16). We next performed unsupervised hierarchical agglomerative clustering using the correlation matrix of the combined gene expression data (Fig. 6B). The samples clustered by cell and tissue type followed by species (with the exception of public data from one chimpanzee heart sample from36). Notably, we found that samples from iPSC-derived cardiomyocytes are consistently more similar to samples from fetal heart tissue, followed by samples from adult heart tissues than any other non-heart tissues (Fig. 6B). These observations are consistent with previous findings that iPSC-derived cardiomyocytes are most similar to first trimester fetal hearts7.
We next evaluated the ability of iPSC-derived cardiomyocytes to capture regulatory divergence in hearts in the context of comparative data sets from other tissues. To do so, we used RNA-seq data collected from four individuals from each species from four different tissues: heart, lung, liver and kidney (GEO accession GSE112356). We classified genes as differentially expressed between human and chimpanzee in each tissue and in day 27 cardiomyocytes using a common analysis pipeline (Methods). To account for incomplete power when classifying genes as differentially expressed between human and chimpanzee in any given cell or tissue type, we again allowed for a more relaxed statistical cutoff for secondary observations. As expected, a large number of genes show inter-species differences in gene expression level across all tissues while different subsets of genes are differentially expressed between the species in only a single tissue (Supplemental Fig. 10). Crucially, genes that are differentially expressed between human and chimpanzee in cardiomyocytes are much more likely to also be classified as differentially expressed between the species in hearts than in any other tissue (Fig. 7).
Discussion
The development of iPSC based model systems have the potential to transform the field of comparative primate genomics, assuming that differentiated cells can recapitulate biological processes that occur in primary cell types. Quite a few studies focused on gene regulatory variation in in vitro differentiated cells have already been published (including by our group), but a systematic assessment of the fidelity of these differentiated cells with respect to primary primate tissue has not been performed. The current study was therefore designed to specifically assess the degree to which differentiated cardiomyocytes can serve as a model system in which to study regulatory differences in human and chimpanzee hearts.
We debated which cell type to use for this first exploration of the fidelity of differentiated cells. Many different cell types are accessible using iPSC differentiation protocols27,49–51. Despite the seemingly many options, in most cell types, individual variation still remains a current barrier to generate consistently pure cultures of target cell types28,52–56. In contrast, cardiomyocyte differentiation protocols are more robust with respect to inter-individual variation28. This outcome is partially due to the dominance of a single pathway (WNT signaling) in resolving developmental bifurcations in favor of cardiomyocyte generation in vitro27,57. With the addition of a metabolic purification step, differentiations to cardiomyocytes tend to be more consistent at generating target cells and with higher purity than would be achievable for other cell types. An additional advantage of utilizing a cardiac lineage for benchmarking is the relatively homogenous nature of cardiac muscle (e.g., the composition of the left ventricle is upwards of 40% cardiomyocytes58). With these factors in mind, differentiations of iPSCs to cardiomyocyte provide an optimal framework to begin characterizing the similarity of iPSC-derived cell types to primary tissue in primates.
We did not expect differentiated cardiomyocytes to be identical to hearts. That would not be a realistic expectation. We hoped that differentiated cardiomyocytes would be the most similar to hearts than to any other tissue, and we hoped to shed light on the opportunities and limitations associated with the use of differentiated cardiomyocytes to study hearts. Our observations confirmed that gene expression data from differentiated cardiomyocytes are more similar to data from hearts than any other tissue we tested, but we also found thousands of differentially expressed genes between hearts and differentiated cardiomyocytes. The majority of these differentially expressed genes, however, may not point to inherent limitations of differentiated cardiomyocytes as a model, but rather they help us better understand the model properties.
Indeed, 67% of differentially expressed genes between hearts and differentiated cardiomyocytes may be explained, not by artifacts specific to the model, but by expected biological differences between in vitro cultured relatively pure cells and a complex primary adult tissue. While most myocyte populations in the adult heart no longer proliferate and have a restricted ability to reenter cell cycle59,60, the differentiated cardiomyocytes, which resemble embryonic heart cells more than adult cells, likely retain the ability to reenter cell cycle similar to their fetal and preadolescent counterparts61,62. It thus may be expected that genes with functions related to transcription regulation and those involved in cell cycle regulation (there is a high overlap between these categories) would be more highly expressed in young differentiated cardiomyocyte cultures than in adult hearts. Additionally, the iPSC differentiation protocol utilized herein produces primarily cardiomyocyte populations27, while the hearts contain a complex composition of cells, including atrial and ventricular cardiomyocytes, nodal cells, cardiac fibroblasts, endothelial cells, blood cells and resident immune cells. It is therefore reasonable that genes involved in immune responses and inflammation are highly expressed in hearts compared with differentiated cardiomyocytes. Finally, it has been previously reported that the metabolic state of hearts and cultured cardiomyocytes are different5. Consistent with that report, we found that genes involved in lipid metabolism are highly expressed in hearts relative to differentiated cardiomyocytes, an expected observation given that our cell cultures were not supplemented with lipids. Thus, the majority of the thousands of differentially expressed gene between primary hearts and cardiomyocytes are not necessarily indicative of inherent limitations of the in vitro model. Rather, many of these regulatory differences point to biological (cell composition) and environmental (lipids) differences between cell culture and primary tissue.
When we specifically considered inter-species comparisons of gene expression levels using either cardiomyocytes or hearts, we found similar patterns. Namely, while most genes are classified as differentially expressed between humans and chimpanzee in both differentiated cardiomyocytes and hearts, thousands of genes are differentially expressed between the species in either hearts or cardiomyocytes. Yet, the discordant inter-species patterns in cardiomyocytes and hearts mostly involve genes with functions related to metabolic processes, regulation of gene expression, and cell cycle. As we discussed above, there are likely biological and environmental explanations to these patterns.
We thus conclude that iPSC-derived cardiomyocytes are a useful model with which to conduct comparative studies of gene regulation. We expect that iPSC-derived cardiomyocytes will be mainly used to comparatively study dynamic gene regulatory processes, such as during differentiation, when frozen heart samples – even when available – are unsuitable. The model is not perfect, but our study indicates that regulatory patterns observed in cultured cardiomyocytes cannot be mistaken as anything other than a representation of the biological processes that occur in primary hearts. With that in mind, we recommend that the data from our study, which are available in unprocessed and processed forms, should be used to evaluate the likelihood that future observations based on cultured cardiomyocytes faithfully represent regulatory patterns in primary heart tissues.
Methods
Ethics Statement
Human skin punch biopsies for fibroblast samples for generation of iPSC lines were collected under at the University of Chicago from participants who gave written informed consent. All protocols were approved by the University of Chicago Institutional Review Board (IRB) protocol 11–0524. All chimpanzee biopsies collected for fibroblast samples for generation of iPSC lines were collected by Yerkes Primate Center under protocol 006-12 in full accordance with IACUC protocols. All protocols were approved by the Yerkes Primate Center. All methods were carried out in accordance with relevant guidelines and regulations.
iPSC panel
In this study, we include 8 chimpanzee iPSC lines and 6 human lines from previously described panels2–4. We also include 2 chimpanzee iPSC lines and 3 human lines not previously described (characterizations for all lines used in the paper are provided in Supplemental Figs 1–3). All iPSC lines are matched for cell type of origin, reprogramming method, and culture conditions. Feeder free iPSC cultures were initially maintained on Growth Factor Reduced Matrigel using Essential 8 Medium (E8) as previously described2. After 10 passages in E8, all cell lines were transitioned to a 50/50% ratio of iDEAL/E8 feeder free medium that was prepared in house as specified previously63. Cell culture was conducted at 37 °C, 5% CO2, and atmospheric O2.
Cardiomyocyte differentiation
Cardiomyocytes were generated following recently published protocols27–29 with minimal modification. At 12 hours prior to initiating differentiation, iPSC lines at 70–90% confluence were seeded to achieve a starting density of 150,000–250,000 cells/cm2. Differentiations were initiated by removing stem cell maintenance media and adding RPMI base media (with HEPES LifeTechnologies# 22400105) supplemented with 6 μM CHIR9902 (LClabs# C-6556), 0.5 mg/mL BSA (Sigma-Aldrich# A2153), 0.213 mg/mL ascorbic acid (Santa Cruz Bio# sc-228390), 0.3 μg/mL sodium selenite (Sigma-Aldrich# S5261), 5 μg/mL Holo-transferrin (Sigma-Aldrich# T3705), 1 μg/mL Linolenic acid (Sigma-Aldrich# L2376), 1 μg/mL Linoleic acid (Sigma-Aldrich# 1012) and 1x Pen/Strep (Days 0–1 media). After 48 hours, the media was changed to RPMI base media (with HEPES LifeTechnologies# 22400105) supplemented with 2 μM Wnt-c59 (Tocris# 5148), 0.5 mg/mL BSA (Sigma-Aldrich# A2153), 0.213 mg/mL ascorbic acid (Santa Cruz Bio# sc-228390 and 1x Pen/Strep (Days 2–3 media). After another 48 hours, the media was changed to RPMI base (with HEPES LifeTechnologies# 22400105) supplemented 0.5 mg/mL BSA (Sigma-Aldrich# A2153), 0.213 mg/mL ascorbic acid (Santa Cruz Bio# sc-228390) and 1x Pen/Strep. Media was changed every 48 hours until day 10 (total of 2 additional media changes). At day 10 the media was switched to RPMI media (no glucose, Cellgro#10-043-CV) supplemented with 5 mM Sodium D/L Lactate (Sigma-Aldrich# L4263), 0.5 mg/mL BSA (Sigma-Aldrich# A2153), 0.213 mg/mL ascorbic acid (Santa Cruz Bio# sc-228390) and 1x Pen/Strep, media was changed every 48 hours (total of 2 additional media changes). At day 15, cells were dissociated using TrypLE and split to a density of 0.4 × 106 cells/cm2 onto Matrigel coated dishes. Cells were replated in RPMI media (no glucose, Cellgro#10-043-CV) supplemented with 5% FBS, 10 mM D-galactose (Sigma-Aldrich# G5388), 1 mM Sodium Pyruvate (LifeTechnologies# 11360070), 1X NEAA (LifeTechnologies# 11140076), 5 mM HEPES (LifeTechnologies# 15630080), 0.5 mg/mL (Sigma-Aldrich# A2153), 0.213 mg/mL ascorbic acid (Santa Cruz Bio# sc-228390) and 1x Pen/Strep (Galactose media). After 24 hours, the media was changed to Galactose media without FBS. Media was changed every 48 hours (total of 1 additional media change). At day 20 media was changed to Galactose media supplemented with 3 ng/mL Triiodothyronine (T3, Sigma-Aldrich# T6397) for T3(+) cardiomyocyte cultures. This concentration of T3 was used to mimic the peak in serum T3 observed upon birth that coincides with a rapid burst of cardiomyocyte differentiation and heart growth64. Media was changed every 48 hours until cells were harvested at day 27 (total of 3 additional media changes). For the first two media changes (Days 0 and 2) cultures were supplied with an excess of media (0.499 mL/cm2), all other days’ media was added at (0.2495 mL/cm2). For a subset of day 27 iPSC-derived cardiomyocytes calcium transients imaging was collected as previously described2. Briefly, iPSC-derived cardiomyocytes were treated 5 μM Fluo-4 AM (F-14217, Life Technologies) for 15 min. After washing cultures once, calcium flux was recorded using the GFP channel of an AMG EVOS FL microscope.
Determining iPSC-derived cardiomyocyte purity using flow cytometry
Day 15 cells were dissociated as described above, and aliquots were pulled off for flow analysis prior to replating. Day 27 iPSC-derived cardiomyocytes were dissociated by 10 minute incubations with TrypLE supplemented with 0.5 U/ml Liberase TH. Dissociated cells were centrifuged at 200 × g for 5 minutes at 4 °C and washed with PBS. Cells were fixed and stained as specified previously27,28. Briefly, 0.5–1 million cells were fixed using a 1% PFA solution. Cells were fixed at 4 °C for 30 minutes before washing once using FACS buffer (autoMACS Running Buffer, Miltenyi Biotech). Fixed cells were permeabilized by incubating with cold 90% methanol for 30 minutes at 4 °C. Fixed, permeabilized cells were washed 2x using FACS buffer prior to staining. For immunostaining, 150,000 cells were transferred to BRAND lipoGrade 96 well immunostaining plates and centrifuged at 200 × g for 5 minutes at 4 °C. Cells were rinsed in FACS buffer then resuspended in the staining solution. A single master mix containing 0.5% BSA/0.1% Triton X-100, PE-labeled TNNT2 (Bdbio clone 13–11) and Alexa Flour 647 labeled-TNNI3 (BDbio clone C5) All antibodies were used at the manufacturer-recommended dilution. Cells were stained for 1 hour at 4 °C and subsequently washed 3x with a PBS solution containing 0.5% BSA/0.1% Triton X-100. Stained cells were resuspended after the final spin in 150 μL FACS buffer containing 5 μM Vybrant DyeCycle Violet Stain (Thermofisher #V35003) for acquisition on a BD LSR II flow cytometer. After data acquisition, we used the program FlowJo (https://www.flowjo.com/) to determine compensation scaling. To do so, we used data from single stained compensation beads (Life Technologies) that were stained and collected in parallel. Live, intact, single cells were gated based on FSC and SSC channels. In order to determine the purity of differentiated cardiomyocytes measured the proportion of TNNI3/TNNT2 double positive cells live cells to all live cells within a sample. To gate the dual positive iPSC-derived cardiomyocytes, we first defined the region of negative/nonspecific signal using multiple biological negative controls: iPSC and fibroblast lines stained in parallel). The fluorescence signal produced by these samples was used to define the region corresponding to no signal or nonspecific binding and the required fluorescence signal to be determined as specific binding or positive. Positive fractions were comparable in fluorescence intensity to the maximum intensity for each channel determined by the compensation beads (which will maximally bind the antibody). Pseudocolor plots showing gating for all samples is provided in Supplemental Figs 11–13, sample purities are provided in Supplemental Table 1.
Isolation of RNA and DNA
To isolate RNA and DNA from cardiomyocytes and iPSC lines, we first aspirated old cell culture media, then rinsed culture wells using PBS. After aspirating PBS, 300 μL of DNA/RNA lysis buffer was added directly to culture plates to lyse cells on the plate. The entire lysate was transferred to a 1 mL tube and immediately frozen for extraction with other samples in balanced batches. We extracted the RNA using the ZR-Duet DNA/RNA MiniPrep kit (Zymo) with the addition of an on-column DNAse I treatment step prior to RNA elution.
Post-mortem human heart tissues were provided by the National Disease Research Interchange (NDRI). Chimpanzee post-mortem heart tissues were provided by Yerkes primate center, the Southwest Foundation for Biomedical Research, and the New Iberia Research Center, MD Anderson Cancer Center. For lysis of heart tissue, frozen tissue from the left ventricle was broken down into smaller chunks while working on dry ice to prevent thawing, and subsequently weighed in 1.5 mL tubes to determine weight of tissue. For each mg of tissue, we added 8 μL of 1X RNA/DNA shield buffer. Tissues were homogenized in RNA/DNA shield using sterile plastic pestles. To further homogenize tissues and release RNA and DNA, tissues were digested with Proteinase K for 30 minutes at 55 °C. Digested, homogenized tissues were centrifuged for 2 minutes to pellet debris, the volume of supernatant was measured and was transferred to fresh tubes. An equal volume of DNA/RNA lysis buffer was added and the solution was mixed prior to freezing at −80 °C. Frozen samples were thawed and extracted in parallel using the same protocol as used for cells.
Sequencing library preparation and RNA sequencing
We used non-strand specific, polyA capture to generate RNA-seq libraries according to the Illumina TruSeq protocol. To estimate the RNA concentration and quality, we used the Agilent 2100 Bioanalyzer. We added barcoded adaptors (Illumina TruSeq RNA Sample Preparation Kit v2) and sequenced the 100 base pair single-end RNA-seq libraries on the Illumina HiSeq 4000 at the Functional Genomics Core at University of Chicago on two flow cells. We used FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/) to confirm that the reads were high quality.
Mapping of RNA-seq data, orthologous exons, read count transformation and normalization
We mapped human reads to the hg38 genome and chimpanzee reads to panTro5 using HISAT265 using the default parameters. We kept on only reads that mapped uniquely. To prevent biases in expression level estimates due to differences in mRNA transcript size and the relatively poor annotation of the chimpanzee genome and differences in mappability between species, we only kept reads that mapped to a list of orthologous metaexons. As the previous version of this metaexon database was generated using older versions of the genomes, we generated an updated orthoexon database based on the most current genome builds (see methods below for details). Gene expression levels were quantified using the featureCounts function in SubRead (version 1.5.166, We performed all downstream processing and analysis steps in R unless otherwise stated.
We used a common analysis pipeline to process RNA sequencing data collected from the current study and also from the downloaded public datasets. We transformed raw gene counts into log2-transformed counts per million (CPM) using edgeR30. To filter for the lowly expressed genes, we kept only genes with an average expression level of log2(CPM) > 2 in at least one cell type67. For the remaining genes, we normalized the read counts using the weighted trimmed mean of M-values algorithm (TMM)67 to account for between-sample differences in the read counts at the extremes of the distribution and calculated the TMM-normalized log2-transformed CPM. After TMM-normalization we then performed a cyclic loess normalization with the function normalizeCyclicLoess from the R/Bioconductor package limma68,69. To account for gene length differences between species we calculated normalized log2-transformed RPKM values by using the function rpkm with normalized library sizes from the package edgeR30. We measured the “gene lengths” as the sum of the lengths of the orthologous exons.
To estimate the total gene expression variation attributed to the different properties of study design and sample metadata in our study, we used a linear mixed model implemented in variancePartition70. We modeled all effects as fixed effects with the exception of individual which was modeled as a random effect.
Updating orthologous exons
As part of the work contained in this paper, we updated our previously generated exon database71 with slight modification. We describe the modifications as follows. First, exon definitions were initially obtained from Ensembl release 88 (downloaded March 2017 containing 744,360 exons across 63,898 genes). This initial set of exon definitions was condensed to remove redundancies (haplotypes, patches and alternate exon entries resulting from different source annotations) and exons from genes that were fully overlapping. Regions of exons containing overlaps belonging to different genes were removed; exons with at least 10 bp of sequence not contained within the overlapping region were retained by removing only the region of overlap. This resulted in the division up of some exons into one or more smaller unique exons. Exons for the same genes with overlaps of at least 1 bp were collapsed into single metaexons containing all overlapping exons. Human fasta sequences (for the reduced set of 320,753 meta exons across 56,479 genes) were then extracted from the genome (Ensembl GRCh38.p10). We then used BLAT (BLAT V. 3572) to identify the orthologous exon sequences in the chimpanzee genome (panTro5). All hits with indels larger than 25 bp were removed (using a function blatOutIndelIdent, from https://bitbucket.org/ee_reh_neh/orthoexon), the fasta sequences for the hits with the highest sequence identity were extracted from the genome (using Bedtools73). The chimpanzee sequences were then blatted against the human genome and back to the chimpanzee genome. Queries that did not return the original blat locations (for either chimpanzee or human) were removed, as were entries where multiple possible mappings occurred with higher than 90% sequence identity. Finally, different exons mapping to the same location (different locations in human, but same location in chimpanzee) were removed. This updated human-chimpanzee orthologous exon database resulted in 254,172 metaexons across 44,125 genes and is provided in Supplemental Files 1–3.
Linear modeling, differential expression and gene enrichment analyses
Differential expression analysis was performed using an empirical Bayes approach to linear modeling in genome wide expression data implemented in the R package limma74,75. To apply this linear modeling approach to RNA-seq read counts, we calculated weights that account for the mean-variance relationship of the count data using the function voom from the limma package76.
In analyses with both tissue and iPSC-derived cardiomyocytes, we cannot correct for purity differences, thus, we chose iPSC-derived cardiomyocytes with the highest purity (Supplemental Table 1). Since RNA quality varied greatly between tissues and derived cells (Supplemental Table 1, Fig. 5), we modeled RNA quality as a fixed effect using RIN scores. For all pairwise differential expression comparisons, the species, cell type, RNA quality (RIN), and a species-by-cell type interaction were modeled as fixed effects, and individual as a random effect. We used contrast tests in limma to find genes that were differentially expressed between tissues and cells for each species. For each pairwise differentially expressed (DE) test, we corrected for multiple testing with the Benjamini & Hochberg false discovery rate77 and genes with an FDR-adjusted P values < 0.05 were considered DE.
To identify genes as shared or specific to one species when examining DE genes between different cell or tissue types while accounting for incomplete power to detect overlaps, we first identified DE genes between the different cell or tissue types at an FDR of 5% in each species separately. Using this list of significant DE genes, we then looked in the other species, but classified genes as DE with a more relaxed cut off using a nominal P value of 0.05. Genes identified at an FDR of 5% in one species but not significant at a nominal P value of less than 0.05 in the other species were determined to be specific to one species. Genes that were significant in one species at FDR of 5% and significant at a nominal P value of less than 0.05 in the other species were designated as shared. This same approach was also taken when looking at interspecies DE genes in heart tissue or iPSC-derived cardiomyocytes.
When examining the overlap of interspecies DE genes in multiple different tissues, data for each tissue or cell type was modeled separately, as all data could not be modeled together due to differences in sequencing technology (50 bp Illumina GAIIx vs 100 bp Illumina HiSeq. 4000). For this analysis, we only included terms for species and RNA quality (RIN score). To identify shared interspecies DE genes across different tissues and cell types while accounting for incomplete power to detect overlaps, we used the same approach as detailed above.
To identify significantly enriched GO biological process terms for genes DE between day 27 cardiomyocytes and heart tissue, we used the compareCluster function implemented in the R package clusterProfiler (version 3.4.478). We used all expressed genes as our background for enrichment and only considered GO terms containing more than three annotated genes and less than 3000 genes. When considering all interspecies differentially expressed genes in heart tissues or day 27 samples, we found no significantly enriched GO terms. Results in the main text were derived from GO analysis using genes with an absolute log2-fold change between human and chimpanzee higher than 1.5.
Determining how purity differences affect estimates of interspecies DE gene overlaps
Since target cell type heterogeneity may vary within tissues and cannot be easily assessed, we were interested in the effect of purity on our ability to detect interspecies tissue specific DE patterns using iPSC-derived cardiomyocytes. As we are unable to physically alter (or effectively estimate) the composition of post mortem tissues, we instead used different combinations of T3 treated day 27 cardiomyocyte samples to generate 277 different average purity scenarios (Supplemental Fig. 14). With each combination of samples, we repeated the same analysis as described in the main text to determine how many interspecies DE genes identified in heart tissue could be recapitulated using iPSC-derived cardiomyocytes.
Electronic supplementary material
Acknowledgements
We thank Michelle Ward, Joyce Hsiao and other members of the Gilad lab for helpful discussions and comments on the manuscript. This work was supported by NIH grants GM120167, HL139447 as well as the Yerkes National Primate Research Center Base Grant ORIP/OD P51OD011132. The computational resources were provided by the University of Chicago Research Computing Center. BJP was supported by the Training in Emerging Multidisciplinary Approaches to Mental Health and Disease (T32MH020065). LEB was supported by the National Science Foundation Graduate Research Fellowship (DGE-1144082) and by the Genetics and Gene Regulation Training Grant (T32 GM07197). JR was supported by the Marie Curie fellowship (IOF 273290). The content presented in this article is solely the responsibility of the authors and the funding bodies had no role in the design of the study and collection, analysis, interpretation of data, and in writing the manuscript.
Author Contributions
Y.G. and B.J.P. conceived of the study and designed the experiments. B.J.P. performed the experiments and analyzed the results. C.C. prepared the sequencing libraries. J.R. extracted RNA from human and chimpanzee tissues for data provided in GEO accession GSE112356. B.J.P., L.E.B. and Y.G. wrote the paper. All authors read and approved the final manuscript.
Data Availability and Resource Sharing
Gene expression (RNA-seq) data are available at GEO under accession number GSE110471. All human and chimpanzee iPSCs are available upon request without restriction or limitation.
Competing Interests
The authors declare no competing interests.
Footnotes
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Bryan J. Pavlovic, Email: bjp@uchicago.edu
Yoav Gilad, Email: gilad@uchicago.edu.
Electronic supplementary material
Supplementary information accompanies this paper at 10.1038/s41598-018-33478-9.
References
- 1.Romero IG, Ruvinsky I, Gilad Y. Comparative studies of gene expression and the evolution of gene regulation. Nat Rev Genet. 2012;13:505–516. doi: 10.1038/nrg3229. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Gallego Romero I, et al. A panel of induced pluripotent stem cells from chimpanzees: a resource for comparative functional genomics. Elife. 2015;4:e07103. doi: 10.7554/eLife.07103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Blake, L. E. et al. A Comparative Study Of Endoderm Differentiation In Humans And Chimpanzees. bioRxiv, 10.1101/135442 (2017). [DOI] [PMC free article] [PubMed]
- 4.Burrows CK, et al. Genetic Variation, Not Cell Type of Origin, Underlies the Majority of Identifiable Regulatory Differences in iPSCs. PLoS Genet. 2016;12:e1005793. doi: 10.1371/journal.pgen.1005793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Uosaki H, et al. Transcriptional Landscape of Cardiomyocyte Maturation. Cell Rep. 2015;13:1705–1716. doi: 10.1016/j.celrep.2015.10.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Li G, et al. Transcriptomic Profiling Maps Anatomically Patterned Subpopulations among Single Embryonic Cardiac Cells. Dev Cell. 2016;39:491–507. doi: 10.1016/j.devcel.2016.10.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.van den Berg CW, et al. Transcriptome of human foetal heart compared with cardiomyocytes from pluripotent stem cells. Development. 2015;142:3231–3238. doi: 10.1242/dev.123810. [DOI] [PubMed] [Google Scholar]
- 8.Bell CC, et al. Transcriptional, Functional, and Mechanistic Comparisons of Stem Cell-Derived Hepatocytes, HepaRG Cells, and Three-Dimensional Human Hepatocyte Spheroids as Predictive In Vitro Systems for Drug-Induced Liver Injury. Drug Metab Dispos. 2017;45:419–429. doi: 10.1124/dmd.116.074369. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Piccini I, Rao J, Seebohm G, Greber B. Human pluripotent stem cell-derived cardiomyocytes: Genome-wide expression profiling of long-term in vitro maturation in comparison to human heart tissue. Genom Data. 2015;4:69–72. doi: 10.1016/j.gdata.2015.03.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Uosaki H, Taguchi YH. Comparative Gene Expression Analysis of Mouse and Human Cardiac Maturation. Genomics Proteomics Bioinformatics. 2016;14:207–215. doi: 10.1016/j.gpb.2016.04.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Handel AE, et al. Assessing similarity to primary tissue and cortical layer identity in induced pluripotent stem cell-derived cortical neurons through single-cell transcriptomics. Hum Mol Genet. 2016;25:989–1000. doi: 10.1093/hmg/ddv637. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Shinozawa T, et al. Gene expression profiling of functional murine embryonic stem cell-derived cardiomyocytes and comparison with adult heart: profiling of murine ESC-derived cardiomyocytes. J Biomol Screen. 2009;14:239–245. doi: 10.1177/1087057108330112. [DOI] [PubMed] [Google Scholar]
- 13.Xia N, et al. Transcriptional comparison of human induced and primary midbrain dopaminergic neurons. Sci Rep. 2016;6:20270. doi: 10.1038/srep20270. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Roessler R, et al. Detailed analysis of the genetic and epigenetic signatures of iPSC-derived mesodiencephalic dopaminergic neurons. Stem Cell Reports. 2014;2:520–533. doi: 10.1016/j.stemcr.2014.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Shan J, et al. Identification of small molecules for human hepatocyte expansion and iPS differentiation. Nat Chem Biol. 2013;9:514–520. doi: 10.1038/nchembio.1270. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Zhao D, et al. Promotion of the efficient metabolic maturation of human pluripotent stem cell-derived hepatocytes by correcting specification defects. Cell Res. 2013;23:157–161. doi: 10.1038/cr.2012.144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Ogawa S, et al. Three-dimensional culture and cAMP signaling promote the maturation of human pluripotent stem cell-derived hepatocytes. Development. 2013;140:3285–3296. doi: 10.1242/dev.090266. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Youssef AA, et al. The Promise and Challenge of Induced Pluripotent Stem Cells for Cardiovascular Applications. JACC Basic Transl Sci. 2016;1:510–523. doi: 10.1016/j.jacbts.2016.06.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Ellen Kreipke R, Wang Y, Miklas JW, Mathieu J, Ruohola-Baker H. Metabolic remodeling in early development and cardiomyocyte maturation. Semin Cell Dev Biol. 2016;52:84–92. doi: 10.1016/j.semcdb.2016.02.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Lin B, et al. Culture in Glucose-Depleted Medium Supplemented with Fatty Acid and 3,3′,5-Triiodo-l-Thyronine Facilitates Purification and Maturation of Human Pluripotent Stem Cell-Derived Cardiomyocytes. Front Endocrinol (Lausanne) 2017;8:253. doi: 10.3389/fendo.2017.00253. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Nunes SS, et al. Biowire: a platform for maturation of human pluripotent stem cell-derived cardiomyocytes. Nat Methods. 2013;10:781–787. doi: 10.1038/nmeth.2524. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Veerman CC, et al. Immaturity of human stem-cell-derived cardiomyocytes in culture: fatal flaw or soluble problem? Stem Cells Dev. 2015;24:1035–1052. doi: 10.1089/scd.2014.0533. [DOI] [PubMed] [Google Scholar]
- 23.Kolanowski TJ, Antos CL, Guan K. Making human cardiomyocytes up to date: Derivation, maturation state and perspectives. Int J Cardiol. 2017;241:379–386. doi: 10.1016/j.ijcard.2017.03.099. [DOI] [PubMed] [Google Scholar]
- 24.Lee YK, et al. Triiodothyronine promotes cardiac differentiation and maturation of embryonic stem cells via the classical genomic pathway. Mol Endocrinol. 2010;24:1728–1736. doi: 10.1210/me.2010-0032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Ivashchenko CY, et al. Human-induced pluripotent stem cell-derived cardiomyocytes exhibit temporal changes in phenotype. Am J Physiol Heart Circ Physiol. 2013;305:H913–922. doi: 10.1152/ajpheart.00819.2012. [DOI] [PubMed] [Google Scholar]
- 26.Yang X, et al. Tri-iodo-l-thyronine promotes the maturation of human cardiomyocytes-derived from induced pluripotent stem cells. J Mol Cell Cardiol. 2014;72:296–304. doi: 10.1016/j.yjmcc.2014.04.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Burridge PW, et al. Chemically defined generation of human cardiomyocytes. Nat Methods. 2014;11:855–860. doi: 10.1038/nmeth.2999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Burridge, P. W., Holmstrom, A. & Wu, J. C. Chemically Defined Culture and Cardiomyocyte Differentiation of Human Pluripotent Stem Cells. Curr Protoc Hum Genet 87, 21 23 21–15, 10.1002/0471142905.hg2103s87 (2015). [DOI] [PMC free article] [PubMed]
- 29.Rana P, Anson B, Engle S, Will Y. Characterization of human-induced pluripotent stem cell-derived cardiomyocytes: bioenergetics and utilization in safety screening. Toxicol Sci. 2012;130:117–131. doi: 10.1093/toxsci/kfs233. [DOI] [PubMed] [Google Scholar]
- 30.Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.The Gene Ontology, C. Expansion of the Gene Ontology knowledgebase and resources. Nucleic Acids Res. 2017;45:D331–D338. doi: 10.1093/nar/gkw1108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Ashburner M, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000;25:25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Brawand D, et al. The evolution of gene expression levels in mammalian organs. Nature. 2011;478:343–348. doi: 10.1038/nature10532. [DOI] [PubMed] [Google Scholar]
- 34.Marchetto MCN, et al. Differential L1 regulation in pluripotent stem cells of humans and apes. Nature. 2013;503:525–529. doi: 10.1038/nature12686. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Lin S, et al. Comparison of the transcriptional landscapes between human and mouse tissues. Proc Natl Acad Sci USA. 2014;111:17224–17229. doi: 10.1073/pnas.1413624111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Peng X, et al. Tissue-specific transcriptome sequencing analysis expands the non-human primate reference transcriptome resource (NHPRTR) Nucleic Acids Res. 2015;43:D737–742. doi: 10.1093/nar/gku1110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Ruiz-Orera J, et al. Origins of De Novo Genes in Human and Chimpanzee. PLoS Genet. 2015;11:e1005721. doi: 10.1371/journal.pgen.1005721. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Schultz MD, et al. Human body epigenome maps reveal noncanonical DNA methylation variation. Nature. 2015;523:212–216. doi: 10.1038/nature14465. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Uhlen M, et al. Proteomics. Tissue-based map of the human proteome. Science. 2015;347:1260419. doi: 10.1126/science.1260419. [DOI] [PubMed] [Google Scholar]
- 40.Williams Z, et al. Discovery and Characterization of piRNAs in the Human Fetal Ovary. Cell Rep. 2015;13:854–863. doi: 10.1016/j.celrep.2015.09.030. [DOI] [PubMed] [Google Scholar]
- 41.Wu J, et al. An alternative pluripotent state confers interspecies chimaeric competency. Nature. 2015;521:316–321. doi: 10.1038/nature14413. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Barrette AM, et al. Antiinflammatory Effects of Budesonide in Human Fetal Lung. Am J Respir Cell Mol Biol. 2016;55:623–632. doi: 10.1165/rcmb.2016-0068OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Qin J, et al. Connexin 32-mediated cell-cell communication is essential for hepatic differentiation from human embryonic stem cells. Sci Rep. 2016;6:37388. doi: 10.1038/srep37388. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Wang Y, et al. Conversion of Human Gastric Epithelial Cells to Multipotent Endodermal Progenitors using Defined Small Molecules. Cell Stem Cell. 2016;19:449–461. doi: 10.1016/j.stem.2016.06.006. [DOI] [PubMed] [Google Scholar]
- 45.Yan L, et al. Epigenomic Landscape of Human Fetal Brain, Heart, and Liver. J Biol Chem. 2016;291:4386–4398. doi: 10.1074/jbc.M115.672931. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.He C, et al. Systematic Characterization of Long Noncoding RNAs Reveals the Contrasting Coordination of Cis- and Trans-Molecular Regulation in Human Fetal and Adult Hearts. Circ Cardiovasc Genet. 2016;9:110–118. doi: 10.1161/CIRCGENETICS.115.001264. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Bernstein BE, et al. TheNIH Roadmap Epigenomics Mapping Consortium. Nat Biotechnol. 2010;28:1045–1048. doi: 10.1038/nbt1010-1045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Banovich NE, et al. Impact of regulatory variation across human iPSCs and differentiated cells. Genome Res. 2018;28:122–131. doi: 10.1101/gr.224436.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Clevers H. Modeling Development and Disease with Organoids. Cell. 2016;165:1586–1597. doi: 10.1016/j.cell.2016.05.082. [DOI] [PubMed] [Google Scholar]
- 50.Murry CE, Keller G. Differentiation of embryonic stem cells to clinically relevant populations: lessons from embryonic development. Cell. 2008;132:661–680. doi: 10.1016/j.cell.2008.02.008. [DOI] [PubMed] [Google Scholar]
- 51.Loh KM, et al. Efficient endoderm induction from human pluripotent stem cells by logically directing signals controlling lineage bifurcations. Cell Stem Cell. 2014;14:237–252. doi: 10.1016/j.stem.2013.12.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Wernig M, Gotz M, Eto K. Overcoming iPSC Obstacles. Cell Stem Cell. 2016;19:291–292. doi: 10.1016/j.stem.2016.08.018. [DOI] [Google Scholar]
- 53.Osafune K, et al. Marked differences in differentiation propensity among human embryonic stem cell lines. Nat Biotechnol. 2008;26:313–315. doi: 10.1038/nbt1383. [DOI] [PubMed] [Google Scholar]
- 54.Kyttala A, et al. Genetic Variability Overrides the Impact of Parental Cell Type and Determines iPSC Differentiation Potential. Stem Cell Reports. 2016;6:200–212. doi: 10.1016/j.stemcr.2015.12.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Nazareth EJ, et al. High-throughput fingerprinting of human pluripotent stem cell fate responses and lineage bias. Nat Methods. 2013;10:1225–1231. doi: 10.1038/nmeth.2684. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Kattman SJ, et al. Stage-specific optimization of activin/nodal and BMP signaling promotes cardiac differentiation of mouse and human pluripotent stem cell lines. Cell Stem Cell. 2011;8:228–240. doi: 10.1016/j.stem.2010.12.008. [DOI] [PubMed] [Google Scholar]
- 57.Rao J, et al. Stepwise Clearance of Repressive Roadblocks Drives Cardiac Induction in Human ESCs. Cell Stem Cell. 2016;18:341–353. doi: 10.1016/j.stem.2015.11.019. [DOI] [PubMed] [Google Scholar]
- 58.Zhou P, Pu WT. Recounting Cardiac Cellular Composition. Circ Res. 2016;118:368–370. doi: 10.1161/CIRCRESAHA.116.308139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Foglia MJ, Poss KD. Building and re-building the heart by cardiomyocyte proliferation. Development. 2016;143:729–740. doi: 10.1242/dev.132910. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Mohamed Tamer M.A., Ang Yen-Sin, Radzinsky Ethan, Zhou Ping, Huang Yu, Elfenbein Arye, Foley Amy, Magnitsky Sergey, Srivastava Deepak. Regulation of Cell Cycle to Stimulate Adult Cardiomyocyte Proliferation and Cardiac Regeneration. Cell. 2018;173(1):104-116.e12. doi: 10.1016/j.cell.2018.02.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Soonpaa MH, et al. Cardiomyocyte Cell-Cycle Activity during Preadolescence. Cell. 2015;163:781–782. doi: 10.1016/j.cell.2015.10.037. [DOI] [PubMed] [Google Scholar]
- 62.Naqvi N, et al. A proliferative burst during preadolescence establishes the final cardiomyocyte number. Cell. 2014;157:795–807. doi: 10.1016/j.cell.2014.03.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Marinho PA, Chailangkarn T, Muotri AR. Systematic optimization of human pluripotent stem cells media using Design of Experiments. Sci Rep. 2015;5:9834. doi: 10.1038/srep09834. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Li M, et al. Thyroid hormone action in postnatal heart development. Stem Cell Res. 2014;13:582–591. doi: 10.1016/j.scr.2014.07.001. [DOI] [PubMed] [Google Scholar]
- 65.Kim D, Langmead B, Salzberg SL. HISAT: a fast spliced aligner with low memory requirements. Nat Methods. 2015;12:357–360. doi: 10.1038/nmeth.3317. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014;30:923–930. doi: 10.1093/bioinformatics/btt656. [DOI] [PubMed] [Google Scholar]
- 67.Robinson MD, Oshlack A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol. 2010;11:R25. doi: 10.1186/gb-2010-11-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Ritchie M, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Research. 2015;43:e47. doi: 10.1093/nar/gkv1007,. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Ballman K, Grill D, Oberg A, Therneau T. Faster cyclic loess: normalizing RNA arrays via linear models. Bioinformatics. 2004;20:2778–2786. doi: 10.1093/bioinformatics/bth327. [DOI] [PubMed] [Google Scholar]
- 70.Hoffman GE, Schadt EE. variancePartition: interpreting drivers of variation in complex gene expression studies. BMC Bioinformatics. 2016;17:483. doi: 10.1186/s12859-016-1323-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Blekhman, R. A database of orthologous exons in primates for comparative analysis of RNA-seq data. Nature Precedings, 10.1038/npre.2012.5360.1034, doi:hdl:10101/npre.2012.7054.1 (2012).
- 72.Kent WJ. BLAT–the BLAST-like alignment tool. Genome Res. 2002;12:656–664. doi: 10.1101/gr.229202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Smyth Gordon K. Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments. Statistical Applications in Genetics and Molecular Biology. 2004;3(1):1–25. doi: 10.2202/1544-6115.1027. [DOI] [PubMed] [Google Scholar]
- 75.Smyth GK, et al. Use of within-array replicate spots for assessing differential expression in microarray experiments. Bioinformatics. 2005;21:2067–2075. doi: 10.1093/bioinformatics/bti270. [DOI] [PubMed] [Google Scholar]
- 76.Law CW, Chen Y, Shi W, Smyth GK. voom: Precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 2014;15:R29. doi: 10.1186/gb-2014-15-2-r29. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society. Series B (Methodological) 1995;57:289–300. [Google Scholar]
- 78.Yu G, Wang LG, Han Y, He Q. Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16:284–287. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Lex A, Gehlenborg N, Strobelt H, Vuillemot R, Pfister H. UpSet: Visualization of Intersecting Sets. IEEE Trans Vis Comput Graph. 2014;20:1983–1992. doi: 10.1109/TVCG.2014.2346248. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Gene expression (RNA-seq) data are available at GEO under accession number GSE110471. All human and chimpanzee iPSCs are available upon request without restriction or limitation.