Abstract
Human induced pluripotent stem cells (hiPSCs) generate multiple clones with inherent heterogeneity, leading to variations in their differentiation capacity. Previous studies have primarily addressed line-to-line variations in differentiation capacity, leaving a gap in the comprehensive understanding of clonal heterogeneity. Here, we aimed to profile the heterogeneity of hiPSC clones and identify predictive biomarkers for cardiomyocyte (CM) differentiation capacity by integrating transcriptomic, epigenomic, endogenous retroelement, and protein kinase phosphorylation profiles. We generated multiple clones from a single donor and validated that these clones exhibited comparable levels of pluripotency markers. The clones were classified into two groups based on their differentiation efficiency to CMs—productive clone (PC) and non-productive clone (NPC). We performed RNA sequencing (RNA-seq) and assay for transposase-accessible chromatin with sequencing (ATAC-seq). NPC was enriched in vasculogenesis and cell adhesion, accompanied by elevated levels of phosphorylated ERK1/2. Conversely, PC exhibited enrichment in embryonic organ development and transcription factor activation, accompanied by increased chromatin accessibility near transcription start site (TSS) regions. Integrative analysis of RNA-seq and ATAC-seq revealed 14 candidate genes correlated with cardiac differentiation potential. Notably, TEK and SDR42E1 were upregulated in NPC. Our integrative profiles enhance the understanding of clonal heterogeneity and highlight two novel biomarkers associated with CM differentiation. This insight may facilitate the identification of suboptimal hiPSC clones, thereby mitigating adverse outcomes in clinical applications.
Graphical Abstract
Supplementary Information
The online version contains supplementary material available at 10.1007/s00018-024-05493-9.
Keywords: Induced pluripotent stem cells, Clonal variation, Cardiomyocyte differentiation efficiency, Integrative analysis, Predictive biomarker
Introduction
Human induced pluripotent stem cells (hiPSCs) are widely used as in vitro models for rare diseases and cell therapies, offering insights into individual genetic backgrounds and the potential for differentiation into diverse cell types. Especially, hiPSC-derived cardiomyocytes (CMs) hold remarkable promise for drug development and drug toxicity testing [1, 2]. The reprogramming of somatic cells involves various methods, such as those based on lentiviruses, episome, and Sendai viruses, resulting in the formation of distinct colonies known as clones. Previous studies have highlighted the heterogeneity among these clones, including variability in epigenetic remodeling [3] and differentiation [4]. Studies by the Panopoulos group have revealed clone-specific effects of differentially methylated CpG sites, which affect the expression patterns of genes and key long non-coding RNAs associated with pluripotent stem cell function [5, 6]. Pluripotency gene expression and cell morphology [7] were empirically used to select high-quality clones, but these criteria may not ensure high differentiation capacity. Several studies have attempted to differentiate multiple hiPSC clones to identify those suitable for clinical applications [4, 8]. However, because differentiation rates are typically assessed post-differentiation, there is a growing need to evaluate the differentiation capacity of undifferentiated hiPSCs to minimize resource expenditure on suboptimal clones. To the best of our knowledge, few efforts have been made to profile the clonal heterogeneity, which emphasizes the need to identify predictive biomarkers to control cell fate determination at the undifferentiated stage.
Variations in differentiation outcomes have been reported based on factors such as reprogramming methods, cell origin, and molecular status [9–11]. For example, our previous paper investigated the cardiac differentiation potential of hiPSCs derived from various reprogramming methods, with each line including two clones [9]. Their findings indicated that the most significant differences in differentiation efficiency were associated with the reprogramming methods rather than the clonal variation. Other previous studies have also predominantly focused on parental cell types or line-to-line variation [10, 12, 13], which may not adequately explain the impact of clonal heterogeneity within hiPSC from a single donor.
Here, we aimed to investigate gene expression patterns and chromatin accessibility landscapes in hiPSC clones by integrating RNA sequencing (RNA-seq), assay for transposase-accessible chromatin by sequencing (ATAC-seq), and phospho-kinase array analysis. Our results unveiled distinct molecular signatures among clones with different differentiation potentials. We identified TEK and SDR42E1 as novel predictive biomarkers for selecting optimal hiPSC clones, providing comprehensive insights into predicting CM differentiation.
Materials and method
Blood collection and ethics statement
Human blood samples were obtained from healthy individuals. Ethical approval for the use of peripheral blood mononuclear cells (PBMCs) and hiPSCs was obtained from the Institutional Review Board of Sungkyunkwan University (IRB no. 2019-11-016-002 and 2020-01-020-001) and Samsung Medical Center (IRB no. 2016-11-025-015). Our study was conducted in accordance with the principles outlined in the Declaration of Helsinki, following the acquisition of written consent. For the reprogramming, PBMCs were obtained using the CPT mononuclear cell preparation tube (BD, 362761) and the provided protocol.
Reprogramming and cell culture
Human PBMCs were cultured and expanded in StemPro-34 SFM (Gibco, Waltham, MA, USA) supplemented with StemPro-34 Nutrient Supplement, 0.1 × penicillin–streptomycin (Gibco), 100 ng/mL stem cell factor (Peprotech, Cranbury, NJ, USA), 100 ng/mL FMS-related tyrosine kinase 3 ligand (FLT3L, Peprotech), 20 ng/mL interleukin-3 (IL-3, Peprotech), and 20 ng/mL interleukin-6 (IL-6, Peprotech). Reprogramming was performed by following the feeder-free protocol of the CytoTune™-iPS 2.0 Sendai Reprogramming Kit (Invitrogen, Carlsbad, CA, USA). The generated human iPSCs were maintained in TeSR™-E8™ medium (Stemcell Technologies, Vancouver, Canada) on Matrigel (Corning Inc., Corning, NY, USA)-coated plates in a 5% CO2 incubator at 37 °C. All clones were cultured using EDTA dissociation (Invitrogen) following published protocols.
Karyotype analysis
hiPSCs were cultured in T25 flasks coated with Matrigel in TeSR™-E8™ medium for 3 days. The karyotype analysis was conducted to standard protocols for cell harvest and chromosomal Giemsa (G)-banding in GenDix Co., Ltd.(Seoul, Korea).
Direct cardiac differentiation of hiPSCs
A previously published protocol involving the modulation of the canonical Wnt signaling pathway was used for hiPSC-CM differentiation [14]. At day 0, 4–6 µM CHIR99021 (Tocris, Bristol, UK) in RPMI1640, supplemented with B27 minus insulin (Gibco), was administered. On days 1–2, 1 mL B27 was added to the insulin differentiation medium to expose the cells to a gradient concentration of CHIR99021. On day 3, after aspiration of the previous medium, 5 µM endo-IWR1 (Tocris) in B27 minus insulin differentiation media was administered. On days 5 and 7, the B27 minus insulin differentiation medium was administered after aspiration of the previous medium. During days 9–15, after aspiration of the previous medium, RPMI1640 supplemented with B27 (Gibco) was administered. To compare the differentiation efficiencies of the clones, we did not proceed with metabolic purification on days 11–15. At day 15, beating cardiomyocytes were treated with TrypLE Select Enzyme 10 × without phenol red (Gibco) and incubated at 37 °C for 10–30 min. Single-cell suspensions were transferred to a 15 mL conical tube containing RPMI1640, and the cells were centrifuged at 300 × g for 5 min. After aspiration of the supernatant, the cell pellet was resuspended in RPMI1640 supplemented with B27 containing 10% Knock Out Serum Replacement (Gibco) and 10 µM Y-27632 (Biogems, Westlake Village, CA, USA) for further analysis.
Flow cytometry
Cells were fixed in 4% paraformaldehyde (Thermo Fisher Scientific, Waltham, MA, USA) for 15 min at room temperature, washed once with washing buffer (1% fetal bovine serum/phosphate-buffered saline [PBS]), and centrifuged at 300 × g for 5 min. The cells were permeabilized with 0.1% Triton X-100 in PBS for 15 min at room temperature, washed twice with washing buffer, and centrifuged at 300 × g for 5 min. Incubation with a diluted primary antibody (1:20 dilution in 3% bovine serum albumin (BSA)/PBS) was performed for 1 h on ice in the dark. The cells were washed twice with washing buffer and centrifuged at 300 × g for 5 min at 4 °C. Cell pellets were resuspended in 500 µL of 3% BSA/PBS and filtered through a cell strainer tube (Falcon, Corning, Inc.). Samples were analyzed using a FACS Aria3 SORP flow cytometer (Becton Dickinson, Franklin Lakes, NJ, USA) by BIORP of the Korea Basic Science Institute. For each experiment, 10,000 events were collected; all data were analyzed using the FlowJo software (Becton Dickinson).
Field potential measurements using a Multielectrode array (MEA)
To coat the surface of CytoView MEA 24-well plates (Axion BioSystems, Atlanta, GA, USA), hESC-qualified Matrigel was diluted in RPMI1640 and applied to the wells. On day 15, hiPSC-derived CMs were seeded on Matrigel-coated plates. See supplementary materials online for detailed method.
Quantitative reverse transcription polymerase chain reaction (qRT-PCR)
Total RNA was extracted from all hiPSC clones using the easy-BLUE™ Total RNA Extraction Kit (iNtRON, Seongnam, Korea) for qRT-PCR. Reverse transcription of 1 µg total RNA was performed using Maxime™ RT Premix (iNtRON). qPCR was performed using KAPA SYBR® FAST Master Mix 2× Universal (KAPA BIOSYSTEMS, Wilmington, MA, USA), a CFX86™ Real-Time System (Bio-Rad, Hercules, CA, USA) or a CFX Connect™ Real-Time System (Bio-Rad) according to the manufacturer’s instructions. Normalized expression levels for each gene (ΔΔCt) were calculated based on the levels of the housekeeping genes GAPDH or RNA18S1. The primer sequences are listed in the Supplementary Table 1.
Immunocytochemistry
Cells were seeded on Matrigel-coated cell chamber slide. Next day, cells were fixed in 4% paraformaldehyde (Thermo Fisher Scientific). Cells were seeded on Matrigel-coated cell chamber slide. Next day, cells were fixed in 4% paraformaldehyde (Thermo Fisher Scientific) for 15 min at room temperature, washed thrice with PBS, permeabilized with 0.1% Triton X-100/PBS for 15 min at room temperature, and washed thrice with PBS. The cells were blocked with 1% BSA in PBS for 1 h at room temperature on a shaker. Incubation with diluted primary antibody was carried out overnight at 4 °C. After 16 h, the cells were washed thrice for 10 min each with PBS at room temperature. Alexa-conjugated secondary antibodies were diluted 1:600 in blocking buffer and incubated with cells for 1 h and 30 min at room temperature on a shaker. After incubation, the cells were washed thrice with PBS for 10 min. Nuclei were counterstained with Hoechst stain (Gibco). Imaging was performed using a TCS SP8 confocal microscope (Leica, Wetzlar, Germany). The antibodies are listed in the Supplementary Table 2.
Protein extraction and immunoblotting
Cells were lysed in the PRO-PREP protein extraction solution (iNtRON) with 1× phosphatase inhibitor cocktail solution (GenDEPOT, Katy, TX, USA) after washing once with cold 1× PBS. Lysates were homogenized and centrifuged at 13,000 rpm at 4 °C for 20 min. Protein concentration was measured, and equal amounts of each protein sample were loaded onto a polyacrylamide gel and separated via sodium dodecyl sulfate-polyacrylamide gel electrophoresis. Cells were lysed in the PRO-PREP protein extraction solution (iNtRON) with 1× phosphatase inhibitor cocktail solution (GenDEPOT, Katy, TX, USA) after washing once with cold 1× PBS. Lysates were homogenized using ultrasonic homogenizers for 3 s at 12% amplitude and centrifuged at 13,000 rpm at 4 °C for 20 min. Protein concentration was measured on a spectrophotometer using the Bradford method. Equal amounts of each protein sample were loaded onto a polyacrylamide gel and separated via sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The proteins were subsequently transferred onto polyvinylidene difluoride membranes (Millipore, Burlington, MA, USA) using a wet transfer system (Bio-Rad). The membranes were blocked with 5% skim milk in Tris-buffered saline with Tween 20 and incubated overnight at 4 °C with diluted primary antibodies. Horseradish peroxidase (HRP)-conjugated secondary antibodies were added in a species-dependent manner and incubated at room temperature for 1 h. The HRP signals were detected using AbSignal (Abclon, Seattle, WA, USA) or SuperSignal West Femto Maximum Sensitivity Substrate (Thermo Fisher Scientific) using a ChemiDoc Imaging system (Bio-Rad). Bands were quantified using the FIJI software. The antibodies are listed in the Supplementary Table 2.
Human phospho-kinase arrays
Cells were lysed in lysis buffer, containing 1 × protease inhibitor cocktail solution (GenDEPOT), after washing once with cold 1 × PBS, and phosphorylation analysis was performed according to the manufacturer’s instructions (R&D Systems, Minneapolis, MN, USA). Signals were detected using a ChemiDoc Imaging System (Bio-Rad), and the density of the dots was quantified using the Quick Spots Tool (Ideal Eyes Systems, Bountiful, UT, USA).
Sample preparation of bulk mRNA-sequencing and ATAC-sequencing
Cells were collected for bulk RNA-seq and ATAC-seq analysis. Briefly, hiPSCs were dissociated and collected in PBS. Cell counting was performed using a trypan blue solution (0.4%). For each hiPSC clone, 1.7 × 106 cells were used for ATAC-seq, and the remaining were used for RNA extraction. See supplementary materials online for detailed method.
Analysis of bulk mRNA-sequencing data
Paired-end reads were aligned to the Gencode GRCh37.p13 reference genome (https://www.gencodegenes.org) using STAR (v.2.7.3a) with the default settings. Gene counts were quantified using featureCounts based on the hg19 GENCODE annotation reference. DESeq2 was used for differential expression analysis and to obtain normalized counts. See supplementary materials online for detailed method.
Analysis of long terminal repeats (LTRs)
RepeatMasker (RMSK) from the UCSC Genome Table Browser of hg19 were utilized in this study. STAR was employed to align raw FASTQ files to hg19 rmsk files with specific multimapper parameters, including winAnchorMultimapNmax 200, outFilterMatchNminOverLread 0.66, outFilterMismatchNoverLmax 0.05, and outFilterMultimapNmax 100. TEs were quantified following an established RNA sequencing analysis protocol using featureCounts. DESeq2 was then applied to identify differentially expressed TEs with a significance threshold of Padj < 0.05 and log2|FC|> 1. To gain deeper insights into the TE characteristics, each TE count was initially treated as repeat and presented in a bar plot. Subsequently, a heatmap was generated to annotate the TEs, with TE counts serving as repeat values.
Analysis of bulk ATAC-sequencing data
FASTQ files were aligned with the human genome (GRCh37.p13), and peak calling was executed using the PEPATAC pipeline. Adapter sequences were removed using Skewer, reads were mapped to the GRCh37 build of the human genome using bowtie2, and reads were sorted and isolated using Samtools while removing duplicates with samblaster. MACS2 was applied for peak calling, extending the peak summits by 250 bp in both directions and filtering based on the ENCODE blacklist. See supplementary materials online for detailed method.
Statistical analysis
Graphic presentation and statistical analysis were performed using GraphPad Prism software v10.1.1 (GraphPad Software, San Diego, CA, USA), and results are presented as the mean per group ± standard deviation (SD) unless otherwise indicated. Between-group comparisons were performed by unpaired Student’s t-tests, Welch’s t-tests, one-way analysis of variance (ANOVA), or Brown-Forsythe and Welch ANOVA. For one-way ANOVA and Brown-Forsythe or Welch ANOVA, multiplicity-adjusted (Tukey or Dunnett T3) P values are reported throughout, with the threshold of alpha set at 0.05. The p-values are indicated in each figure for statistically significant comparisons (*, p < 0.05; **, p < 0.01; ***, p < 0.001; ****, p < 0.0001). The specific tests used for each figure and number of replicates are mentioned in the figure legends.
Results
Human iPSC clones reprogrammed from a single donor represented comparable levels of pluripotency marker expression and cell proliferation
We generated multiple clones from a single donor using Sendai virus to investigate the heterogeneity among hiPSC clones (Figs. 1A and S1A). Approximately 20 colonies were observed, and individual colonies were manually picked and subsequently expanded on Matrigel-coated culture plates. The resulting six clones (CL1-6: clone 1–6) formed stable colonies devoid of differentiated cells and displayed typical hiPSC morphology [7], including prominent nucleoli and distinct borders (Fig. 1B). All clones were maintained under the same conditions to minimize the effect of subculture processes. Immunofluorescence confirmed the expression of pluripotency-associated markers, including SOX2, SSEA4, and OCT4 in all clones (Figs. 1C and S1B). The protein levels of SOX2 and OCT4 were not significantly different among the clones (Fig. 1D, F, and G). In addition, the six clones exhibited similar proliferation rates (~ 60%) and normal karyotypes (Figs. 1E, H, and S1C). In summary, we obtained six hiPSC clones that expressed pluripotency markers and exhibited comparable growth rates.
Fig. 1.
Generation and characterization of human induced pluripotent stem cell (hiPSC) clones. A Experimental design for the evaluation of clonal heterogeneity in hiPSC. B Phase-contrast images of the clones. Scale bar = 50 μm. C Representative immunostaining images for SOX2 (green), SSEA4 (red), OCT4 (yellow), and Hoechst (blue) staining of the clones. Scale bar = 50 μm. D Representative immunoblotting images for OCT4, SOX2, and H3. E Representative images for 5-ethynyl-2′-deoxyuridine (EdU) incorporation assay. Scale bar = 50 μm. F–G Quantification of signal intensity of the immunoblotting for OCT4 (F) and SOX2 (G). Data show two technical replicates per group, and all values were normalized to H3. H Quantification of EdU + proliferating cells stained with Hoechst. Data show six technical replicates per group. All statistical analysis were performed using one-way analysis of variance (ANOVA) with Tukey’s multiple comparisons test (F, H) or Brown-Forsythe and Welch ANOVA with Dunnett’s multiple comparisons test (G). Data are presented as bar graphs and error bars with mean ± standard deviation
Differentiation capacity of cardiomyocytes varied among clones
To assess the differentiation potential of the clones, we induced cardiomyocytes (CMs) from six hiPSC clones following an established protocol [14]. All clones were simultaneously treated with the same concentration of CHIR99021 (5 µM), a GSK-3 inhibitor. The proportion of TNNT2 (cardiac troponin-T)-positive cells were compared via flow cytometry between day 12 to 15 of differentiation (Fig. 2A). CL1 and 3 consistently exhibited robust differentiation rates (> 60%), whereas CL2 showed a lower differentiation rate (< 50%). CL4, 5, and 6 showed variability among batches, thereby these clones were excluded in further analysis (Fig. 2B, C, and S2A). The differentiated cells derived from CL1 and 3 showed spontaneous beating by day 15 (white arrows in Fig. 2D), whereas CL2 showed a higher proportion of non-beating cells (Videos S1 and 2). Immunofluorescence confirmed the coexistence of CMs (TNNT2-positive) and non-CMs (TNNT2-negative) in CL2 (Figs. 2E and S2B). Using quantitative reverse transcription-polymerase chain reaction (qRT-PCR), we compared the expression levels of CM markers (TNNT2 and MYH7/6: myosin heavy chain 7/6) and a non-CM marker (VIM: vimentin) (Fig. 2F). VIM was upregulated in CL2, indicating the presence of a mixed proportion of CMs and non-CMs. To mitigate the potential effects of CHIR99021 concentration, differentiation was also induced at 4 and 6 μM. At 4 μM of CHIR99021, the results were consistent with previous observations, while differentiation did not occur in any of the clones at 6 μM of CHIR99021 (Fig. S3A–C and Videos S3–S5).
Fig. 2.
Evaluation of the differentiation efficiency and functionality of human induced pluripotent stem cell-derived cardiomyocytes (hiPSC-CMs). A Experimental scheme for the evaluation of cardiac differentiation capacity. B Flow cytometry of cardiac troponin T (TNNT2) expression in hiPSC-CMs showed that the clones had different efficiencies, based on which they were divided into two groups—PC (blue) and NPC (red). Three or four technical replicates per group are shown. C Representative flow cytometric histograms showing the TNNT2 + (%) cells of hiPSC-CMs when hiPSC clones (CL1- 3) were treated with 5 µM of CHIR99021. Gray peaks represent the non-stained control. D Phase-contrast images of hiPSC-CMs derived from PC (left, CL1) and NPC (right, CL2); beating CMs were represented with white arrows. Scale bar = 1000 μm. E Representative images of Immunostaining for TNNT2 (green) and Hoechst (blue) staining of hiPSC-CMs (day 15). F Quantitative reverse transcription-polymerase chain reaction (qRT-PCR) of CM (TNNT2, MYH7, and MYH6) and non-CM (VIM) markers in hiPSC-CMs. All values were normalized to GAPDH. Data show three or four technical replicates per group. G Field potential analysis of hiPSC-CMs using a multielectrode (MEA) system. Quantification of the beating rate, spike amplitude, and spike slope. Data represent three technical replicates per group. H Representative heatmap images of conduction map. The cardiac beating begins where the plot is blue (shortest delay from the beat origin) and ends where the plot is red (longest delay from the beat origin). All statistical analysis were performed using unpaired Student’s t-tests or Welch’s t-tests; *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001. Data are presented as bar graphs and error bars with mean ± standard deviation (B, G) or standard error of mean (F)
Functional assessment of CMs was performed using a multielectrode array system, and data were acquired using CMs at day 20, which was 5 days post-plating. We detected active beating of CMs in CL1 and 3, but not in CL2 (Fig. S3D), possibly due to the low proportion of CMs (Fig. 2B-F). There was no significant difference of beating rate between CL1 and 3, whereas CL3 exhibited an increase in spike amplitude and slope (Fig. 2G and S3D). The conduction map showed a delay in conduction, which was no significant difference of conduction velocity, maximum delay, and propagation consistency between CL1 and 3, with no observed arrhythmias (Fig. 2H and S3E). Recent studies have reported that hiPSC-derived CMs show conduction velocities ranging from 0.04 m/s to 1.0 m/s [15, 16], and CL1 and 3 showed the velocities within this range except for CL2. Overall, the hiPSC clones (CL1-3) were classified into two groups based on their differentiation capacity, with CL2 showing a low capacity, resulting in mixed proportions of CMs and non-CMs—CL1 and 3 as productive clone (PC) and CL2 as non-productive clone (NPC).
Phosphorylation state of ERK1/2 signaling pathway was elevated in NPC
Signaling pathways such as Wnt, BMP, and ERK play crucial roles in cardiac development [17, 18]. Recent studies have highlighted the effect of the phosphorylation status of signaling kinases in hiPSCs on their differentiation potential [19, 20]. To compare the signaling profiles of the hiPSC clones (PC and NPC), we conducted a human phospho-kinase array. We examined whether the phosphorylation levels of Wnt/β-catenin or GSK-3α/β differed between the two groups, because we used a GSK-3 inhibitor, a regulator of the Wnt/β -catenin pathway, to induce CM differentiation (Fig. 2A). However, the phosphorylation of GSK-3α/β (Ser21/Ser9) did not show significant differences among the clones (Fig. 3A and B). We also assessed the nuclear translocation of β-catenin using immunofluorescence, which revealed no discernible differences across all clones (Fig. S4A).
Fig. 3.
The phosphorylation state of ERK1/2 signaling pathway was elevated in non-productive clone (NPC). A Chemiluminescent images of human phosphokinase array revealed distinct phosphorylation levels of ERK1/2 (T202/Y204 and T185/Y187), eNOS (S1177), p38α (T180/Y182), and p53 (S46) among clones (CL1-3). B Mean pixel density of 37 kinase phosphorylation sites and two related total proteins, normalized to reference spots. Statistical analysis was performed using multiple unpaired Student’s t-tests, and the two-stage step-up method of Benjamini, Krieger, and Yekutieli was used for the correction of multiple comparisons; *q < 0.05. Data represent two technical replicates per clone (PC: CL1 and 3, NPC: CL2). C Volcano plot of multiple unpaired Student’s t-tests showing that four phosphorylation sites were significantly different between PC and NPC; the largest difference is highlighted with a pink box. D Representative immunoblotting images for ERK1/2, phosphorylated ERK1/2 (T202/Y204), and H3. T: threonine, Y: tyrosine, S: serine
The phosphorylation levels of ERK1/2 (Thr202/Tyr204 and T185/Y187), p38α (Thr180/Tyr182), and eNOS (Ser1177) were significantly elevated in NPC, whereas those of p53 (Ser46) were increased in PC (Fig. 3B and C). Of these pathways, previous studies have reported that ERK1/2 [18] and p38α [21, 22] pathways are involved in CMs differentiation. The difference in the phosphorylation level of ERK1/2 between PC and NPC was the most pronounced in this study (Fig. 3C), and we confirmed the increased phosphorylation of ERK1/2 in NPC using immunoblotting (Fig. 3D). Taken together, the level of phosphorylated ERK1/2 was elevated in NPC compared to that in PC, whereas no differences were observed in the regulation of the Wnt pathway.
Distinct transcriptomic profiles revealed clonal heterogeneity
Recent studies have reported that variations in the gene expression patterns of human pluripotent stem cells can affect their potential to differentiate into various cell lineages [23]. To analyze the transcriptomic profiles of the three clones (hiPSC #1 CL1-3), we performed bulk RNA-seq, followed by quality control of the raw read data (Fig. S5A). We compared our data with previously published data using principal component analysis (PCA) to ensure that all clones exhibited transcriptional signatures consistent with those of hiPSCs [24]. Our clones were closely clustered with hiPSCs, contrasting with other tissues, based on data from the Genotype-Tissue Expression (GTEx) portal (Fig. 4A).
Fig. 4.
Transcriptomic analysis of human induced pluripotent stem cell (hiPSC) clones using bulk RNA-sequencing. A Principal component analysis (PCA) plot showing that generated hiPSC clones were in a cluster of previous iPSC data and distant from data from other tissues, based on data from the Genotype-Tissue Expression [GTEx] portal. B Heatmap displayed differentially expressed genes (DEGs, log2|FC|> 1 and adjusted P-values (Padj) < 0.05) between PC and NPC. C, D Gene Ontology (GO) analysis of DEGs in the biological process (BP), molecular function (MF), and cellular component (CC) categories. The emapplots (C) and bar plots (D) showed significantly enriched GOs in the two groups. The blue and red dots and bars indicate upregulated GOs in PC and NPC, respectively. E Chord diagram illustrating inter-relationships between the genes related to the GOs identified in C. F The volcano plot depicted the DEGs shown in B, and the dashed lines indicated log2|FC|= 1 and Padj = 0.05. The top genes identified in E are marked with green dots
To investigate the molecular signatures that affect clonal heterogeneity, especially the differentiation capacity of CMs, we identified 416 differentially expressed genes (DEGs) between the two groups. Among these, 240 genes were upregulated in PCs, whereas 176 were upregulated in NPC (log2|FC|> 1, adjusted P-value [Padj] < 0.05) (Fig. 4B). The majority of the DEGs were annotated as protein-coding genes (90%, Fig. S5B). Gene Ontology (GO) analysis revealed that PC exhibited enrichment in the development of embryonic organs, including the heart and kidney, along with DNA-binding transcription activator activity (Fig. 4C, D, and S5C). In contrast, NPC was enriched for muscle cell development, vasculogenesis, transmembrane receptor protein tyrosine kinase activity, and actin filaments. Both groups exhibited significant enrichment of the ERK1/2 signaling pathway. Furthermore, gene set enrichment analysis (GSEA) indicated the enrichment of signaling pathways regulating the pluripotency of stem cells in PC, whereas NPC was enriched for cell adhesion molecules and vascular smooth muscle contraction (Fig. S5E and F). To figure out which genes are involved in clone-specific GO terms and the ERK1/2 signaling (Figs. 3A–D, 4C, D, S4C, and D), we created a chord plot showing the relationships between DEGs and enriched pathways (Fig. 4E and F). For example, NPC showed upregulated levels of the transmembrane protein receptor tyrosine kinases TIE1 and TEK (also known as TIE2), which are key regulators of vasculature development (e.g., endothelial cells) [25, 26] and ERK1/2 activation [27]. In contrast, PC showed upregulated levels of FOXC1, a member of the FOX family involved in embryonic tissue development [28] and differentiation of early CMs [29]. These findings highlight the transcriptomic profiles between PC and NPC with varying differentiation potential for CMs.
Dysregulation of endogenous retroelements modulated nearby gene transcription in clones
Endogenous retroelements (EREs), comprising long terminal repeats (LTRs) and non-LTR transposons (Fig. 5A), constitute almost half of the human genome [30]. Previous studies have highlighted the effect of ERE expression patterns in pluripotent stem cells on various characteristics of hiPSCs, including cell morphology, pluripotency, and lineage commitment [31–33]. To examine the expression patterns of EREs among the clones, we mapped transcript reads onto copies of transposable elements (TEs) (Fig. 5B). The total read counts of TE contents were quantified for each clone (4.6 × 106, 4.7 × 106 and 4.7 × 106, respectively), showing consistent patterns across all clones (Fig. 5C). Differentially expressed TEs were identified between PC and NPC (log2|FC|> 1 and Padj < 0.05), with LTRs showing the highest proportions (CL1: 83.9%, CL2: 86.6%, and CL3: 84.9%). Notably, NPC exhibited an increase in the total number of significant copies of TEs when compared with PC (Fig. 5D). As copies of repeat classes are distributed across the human chromosomes [34], we compared the overall and locus-specific expression of LTRs to identify differentially expressed subtypes and their chromosomal locations. We detected 15 significant LTR subtypes (log2|FC|> 1 and Padj < 0.05) between PC and NPC. Of these, 5 subtypes (HERVK-int, HERVK11-int, LTR88c, THE1D-int, and MER52C) were highly expressed in PC, while 10 subtypes (HERVI-int, MER52C, HERV4_I-int, MER51B, LTR88b, etc.) were highly expressed in NPC (Fig. 5E). Additionally, 103 ERE loci were significantly dysregulated, with 49 of these loci belonging to LTRs from five subfamilies—ERV1 (63.3%), ERVK (2%), ERVL (8.2%), ERVL-MaLR (24.5%), and gypsy (2%) (Table S3).
Fig. 5.
Assessment of variations in expression levels of endogenous retroelements (EREs). A Classification of human transposable elements (TEs). B Experimental scheme for the analysis of TEs. C Bar plot showing the proportion of total read counts of TE contents identified in RNA-seq. D Bar plot showing the proportion of differentially expressed TEs (log2|FC|> 1 and Padj < 0.05), indicating that the long terminal repeats (LTRs) showed the highest proportion. Normalized counts of the differentially expressed LTRs are presented. E Heatmap showed the expression levels of significant LTR subtypes (e.g., LTR88c, HERVI-int, and HERV4_I-int) between PC and NPC. F Heatmap displayed the expression of genes located in proximity to the identified EREs in E. The genes corresponding to DEGs are marked with *. (G-H) Loci of the genes (ANKRD2, CLEC2A, OC90, and HHLA1) were identified in the University of California, Santa Cruz (UCSC) browsers and visualized in Integrative Genomics Viewer (IGV) tracks. Two examples of dysregulated MER77 and HERVI-int integrants and expression level of the closest genes: OC90 and HHLA1 for MER77 on chromosome 8 (G), and ANKRRD2 and CLEC2A for HERVI-int on chromosome 1 and 12, respectively (H)
Given that EREs act as cis-regulatory elements to regulate the expression of neighboring genes and generate chimeric transcripts [35], we investigated differentially expressed loci near gene transcripts. We identified 21 genes adjacent to these loci (Fig. 5F and Table S4), of which 8 genes (ANKRD45, LYST, CLEC2A, RAB17, WFDC3, CEP85L, HHLA1, and OC90) are protein-coding. Among these, 5 genes (CLEC2A, RAB17, CEP85L, HHLA1, and OC90) were identified as DEGs (Fig. 4B). Consistent with previous findings [32, 34, 35], the upregulation of HHLA1 and OC90 was affected by nearby ERE loci (Fig. 5G). Unexpectedly, no reads were mapped to exons 1–4 of CLEC2A, suggesting a new transcript potentially transcribed from LTR10B1 (chr12: 10,055260–10,055,764) and HERVI-int (chr12: 10,055,800–10,059,965) (Fig. 5H and Table S4). We observed a differentially expressed ERE loci near ANKRD45, which was not identified as a DEGs (chr1: 173,607,756–173,611,588). Collectively, our analysis of EREs revealed that profiles of EREs, especially LTRs, may contribute to clonal heterogeneity by regulating nearby genes and generating new transcripts.
Clones revealed differential chromatin accessibility and susceptibility to transcription factors
The reprogramming of hiPSCs entails global remodeling of the epigenetic state [36], where epigenetic variations, such as DNA methylation and chromatin state, play pivotal roles in the differentiation potential of hiPSCs [37]. To detect genomic chromatin accessibility between PC and NPC, we performed an ATAC-seq. An average of 9.6 × 106 fragments per sample was obtained, with a consistent distribution of peaks across genomic regions in all clones (Fig. 6A). In total, 3.1 × 105, 3.0 × 105, and 3.6 × 105 open chromatin peaks were identified in CL1, 2, and 3, respectively (Fig. 6B). PC exhibited elevated chromatin accessibility in the vicinity of transcription start site (TSS) regions (TSS ± 3 kb) compared to NPC, indicating a predisposition for active transcription (Fig. 6C). We identified 2869 differentially accessible regions (DARs) by comparing ATAC-seq peak tag counts. Among these, 2142 regions were detected in PC (log2|FC|> 0.5 and P < 0.05), whereas 727 regions were detected in NPC (Fig. 6D). Next, we examined the genomic locations of DARs, and the distal regions (> 10 kb from the TSS) accounted for 43% and 37% in PC and NPC (Fig. 6E), respectively. The promoter-proximal regions (TSS ± 3 kb) accounted for 23% and 18% of DARs in PC and NPC. Although it has been reported that the chromatin states in distal regions can affect gene expression [38, 39], we focused on how DARs in promoter-proximal regions affect gene expression status.
Fig. 6.
Different chromatin statuses of human induced pluripotent stem cell (hiPSC) clones were identified using ATAC-sequencing. A Quality control of ATAC-seq. Stacked bar charts showed the distribution of peaks across genomic regions. B The circle plot of genome-wide chromatin accessibility identified in ATAC-seq. C The enriched heatmap displayed the normalized score of ATAC-seq peaks within ± 3 kb of the transcription start site (TSS). D Heatmap of differentially accessible regions (DARs, log2|FC|> 0.5 and P < 0.05) between PC and NPC. E Pie charts illustrated the distribution of DARs across genomic regions. F The volcano plot depicted the genes annotated as promoter in (E); n = 454 for PC (red) and n = 128 for NPC (blue). The dashed lines indicated log2|FC|> 0.5 and P < 0.05. G Gene Ontology (GO) analysis of the genes shown in F. H Loci of the genes (TEK and PDGFRB) were identified in University of California, Santa Cruz (UCSC) browsers and visualized in Integrative Genomics Viewer (IGV) tracks. I Heatmap displayed gene expression levels related to identified GOs (GO:1,905,772, 0007507) in G. The genes corresponding to DEGs are marked with *
We identified 454 and 128 genes with increased open chromatin around their promoters in PC and NPC, respectively (Fig. 6F). For example, chromatin accessibility (TSS ± 3 kb) of DPPA2, a key factor in resetting the epigenome to a pluripotent state during reprogramming [40], was increased in PC. We performed GO analysis to investigate the biological processes associated with the genes harboring DARs. We found that DAR-associated genes in PC were enriched in processes related to positive regulation of DNA/Nucleic acid-templated transcription (Fig. 6G), consistent with findings from our RNA-seq analysis (Fig. 4C). Despite the limited differentiation potential of NPC towards CMs, DAR-associated genes in NPC as well as PC were enriched in heart development (Fig. 6G). Specifically, 11 genes in NPC were related to heart development (GO: 0007507) (Fig. 6H and I). The expression levels of PDGFRB and TEK, which are implicated in endothelial cell differentiation [41] and atrioventricular canal development [42], were significantly upregulated in NPC. Consequently, this increased expression in NPC, driven by DARs in promoter-proximal regions, may impede CM differentiation, resulting in non-CMs with increased expression of VIM.
Next, we performed HOMER (Hypergeometric Optimization of motif EnRichment) de novo motif analysis using DARs to identify clone-specific transcription factor (TF) motifs. The ETV4 and GATA5 motifs were the top motifs in PC, whereas the TEAD2 and FOSL2 motifs were found in NPC (Fig. S6A). The ETV4-encoding ETS-domain-containing TF family plays a role in the maintenance of pluripotency in stem cells [43], and some GATA factors (GATA4, GATA5, and GATA6) are expressed in the precardiac mesoderm during heart development [44]. Although we further associated enriched motifs with the mRNA expression levels of TF families that could bind these motifs, we did not find a correlation between the motifs and the expression levels of their family members at the undifferentiated stage (Fig. S6B). Nevertheless, our results showed that the clones presented different susceptibilities to specific TFs, and they could regulate gene expression during cardiac differentiation, similar to the GATA family. Collectively, our epigenomic profile suggests that variability in the epigenetic state may influence the changes in gene expression, thereby contributing to clonal heterogeneity.
TEK and SDR42E1 were identified as marker genes for clonal heterogeneity of hiPSC-CM differentiation
Recent studies have proposed reliable marker genes based on comprehensive analysis of transcriptional and epigenetic modifications [12, 13, 39]. To identify candidate genes that could predict cardiac differentiation, we assigned chromatin accessibility at the promoter-proximal regions to the nearest genes in the hg19 reference genome. By overlapping DEGs and DARs between the two groups, we identified a total of 56 genes with accessible regions that were significantly differentially expressed between PC and NPC (Fig. 7A). From these, we selected 14 candidate markers for PC (DPPA2, FOXK1, FOXO4, GBX2, HEY2, NANOG, PTCH1, PDGFα, and UTF1) and NPC (TEK, PDGFRβ, SDR42E1, TXNRD2, and JPH4) based on their basal expression levels in hiPSCs (Fig. 7A and B).
Fig. 7.
TEK and SDR42E1 were identified as marker genes for predicting cardiac differentiation capacity. A Venn diagram showed the overlap between DEGs and DARs identified via RNA-seq and ATAC-seq peaks, specifically in protein-coding genes. The positively correlated genes were colored in red (upregulated in PC) or blue (NPC), while the negatively correlated genes were colored in green (opposite). B Representative images of ATAC-seq (left) and normalized counts obtained from RNA-seq (right) for FOXK1 and TEK; upregulated in PC (blue) and NPC (red). The bar graphs for normalized counts represent two biological replicates (PC) and one biological replicate (NPC). C Experimental scheme for the validation of candidate genes. D Flow cytometry of TNNT2 expression in hiPSC-CMs at day 15 after differentiation. Three (hiPSC #2) or five (hiPSC #3) technical replicates per group are shown. E Immunostaining for TNNT2 (green) and Hoechst (blue) staining of hiPSC-CMs. (F) qRT-PCR of TEK and SDR42E1 in hiPSC clones (#2 and #3). All values were normalized to RNA18S1. Data show three technical replicates per group. All statistical analysis were performed using unpaired Student’s t-tests; *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001. Data are presented as bar graphs and error bars with mean ± standard deviation (D) or standard error of mean (F)
To further refine the candidate genes, we included two additional hiPSC lines derived from healthy individuals, hiPSC #2 and #3. Each line contained two clones (CL1 and CL2) that were differentiated into CMs (Fig. 7C). The proportion of TNNT2-positive cells was assessed via flow cytometry. In hiPSC #2, CL1 showed a higher percentage of TNNT2-positive cells compared to CL2 when treated with 6 μM of CHIR99021, while both clones of hiPSC #3 showed no significant differences between each other (Fig. 7D, E). As expected, the expression of CM markers (TNNT2 and MYH7) was increased in CMs derived from #2 CL1, whereas hiPSC #2 CL2 showed increased expression of the non-CM marker VIM (Fig. S7A). Based on these data, we categorized the hiPSC lines into two groups: those with differing differentiation capacities within clones (hiPSC #2) and those with similar capacities (hiPSC #3).
In hiPSC #2, the non-productive markers TEK and SDR42E1 exhibited increased expression (4.3- and 23-fold increases, respectively) in hiPSC #2 CL2 (Fig. 7F). The expression of another non-productive marker, PDGFRB, was significantly increased in hiPSC #2 CL1, which did not align with the expression pattern observed in hiPSC #1. In contrast, all productive markers showed no significant differences between the two clones (Fig. S7B). These data indicated that most of the candidate genes, except for TEK and SDR42E1, could be false-positive markers and showed no consistent predisposition toward cardiac differentiation potential. On the other hand, in the clones of hiPSC #3, which had similar differentiation capacities, the expression of TEK and SDR42E1 was not significantly different between the two clones, confirming the correlation between TEK and SDR42E1 expression and differentiation capacities (Figs. 7F, S7B).
The angiopoietin-1/TEK axis plays a crucial role in vascular development, including the development of endothelial and muscle cells [25, 26, 45]. SDR42E1, a member of the short-chain dehydrogenase/reductase (SDR) family, is associated with multiple cellular processes, including NAD(P)-dependent oxidoreductase activity [46]. Although the direct effects of the SDR family on stem cell development have not been reported, the family is known to be involved in the metabolism of various compounds, including retinoids [47] and lipids, which sustain pluripotency and modulate myocardial cell differentiation [19, 48, 49]. In summary, our transcriptome and epigenome profiling data suggest that TEK and SDR42E1 serve as predictive biomarkers implicated in the cardiac differentiation of hiPSC clones.
Discussion
Human iPSCs represent a potential source for regenerative medicine and disease modeling, and stem cell therapy using hiPSC-CMs holds promise for clinical applicability [50]. For clinical applications, maintaining consistently high purity and yield across batches is important. Therefore, researchers have worked to optimize these aspects. For example, empirical efforts have been made for the selection of hiPSC clones expressing high levels of pluripotency marker genes [7] or the optimization of the CHIR99021 concentration [51]. However, to the best of our knowledge, few references serve as standards for the clonal selection. The efficient selection of clones with high differentiation capacity at an undifferentiated stage may significantly contribute to this field, and thereby predictive markers for clonal heterogeneity need to be investigated. In the present study, we established clonal heterogeneity profiles and identified TEK and SDR42E1 as predictive biomarkers of cardiac differentiation.
All the six generated clones expressed pluripotency markers (OCT4, SOX2, and SSEA-1) and exhibited similar proliferation rates. However, we observed that CL2 consistently exhibited a lower CM differentiation efficiency than the other clones. Moreover, the expression of the three germ layers showed similar patterns upon comparing the average expression levels of the marker genes using RNA-seq. These results suggest that conventional evaluation is not enough for predicting the differentiation potential of hiPSC clones and highlight the need for alternative indicators of clonal heterogeneity. We analyzed RNA-seq, ATAC-seq, endogenous retroelement, and phosphokinase array to identify the biomarkers required to minimize the likelihood of false positives. We identified 56 DEGs with an open chromatin state in TSS-proximal regions. By including two additional hiPSC lines, we validated our results using qRT-PCR and identified TEK and SDR42E1 as novel predictive biomarkers. Furthermore, there were no significant differences of OCT4 and SOX2 expression levels between two clones of hiPSC #2, suggesting comparable levels of pluripotency. Our data demonstrate that TEK and SDR42E1 were upregulated in the clones with reduced CM differentiation capacity. TEK, also known as TIE2, is a transmembrane receptor tyrosine kinase and a known factor affecting vascular development via angiopoietin-1 (Ang1). Ang1-TEK reportedly forms a complex with the extracellular matrix (ECM) [27], and NPC showed enrichment of cell–cell contacts, such as actin filaments and basement filaments. We found that NPC showed upregulated phosphorylation levels of ERK1/2 and eNOS, which are downstream signaling factors of ECM-anchored TEK [52]. SDR42E1, a member of the short-chain dehydrogenase/reductase family, is involved in steroid biosynthesis; a previous study reported that mutations in the SDR42E1 gene are associated with oculocutaneous genital syndrome [53]. However, its role in pluripotent stem cells has not yet been reported. Taken together, our findings indicate that the expression patterns of TEK and SDR42E1 serve as markers for the differentiation potential of hiPSC clones toward CMs, establishing them as new references for clonal heterogeneity.
Although previous studies have reported the molecular signatures of hiPSCs that regulate their differentiation capacity (e.g., CMs), those studies focused on variations in individual hiPSCs. For example, the modulatory effects of SALL3 [12] and CXCL4/PF4 [13] on the differentiation lineage in various hiPSCs have been demonstrated, indicating that SALL3 expression levels correlate with ectoderm differentiation, and PF4 levels correlate with mesoderm differentiation. Although these markers reflected variations affecting the cardiac differentiation propensity, we observed an increase in SALL3 expression levels in PC, whereas PF4 levels exhibited no significant differences among the clones (data not shown).
Given that high expression of TEs is a characteristic of naïve pluripotency [54], we investigated whether PC and NPC exhibit features of primed or naïve pluripotency at the transcriptional level. We compared our RNA-seq data with human primed and naïve ESCs (hESCs) from previously published datasets [55]. PCA and heatmap analysis confirmed that the expression patterns of naïve signature genes, including SUSD2, TFCP2L1, KLF5, and KLF4, in hiPSC clones were distinct from those observed in both primed and naïve ESCs (Fig. S8A and B). Furthermore, our epigenome profile data showed that 71% (1,384 of 1,948) and 67% (449 of 668) of the total DARs were distal regions (> 10 kb from the TSS) in PC and NPC, respectively. Some studies have reported that the chromatin state of distal regions as well as regions proximal to the TSS induces gene expression variability [38]. Similarly, we identified specific subtypes of EREs, such as HERVI-int, which were dysregulated among clones and were located across the genome. Previous studies have reported that copies of EREs act as distal enhancers [56]. Although it is not clear how clone-specific EREs in intergenic regions regulate gene expression, we identified several clone-specific intergenic regions, suggesting a possible role of open distal regions in gene regulation.
Our study has some limitations. Firstly, the number of clones analyzed for integrative analysis is restricted due to reprogramming efficiency. Distinguishing clones into PC and NPC categories is challenging due to batch-to-batch variability, potentially complicating statistical analysis and leading to false positives. However, additional hiPSCs recruited for candidate gene validation consistently showed increased expression of TEK and SDR42E1 in NPC. Secondly, the role of biomarkers in cardiac differentiation of hiPSCs remains unclear. Our results from bulk RNA-seq and ATAC-seq of undifferentiated cells do not elucidate expression pattern of biomarkers during cardiac differentiation. Therefore, future studies should investigate the molecular mechanisms to understand the roles of biomarkers during developmental stages. Thirdly, regulatory mechanisms governing TEK and SDR42E1 expression are not fully understood. Epigenetic remodeling occurred during reprogramming [35], and our results showed higher chromatin accessibility and enriched pathways such as RNA transcription factor in PC than NPC. It suggests potential effects on nearby gene transcription. However, we focused on the gene expression patterns explaining the clonal heterogeneity in fully reprogrammed cells, lacking precise mechanisms regulating the expression of TEK and SDR42E1. This underscores the importance of addressing this complex question in future studies.
In conclusion, by integrating transcriptomic and epigenomic profiles, we identified TEK and SDR42E1 as novel predictive biomarkers of clonal heterogeneity for cardiac differentiation. These findings provide a comprehensive understanding of the molecular signatures underlying the cardiac differentiation capacity of hiPSC clones.
Supplementary Information
Below is the link to the electronic supplementary material.
Acknowledgements
The authors acknowledge the Sungkyunkwan University Cooperative Center for Research Facilities for providing services and facilities. Scientific illustrations were created using BioRender and Prism 10.
Abbreviations
- hiPSCs
Human induced pluripotent stem cells
- ATAC-seq
Assay for transposase-accessible chromatin using sequencing
- PC
Productive clone
- NPC
Non-productive clone
- CMs
Cardiomyocytes
- CL1,2,3,4,5,6
Clone1,2,3,4,5,6
- DEGs
Differentially expressed genes
- DARs
Differentially accessible regions
- PBMCs
Peripheral blood mononuclear cells
- TEs
Transposable elements
- EREs
Endogenous retroelements
- LTRs
Long terminal repeats
- TEK
TEK receptor tyrosine kinase
- SDR42E1
Short chain dehydrogenase/reductase family 42E member 1
- ERK1/2
Extracellular signal-regulated kinase 1/2
- TNNT2
Cardiac Troponin T
- HERV
Human endogenous retrovirus
Author contributions
Conceptualization, J.Y., J.S., and J.L.; methodology, J.Y. and J.S.; software, S.J.; validation, J.Y. and J.S.; formal analysis, J.Y., J.S., and S.J.; investigation, J.Y., J.S., J.J., and S.H.; resources, J.Y., J.S., J.J., K.L., and H.R.J.; data curation, J.Y., J.S., and S.J.; writing—original draft, J.Y.; writing—review & editing—J.Y., J.S., S.J., and J.L.; visualization, J.Y., J.S., and S.J.; supervision, J.L.; project administration, J.L.; funding acquisition, J.L. All authors have read and commented on the manuscript.
Funding
This work was supported by the Bio&Medical Technology Development Program of the National Research Foundation (NRF) funded by the Korean government (MSIT) [grant numbers 2020M3A9E4037847, 2020M3A9E4037902]; and a Korean Fund for Regenerative Medicine (KFRM) grant funded by the Korean government (Ministry of Science and ICT, Ministry of Health and Welfare) [Grant Number 22A0302L1].
Data availability
The datasets are available in the article, in its online supplementary material, and data repositories (GEO accession No. GSE260767, GSE260883).
Declarations
Conflict of interest
All authors declare to have no conflict of interest related to this manuscript.
Consent for publication
We confirmed that consent for publication was received from all individuals.
Ethical approval
Ethical approval for the use of PBMCs and hiPSCs in this study was obtained from the Institutional Review Board of Sungkyunkwan University (IRB no. 2019-11-016-002 and 2020-01-020-001) and Samsung Medical Center (IRB no. 2016-11-025-015). Our study was conducted in accordance with the principles outlined in the Declaration of Helsinki, following the acquisition of written consent.
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Jihye Yun, Jaemin So and Seunghee Jeong contributed equally.
References
- 1.Burridge PW, Li YF, Matsa E, Wu H, Ong S-G, Sharma A, Holmström A, Chang AC, Coronado MJ, Ebert AD, Knowles JW, Telli ML, Witteles RM, Blau HM, Bernstein D, Altman RB, Wu JC (2016) Human induced pluripotent stem cell–derived cardiomyocytes recapitulate the predilection of breast cancer patients to doxorubicin-induced cardiotoxicity. Nat Med 22:547–556. 10.1038/nm.4087 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Sharma A, Burridge PW, Mckeithan WL, Serrano R, Shukla P, Sayed N, Churko JM, Kitani T, Wu H, Holmström A, Matsa E, Zhang Y, Kumar A, Fan AC, Del Álamo JC, Wu SM, Moslehi JJ, Mercola M, Wu JC (2017) High-throughput screening of tyrosine kinase inhibitor cardiotoxicity with human induced pluripotent stem cells. Sci Transl Med. 10.1126/scitranslmed.aaf2584 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Panopoulos AD, Smith EN, Arias AD, Shepard PJ, Hishida Y, Modesto V, Diffenderfer KE, Conner C, Biggs W, Sandoval E, D’Antonio-Chronowska A, Berggren WT, Izpisua Belmonte JC, Frazer KA (2017) Aberrant DNA methylation in human iPSCs associates with MYC-binding motifs in a clone-specific manner independent of genetics. Cell Stem Cell 20:505-517.e506. 10.1016/j.stem.2017.03.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Sharma R, Khristov V, Rising A, Jha BS, Dejene R, Hotaling N, Li Y, Stoddard J, Stankewicz C, Wan Q, Zhang C, Campos MM, Miyagishima KJ, Mcgaughey D, Villasmil R, Mattapallil M, Stanzel B, Qian H, Wong W, Chase L, Charles S, Mcgill T, Miller S, Maminishkis A, Amaral J, Bharti K (2019) Clinical-grade stem cell–derived retinal pigment epithelium patch rescues retinal degeneration in rodents and pigs. Sci Transl Med 11:eat5580. 10.1126/scitranslmed.aat5580 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Carey W, Bryce MS, Hanna H, Jacob FA, Dina BY, Kim J, Ganz K, Steine J, Eveline CP, John CP, Menno WG, Gao Q, Jaenisch R, (2011) Reprogramming factor stoichiometry influences the epigenetic state and biological properties of induced pluripotent stem cells. Cell Stem Cell 9:588–598. 10.1016/j.stem.2011.11.003 [DOI] [PubMed] [Google Scholar]
- 6.Benetatos L, Vartholomatos G, Hatzimichael E (2014) DLK1-DIO3 imprinted cluster in induced pluripotency: landscape in the mist. Cell Mol Life Sci 71:4421–4430. 10.1007/s00018-014-1698-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Pantazis CB, Yang A, Lara E, Mcdonough JA, Blauwendraat C, Peng L, Oguro H, Kanaujiya J, Zou J, Sebesta D, Pratt G, Cross E, Blockwick J, Buxton P, Kinner-Bibeau L, Medura C, Tompkins C, Hughes S, Santiana M, Faghri F, Nalls MA, Vitale D, Ballard S, Qi YA, Ramos DM, Anderson KM, Stadler J, Narayan P, Papademetriou J, Reilly L, Nelson MP, Aggarwal S, Rosen LU, Kirwan P, Pisupati V, Coon SL, Scholz SW, Priebe T, Öttl M, Dong J, Meijer M, Janssen LJM, Lourenco VS, Van Der Kant R, Crusius D, Paquet D, Raulin A-C, Bu G, Held A, Wainger BJ, Gabriele RMC, Casey JM, Wray S, Abu-Bonsrah D, Parish CL, Beccari MS, Cleveland DW, Li E, Rose IVL, Kampmann M, Calatayud Aristoy C, Verstreken P, Heinrich L, Chen MY, Schüle B, Dou D, Holzbaur ELF, Zanellati MC, Basundra R, Deshmukh M, Cohen S, Khanna R, Raman M, Nevin ZS, Matia M, Van Lent J, Timmerman V, Conklin BR, Johnson Chase K, Zhang K, Funes S, Bosco DA, Erlebach L, Welzer M, Kronenberg-Versteeg D, Lyu G, Arenas E, Coccia E, Sarrafha L, Ahfeldt T, Marioni JC, Skarnes WC, Cookson MR, Ward ME, Merkle FT (2022) A reference human induced pluripotent stem cell line for large-scale collaborative studies. Cell Stem Cell 29:1685-1702.e1622. 10.1016/j.stem.2022.11.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Garreta E, Sanchez S, Lajara J, Montserrat N, Belmonte JCI (2018) Roadblocks in the path of iPSC to the clinic. Curr Transplant Rep 5:14–18. 10.1007/s40472-018-0177-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Churko JM, Lee J, Ameen M, Gu M, Venkatasubramanian M, Diecke S, Sallam K, Im H, Wang G, Gold JD, Salomonis N, Snyder MP, Wu JC (2017) Transcriptomic and epigenomic differences in human induced pluripotent stem cells generated from six reprogramming methods. Nat Biomed Eng 1:826–837. 10.1038/s41551-017-0141-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Kwon EM, Connelly JP, Hansen NF, Donovan FX, Winkler T, Davis BW, Alkadi H, Chandrasekharappa SC, Dunbar CE, Mullikin JC, Liu P (2017) iPSCs and fibroblast subclones from the same fibroblast population contain comparable levels of sequence variations. Proc Natl Acad Sci U S A 114:1964–1969. 10.1073/pnas.1616035114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Kyttälä A, Moraghebi R, Valensisi C, Kettunen J, Andrus C, Pasumarthy K, Kalyan NM, Nishimura K, Ohtaka M, Weltner J, Handel V, Ben PO, Sinisalo J, Jalanko A, Hawkins D, R., Woods N-B, Otonkoski T, Trokovic R, (2016) Genetic variability overrides the impact of parental cell type and determines iPSC differentiation potential. Stem Cell Rep 6:200–212. 10.1016/j.stemcr.2015.12.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Kuroda T, Yasuda S, Tachi S, Matsuyama S, Kusakawa S, Tano K, Miura T, Matsuyama A, Sato Y (2019) SALL3 expression balance underlies lineage biases in human induced pluripotent stem cell differentiation. Nat Commun 10:2175. 10.1038/s41467-019-09511-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ohashi F, Miyagawa S, Yasuda S, Miura T, Kuroda T, Itoh M, Kawaji H, Ito E, Yoshida S, Saito A, Sameshima T, Kawai J, Sawa Y, Sato Y (2019) CXCL4/PF4 is a predictive biomarker of cardiac differentiation potential of human induced pluripotent stem cells. Sci Rep 9:4638. 10.1038/s41598-019-40915-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Lee J, Termglinchan V, Diecke S, Itzhaki I, Lam CK, Garg P, Lau E, Greenhaw M, Seeger T, Wu H, Zhang JZ, Chen X, Gil IP, Ameen M, Sallam K, Rhee JW, Churko JM, Chaudhary R, Chour T, Wang PJ, Snyder MP, Chang HY, Karakikes I, Wu JC (2019) Activation of PDGF pathway links LMNA mutation to dilated cardiomyopathy. Nature 572:335–340. 10.1038/s41586-019-1406-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Kadota S, Minami I, Morone N, Heuser JE, Agladze K, Nakatsuji N (2013) Development of a reentrant arrhythmia model in human pluripotent stem cell-derived cardiac cell sheets. Eur Heart J 34:1147–1156. 10.1093/eurheartj/ehs418 [DOI] [PubMed] [Google Scholar]
- 16.Yang X, Pabon L, Murry CE (2014) Engineering adolescence. Circ Res 114:511–523. 10.1161/circresaha.114.300558 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Paige SL, Plonowska K, Xu A, Wu SM (2015) Molecular regulation of cardiomyocyte differentiation. Circ Res 116:341–353. 10.1161/circresaha.116.302752 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Yang J, Boerm M, Mccarty M, Bucana C, Fidler IJ, Z J Yang Y, Su B, (2000) Mekk3 is essential for early embryonic cardiovascular development. Nat Genet 24:309–313. 10.1038/73550 [DOI] [PubMed] [Google Scholar]
- 19.Sharma A, Zhang Y, Buikema JW, Serpooshan V, Chirikian O, Kosaric N, Churko JM, Dzilic E, Shieh A, Burridge PW, Wu JC, Wu SM (2018) Stage-specific effects of bioactive lipids on human iPSC cardiac differentiation and cardiomyocyte proliferation. Sci Rep. 10.1038/s41598-018-24954-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Uosaki H, Magadum A, Seo K, Fukushima H, Takeuchi A, Nakagawa Y, Moyes KW, Narazaki G, Kuwahara K, Laflamme M, Matsuoka S, Nakatsuji N, Nakao K, Kwon C, Kass DA, Engel FB, Yamashita JK (2013) Identification of chemicals inducing cardiomyocyte proliferation in developmental stage-specific manner with pluripotent stem cells. Circ Cardiovasc Genet 6:624–633. 10.1161/circgenetics.113.000330 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Gaur M, Ritner C, Sievers R, Pedersen A, Prasad M, Bernstein HS, Yeghiazarians Y (2010) Timed inhibition of p38MAPK directs accelerated differentiation of human embryonic stem cells into cardiomyocytes. Cytotherapy 12:807–817. 10.3109/14653249.2010.491821 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Graichen R, Xu X, Braam SR, Balakrishnan T, Norfiza S, Sieh S, Soo SY, Tham SC, Mummery C, Colman A, Zweigerdt R, Davidson BP (2008) Enhanced cardiomyogenesis of human embryonic stem cells by a small molecular inhibitor of p38 MAPK. Differentiation 76:357–370. 10.1111/j.1432-0436.2007.00236.x [DOI] [PubMed] [Google Scholar]
- 23.Sun C, Zhang J, Zheng D, Wang J, Yang H, Zhang X (2018) Transcriptome variations among human embryonic stem cell lines are associated with their differentiation propensity. PLoS ONE 13:e0192625. 10.1371/journal.pone.0192625 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Carcamo-Orive I, Hoffman GE, Cundiff P, Beckmann ND, D’Souza SL, Knowles JW, Patel A, Hendry C, Papatsenko D, Abbasi F, Reaven GM, Whalen S, Lee P, Shahbazi M, Henrion MYR, Zhu K, Wang S, Roussos P, Schadt EE, Pandey G, Chang R, Quertermous T, Lemischka I (2017) Analysis of transcriptional variability in a large human iPSC library reveals genetic and non-genetic determinants of heterogeneity. Cell Stem Cell 20:518-532.e519. 10.1016/j.stem.2016.11.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Dube A, Akbarali Y, Sato TN, Libermann TA, Oettgen P (1999) Role of the Ets transcription factors in the regulation of the vascular-specific Tie2 gene. Circ Res 84:1177–1185. 10.1161/01.res.84.10.1177 [DOI] [PubMed] [Google Scholar]
- 26.Takakura N, Huang X-L, Naruse T, Hamaguchi I, Dumont DJ, Yancopoulos GD, Suda T (1998) Critical role of the Tie2 endothelial cell receptor in the development of definitive hematopoiesis. Immunity 9:677–686. 10.1016/s1074-7613(00)80665-2 [DOI] [PubMed] [Google Scholar]
- 27.Fukuhara S, Sako K, Noda K, Nagao K, Miura K, Mochizuki N (2009) Tie2 is tied at the cell-cell contacts and to extracellular matrix by angiopoietin-1. Exp Mol Med 41:133. 10.3858/emm.2009.41.3.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Han B, Bhowmick N, Qu Y, Chung S, Giuliano AE, Cui X (2017) FOXC1: an emerging marker and therapeutic target for cancer. Oncogene 36:3957–3963. 10.1038/onc.2017.48 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Lambers E, Arnone B, Fatima A, Qin G, Wasserstrom JA, Kume T (2016) Foxc1 regulates early cardiomyogenesis and functional properties of embryonic stem cell derived cardiomyocytes. Stem Cells 34:1487–1500. 10.1002/stem.2301 [DOI] [PubMed] [Google Scholar]
- 30.Medstrand P, Van De Lagemaat LN, Mager DL (2002) Retroelement distributions in the human genome: variations associated with age and proximity to genes. Genome Res 12:1483–1495. 10.1101/gr.388902 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Wang T, Medynets M, Johnson KR, Doucet-O’Hare TT, Disanza B, Li W, Xu Y, Bagnell A, Tyagi R, Sampson K, Malik N, Steiner J, Hadegan A, Kowalak J, O’Malley J, Maric D, Nath A (2020) Regulation of stem cell function and neuronal differentiation by HERV-K via mTOR pathway. Proc Natl Acad Sci 117:17842–17853. 10.1073/pnas.2002427117 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Koyanagi-Aoi M, Ohnuki M, Takahashi K, Okita K, Noma H, Sawamura Y, Teramoto I, Narita M, Sato Y, Ichisaka T, Amano N, Watanabe A, Morizane A, Yamada Y, Sato T, Takahashi J, Yamanaka S (2013) Differentiation-defective phenotypes revealed by large-scale analyses of human pluripotent stem cells. Proc Natl Acad Sci 110:20569–20574. 10.1073/pnas.1319061110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Santoni FA, Guerra J, Luban J (2012) HERV-H RNA is abundant in human embryonic stem cells and a precise marker for pluripotency. Retrovirology 9:111. 10.1186/1742-4690-9-111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Kowalski PE, Freeman JD, Mager DL (1999) Intergenic splicing between a HERV-H endogenous retrovirus and two adjacent human genes. Genomics 57:371–379. 10.1006/geno.1999.5787 [DOI] [PubMed] [Google Scholar]
- 35.Friedli M, Turelli P, Kapopoulou A, Rauwel B, Castro-Díaz N, Rowe HM, Ecco G, Unzu C, Planet E, Lombardo A, Mangeat B, Wildhaber BE, Naldini L, Trono D (2014) Loss of transcriptional control over endogenous retroelements during reprogramming to pluripotency. Genome Res 24:1251–1259. 10.1101/gr.172809.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Liang G, Zhang Y (2013) Embryonic stem cell and induced pluripotent stem cell: an epigenetic perspective. Cell Res 23:49–69. 10.1038/cr.2012.175 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Nishizawa M, Chonabayashi K, Nomura M, Tanaka A, Nakamura M, Inagaki A, Nishikawa M, Takei I, Oishi A, Tanabe K, Ohnuki M, Yokota H, Koyanagi-Aoi M, Okita K, Watanabe A, Takaori-Kondo A, Yamanaka S, Yoshida Y (2016) Epigenetic variation between human induced pluripotent stem cell lines is an indicator of differentiation capacity. Cell Stem Cell 19:341–354. 10.1016/j.stem.2016.06.019 [DOI] [PubMed] [Google Scholar]
- 38.Bulger M, Groudine M (2011) Functional and mechanistic diversity of distal transcription enhancers. Cell 144:327–339. 10.1016/j.cell.2011.01.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Wang X, Yan J, Shen B, Wei G (2021) Integrated chromatin accessibility and transcriptome landscapes of doxorubicin-resistant breast cancer cells. Front Cell Dev Biol 9:708066. 10.3389/fcell.2021.708066 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Hernandez C, Wang Z, Ramazanov B, Tang Y, Mehta S, Dambrot C, Lee Y-W, Tessema K, Kumar I, Astudillo M, Neubert TA, Guo S, Ivanova NB (2018) Dppa2/4 facilitate epigenetic remodeling during reprogramming to pluripotency. Cell Stem Cell 23:396-411.e398. 10.1016/j.stem.2018.08.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.De Val S, Black BL (2009) Transcriptional control of endothelial cell development. Dev Cell 16:180–195. 10.1016/j.devcel.2009.01.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Munshi NV (2012) Gene regulatory networks in cardiac conduction system development. Circ Res 110:1525–1537. 10.1161/circresaha.111.260026 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Akagi T, Kuure S, Uranishi K, Koide H, Costantini F, Yokota T (2015) ETS-related transcription factors ETV4 and ETV5 Are involved in proliferation and induction of differentiation-associated genes in embryonic stem (ES) cells. J Biol Chem 290:22460–22473. 10.1074/jbc.m115.675595 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Ang YS, Rivas RN, Ribeiro AJS, Srivas R, Rivera J, Stone NR, Pratt K, Mohamed TMA, Fu JD, Spencer CI, Tippens ND, Li M, Narasimha A, Radzinsky E, Moon-Grady AJ, Yu H, Pruitt BL, Snyder MP, Srivastava D (2016) Disease model of GATA4 mutation reveals transcription factor cooperativity in human cardiogenesis. Cell 167(1734–1749):e1722. 10.1016/j.cell.2016.11.033 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Joo HJ, Kim H, Park S-W, Cho H-J, Kim H-S, Lim D-S, Chung H-M, Kim I, Han Y-M, Koh GY (2011) Angiopoietin-1 promotes endothelial differentiation from embryonic stem cells and induced pluripotent stem cells. Blood 118:2094–2104. 10.1182/blood-2010-12-323907 [DOI] [PubMed] [Google Scholar]
- 46.Kavanagh KL, Jörnvall H, Persson B, Oppermann U (2008) Medium- and short-chain dehydrogenase/reductase gene and protein families. Cell Mol Life Sci 65:3895–3906. 10.1007/s00018-008-8588-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Duester G (2000) Families of retinoid dehydrogenases regulating vitamin A function. Eur J Biochem 267:4315–4324. 10.1046/j.1432-1327.2000.01497.x [DOI] [PubMed] [Google Scholar]
- 48.Zhang Q, Jiang J, Han P, Yuan Q, Zhang J, Zhang X, Xu Y, Cao H, Meng Q, Chen L, Tian T, Wang X, Li P, Hescheler J, Ji G, Ma Y (2011) Direct differentiation of atrial and ventricular myocytes from human embryonic stem cells by alternating retinoid signals. Cell Res 21:579–587. 10.1038/cr.2010.163 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.De Angelis MT, Parrotta EI, Santamaria G, Cuda G (2018) Short-term retinoic acid treatment sustains pluripotency and suppresses differentiation of human induced pluripotent stem cells. Cell Death Disease. 10.1038/s41419-017-0028-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Zhang H, Xue Y, Pan T, Zhu X, Chong H, Xu C, Fan F, Cao H, Zhang B, Pan J, Zhou Q, Yang G, Wang J, Wang DJ (2022) Epicardial injection of allogeneic human-induced-pluripotent stem cell-derived cardiomyocytes in patients with advanced heart failure: protocol for a phase I/IIa dose-escalation clinical trial. BMJ Open 12:e056264. 10.1136/bmjopen-2021-056264 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Shafa M, Yang F, Fellner T, Rao MS, Baghbaderani BA (2018) Human-induced pluripotent stem cells manufactured using a current good manufacturing practice-compliant process differentiate into clinically relevant cells from three germ layers. Front Med. 10.3389/fmed.2018.00069 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Huang H, Bhat A, Woodnutt G, Lappe R (2010) Targeting the ANGPT–TIE2 pathway in malignancy. Nat Rev Cancer 10:575–585. 10.1038/nrc2894 [DOI] [PubMed] [Google Scholar]
- 53.Bouhouche A, Albaroudi N, El Alaoui MA, Askander O, Habbadi Z, El Hassani A, Iraqi H, El Fahime E, Belmekki M (2021) Identification of the novel SDR42E1 gene that affects steroid biosynthesis associated with the oculocutaneous genital syndrome. Exp Eye Res 209:108671. 10.1016/j.exer.2021.108671 [DOI] [PubMed] [Google Scholar]
- 54.Theunissen TW, Friedli M, He Y, Planet E, O’Neil RC, Markoulaki S, Pontis J, Wang H, Iouranova A, Imbeault M, Duc J, Cohen MA, Wert KJ, Castanon R, Zhang Z, Huang Y, Nery JR, Drotar J, Lungjangwa T, Trono D, Ecker JR, Jaenisch R (2016) Molecular criteria for defining the naive human pluripotent state. Cell Stem Cell 19:502–515. 10.1016/j.stem.2016.06.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Bi Y, Tu Z, Zhou J, Zhu X, Wang H, Gao S, Wang Y (2022) Cell fate roadmap of human primed-to-naive transition reveals preimplantation cell lineage signatures. Nat Commun 13:3147. 10.1038/s41467-022-30924-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Nishihara H (2019) Retrotransposons spread potential cis-regulatory elements during mammary gland evolution. Nucleic Acids Res. 10.1093/nar/gkz1003 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets are available in the article, in its online supplementary material, and data repositories (GEO accession No. GSE260767, GSE260883).








