Mitochondrial DNA integrity and metabolome profile are preserved in the human induced pluripotent stem cell reference line KOLF2.1J

Jochen Dobner; Thach Nguyen; Andreas Dunkel; Alessandro Prigione; Jean Krutmann; Andrea Rossi

doi:10.1016/j.stemcr.2024.01.009

. 2024 Feb 22;19(3):343–350. doi: 10.1016/j.stemcr.2024.01.009

Mitochondrial DNA integrity and metabolome profile are preserved in the human induced pluripotent stem cell reference line KOLF2.1J

Jochen Dobner ^1,^∗, Thach Nguyen ¹, Andreas Dunkel ², Alessandro Prigione ³, Jean Krutmann ^1,⁴, Andrea Rossi ^1,^5,^∗∗

PMCID: PMC10937150 PMID: 38402620

Summary

Quality control of human induced pluripotent stem cells (iPSCs) is critical to ensure reproducibility of research. Recently, KOLF2.1J was characterized and published as a male iPSC reference line to study neurological disorders. Emerging evidence suggests potential negative effects of mtDNA mutations, but its integrity was not analyzed in the original publication. To assess mtDNA integrity, we conducted a targeted mtDNA analysis followed by untargeted metabolomics analysis. We found that KOLF2.1J mtDNA integrity was intact at the time of publication and is still preserved in the commercially distributed cell line. In addition, the basal KOLF2.1J metabolome profile was similar to that of the two commercially available iPSC lines IMR90 and iPSC12, but clearly distinct from an in-house-generated ERCC6^R683X/R683X iPSC line modeling Cockayne syndrome. Conclusively, we validate KOLF2.1J as a reference iPSC line, and encourage scientists to conduct mtDNA analysis and unbiased metabolomics whenever feasible.

Keywords: mtDNA integrity, mtDNA analysis, KOLF2.1J, iPSCs quality control, iPSC reference line, Oxford Nanopore sequencing, Cockayne syndrome, ERCC6

Highlights

•
The mtDNA of the KOLF2.1J reference line is intact
•
The metabolomic signature of the KOLF2.1J reference line is inconspicuous
•
We recommend assessing mtDNA integrity of iPSC lines
•
ERCC6^R683X/R683X Cockayne syndrome model line shows an altered metabolomic signature

Rossi and colleagues analyze the mtDNA and metabolomic signature of the KOLF2.1J reference line using various sequencing techniques and untargeted metabolomics. The KOLF2.1J line displays intact mtDNA and an inconspicuous metabolomic profile. In contrast, a genetically modified Cockayne syndrome iPSC model line shows an altered metabolomic signature.

Introduction

Human induced pluripotent stem cells (iPSCs) have revolutionized basic research and regenerative medicine (Hirschi et al., 2014). Analysis of their genomic integrity is important to guarantee reproducibility (Rossi et al., 2022; Volpato and Webber, 2020). The male-derived iPSC line KOLF2.1J was recently suggested as a reference for Alzheimer disease modeling (Pantazis et al., 2022). The authors performed deep genotyping, ruling out the presence of detrimental nuclear genome variants (Pantazis et al., 2022). Although several studies reported associations of mtDNA mutations of human iPSCs that could affect stability, functionality, and differentiation potential (Prigione et al., 2011; Wei et al., 2021), mtDNA integrity was originally not assessed.

Mitochondria play key roles in the maintenance of stemness and differentiation (Lisowski et al., 2018). Specific mtDNA single nucleotide variants (SNVs) are disease associated (Chinnery and Gomez-Duran, 2018). SNVs affecting all mtDNA copies (homoplasmy) or only a fraction (heteroplasmy) can cause diseases in ∼0.02% of all individuals. Different mtDNA profiles also likely contribute to the observed heterogeneity among iPSC clones (Deuse et al., 2019; Hämäläinen, 2016; Perales-Clemente et al., 2016; Prigione et al., 2011; Wei et al., 2021). Given this impact, mtDNA analysis is recommended to guarantee the overall integrity of cell lines (Rossi et al., 2022).

Consequently, we decided to assess KOLF2.1J mtDNA integrity at the time of publication and in the distributed commercially available clones. We further performed untargeted metabolomics to rule out aberrant metabolism of the globally distributed cell line.

Results

mtDNA integrity analysis

First, we analyzed whole-genome sequencing (WGS) data previously generated (Pantazis et al., 2022) (Figure 1, left). Second, we used targeted (Figure 1, right) and WGS Oxford Nanopore Technologies (ONT) long-read sequencing of KOLF2.1J cultured in our laboratory.

Overview of the 2 workflows used to analyze mtDNA integrity of the KOLF2.1J reference cell line

(Left) WGS data were obtained from a publicly available repository. After rigorous data preprocessing, reads corresponding to mtDNA were extracted and realigned to a canonical and a shifted mtDNA reference genome. Finally, SNV and indel mutation analysis was performed using Mutect2 and Mutserve. (Right) For analysis using long-reads sequencing, cells were lysed and DNA extracted. Afterward, mtDNA was enriched by performing 9 overlapping PCR reactions, and a sequencing library was prepared. Long-read sequencing data were generated using a minION M1kC with a Flongle adaptor. Raw data were basecalled and analyzed using Mutserve.

SNVs and insertion/deletion mutations (indels) of short-read WGS data were analyzed using Mutserve (Weissensteiner et al., 2021) and Mutect2 (Laricchia et al., 2022). This is challenging because low mutation frequencies require clear distinction from contamination, sequencing errors, and erratic alignment to nuclear sequences of mitochondrial origin (Albayrak et al., 2016; Li et al., 2012; Maude et al., 2019; Wei et al., 2022). The circular mtDNA further complicates the alignment at the artificial breakage point to the revised Cambridge Reference Sequence (rCRS, GenBank: NC_012920). To account for these features, we used rigorous sequencing data processing based on the Mutect2 mitochondria workflow (Laricchia et al., 2022). Briefly, we mapped the WGS data to the GRCh38 reference genome using the Burrows-Wheeler alignment (Li and Durbin, 2009), filtered duplicates, extracted mtDNA reads, and mapped a reversed FASTQ file derived from this processing to the rCRS while accounting for the artificial reference breakage point. To ensure the absence of residual genomic DNA, we used haplocheck (Weissensteiner et al., 2021). Indel calling was performed using Mutect2 and variant calling using Mutect2 and Mutserve at a variant calling threshold of 0.01. Variants were checked for entries in the Mitomap disease variant database (www.mitomap.org). Two homoplasmic indels were residing in short tandem repeats (m310T>TC, m513G>GCACA) and thus classified as irrelevant. Other regions prone for sequencing errors (Laricchia et al., 2022) were also neglected for analysis.

Overall, 38 (Mutect2) and 39 (Mutserve) variants were detected (Figure 2A; Table S1). The vast majority of variants (Mutect2: 33; Mutserve: 34) were annotated as known polymorphisms. Of the remaining 5, 2 were annotated as synonymous and 2 as benign variants. One unknown variant was located at position m3650 in the coding sequence of the MT-ND1 gene. This variant was found to be synonymous with no predicted impact on translation.

Analysis of KOLF2.1J mtDNA integrity

(A) SNV detection of the mtDNA of the iPSC line KOLF2.1J using Mutserve. The x axis represents the mtDNA nucleotide position. The y axis represents the heteroplasmy level. Of the identified 39 SNVs, 34 were known common polymorphisms (green dots). Of the 5 additional SNVs (pink dots), 4 were described as not disease associated. These were located in *MT-ATP6*, *MT-ND3*, *MT-ND4*, and *MT-ND5*. The last SNV was unknown and was located in *MT-ND1.* Manual inspection revealed the variant to be synonymous. This analysis was performed on WGS data obtained from a public repository.

(B) SNV detection of the mtDNA of the iPSC line KOLF2.1J using Mutserve. Of the identified 42 SNVs, 37 were known common polymorphisms (green dots). Of the 5 additional SNVs (pink dots), 4 were described as not disease associated. These 4 SNVs were located in *MT-ATP6*, *MT-ND3*, *MT-ND4*, and *MT-ND5*. The last SNV was located in an artifact-prone poly C stretch and subsequently identified as a false hit. This analysis was performed on sequencing data obtained by in-house long-read ONT sequencing based on a custom pipeline.

Next, we purchased the KOLF2.1J cell line and performed mtDNA analysis using both ONT WGS and targeted mtDNA sequencing to determine whether mtDNA integrity remains intact in distributed cell lines. SNVs were detected using Mutserve at a variant calling threshold of 0.05 (targeted) or 0.01 (WGS). This approach identified 37 SNVs for the ONT WGS data (Figure S1; Table S2) and 42 SNVs for the targeted approach (Figure 2B; Table S3) in line with the original WGS data. Of these 42, 37 were annotated as known polymorphisms. Of the remaining 5, 4 were already detected in the original WGS dataset. The remaining 1 resided in an artifact-prone poly C stretch and was a false positive hit. Read phasing of long ONT reads on the variants called with high-quality WGS data revealed the presence of only 1 clear haplotype because the vast majority of detected variants were homozygous (data not shown).

Metabolome profile analysis

To corroborate these findings, we assessed the basal metabolome profile of KOLF2.1J and compared it to established iPSCs (IMR90, iPSC12) and an in-house-generated Cockayne syndrome disease model (ERCC6^R683X/R683X) iPSC line (Sanchez-Roman et al., 2018). Notably, principal-component analysis (PCA) analysis revealed a cluster of iPSC12, IMR90, and KOLF2.1J that was clearly distinguishable from that of the ERCC6^R683X/R683X line (Figure 3A). This was also reflected by the high number of significantly differently abundant metabolites detected for ERCC6^R683X/R683X compared to the other lines (Figure 3B), whereas comparison of KOLF2.1J with iPSC12 and IMR90 revealed only a low number of differently abundant metabolites (Figure S2). The majority of identified metabolites accounting for the differences of ERCC6^R683X/R683X compared to the other iPSCs was of the superclass organic acids (Figure 3C), including amino acids (Figure 3D), in line with altered cellular metabolism in Cockayne syndrome patients (Sanchez-Roman et al., 2018). Accordingly, we found the amino acids ornithine, proline, and glycine (Figure 3E) to be less abundant in the ERCC6^R683X/R683X disease line compared to the isogenic iPSC12, the IMR90, and the KOLF2.1J reference line. Although the definition of a “healthy” basal iPSC metabolome is beyond the scope of the present study, these results may serve as an indication.

Analysis of untargeted metabolomics data

(A) PCA indicates similarity between analyzed cell lines. Although all of the commercially available human iPSC lines (KOLF2.1J, iPSC12, IMR90) cluster together, the disease iPSC line ERCC6^R683X/R683X is found in a different cluster strongly separated on the first principal component.

(B) Volcano plot showing differentially abundant metabolite features in the basal metabolome of the 3 commercially available iPSC lines (KOLF2.1J, iPSC12, IMR90) compared to the disease cell line ERCC6^R683X/R683X.

(C) Organic acids make up the majority of the detected metabolites.

(D) Amino acids are differentially abundant in ERCC6^R683X/R683X compared to healthy iPSC lines.

(E) Detailed view on selected amino acids with lower abundance in ERCC6^R683X/R683X compared to healthy iPSC lines. Comparison of differentially abundant metabolites in (B) indicates high similarity of metabolomes between the 3 commercially available established iPSC lines compared to a disease model iPSC line.

Discussion

By combining short- and long-read WGS and targeted ONT sequencing data analysis, we performed an in-depth characterization of the KOLF2.1J mtDNA integrity at the time of publication and in the cell line in use in our laboratory. We found no detrimental disease-associated mtDNA SNVs or indels. Although very low yet potentially detrimental heteroplasmy may exist under the detection limit, the high (short-read) WGS coverage depth of ∼300× for most mtDNA positions gives us high confidence in our analysis. Further confidence is provided by our targeted ONT sequencing analysis. Both approaches showed equal performance, supporting the notion that expensive WGS data generation is not strictly necessary when focusing on mtDNA in specific use cases. Furthermore, untargeted metabolomic analysis of KOLF2.1J revealed that the abundance of detected metabolites was in line with regularly used iPSC lines and distinct from that of a disease iPSC line with altered metabolism. Although our experimental setup is not suited to define a “healthy” iPSC profile, the presented data can serve as a starting point to raise awareness of the value of metabolomics data in the stem cell field (Table S4). In summary, these data support the usability of KOLF2.1J as a male-derived reference human iPSC line with respect to mtDNA integrity and metabolism.

These findings highlight that to ensure the absence of disease-related mtDNA mutations, researchers should monitor the mtDNA integrity of iPSC lines whenever feasible (Rossi et al., 2022). Since WGS data generation yields data on mtDNA as a by-product, no additional major effort is necessary to assess mtDNA integrity in such cases. Nevertheless, monitoring mtDNA remains a nontrivial task demanding careful evaluation. The emphasis for future endeavors should be on developing user-friendly software that allows for a cost-effective evaluation of mtDNA integrity. This will facilitate the inclusion of mtDNA analysis as part of the standard quality control measures for newly produced or engineered iPSCs. To some extent, we tackled this issue by conducting a cost-effective experiment using ONT sequencing to examine the mtDNA. It must be noted, however, that variant calling using sequencing data from long-read ONT sequencing needs to be performed carefully. Here, when using FASTQ files directly from the sequencing device (high accuracy mode), Mutserve detected a m625G>A transition located in the tRNA MT-TF at a heteroplasmy level of >0.9. Manual inspection of the binary alignment map (BAM) file actually indicated a 0.5/0.5 G/A alignment. One contributing factor to this problem is the suboptimal performance of Mutserve for noisy sequencing data. In light of the documented detrimental effects of this mutation on cellular fitness (Sudo et al., 2011), we conducted an analysis of m625 using Sanger sequencing (Figure 4A). In contrast to the sequence derived from ONT sequencing, we found no alteration at the position of interest (Figure 4B). This highlights the necessity for a pipeline facilitating precise variant detection via cost-effective long-read ONT sequencing specifically designed for users who possess minimal knowledge of sequence analysis.

Analysis of m625 by Sanger sequencing

(A) To analyze the suspicious position m625, the respective part of the mtDNA was amplified by PCR. Afterward, the PCR product was purified and sent for Sanger sequencing using 2 internal primers.

(B) Sanger sequencing chromatograms using forward (top) and reverse (bottom) primers of KOLF2.1J surrounding the potentially detrimental mutation. Inspection of the chromatograms revealed that the sequence is not altered and the variant calling failed at this position. This warrants careful analysis of mitochondrial DNA mutations. Red arrowheads point to the positions of interest.

Collectively, we extend an invitation to scientists to explore the possibility of mtDNA integrity analysis to assess iPSC quality, particularly when WGS data are accessible, or by using cost-effective ONT sequencing devices. Based on our experience, the targeted ONT sequencing described in this paper yields reliable results down to a heteroplasmy level of 0.05. To correctly represent such a population within a 90% confidence interval, we recommend >6,000 reads for a given position. For homoplasmic variants, as little as 40 reads are sufficient to correctly represent their frequency. This number is likely to further decrease with new releases of the ONT sequencing chemistry. Although read phasing did not yield additional value in the case of the KOLF2.1J line, in general, we recommend performing such an analysis to monitor cosegregation of potentially harmful variants.

In conclusion, the KOLF2.1J iPSC line displayed intact mtDNA integrity at the time of publication, still maintains mtDNA integrity and inconspicuous metabolism, and can thus indeed be used as a reference iPSC line.

Experimental procedures

Resource availability

Further requests should be directed to the corresponding authors jochen.dobner@iuf-duesseldorf.de and andrea.rossi@iuf-duesseldorf.de.

Materials availability

The ERCC6^R683X/R683X iPSC line can be obtained via material transfer agreement.

Data and code availability

Details on the code to reproduce the results can be found at Mendeley Data (https://doi.org/10.17632/n4mccs2zfg.2).

Original short-read WGS data analysis

WGS data were downloaded from the Alzheimer’s Disease Workbench (https://www.alzheimersdata.org/ad-workbench) and preprocessed according to the guidelines of the genome analysis toolkit (GATK; Van der Auwera et al., 2013). Reads from FASTQ files were aligned to the GRCh38 reference genome using bwa mem -K 100000000 -p -v 3 -t 24. Duplicates were marked (gatk MarkDuplicates), and reads mapped to the mtDNA extracted (gatk PrintReads) using the read filters MateOnSameContigOrNoMappedMateReadFilter, MateUnmappedAndUnmappedReadFilter, and NotDuplicateReadFilter. The aligned BAM file was reverted to FASTQ via generation of an unaligned BAM file (gatk RevertSam, gatk SamToFastq), and realigned to either the reference mtDNA genome or a mtDNA reference genome shifted to move the breakage point to the middle of the sequence (bwa mem -K 100000000 -p -v 3 -t 24). Afterward, aligned BAM files were merged with unaligned BAM files (gatk MergeBamAlignment), duplicates filtered (gatk MarkDuplicates and PrintReads) with NotDuplicateReadFilter, and variants called with Mutect2 (gatk Mutect2) on either the control region spanning the breakage point (8,025–9,144 in the shifted reference mtDNA) or the remaining sequence (576–16,024 in the original reference mtDNA). Variant calls from the shifted reference were then lifted over to the correct position numbering using R (R Foundation for Statistical Computing) and the results were merged (gatk MergeVcfs). Finally, calls were filtered (gatk FilterMutectCalls) using -mitochondria mode true -max-alt-allele-count 2 -min-allele-fraction 0.01 and variants left aligned and trimmed to improve visualization (gatk LeftAlignAndTrimVariants). For Mutserve, the bwa-aligned BAM file was used after filtering out duplicates and extracting mtDNA-aligned reads. SNV detection was carried out using default settings and a detection threshold of 0.01. Afterward, the data were processed and visualized using a custom R script.

Targeted long-read ONT sequencing analysis

After total DNA isolation, we enriched for mtDNA by PCR to generate nine overlapping amplicons (Ramos et al., 2009; Zascavage et al., 2019) and sequenced using an ONT minION M1kC device with a Flongle adaptor, as previously described (Nguyen et al., 2022) (ONT). Briefly, pooled PCR products were used to prepare the sequencing library using the Sequencing by Ligation Kit (SQK-LSK109 and NBD104 native barcoding kit) according to the manufacturer’s instructions (ONT). The final library was loaded at a total of 15 fmol on a Flongle flow cell and sequenced for 24 h. Postsequencing, FAST5 files were basecalled with Guppy basecaller (ONT) and the following settings: --min_qscore 10 --require_barcodes_both_ends --enable_trim_barcodes --barcode_kits "EXP-NBD104" -c dna_r9.4.1_450bps_sup.cfg. FASTQ files were concatenated into single FASTQ files and aligned to the rCRS using minimap2 (Li, 2018). Variant calling was performed with Mutserve at a threshold level of 0.05. Afterward, data were processed using a custom R script and visualized using ggplot2 (Wickham, 2016). Sanger sequencing was performed on a PCR amplicon (forward primer: 5′-CTGTATCCGACATCTGGTTCCT-3′, reverse primer: 5′-GTTTAGCTCAGAGCGGTCAAGT-3′) by Microsynth using internal custom forward and reverse primers (forward primer: 5′-CCCTAACACCAGCCTAACCA-3′, reverse primer: 5′-AGGGTGAACTCACTGGAACG-3′).

Untargeted metabolomics

Untargeted metabolomics was conducted using small adaptations of an established protocol for the analysis of regulatory myeloid immune cells by ultra-high performance liquid chromatography-time-of-flight-mass spectrometry (Baumann et al., 2020). Details regarding this specialized experiment can be found in the supplemental information.

Acknowledgments

We thank Kira Frye for excellent technical assistance. We thank Melanie Köhler and Veronika Somoza for fruitful discussions on the project. We thank Haribaskar Ramachandran for the genome editing experiments. We thank Mark Cookson and Caroline Pantazis for sharing the original WGS data.

The IUF is funded by the federal and state governments, the Ministry of Culture and Science of North Rhine-Westphalia (MKW) and the Federal Ministry of Education and Research. We are grateful for the support from the Deutsche Forschungsgemeinschaft (DFG) (RO 5380/1-1, PR1527/5-1, and PR1527/6-1), the European Joint Programme for Rare Diseases and German Federal Ministry of Education and Research (01GM2002A), the Leigh Syndrome Consortium and AFM Telethon (AR-25179), and the Leibniz Competition (SAW) Cooperative Excellence project (K246/2019).

Author contributions

Conceptualization, J.D. and A.R. Methodology, J.D., T.N., and A.D. Investigation, J.D., and A.D. Formal analysis, J.D. and A.D. Data curation, J.D., T.N., and A.D. Writing – original draft, J.D. and A.R. Writing – review & editing, T.N., A.D., A.P., and J.K. Visualization, J.D. and A.D. Resources, J.K. and A.R. Funding acquisition, A.P., J.K., and A.R. Project administration, J.D. and A.R. Supervision, A.R.

Declaration of interests

The authors declare no competing interests.

Published: February 22, 2024

Footnotes

Supplemental information can be found online at https://doi.org/10.1016/j.stemcr.2024.01.009.

Contributor Information

Jochen Dobner, Email: jochen.dobner@iuf-duesseldorf.de.

Andrea Rossi, Email: andrea.rossi@iuf-duesseldorf.de.

Supplemental information

Document S1. Supplemental experimental procedures and Figures S1 and S2

mmc1.pdf^{(307.6KB, pdf)}

Table S1. Identified and annotated SNVs of short-read WGS, related to Figure 2

SNV information on the KOLF2.1J reference line derived from short-read WGS. Detected with Mutect2 mitochondria mode and Mutserve.

mmc2.xlsx^{(22.3KB, xlsx)}

Table S2. Identified and annotated SNVs of long-read nanopore WGS, related to the mtDNA analysis Results section

SNV information on the KOLF2.1J reference line derived from long-read WGS. Detected with Mutserve.

mmc3.xlsx^{(18.8KB, xlsx)}

Table S3. Identified and annotated SNVs of targeted long-read nanopore amplicon sequencing, related to the mtDNA analysis Results section

SNV information on the KOLF2.1J reference line derived from targeted long-read mtDNA sequencing. Detected with Mutserve.

mmc4.xlsx^{(19KB, xlsx)}

Table S4. Metabolites with significantly higher abundance for KOLF2.1J/iPSC12/IMR90 compared to ERCC6R683X/R683X, related to Figure 3

mmc5.xlsx^{(1,016KB, xlsx)}

Document S2. Article plus supplemental information

mmc6.pdf^{(4.3MB, pdf)}

References

Albayrak L., Khanipov K., Pimenova M., Golovko G., Rojas M., Pavlidis I., Chumakov S., Aguilar G., Chávez A., Widger W.R., Fofanov Y. The ability of human nuclear DNA to cause false positive low-abundance heteroplasmy calls varies across the mitochondrial genome. BMC Genom. 2016;17:1017. doi: 10.1186/s12864-016-3375-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Baumann T., Dunkel A., Schmid C., Schmitt S., Hiltensperger M., Lohr K., Laketa V., Donakonda S., Ahting U., Lorenz-Depiereux B., et al. Regulatory myeloid cells paralyze T cells through cell–cell transfer of the metabolite methylglyoxal. Nat. Immunol. 2020;21:555–566. doi: 10.1038/s41590-020-0666-9. [DOI] [PubMed] [Google Scholar]
Chinnery P.F., Gomez-Duran A. Oldies but Goldies mtDNA Population Variants and Neurodegenerative Diseases. Front. Neurosci. 2018;12 doi: 10.3389/fnins.2018.00682. [DOI] [PMC free article] [PubMed] [Google Scholar]
Deuse T., Hu X., Agbor-Enoh S., Koch M., Spitzer M.H., Gravina A., Alawi M., Marishta A., Peters B., Kosaloglu-Yalcin Z., et al. De novo mutations in mitochondrial DNA of iPSCs produce immunogenic neoepitopes in mice and humans. Nat. Biotechnol. 2019;37:1137–1144. doi: 10.1038/s41587-019-0227-7. [DOI] [PubMed] [Google Scholar]
Hämäläinen R.H. Mitochondrial DNA mutations in iPS cells: mtDNA integrity as standard iPSC selection criteria? EMBO J. 2016;35:1960–1962. doi: 10.15252/embj.201695185. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hirschi K.K., Li S., Roy K. Induced Pluripotent Stem Cells for Regenerative Medicine. Annu. Rev. Biomed. Eng. 2014;16:277–294. doi: 10.1146/annurev-bioeng-071813-105108. [DOI] [PMC free article] [PubMed] [Google Scholar]
Laricchia K.M., Lake N.J., Watts N.A., Shand M., Haessly A., Gauthier L., Benjamin D., Banks E., Soto J., Garimella K., et al. Mitochondrial DNA variation across 56,434 individuals in gnomAD. Genome Res. 2022;32:569–582. doi: 10.1101/gr.276013.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34:3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]
Li H., Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
Li M., Schroeder R., Ko A., Stoneking M. Fidelity of capture-enrichment for mtDNA genome sequencing: influence of NUMTs. Nucleic Acids Res. 2012;40:e137. doi: 10.1093/nar/gks499. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lisowski P., Kannan P., Mlody B., Prigione A. Mitochondria and the dynamic control of stem cell homeostasis. EMBO Rep. 2018;19 doi: 10.15252/embr.201745432. [DOI] [PMC free article] [PubMed] [Google Scholar]
Maude H., Davidson M., Charitakis N., Diaz L., Bowers W.H.T., Gradovich E., Andrew T., Huntley D. NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele. Front. Cell Dev. Biol. 2019;7 doi: 10.3389/fcell.2019.00201. [DOI] [PMC free article] [PubMed] [Google Scholar]
Nguyen T., Ramachandran H., Martins S., Krutmann J., Rossi A. Identification of genome edited cells using CRISPRnano. Nucleic Acids Res. 2022;50:W199–W203. doi: 10.1093/nar/gkac440. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pantazis C.B., Yang A., Lara E., McDonough J.A., Blauwendraat C., Peng L., Oguro H., Kanaujiya J., Zou J., Sebesta D., et al. A reference human induced pluripotent stem cell line for large-scale collaborative studies. Cell Stem Cell. 2022;29:1685–1702.e22. doi: 10.1016/j.stem.2022.11.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
Perales-Clemente E., Cook A.N., Evans J.M., Roellinger S., Secreto F., Emmanuele V., Oglesbee D., Mootha V.K., Hirano M., Schon E.A., et al. Natural underlying mtDNA heteroplasmy as a potential source of intra-person hiPSC variability. EMBO J. 2016;35:1979–1990. doi: 10.15252/embj.201694892. [DOI] [PMC free article] [PubMed] [Google Scholar]
Prigione A., Lichtner B., Kuhl H., Struys E.A., Wamelink M., Lehrach H., Ralser M., Timmermann B., Adjaye J. Human Induced Pluripotent Stem Cells Harbor Homoplasmic and Heteroplasmic Mitochondrial DNA Mutations While Maintaining Human Embryonic Stem Cell–like Metabolic Reprogramming. Stem Cell. 2011;29:1338–1348. doi: 10.1002/stem.683. [DOI] [PubMed] [Google Scholar]
Ramos A., Santos C., Alvarez L., Nogués R., Aluja M.P. Human mitochondrial DNA complete amplification and sequencing: A new validated primer set that prevents nuclear DNA sequences of mitochondrial origin co-amplification. ELECTROPHORESIS. 2009;30:1587–1593. doi: 10.1002/elps.200800601. [DOI] [PubMed] [Google Scholar]
Rossi A., Lickfett S., Martins S., Prigione A. A call for consensus guidelines on monitoring the integrity of nuclear and mitochondrial genomes in human pluripotent stem cells. Stem Cell Rep. 2022;17:707–710. doi: 10.1016/j.stemcr.2022.01.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sanchez-Roman I., Lautrup S., Aamann M.D., Neilan E.G., Østergaard J.R., Stevnsner T. Two Cockayne Syndrome patients with a novel splice site mutation – clinical and metabolic analyses. Mech. Ageing Dev. 2018;175:7–16. doi: 10.1016/j.mad.2018.06.001. [DOI] [PubMed] [Google Scholar]
Sudo A., Takeichi N., Hosoki K., Saitoh S. Successful cochlear implantation in a patient with mitochondrial hearing loss and m.625G>A transition. J. Laryngol. Otol. 2011;125:1282–1285. doi: 10.1017/S0022215111002453. [DOI] [PubMed] [Google Scholar]
Van der Auwera G.A., Carneiro M.O., Hartl C., Poplin R., del Angel G., Levy-Moonshine A., Jordan T., Shakir K., Roazen D., Thibault J., et al. From FastQ Data to High-Confidence Variant Calls: The Genome Analysis Toolkit Best Practices Pipeline. Curr. Protoc. Bioinformatics. 2013;43:11.10.1–11.10.33. doi: 10.1002/0471250953.bi1110s43. [DOI] [PMC free article] [PubMed] [Google Scholar]
Volpato V., Webber C. Addressing variability in iPSC-derived models of human disease: guidelines to promote reproducibility. Dis. Model. Mech. 2020;13 doi: 10.1242/dmm.042317. [DOI] [PMC free article] [PubMed] [Google Scholar]
Weissensteiner H., Forer L., Fendt L., Kheirkhah A., Salas A., Kronenberg F., Schoenherr S. Contamination detection in sequencing studies using the mitochondrial phylogeny. Genome Res. 2021;31:309–316. doi: 10.1101/gr.256545.119. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wei W., Gaffney D.J., Chinnery P.F. Cell reprogramming shapes the mitochondrial DNA landscape. Nat. Commun. 2021;12:5241. doi: 10.1038/s41467-021-25482-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wei W., Schon K.R., Elgar G., Orioli A., Tanguy M., Giess A., Tischkowitz M., Caulfield M.J., Chinnery P.F. Nuclear-embedded mitochondrial DNA sequences in 66,083 human genomes. Nature. 2022;611:105–114. doi: 10.1038/s41586-022-05288-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wickham H. Springer-Verlag New York; 2016. ggplot2: Elegant Graphics for Data Analysis. [Google Scholar]
Zascavage R.R., Hall C.L., Thorson K., Mahmoud M., Sedlazeck F.J., Planz J.V. Approaches to Whole Mitochondrial Genome Sequencing on the Oxford Nanopore MinION. Curr. Protoc. Hum. Genet. 2019;104:e94. doi: 10.1002/cphg.94. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Supplemental experimental procedures and Figures S1 and S2

mmc1.pdf^{(307.6KB, pdf)}

Table S1. Identified and annotated SNVs of short-read WGS, related to Figure 2

SNV information on the KOLF2.1J reference line derived from short-read WGS. Detected with Mutect2 mitochondria mode and Mutserve.

mmc2.xlsx^{(22.3KB, xlsx)}

Table S2. Identified and annotated SNVs of long-read nanopore WGS, related to the mtDNA analysis Results section

SNV information on the KOLF2.1J reference line derived from long-read WGS. Detected with Mutserve.

mmc3.xlsx^{(18.8KB, xlsx)}

Table S3. Identified and annotated SNVs of targeted long-read nanopore amplicon sequencing, related to the mtDNA analysis Results section

SNV information on the KOLF2.1J reference line derived from targeted long-read mtDNA sequencing. Detected with Mutserve.

mmc4.xlsx^{(19KB, xlsx)}

Table S4. Metabolites with significantly higher abundance for KOLF2.1J/iPSC12/IMR90 compared to ERCC6R683X/R683X, related to Figure 3

mmc5.xlsx^{(1,016KB, xlsx)}

Document S2. Article plus supplemental information

mmc6.pdf^{(4.3MB, pdf)}

Data Availability Statement

Details on the code to reproduce the results can be found at Mendeley Data (https://doi.org/10.17632/n4mccs2zfg.2).

[bib1] Albayrak L., Khanipov K., Pimenova M., Golovko G., Rojas M., Pavlidis I., Chumakov S., Aguilar G., Chávez A., Widger W.R., Fofanov Y. The ability of human nuclear DNA to cause false positive low-abundance heteroplasmy calls varies across the mitochondrial genome. BMC Genom. 2016;17:1017. doi: 10.1186/s12864-016-3375-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib2] Baumann T., Dunkel A., Schmid C., Schmitt S., Hiltensperger M., Lohr K., Laketa V., Donakonda S., Ahting U., Lorenz-Depiereux B., et al. Regulatory myeloid cells paralyze T cells through cell–cell transfer of the metabolite methylglyoxal. Nat. Immunol. 2020;21:555–566. doi: 10.1038/s41590-020-0666-9. [DOI] [PubMed] [Google Scholar]

[bib3] Chinnery P.F., Gomez-Duran A. Oldies but Goldies mtDNA Population Variants and Neurodegenerative Diseases. Front. Neurosci. 2018;12 doi: 10.3389/fnins.2018.00682. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib4] Deuse T., Hu X., Agbor-Enoh S., Koch M., Spitzer M.H., Gravina A., Alawi M., Marishta A., Peters B., Kosaloglu-Yalcin Z., et al. De novo mutations in mitochondrial DNA of iPSCs produce immunogenic neoepitopes in mice and humans. Nat. Biotechnol. 2019;37:1137–1144. doi: 10.1038/s41587-019-0227-7. [DOI] [PubMed] [Google Scholar]

[bib5] Hämäläinen R.H. Mitochondrial DNA mutations in iPS cells: mtDNA integrity as standard iPSC selection criteria? EMBO J. 2016;35:1960–1962. doi: 10.15252/embj.201695185. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib6] Hirschi K.K., Li S., Roy K. Induced Pluripotent Stem Cells for Regenerative Medicine. Annu. Rev. Biomed. Eng. 2014;16:277–294. doi: 10.1146/annurev-bioeng-071813-105108. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib7] Laricchia K.M., Lake N.J., Watts N.A., Shand M., Haessly A., Gauthier L., Benjamin D., Banks E., Soto J., Garimella K., et al. Mitochondrial DNA variation across 56,434 individuals in gnomAD. Genome Res. 2022;32:569–582. doi: 10.1101/gr.276013.121. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib8] Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34:3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib9] Li H., Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib10] Li M., Schroeder R., Ko A., Stoneking M. Fidelity of capture-enrichment for mtDNA genome sequencing: influence of NUMTs. Nucleic Acids Res. 2012;40:e137. doi: 10.1093/nar/gks499. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib11] Lisowski P., Kannan P., Mlody B., Prigione A. Mitochondria and the dynamic control of stem cell homeostasis. EMBO Rep. 2018;19 doi: 10.15252/embr.201745432. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib12] Maude H., Davidson M., Charitakis N., Diaz L., Bowers W.H.T., Gradovich E., Andrew T., Huntley D. NUMT Confounding Biases Mitochondrial Heteroplasmy Calls in Favor of the Reference Allele. Front. Cell Dev. Biol. 2019;7 doi: 10.3389/fcell.2019.00201. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] Nguyen T., Ramachandran H., Martins S., Krutmann J., Rossi A. Identification of genome edited cells using CRISPRnano. Nucleic Acids Res. 2022;50:W199–W203. doi: 10.1093/nar/gkac440. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib14] Pantazis C.B., Yang A., Lara E., McDonough J.A., Blauwendraat C., Peng L., Oguro H., Kanaujiya J., Zou J., Sebesta D., et al. A reference human induced pluripotent stem cell line for large-scale collaborative studies. Cell Stem Cell. 2022;29:1685–1702.e22. doi: 10.1016/j.stem.2022.11.004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib15] Perales-Clemente E., Cook A.N., Evans J.M., Roellinger S., Secreto F., Emmanuele V., Oglesbee D., Mootha V.K., Hirano M., Schon E.A., et al. Natural underlying mtDNA heteroplasmy as a potential source of intra-person hiPSC variability. EMBO J. 2016;35:1979–1990. doi: 10.15252/embj.201694892. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib16] Prigione A., Lichtner B., Kuhl H., Struys E.A., Wamelink M., Lehrach H., Ralser M., Timmermann B., Adjaye J. Human Induced Pluripotent Stem Cells Harbor Homoplasmic and Heteroplasmic Mitochondrial DNA Mutations While Maintaining Human Embryonic Stem Cell–like Metabolic Reprogramming. Stem Cell. 2011;29:1338–1348. doi: 10.1002/stem.683. [DOI] [PubMed] [Google Scholar]

[bib17] Ramos A., Santos C., Alvarez L., Nogués R., Aluja M.P. Human mitochondrial DNA complete amplification and sequencing: A new validated primer set that prevents nuclear DNA sequences of mitochondrial origin co-amplification. ELECTROPHORESIS. 2009;30:1587–1593. doi: 10.1002/elps.200800601. [DOI] [PubMed] [Google Scholar]

[bib18] Rossi A., Lickfett S., Martins S., Prigione A. A call for consensus guidelines on monitoring the integrity of nuclear and mitochondrial genomes in human pluripotent stem cells. Stem Cell Rep. 2022;17:707–710. doi: 10.1016/j.stemcr.2022.01.019. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib19] Sanchez-Roman I., Lautrup S., Aamann M.D., Neilan E.G., Østergaard J.R., Stevnsner T. Two Cockayne Syndrome patients with a novel splice site mutation – clinical and metabolic analyses. Mech. Ageing Dev. 2018;175:7–16. doi: 10.1016/j.mad.2018.06.001. [DOI] [PubMed] [Google Scholar]

[bib20] Sudo A., Takeichi N., Hosoki K., Saitoh S. Successful cochlear implantation in a patient with mitochondrial hearing loss and m.625G>A transition. J. Laryngol. Otol. 2011;125:1282–1285. doi: 10.1017/S0022215111002453. [DOI] [PubMed] [Google Scholar]

[bib21] Van der Auwera G.A., Carneiro M.O., Hartl C., Poplin R., del Angel G., Levy-Moonshine A., Jordan T., Shakir K., Roazen D., Thibault J., et al. From FastQ Data to High-Confidence Variant Calls: The Genome Analysis Toolkit Best Practices Pipeline. Curr. Protoc. Bioinformatics. 2013;43:11.10.1–11.10.33. doi: 10.1002/0471250953.bi1110s43. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib22] Volpato V., Webber C. Addressing variability in iPSC-derived models of human disease: guidelines to promote reproducibility. Dis. Model. Mech. 2020;13 doi: 10.1242/dmm.042317. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib23] Weissensteiner H., Forer L., Fendt L., Kheirkhah A., Salas A., Kronenberg F., Schoenherr S. Contamination detection in sequencing studies using the mitochondrial phylogeny. Genome Res. 2021;31:309–316. doi: 10.1101/gr.256545.119. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib24] Wei W., Gaffney D.J., Chinnery P.F. Cell reprogramming shapes the mitochondrial DNA landscape. Nat. Commun. 2021;12:5241. doi: 10.1038/s41467-021-25482-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib25] Wei W., Schon K.R., Elgar G., Orioli A., Tanguy M., Giess A., Tischkowitz M., Caulfield M.J., Chinnery P.F. Nuclear-embedded mitochondrial DNA sequences in 66,083 human genomes. Nature. 2022;611:105–114. doi: 10.1038/s41586-022-05288-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib26] Wickham H. Springer-Verlag New York; 2016. ggplot2: Elegant Graphics for Data Analysis. [Google Scholar]

[bib27] Zascavage R.R., Hall C.L., Thorson K., Mahmoud M., Sedlazeck F.J., Planz J.V. Approaches to Whole Mitochondrial Genome Sequencing on the Oxford Nanopore MinION. Curr. Protoc. Hum. Genet. 2019;104:e94. doi: 10.1002/cphg.94. [DOI] [PubMed] [Google Scholar]

PERMALINK

Mitochondrial DNA integrity and metabolome profile are preserved in the human induced pluripotent stem cell reference line KOLF2.1J

Jochen Dobner

Thach Nguyen

Andreas Dunkel

Alessandro Prigione

Jean Krutmann

Andrea Rossi

Summary

Highlights

Introduction

Results

mtDNA integrity analysis

Figure 1.

Figure 2.

Metabolome profile analysis

Figure 3.

Discussion

Figure 4.

Experimental procedures

Resource availability

Materials availability

Data and code availability

Original short-read WGS data analysis

Targeted long-read ONT sequencing analysis

Untargeted metabolomics

Acknowledgments

Author contributions

Declaration of interests

Footnotes

Contributor Information

Supplemental information

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases