Skip to main content
Journal of Assisted Reproduction and Genetics logoLink to Journal of Assisted Reproduction and Genetics
. 2023 Jul 13;40(9):2185–2196. doi: 10.1007/s10815-023-02880-2

Modeling-based prediction tools for preimplantation genetic testing of mitochondrial DNA diseases: estimating symptomatic thresholds, risk, and chance of success

Dongmei Ji 1,2,3,#, Ning Zhang 1,4,5,#, Weiwei Zou 1,2,3,#, Zhikang Zhang 1,2,5, Jordan Lee Marley 6, Zhuoli Liu 7, Chunmei Liang 1,2,3, Lingchao Shen 1,2,5, Yajing Liu 1,2,3, Dan Liang 1,2,3, Tianhong Su 8,9,, Yinan Du 4,, Yunxia Cao 1,2,3,
PMCID: PMC10440331  PMID: 37439868

Abstract

Purpose

Preimplantation genetic testing (PGT) has become a reliable tool for preventing the germline transmission of mitochondrial DNA (mtDNA) variants. However, procedures are not standardized across mtDNA variants. In this study, we aim to estimate symptomatic thresholds, risk, and chance of success for PGT for mtDNA pathogenic variant carriers.

Methods

We performed a systematic analysis of heteroplasmy data including 455 individuals from 187 familial pedigrees with the common m.3243A>G, m.8344A>G, or m.8993T>G pathogenic variants. We applied binary logistic regression for estimating symptomatic thresholds of heteroplasmy, simplified Sewell-Wright formula and Kimura equations for predicting the risk of disease transmission, and binomial distribution for predicting minimum oocyte numbers.

Results

We estimated the symptomatic thresholds of m.8993T>G and m.8344A>G as 29.86% and 16.15%, respectively. We could not determine a threshold for m.3243A>G. We established models for mothers harboring common and rare mtDNA pathogenic variants to predict the risk of disease transmission and the number of oocytes required to produce an embryo with sufficiently low variant load. In addition, we provide a table allowing the prediction of transmission risk and the minimum required oocytes for PGT patients with different variant levels.

Conclusion

We have established models that can determine the symptomatic thresholds of common mtDNA pathogenic variants. We also constructed universal models applicable to nearly all mtDNA pathogenic variants which can predict risk and minimum numbers for PGT patients. These models have advanced our understanding of mtDNA disease pathogenesis and will enable more effective prevention of disease transmission using PGT.

Supplementary Information

The online version contains supplementary material available at 10.1007/s10815-023-02880-2.

Keywords: Mitochondrial diseases, Mitochondrial DNA variants, PGT, Genetic counseling, Heteroplasmy

Introduction

Mitochondrial diseases are among the most common genetic disorders with an estimated incidence of 1:4300 [1]. Some mitochondrial diseases are caused by maternally inherited variants in mitochondrial DNA (mtDNA). Pathogenic mtDNA variants are estimated to be present in 0.5% of the population; and to date, over 300 variants have been reported to result in devastating neurological diseases such as MELAS and Leigh syndrome [25]. As mtDNA is maternally inherited, females carrying mtDNA variants have a risk of transmitting pathogenic variants to their offspring. Nowadays, some reproductive methods that reduce the risk of transmitting mtDNA variants have already been developed such as prenatal testing, preimplantation genetic testing (PGT), and mitochondrial donation [6]. It is worth noting that various options will be appropriate for different individuals, and determining which option is the most appropriate is not always straightforward and necessitates contributions from both mitochondrial and fertility specialists.

Pathogenic variants in mtDNA can exist alongside wild-type mtDNA molecules within cells (heteroplasmy), and variant load can be measured as a percentage of total mtDNA. For most mtDNA pathogenic variants, patients develop symptoms when variant load surpasses a specific threshold, and severity of disease correlates with variant load thereafter [7, 8]. There is considerable variation of variant load across oocytes and offspring of a woman due to a bottleneck in mtDNA inheritance, making the transmission of mtDNA diseases unpredictable [911]. Preimplantation genetic testing (PGT) allows the determination of variant load in early embryos and the selection of optimal embryos for implantation, reducing the risk of disease transmission. However, to date, there are no standardized PGT procedures for reducing the risk of germline transmission of different mtDNA variants. Specifically, only limited criteria have been established for selecting eligible embryos and determining the number of oocytes required to ensure acquisition of a suitable embryo.

Here, we developed integrative models that enabled us to predict the probability of carriers with different mtDNA variants being symptomatic. We also used these models to estimate symptomatic thresholds for the same variants. In addition, we established standardized tools that predict risk of disease transmission and the number of oocytes needed for retrieval in PGT to reduce this risk. These models can be adapted to numerous mtDNA variants.

Materials and methods

Establishment of common mtDNA variant databases (m.3243A>G, m.8344A>G, m.8993T>G)

m.3243A>G, m.8344A>G, m.8993T>G are the three most common pathogenic mtDNA variants and thus were selected for modeling. A mitochondrial pedigree database was established prior to the development of prediction models. Mitochondrial pedigrees were derived from Mitomap (https://www.mitomap.org/MITOMAP) and the Reproductive Medicine Center, the First Affiliated Hospital of Anhui Medical University.

Quantitative analysis of the mtDNA variant load

DNA was extracted using Magnetic Universal Genomic DNA Kit (Tiangen, China). Whole mtDNA was amplified via PCR using REPLI-g Single-Cell Kit (Qiagen, Germany) using primers 2120F GGACACTAGGAAAAAACCTTGTAGAGAGAG and 2119R AAAGAGCTGTTCCTCTTTGGACTAACA, and isolated via gel extraction. Amplified DNA was sheared to fragments of 200 bp and repaired using a DNA end repairing agent (NGS Fast DNA Library Prep Set for Illumina, Joy Orient, China). Products were blunt-end-ligated to adapters using T4 DNA ligase (NGS Fast DNA Library Prep Set for Illumina, Joy Orient, China) and amplified by four to six rounds of ligation-mediated PCR, followed by magnetic bead purification (NGS Fast DNA Library Prep Set for Illumina, Joy Orient, China). NGS libraries were prepared and sequenced on the Novaseq6000 platform (Illumina, USA), using 150-bp-long paired-end reads. Quality control was performed and low-quality sequences were removed. Sequences of mtDNA were aligned to the Cambridge reference sequence NC_012920 using BWA. Variants were filtered based on genotype quality and read depth, and reliable variants were referenced against the MITOMAP database.

Classification and selection of pedigrees

All pedigrees were divided into 3 categories: familial, de novo, and uninformative. Pedigrees were considered “familial” if family history was positive (based on clinical symptoms and the presence of mtDNA variants in maternal relatives). Pedigrees were considered “de novo” if the family history was negative, and mtDNA variant levels of a proband’s mother and all tested maternal relatives were 0%. The remaining pedigrees were considered “uninformative.”

Uninformative pedigrees were excluded from the database. Within familial and de novo pedigrees, all reported mtDNA variants were analyzed using the Mitomap database. Their pathogenicity was predicted by MitoTIP (for rRNA/tRNA variants) [12] and HmtVar (for coding and non-coding/control region variants) [13]. Only those with a status of pathogenicity “confirmed” or “reported,” and with prediction of “pathogenic” or “probably pathogenic,” were included in the analysis. Furthermore, variants without relevant pedigree information, variants not associated with neuromuscular diseases, and cancer-related variants were excluded. Finally, a total of 47 RNA-related variants and 27 protein-related variants were included. Detailed inclusion and exclusion processes were presented in Fig. S1. mtDNA variant levels of muscle samples from symptomatic individuals carrying the above variants were collected. A total of 269 familial and 97 de novo mtDNA variants were used in the analysis.

D’Agostino and Pearson test and QQ plots showed that neither data set was consistent with a normal distribution (Fig. S2A, Table S1). The mtDNA variant levels of patients with clinical symptoms were higher in familial pedigrees compared with de novo pedigrees (P < 0.05) (Fig. S2B and S2C). As a result, only familial pedigrees were included in the analysis.

Correction and processing of variant load data

mtDNA variant load of blood and muscle was averaged for each patient before analysis. For m.8344A>G and m.8993T>G, the mtDNA variant levels in blood and muscle are usually similar and are relatively constant over time [1416]. For m.3243A>G, as the mtDNA variant load in the blood of carriers normally decreases over time, blood data were age corrected (Eq. 1) [14, 17, 18]. To avoid over-correcting, only those lower than 95% after age correction were included in the analysis. In consideration of the limitations of detection sensitivity, in the case where a mother was an mtDNA variant carrier with clinical symptoms and her offspring had very low heteroplasmy (close to 0%), variant load was also included in the analysis as 1%.

Age-adjustedbloodlevel=Bloodheteroplasmy/0.977age+12 1

All variant carriers were divided into two groups, “symptomatic” or “asymptomatic,” based on the occurrence of disease. Carriers with mild symptom were considered “symptomatic” unless the symptoms were extremely common in the patient’s age group (such as dementia in patients over 80).

Predicting the probability of symptomatic disease and estimating symptomatic thresholds

Binary logistic regression was used for predicting the probability of being clinically symptomatic based on variant load (Eq. 2). Mean mtDNA variant load value from blood and muscle was the independent variable (x) with a probability of being symptomatic as the dependent variable (y). The model included parameters 0 and 1, controlling position and the shape of the function, respectively (Eq. 2). According to Bayes’ theorem, the proportion of symptomatic individuals in pedigrees (prior probability of being symptomatic) was required to determine the probability of a carrier being symptomatic based on variant load. The prior probability of being symptomatic was used for weighting data, which was estimated by the ratio of symptomatic siblings (excluding probands) to total siblings. Data from the probands were excluded to minimize the ascertainment bias [7].

Lny1-y=β0+β1x 2

Using the symptomatic probability prediction model and setting a 95% or higher chance of being asymptomatic as the cut-off point, the corresponding mtDNA variant load was determined as the symptomatic threshold for an mtDNA variant (s).

Predicting the risk of disease transmission

Due to the positive correlation between symptomatic risk and variant load, a distribution model of mtDNA variant load across offspring was first constructed to predict risk. The simplified Sewell-Wright formula (Eqs. 3 and 4) [19] and Kimura equations (Eqs. 57) [20] were used to establish the distribution model. The simplified Sewell-Wright formula is a function of four parameters: p0 (variant load of the mother), t (number of generations), Neff (effective population size), and V (variance of mtDNA variant load across oocytes/offspring). p0 and V (both range from 0 to 1) were substituted into the formula to calculate b (bottleneck parameter) (Eq. 3). p0 and b were then substituted into Kimura equations (Eqs. 57) and the distribution of mtDNA variant load calculated. Numerical calculation methods were as previously reported Wonnapinij et al. [20, 21].

V=p01-p01-e-t/Neff=p01-p01-b 3
b=e-t/Neff 4
f(0)=1-p0+i=12i+1p01-p0-1iF1-i,i+2,2,1-p0bii+1/2 5
(x)=i=1ii+12i+1p01-p0F1-i,i+2,2,xF1-i,i+2,2,p0bii+1/2 6
f(1)=p0+i=12i+1p01-p0-1iF1-i,i+2,2,p0bii+1/2 7

Based on the symptomatic threshold s for specific mtDNA variants and the distribution model, the risk of disease transmission (the cumulative probability of offspring carrying variant load between s and 100%) was calculated.

Establishment of a model to predict the minimum oocytes for retrieval

Suitable embryos for transfer were defined as those with a probability higher than 95% of carrying a variant load below s. Based on the risk of disease transmission (p), assuming that X number of oocytes are required to ensure acquisition of at least A suitable embryos, and that k is the proportion of all the oocytes (including mature and immature oocytes) retrieved during the oocyte collection procedure that can develop into a transplantable embryo (the transplantable embryo means it is considered to have the potential to result in a healthy pregnancy and live birth), an oocyte retrieval prediction model was developed using the binomial distribution (Eqs. 8 and 9).

i=0A-1Ckxipi1-pkX-i<0.05 8

In particular, when A = 1, the model can be simplified as

X>log1-p0.05k 9

Statistics

GraphPad Prism 8 (GraphPad Software Inc., La Jolla, CA, USA) and MATLAB R2021b (The MathWorks, Inc., Natick, MA, USA) were used for statistical modeling and plotting. Data consistent with normal distribution are presented as mean ± standard deviation (SD), otherwise as median and interquartile range. Mann–Whitney tests were used to compare heteroplasmy data. All data were calculated with 95% CIs.

Results

Characteristics of the database

We first divided individuals carrying m.8993T>G, m.8344A>G, or m.3243A>G into 3 datasets, each including symptomatic and asymptomatic carriers, and analyzed their heteroplasmy in blood and muscle samples. D’Agostino and Pearson test and QQ plots showed that none of the datasets were consistent with a normal distribution (Fig. 1A and Table S2). The variant load of symptomatic m.8993T>G carriers was higher than that of symptomatic carriers of m.8344A>G and m.3243A>G (median = 89%, IQR = 78–95% vs. median = 80%, IQR = 64–90% and median = 70%, IQR = 55–81%, respectively, P < 0.001) (Fig. 1B and 1C), suggesting that the symptomatic threshold of m.8993T>G carriers is higher than that of m.8344A>G and m.3243A>G carriers.

Fig. 1.

Fig. 1

The distribution of variant load of m.8993T>G, m.8344A>G, and m.3243A>G in the symptomatic individuals (sym.) in familial pedigrees. A Quantile-quantile (QQ) plots showing distribution of variant load in symptomatic carriers of m.8993T>G (blue), m.8344A>G (orange), and m.3243A>G (purple). B The distribution of variant load of symptomatic individuals for each variant (minimum, 25% quartile, median, 75% quartile, maximum). C Cumulative (cum) number of symptomatic individuals relative to all carriers for each variant with variant load equal to or lower than a certain level. *** P < 0.001

The distribution of m.8993T>G variant load in the asymptomatic individuals was consistent with normal distribution, while the distribution of variant load in the symptomatic individuals was not (Fig. S3A and Table S3). The variant load of symptomatic individuals was higher than that of asymptomatic individuals (median = 89%, IQR= 78–95% vs. median = 50%, IQR=40–77%, P < 0.001) (Fig. S3B and S3C). Analysis of m.8344A>G carriers indicated that variant load was not normally distributed in asymptomatic nor symptomatic individuals (Fig. S4A and Table S4). The variant load of symptomatic individuals was higher than that of asymptomatic individuals (median = 80%, IQR = 64–90% vs. median = 28%, IQR = 1–59%, P < 0.001) (Fig. S4B and S4C). The distribution of m.3243A>G variant load was also not consistent with normal distribution in asymptomatic nor symptomatic individuals (Fig. S5A and Table S5). The variant load of symptomatic individuals was higher than that of asymptomatic individuals (median = 70%, IQR = 55–81% vs. median = 42%, IQR = 20–63%, P < 0.001) (Fig. S5B and S5C).

Predicting the probability of carriers being symptomatic and estimating symptomatic thresholds for common mtDNA variants

After pre-processing, our database included the heteroplasmy data of 455 individuals from 187 familial pedigrees (Tables S6–S8), encompassing 87 datapoints from 42 m.8993T>G pedigrees (35 clinically asymptomatic and 52 symptomatic), 152 from 37 m.8344A>G pedigrees (88 asymptomatic and 64 symptomatic), and 216 from 108 m.3243A>G pedigrees (85 asymptomatic and 131 symptomatic).

Based on our m.8993T>G data, we calculated the prior probability of carriers being symptomatic as 0.56, 0 as − 5.451 (95%, CI − 5.816 to − 5.097), and 1 as 8.395 (95%, CI 7.920 to 8.885). Using the equation Lny1-y=-5.451+8.395x, we estimated the symptomatic threshold as 29.86% for m.8993T>G (Fig. 2A). The area under the ROC curve (AUC) of our model was 0.847 (95% CI 0.834 to 0.860), indicating our model was a good fit to the data (Fig. 2B).

Fig. 2.

Fig. 2

Prediction model of the symptomatic threshold for common mtDNA variants. A Estimated probability of being symptomatic (95% CIs) for specific variant load of m.8993T>G. Carriers with variant load under 29.86% are predicted to be clinically symptomatic by our model (dashed line). B The ROC curve of the m.8993T>G threshold prediction model, with an AUC of 0.847. C Estimated probability of being symptomatic (95% CIs) for specific variant load of m.8344A>G. Carriers with variant load under 16.15% are predicted to be clinically symptomatic by our model (dashed line). D The ROC curve of the m.8344A>G threshold prediction model, with an AUC of 0.867. E Estimated probability of being symptomatic (95% CIs) for specific variant load of m.3243A>G. Carriers with variant load of 0% were predicted to have a probability of 15.5% of being symptomatic. F The ROC curve of the m.3243A>G threshold prediction model, with an AUC of 0.761. A, C, E x-axis represents the variant load of samples and the y-axis the probability of an individual being symptomatic. Outer lines represent CIs. The x-axis and the y-axis are the false positive rate and true positive rate of the prediction model, respectively

For m.8344A>G, we calculated the general prior probability of being symptomatic as 0.37, 0 as − 3.827 (95%, CI − 4.006 to − 3.654), and 1 as 5.463 (95%, CI 5.205 to 5.728). The equation Lny1-y=-3.827+5.463x suggested carriers with a variant load over 16.15% would be symptomatic (Fig. 2C). In ROC analysis, the AUC of our model was 0.867 (95% CI 0.859 to 0.876), suggesting an excellent fit to the data (Fig. 2D).

For m.3243A>G, we calculated the general prior probability of being symptomatic as 0.54, 0 as − 1.696 (95% CI, − 1.806 to − 1.588), and 1 as 4.213 (95% CI, 4.027 to 4.402). However, we failed to determine a threshold for m.3243A>G using the equation Lny1-y=-1.696+4.213x. According to our formula, when variant load was extremely low (close to 0%), the probability of being symptomatic was still 15.50% (Fig. 2E). In ROC analysis, the AUC of the model was 0.761 (95%, CI 0.751 to 0.770), suggesting a suboptimal fit to the data (Fig. 2F). In conclusion, our models presented a better fit for m.8993T>G and m.8344A>G and allowed estimation of symptomatic thresholds.

Applying our models to predict the risk of disease transmission and required number of oocytes

To demonstrate the application of our models, we used them to predict the risk of disease transmission and the number of oocytes required for PGT using data published by Chinnery’s group [22]. Brown et al. studied the distribution of variant load across 82 individual primary oocytes from a patient carrying m.3243A>G [22]. This dataset was previously used for verifying the Kimura distribution in human oocytes [20]. Our prediction model presented an excellent fit to the data (Eqs. 37). The symptomatic threshold of m.3243A>G is generally considered between 15 and 30% [23]. As we were unable to determine a reliable threshold of m.3243A>G using our data, we set an empirical threshold of 15%. In this instance we determined p0 to be 12.64%, the mean heteroplasmy of the patient’s oocytes. This value is more reliable than using the variant load of patients’ blood or muscle [19], but the variant load of oocytes is difficult to measure in normal circumstances because the oocytes need to be retrieved surgically. We substituted these parameters into our distribution model (Eqs. 37) and calculated the patients’ cumulative probability of having a symptomatic child (p) as 65.53% (Fig. 3).

Fig. 3.

Fig. 3

The probability (y-axis) of a child having certain levels of m.3243A>G (x-axis) according to our prediction model. The left dashed line represents the symptomatic threshold of m.3243A>G (15%). The right dashed line represents offspring with 100% variant load (fixed). The cumulative probability between the two dashed lines is the risk of offspring being symptomatic

According to data we collected from our intracytoplasmic sperm injection and PGT patients, the proportion of all the oocytes retrieved during the oocyte collection procedure that can develop into a transplantable embryo (k) is 49.3% (Fig. S6). We substituted k and p into our model of oocyte retrieval and found that to obtain at least one embryo with variant load below the symptomatic threshold, the number of oocytes required for retrieval must be more than 14.28 (Eqs. 8 and 9). This indicates that a patient carrying the m.3243A>G variant at an average of 12.64% in oocytes needs to provide minimally 15 oocytes to obtain an embryo with sufficiently low variant load for implantation. Our model can similarly be applied using patient’s blood or muscle variant load.

Universal models to predict transmission risk and minimum oocyte number

As shown above, for common variants, we could take the advantage of deep pedigree data to build precise models for the prediction of symptomatic thresholds, risk of disease transmission, and necessary oocyte retrieval numbers. However, it is unrealistic to establish individual models for every rare variant with limited pedigree data. We therefore programmed two universal models for predicting the risk of disease transmission and requisite oocyte retrieval numbers that can be applied to any heteroplasmic mtDNA pathogenic variants, based on variant load in muscles and/or blood of patients. We set the universal threshold value s as 18% and the bottleneck parameter b as 0.66, as these values fit multiple datasets of different mtDNA variants and are applicable to most variants [21, 23, 24].

We modeled the data of three patients from our clinic with rare pathogenic mtDNA variants (P1: m.9185T>G; P2: m.3697G>A; P3: m.10191A>G) (detailed information in Fig. 4A). We predicted the probability of patients’ offspring inheriting different variant levels, risk of disease transmission without intervention, and the minimum oocytes needed. We determined (P1) probability of disease transmission is 61.08%, and the minimum number of oocytes necessary for PGT is 13; (P2) probability of disease transmission is 84.64%, and the minimum number of oocytes necessary for PGT is 37; (P3) probability of disease transmission is 28.55%, and the minimum number of oocytes necessary for PGT is 5 (Fig. 4B4D).

Fig. 4.

Fig. 4

Prediction of risk of disease transmission and minimum number of oocytes required for PGT using our universal prediction model. A Rare pathogenic mtDNA variants carried by patients and mean mtDNA variant load in blood and muscle. B The probability of a patient 1 having offspring with different variant levels. C The probability of a patient 2 having offspring with different variant loads. D The probability of a patient 3 having offspring with different variant loads. Dashed lines indicate the universal value of pathogenicity threshold (18%). E Assuming that every mtDNA variant follows the same role of transmitting pattern through the bottleneck, for patients with certain variant load (1–60%) of any mtDNA variant, risk of disease transmission, and minimum number of oocytes required for PGT were modeled

In addition, we calculated the risk of disease transmission and the minimum number of oocytes required for patients carrying any mtDNA variant across a wide range of variant levels (1–60%, Fig. 4E). For patients with over 60% variant load, data were not presented due to high likelihood of high variant load and thus low chance of obtaining a suitable embryo.

Discussion

For women carrying heteroplasmic mtDNA variants, it is difficult to predict the risk of having a symptomatic child due to a bottleneck in mtDNA inheritance and ambiguous symptomatic thresholds of variant load [11]. PGT enables the measurement of mtDNA variant load in early embryos and reduces the risk of disease transmission. In this study, we have established integrative models that allow us to predict the probability of being clinically symptomatic based on variant load in blood/muscle, and estimated the symptomatic thresholds for m.8993T>G and m.8344A>G. In addition, we have constructed universal models that allow us to predict the risk of disease transmission and the number of oocytes requisite for PGT. These models can be applied to generate standardized PGT guidelines.

Identifying oocytes with the lowest variant load is crucial to successfully conduct embryo transfer during PGT. Mitochondrial and fertility specialists must strike a balance between variant load and the quality of embryos/blastocysts, as well as the likelihood of successful pregnancy upon implantation. However, the cut-off point is unclear for most mtDNA variants, potentially increasing difficulties of genetic counseling and risking disease transmission [25]. Using the collective data of 455 individuals from 187 familial pedigrees, we established models for the three most common pathogenic mtDNA variants (m.3243A>G, m.8344A>G, and m.8993T>G) to predict the probability of a carrier showing symptoms based on muscle and/or blood heteroplasmy levels, and estimated the symptomatic thresholds of these variants.

For most pathogenic mtDNA variants, variant load in skeletal muscle and blood correlates with the severity of clinical presentations, m.8993T>G being one such example [8, 26, 27]. White et al. indicated that the symptomatic cut-off point of m.8993T>G is 60% [27]; however, this prediction can be further refined. A critical difference between our study and that of White et al. lies in the method of categorizing the clinical condition. White et al. categorized symptoms into “mild” and “severe” and did not include the asymptomatic carriers, who would likely carry low variant load. Therefore, the proportion of high variant load carriers in the cohort is higher than those in the wider population, compromising the estimation of the symptomatic threshold. This may result in an overestimation of the threshold. In contrast, we categorized carriers by “symptomatic” and “asymptomatic,” which improved the precision of the threshold prediction. Our results further define a symptomatic threshold and advance our understanding of phenotype development.

Reports have shown a correlation between clinical features and the variant load of m.8344A>G, but no stringent symptomatic threshold has been determined [17]. Chinnery et al. examined 55 symptomatic m.8344A>G patients, none of whom carried variant load below 50% [17], suggesting an ambiguous threshold of below 50%. van de Glind et al. reported an m.8344A>G carrier with variant load of 15%. The patients’ physical examination and laboratory evaluation were normal and they presented with only muscle weakness, raising questions regarding diagnosis [28]. Here we report a more precise symptomatic threshold for m.8344A>G of 16.15% for the first time.

For m.3243A>G, a threshold could not be estimated. Our data and other studies have shown that the m.3243A>G phenotype is not simply correlated to variant load. Recently, factors affecting the phenotypes of m.3243A>G variant have been reviewed and we concluded that mtDNA copy number, nuclear genetic factors, and sex difference also play roles in determining phenotypes among m.3243A>G patients [29]. It is worth noting that individuals might still show symptoms even if their variant load is extremely low (close to 0%) [8, 30]. For example, Damian et al. reported that a 66-year-old patient whose variant load was 5% showed mild neural deafness [31]. In addition, some symptoms such as diabetes, stroke, and myopathy can also derive from other common age-related diseases and a large number of such pedigrees have been reported [3234], which may confound analysis of disease phenotype. Currently, the threshold of m.3243A>G in clinical practice is considered between 15 and 30% [23, 35, 36]. Given the limitations of the reasons discussed above, precisely defining this threshold requires further investigation.

It is widely accepted that the segregation of mtDNA across oocytes is mostly determined by random genetic drift [22, 36, 37]. The Kimura distribution model of mtDNA inheritance, which can explain the variation in variant load across offspring, was established based on this theory [20, 24, 37]. Using a simplified Sewell-Wright formula, Kimura distribution, and symptomatic thresholds, a risk of disease transmission prediction model can be calculated using variant load of the mother and the variant variance of her oocytes/offspring. Using binomial distribution, the proportion of in vitro fertilized oocytes that can develop into a blastocyst, and the calculated risk of disease transmission, we constructed a model for predicting the minimum number of oocytes required to that ensure at least one embryo with variant load below the symptomatic threshold can be collected. Importantly, these models are less reliable if the calculation of the key parameter V (variance of mtDNA variant load across oocytes/offspring) is based on less than 20 samples [21], thus are unsuitable for rare variants. We therefore established universal models that can be applied to uncommon pathogenic variants, where only a patient’s variant load is needed for predictions. The model is constructed based on the assumption that each mtDNA variant follows the same transmission pattern through the bottleneck. We acknowledge that this is a big assumption. Indeed, different mtDNA variants do exhibit varying genetic bottlenecks and even individuals with the same mtDNA variant may possess varying genetic bottlenecks [15, 19]. In addition, our models do not incorporate age adjustment for k. With age there is a decrease in the proportion of in vitro fertilized oocytes that can develop into suitable embryo for transfer. Our models and the prediction table could serve as an initial reference guideline for providing genetic counseling to women harboring pathogenic mtDNA variants. Importantly, they need to be interpreted with caution. Mitochondrial and fertility experts may consult our guidelines and offer personalized advisory opinions, taking into account the specific characteristics of various mtDNA variants, to ensure accurate and reliable data for a given carrier of mtDNA variants.

In addition, we provide a table for clinicians to conveniently calculate risk of disease transmission and minimum number of oocytes required in such cases. For patients carrying a variant load exceeding 60%, data were excluded owing to the higher probability of harboring high variant load and a consequent lower likelihood of obtaining an appropriate embryo. For those carriers with high risk of transmitting mitochondrial disease, PGT should be assessed carefully. They may face increased time and financial burden, and require multiple PGT cycles to get a suitable embryo for transfer. It also highlights the crucial input of both mitochondrial and fertility experts when selecting an embryo for transfer. For those specialists, it is essential to set realistic expectations for mitochondrial disease patients and furnish them with the necessary information to make informed decisions about the most suitable reproductive option for them. In addition, our results also emphasize the significance of having early discussions on reproductive options for women carrying mtDNA variants. These constraints including the minimum number of oocytes may render many of these mtDNA variant carriers ineligible for PGT, necessitating consideration of alternative reproductive options such as mitochondrial donation. Mitochondrial donation can replace defective mitochondria (contain mutated mtDNA) with healthy mitochondria (contain normal mtDNA), which has been approved by UK in 2015 and Australia in 2022 [38, 39]. Recently, first baby using mitochondrial donation therapy in the UK has been born, with a safe and ethical manner [40]. Although these are still early stage for mitochondrial donation therapy, a bright future for such applications can be foreseen.

Our models will contribute to genetic counseling of women carrying pathogenic mtDNA variants, and further the development of PGT guidelines for clinicians. Importantly, any clinical judgment must be conducted based on adequate genetic counseling for different types of mtDNA variants and varying loads, where mitochondrial and fertility experts play a key role. At the NHS Newcastle Fertility Centre (UK), Dziadek et al. have established an innovative clinical service, providing a model pathway to ensure the provision of the best care and advice for mitochondrial patients [41]. In the future, we will enroll more mtDNA variant carriers to verify our models with further case studies. Recently, we successfully blocked germline transmission of pathogenic variants using PGT in China and the first baby has been born. Our data have been submitted to a peer reviewed journal and are going through scientific peer review. In addition, we have established the largest mitochondrial patient cohort in China and are trying to construct precise prediction models for different mtDNA variants using machine learning. With the accumulation of data, the improvement of modeling tools and the development of artificial intelligence, prediction models will become more precise and personalized.

Supplementary information

ESM 1 (1.1MB, pdf)

(PDF 1106 kb)

Acknowledgements

The authors thank Yuzhou Gao, Haoyu He, Yang Yang, Yuechen Wang, Pei Hong, Chongwu Xu, and Xuefeng Tian for collecting the data and helpful discussion.

Author contribution

Conceptualization, D.J., N.Z., and W.Z.; data curation, D.J., N.Z., W.Z., Z.Z, J.L.M., Z.L., and C.L.; formal analysis, D.J., N.Z., W.Z., Z.Z, J.L.M., Z.L., C.L., L.S., Y.L., and D.L.; funding acquisition, D.J., N.Z., T.S., Y.D., and Y.C.; investigation, D.J., N.Z., W.Z., Z.Z, J.L.M., Z.L., C.L., L.S., Y.L., and D.L.; methodology, D.J., N.Z., W.Z., T.S., Y.D., and Y.C.; project administration, T.S., Y.D., and Y.C.; resources, T.S., Y.D., and Y.C.; software, D.J., N.Z., and W.Z.; supervision, T.S., Y.D., and Y.C.; validation, D.J., N.Z., W.Z., T.S., Y.D., and Y.C.; visualization, D.J., N.Z., W.Z., L.S., Y.L., and D.L., writing—original draft, D.J., N.Z., and W.Z.; writing—review and editing, J.L.M., T.S., Y.D., and Y.C. All authors read and approved the final version of the manuscript.

Funding

The National Natural Science Foundation of China (Grant No: U20A20350 to Y.C., Grant No: 81971455 to D.J., Grant No: 82202043 to T.S., Grant No: 81871216 to Y.C., and Grant No: 31701162 to Y.D.), the National Key Research and Development Program of China (Grant No: 2021YFC2700901 to Y.D., D.J., W.Z., and D.L.), Shanghai Science and Technology Innovation Action Plan-Shanghai Sailing Program funded by Science and Technology Commission of Shanghai Municipality (Grant No: 22YF1431600 to T.S.), and the National College Students’ Innovation and Entrepreneurship Training Program (Grant No: 202210366004 to D.J. and N.Z.). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Data availability

All relevant data are within the manuscript and the supplementary materials. All raw data is publicly available at Mendeley Data: Ji, Dongmei; Zhang, Ning; Zou, Weiwei; Zhang, Zhikang; Marley, Jordan; Liu, Zhuoli; Liang, Chunmei; Shen, Lingchao; Liu, Yajing; Liang, Dan; Su, Tianhong; Du, Yinan; Cao, Yunxia (2022), “Data of Modeling-based prediction tools for preimplantation genetic testing of mitochondrial DNA diseases,” Mendeley Data, V1, doi: 10.17632/ynx4mmhbxr.1 (https://data.mendeley.com/datasets/ynx4mmhbxr).

Declarations

Ethics approval

The study was approved by the Ethics Committee of the First Affiliated Hospital of Anhui Medical University (PJ2020-08-15).

Consent to participate

Informed consent was obtained from all individual participants included in the study.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Dongmei Ji, Ning Zhang and Weiwei Zou contributed equally to this work.

Contributor Information

Tianhong Su, Email: sutianhong@renji.com.

Yinan Du, Email: duyinannan@126.com.

Yunxia Cao, Email: caoyunxia5972@ahmu.edu.cn.

References

  • 1.Gorman GS, Schaefer AM, Ng Y, Gomez N, Blakely EL, Alston CL, et al. Prevalence of nuclear and mitochondrial DNA mutations related to adult mitochondrial disease. Ann Neurol. 2015;77(5):753–759. doi: 10.1002/ana.24362. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Elliott HR, Samuels DC, Eden JA, Relton CL, Chinnery PF. Pathogenic mitochondrial DNA mutations are common in the general population. Am J Hum Genet. 2008;83(2):254–260. doi: 10.1016/j.ajhg.2008.07.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Gal A, Balicza P, Weaver D, Naghdi S, Joseph SK, Várnai P, et al. MSTO1 is a cytoplasmic pro-mitochondrial fusion protein, whose mutation induces myopathy and ataxia in humans. EMBO Mol Med. 2017;9(7):967–984. doi: 10.15252/emmm.201607058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Alston CL, Rocha MC, Lax NZ, Turnbull DM, Taylor RW. The genetics and pathology of mitochondrial disease. J Pathol. 2017;241(2):236–250. doi: 10.1002/path.4809. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Lake NJ, Compton AG, Rahman S, Thorburn DR. Leigh syndrome: one disorder, more than 75 monogenic causes. Ann Neurol. 2016;79(2):190–203. doi: 10.1002/ana.24551. [DOI] [PubMed] [Google Scholar]
  • 6.Craven L, Tang M-X, Gorman GS, De Sutter P, Heindryckx B. Novel reproductive technologies to prevent mitochondrial disease. Hum Reprod Update. 2017;23(5):501–519. doi: 10.1093/humupd/dmx018. [DOI] [PubMed] [Google Scholar]
  • 7.Wilson IJ, Carling PJ, Alston CL, Floros VI, Pyle A, Hudson G, et al. Mitochondrial DNA sequence characteristics modulate the size of the genetic bottleneck. Hum Mol Genet. 2016;25(5):1031–1041. doi: 10.1093/hmg/ddv626. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Jacobs LJAM, de Wert G, Geraedts JPM, de Coo IFM, Smeets HJM. The transmission of OXPHOS disease and methods to prevent this. Hum Reprod Update. 2006;12(2):119–136. doi: 10.1093/humupd/dmi042. [DOI] [PubMed] [Google Scholar]
  • 9.Floros VI, Pyle A, Dietmann S, Wei W, Tang WCW, Irie N, et al. Segregation of mitochondrial DNA heteroplasmy through a developmental genetic bottleneck in human embryos. Nat Cell Biol. 2018;20(2):144–151. doi: 10.1038/s41556-017-0017-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Sallevelt SCEH, Dreesen JCFM, Drüsedau M, Spierts S, Coonen E, van Tienen FHJ, et al. Preimplantation genetic diagnosis in mitochondrial DNA disorders: challenge and success. J Med Genet. 2013;50(2):125–132. doi: 10.1136/jmedgenet-2012-101172. [DOI] [PubMed] [Google Scholar]
  • 11.Stewart JB, Chinnery PF. The dynamics of mitochondrial DNA heteroplasmy: implications for human health and disease. Nat Rev Genet. 2015;16(9):530–542. doi: 10.1038/nrg3966. [DOI] [PubMed] [Google Scholar]
  • 12.Sonney S, Leipzig J, Lott MT, Zhang S, Procaccio V, Wallace DC, et al. Predicting the pathogenicity of novel variants in mitochondrial tRNA with MitoTIP. PLoS Comput Biol. 2017;13(12):e1005867. doi: 10.1371/journal.pcbi.1005867. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Preste R, Vitale O, Clima R, Gasparre G, Attimonelli M. HmtVar: a new resource for human mitochondrial variations and pathogenicity data. Nucleic Acids Res. 2019;47(D1):D1202–D1D10. doi: 10.1093/nar/gky1024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Rajasimha HK, Chinnery PF, Samuels DC. Selection against pathogenic mtDNA mutations in a stem cell population leads to the loss of the 3243A-->G mutation in blood. Am J Hum Genet. 2008;82(2):333–343. doi: 10.1016/j.ajhg.2007.10.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Zhang H, Burr SP, Chinnery PF. The mitochondrial DNA genetic bottleneck: inheritance and beyond. Essays Biochem. 2018;62(3):225–234. doi: 10.1042/EBC20170096. [DOI] [PubMed] [Google Scholar]
  • 16.van den Ameele J, Li AYZ, Ma H, Chinnery PF. Mitochondrial heteroplasmy beyond the oocyte bottleneck. Semin Cell Dev Biol. 2020;97:156–166. doi: 10.1016/j.semcdb.2019.10.001. [DOI] [PubMed] [Google Scholar]
  • 17.Chinnery PF, Howell N, Lightowlers RN, Turnbull DM. Molecular pathology of MELAS and MERRF. The relationship between mutation load and clinical phenotypes. Brain. 1997;120(Pt 10):1713–1721. doi: 10.1093/brain/120.10.1713. [DOI] [PubMed] [Google Scholar]
  • 18.Rahman S, Poulton J, Marchington D, Suomalainen A. Decrease of 3243 A-->G mtDNA mutation from blood in MELAS syndrome: a longitudinal study. Am J Hum Genet. 2001;68(1):238–240. doi: 10.1086/316930. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Johnston IG. Varied mechanisms and models for the varying mitochondrial bottleneck. Front Cell Dev Biol. 2019;7:294. doi: 10.3389/fcell.2019.00294. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wonnapinij P, Chinnery PF, Samuels DC. The distribution of mitochondrial DNA heteroplasmy due to random genetic drift. Am J Hum Genet. 2008;83(5):582–593. doi: 10.1016/j.ajhg.2008.10.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Wonnapinij P, Chinnery PF, Samuels DC. Previous estimates of mitochondrial DNA mutation level variance did not account for sampling error: comparing the mtDNA genetic bottleneck in mice and humans. Am J Hum Genet. 2010;86(4):540–550. doi: 10.1016/j.ajhg.2010.02.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Brown DT, Samuels DC, Michael EM, Turnbull DM, Chinnery PF. Random genetic drift determines the level of mutant mtDNA in human primary oocytes. Am J Hum Genet. 2001;68(2):533–536. doi: 10.1086/318190. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Hellebrekers DMEI, Wolfe R, Hendrickx ATM, de Coo IFM, de Die CE, Geraedts JPM, et al. PGD and heteroplasmic mitochondrial DNA point mutations: a systematic review estimating the chance of healthy offspring. Hum Reprod Update. 2012;18(4):341–349. doi: 10.1093/humupd/dms008. [DOI] [PubMed] [Google Scholar]
  • 24.Samuels DC, Wonnapinij P, Chinnery PF. Preventing the transmission of pathogenic mitochondrial DNA mutations: can we achieve long-term benefits from germ-line gene transfer? Hum Reprod. 2013;28(3):554–559. doi: 10.1093/humrep/des439. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Smeets HJM, Sallevelt SCEH, Dreesen JCFM, de Die-Smulders CEM, de Coo IFM. Preventing the transmission of mitochondrial DNA disorders using prenatal or preimplantation genetic diagnosis. Ann N Y Acad Sci. 2015;1350:29–36. doi: 10.1111/nyas.12866. [DOI] [PubMed] [Google Scholar]
  • 26.Dahl HH, Thorburn DR, White SL. Towards reliable prenatal diagnosis of mtDNA point mutations: studies of nt8993 mutations in oocytes, fetal tissues, children and adults. Hum Reprod. 2000;15(Suppl 2):246–255. doi: 10.1093/humrep/15.suppl_2.246. [DOI] [PubMed] [Google Scholar]
  • 27.White SL, Collins VR, Wolfe R, Cleary MA, Shanske S, DiMauro S, et al. Genetic counseling and prenatal diagnosis for the mitochondrial DNA mutations at nucleotide 8993. Am J Hum Genet. 1999;65(2):474–482. doi: 10.1086/302488. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.van de Glind G, de Vries M, Rodenburg R, Hol F, Smeitink J, Morava E. Resting muscle pain as the first clinical symptom in children carrying the MTTK A8344G mutation. Eur J Paediatr Neurol. 2007;11(4):243–246. doi: 10.1016/j.ejpn.2007.01.004. [DOI] [PubMed] [Google Scholar]
  • 29.Li D, Liang C, Zhang T, Marley JL, Zou W, Lian M, et al. Pathogenic mitochondrial DNA 3243A>G mutation: from genetics to phenotype. Front Genet. 2022;13:951185. doi: 10.3389/fgene.2022.951185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Poulton J, Finsterer J, Yu-Wai-Man P. Genetic counselling for maternally inherited mitochondrial disorders. Mol Diagn Ther. 2017;21(4):419–429. doi: 10.1007/s40291-017-0279-7. [DOI] [PubMed] [Google Scholar]
  • 31.Damian MS, Seibel P, Schachenmayr W, Reichmann H, Dorndorf W. VACTERL with the mitochondrial np 3243 point mutation. Am J Med Genet. 1996;62(4):398–403. doi: 10.1002/(SICI)1096-8628(19960424)62:4<398::AID-AJMG13>3.0.CO;2-J. [DOI] [PubMed] [Google Scholar]
  • 32.Hammans SR, Sweeney MG, Hanna MG, Brockington M, Morgan-Hughes JA, Harding AE. The mitochondrial DNA transfer RNALeu(UUR) A-->G(3243) mutation. A clinical and genetic study. Brain. 1995;118(Pt 3):721–734. doi: 10.1093/brain/118.3.721. [DOI] [PubMed] [Google Scholar]
  • 33.Olsson C, Zethelius B, Lagerström-Fermér M, Asplund J, Berne C, Landegren U. Level of heteroplasmy for the mitochondrial mutation A3243G correlates with age at onset of diabetes and deafness. Hum Mutat. 1998;12(1):52–58. doi: 10.1002/(SICI)1098-1004(1998)12:1<52::AID-HUMU8>3.0.CO;2-K. [DOI] [PubMed] [Google Scholar]
  • 34.Ciafaloni E, Ricci E, Shanske S, Moraes CT, Silvestri G, Hirano M, et al. MELAS: clinical features, biochemistry, and molecular genetics. Ann Neurol. 1992;31(4):391–398. doi: 10.1002/ana.410310408. [DOI] [PubMed] [Google Scholar]
  • 35.Pickett SJ, Blain A, Ng YS, Wilson IJ, Taylor RW, McFarland R, et al. Mitochondrial donation - which women could benefit? N Engl J Med. 2019;380(20):1971–1972. doi: 10.1056/NEJMc1808565. [DOI] [PubMed] [Google Scholar]
  • 36.Monnot S, Gigarel N, Samuels DC, Burlet P, Hesters L, Frydman N, et al. Segregation of mtDNA throughout human embryofetal development: m.3243A>G as a model system. Hum Mutat. 2011;32(1):116–125. doi: 10.1002/humu.21417. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Otten ABC, Sallevelt SCEH, Carling PJ, Dreesen JCFM, Drüsedau M, Spierts S, et al. Mutation-specific effects in germline transmission of pathogenic mtDNA variants. Hum Reprod. 2018;33(7):1331–1341. doi: 10.1093/humrep/dey114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Dyer C. UK is set to allow mitochondrial donation after MPs vote in favour. BMJ. 2015;350:h657. doi: 10.1136/bmj.h657. [DOI] [PubMed] [Google Scholar]
  • 39.Dziadek MA, Sue CM. Mitochondrial donation: is Australia ready? Med J Aust. 2022;216(3):118–121. doi: 10.5694/mja2.51309. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Wise J. First baby born in the UK using mitochondrial donation therapy. BMJ. 2023;381:1091. doi: 10.1136/bmj.p1091. [DOI] [PubMed] [Google Scholar]
  • 41.Gorman GS, McFarland R, Stewart J, Feeney C, Turnbull DM. Mitochondrial donation: from test tube to clinic. Lancet. 2018;392(10154):1191–1192. doi: 10.1016/S0140-6736(18)31868-3. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

ESM 1 (1.1MB, pdf)

(PDF 1106 kb)

Data Availability Statement

All relevant data are within the manuscript and the supplementary materials. All raw data is publicly available at Mendeley Data: Ji, Dongmei; Zhang, Ning; Zou, Weiwei; Zhang, Zhikang; Marley, Jordan; Liu, Zhuoli; Liang, Chunmei; Shen, Lingchao; Liu, Yajing; Liang, Dan; Su, Tianhong; Du, Yinan; Cao, Yunxia (2022), “Data of Modeling-based prediction tools for preimplantation genetic testing of mitochondrial DNA diseases,” Mendeley Data, V1, doi: 10.17632/ynx4mmhbxr.1 (https://data.mendeley.com/datasets/ynx4mmhbxr).


Articles from Journal of Assisted Reproduction and Genetics are provided here courtesy of Springer Science+Business Media, LLC

RESOURCES