Abstract
Controlled ovarian stimulation is tailored to the patient based on clinical parameters but estimating the number of retrieved metaphase II (MII) oocytes is a challenge. Here, we have developed a model that takes advantage of the patient’s genetic and clinical characteristics simultaneously for predicting the stimulation outcome. Sequence variants in reproduction-related genes identified by next-generation sequencing were matched to groups of various MII oocyte counts using ranking, correspondence analysis, and self-organizing map methods. The gradient boosting machine technique was used to train models on a clinical dataset of 8,574 or a clinical-genetic dataset of 516 ovarian stimulations. The clinical-genetic model predicted the number of MII oocytes better than that based on clinical data. Anti-Müllerian hormone level and antral follicle count were the two most important predictors while a genetic feature consisting of sequence variants in the GDF9, LHCGR, FSHB, ESR1, and ESR2 genes was the third. The combined contribution of genetic features important for the prediction was over one-third of that revealed for anti-Müllerian hormone. Predictions of our clinical-genetic model accurately matched individuals’ actual outcomes preventing over- or underestimation. The genetic data upgrades the personalized prediction of ovarian stimulation outcomes, thus improving the in vitro fertilization procedure.
Author summary
Infertility is a condition that leads to the failure of natural conception. It affects more than 186 million people worldwide. Because in vitro fertilization (IVF) is an effective infertility treatment, optimizing the steps in the process is essential to best assist those trying to conceive. The IVF process begins with ovarian stimulation during which the woman takes ovary-stimulating hormones, i.e., gonadotropins, to produce a certain number of viable, fertilization-ready egg cells, but predicting the number of egg cells collected after such stimulation is difficult. These predictions are usually based on the patient’s clinical parameters and depend on the physician’s experience, making them highly subjective. Here, we used machine learning to identify features that physicians could adopt to predict the number of egg cells obtained after ovarian stimulation. We found that clinical parameters (anti-Müllerian hormone level and antral follicle count), as well as genetic characteristics (variants in reproduction-related genes—GDF9, LHCGR, FSHB, ESR1, and ESR2), are important features that increase the accuracy of such predictions. Our predictive model has been designed to help physicians tailor the protocols used for ovarian stimulation on a case-by-case basis, thereby improving the safety and efficiency of the IVF process.
Introduction
Infertility affects over 186 million people worldwide and results from a combination of genetic, environmental, and endocrine factors [1]. Clinical parameters used to evaluate the cause of female infertility include endocrine indicators such as follicle-stimulating hormone (FSH), estradiol (E2), inhibin-B, and anti-Müllerian hormone (AMH), as well as ovarian parameters such as ovarian volume, ovarian vascularity indices, or ovarian reserve, and genetic factors, such as karyotype abnormalities or primary ciliary dyskinesia [2–5].
One of the most effective infertility treatments is in vitro fertilization (IVF) and optimization of this technology is essential to support people who cannot conceive naturally. Many male and female factors, including genetic factors, may affect the success of IVF, so reliably predicting its outcome is a challenge. Predictions often depend on the experience of a clinician which makes them highly subjective. Therefore, clinical parameters such as the woman’s age, body mass index (BMI), cause and duration of infertility, and markers of the ovarian reserve such as AMH, FSH, and antral follicle count (AFC) are taken into account in decision-making about IVF treatment [6]. A comprehensive determination of the relationship between these interacting features and the outcome of ovarian stimulation is necessary. Artificial intelligence, machine learning, and deep learning were implemented to develop models used to classify ovarian response [7], select embryos [8], and predict the outcome of embryo implantation or the chances of pregnancy [9]. Fed with patient parameters and IVF cycle-specific variables, predictive models are proving to offer solutions for patients at various stages of the IVF process with significant confidence.
The ovarian response to controlled gonadotropin stimulation and the retrieval of multiple, high-quality oocytes is critical for successful embryo formation, selection, and transfer during the IVF process [10]. The number of oocytes retrieved can increase the probability of obtaining a live birth from an embryo transfer [11]. The dose of gonadotropin and the pool of recruitable follicles are the variables that most affect the number of oocytes collected. A very low dose of gonadotropin is associated with poor ovarian response; raising the dose boosts the number of growing follicles and the oocyte yield, however, very high doses of gonadotropin in women with normal ovarian reserve increase the risk of hyper-response [12]. It seems tempting, therefore, to focus on the woman’s clinical and genetic characteristics to adjust the type or dose of drugs used in ovarian stimulation to obtain an optimal number of oocytes.
The objective of this study was to identify potent predictors of ovarian response to gonadotropin stimulation by assessing the effect of clinical characteristics, cycle-specific parameters, and reproduction-related gene sequence variants on the number of metaphase II (MII) oocytes retrieved. We also aimed to develop and validate a machine-learning model using clinical and genetic data to predict this measure in a cohort of gonadotropin-stimulated patients and oocyte donors.
Materials and methods
Ethics statement
The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Ethics Committee of the Regional Medical Chamber of Gdańsk (protocol code KB-23/20, date of approval 27th October 2020). Written informed consent was obtained from all individuals involved in the study. All individual-level data, including clinical data, were de-identified.
Study population
Data for the study was collected between November 2014 and February 2021 at INVICTA Fertility Clinics (Bydgoszcz, Gdańsk, Gdynia, Słupsk, Warszawa, Wrocław; Poland). The study population consisted of 6,043 women (9,090 IVF processes) diagnosed with infertility undergoing controlled ovarian stimulation with menotropin (Menopur; Ferring GmbH, Kiel, Germany), follitropin delta (Rekovelle; Ferring GmbH, Kiel, Germany), or follitropin alfa (Gonal F; Merck Serono S.p.A., Modugno, Italy). Exclusion criteria were: stimulation protocols with other gonadotropins, AMH levels above 15 ng/ml, or undetected.
DNA extraction
Genomic DNA was isolated from whole blood or urogenital swabs using the MagNA Pure 96 IVD instrument and the MagNA Pure 96 DNA and Viral NA Small Volume Kit (Roche, Basel, Switzerland). DNA concentration was quantified with the Qubit 2.0 fluorometer (Life Technologies, Rockville, MD, USA) and Qubit dsDNA HS Assay Kit (ThermoFisher Scientific, Waltham, MA, USA). Isolates were stored at −80°C.
Genotyping
Sequence variants were identified by next-generation sequencing (NGS). Targeted libraries were prepared from 10 ng of gDNA using the Ion AmpliSeq Library Kit 2.0 (ThermoFisher Scientific, Waltham, MA, USA) and Ion AmpliSeq Made-to-Order panel (primer set version 7.05 based on the hg38 reference genome including 325 amplicons in three primer pools, covering approximately 73,000 bases in 121 exons with 100-bp exon padding of 14 genes of interest). Sequencing was conducted using the Ion Personal Genome Machine (PGM) (Life Technologies, Carlsbad, CA, USA) and Ion PGM Hi-Q View Sequencing Kit (ThermoFisher Scientific, Waltham, MA, USA). Data were analyzed with Ion Torrent Suite Server software version 5.12.2 (ThermoFisher Scientific, Waltham, MA, USA). Technical details are provided in S1 Appendix.
Machine learning for a predictive model
A machine learning model was based on the gradient boosting machine (GBM) technique [13] implemented by the LightGBM framework [14]. The GBM has a built-in ability to handle missing data and in this study, data from the previous stimulation were not available for the first-time-stimulated patients. To keep all observations in the modeling, we decided to use GBM. As GBM parameters, 100 decision trees with five leaves and a maximum depth of 16 were selected. The training was performed with l2 loss function and gradient-boosted decision trees (GBDT) using the k-fold cross-validation method with five folds.
Error metrics used to validate model performance included the root mean square error (RMSE), the mean absolute error (MAE), and the mean absolute percentage error (MAPE) (defined in S2 Appendix). The SHapley Additive exPlanations (SHAP) library [15] was used to explain model predictions.
Statistical tests
The Kolmogorov–Smirnov and the Mann–Whitney U tests were used to determine differences in the distribution of the number of MII oocytes among patients carrying the reference or alternative allele of sequence variants (hypotheses defined in S2 Appendix). The Pearson correlation coefficient (r), along with the two-sided p-value, was determined to investigate the linear association between clinical variables and the number of MII oocytes. The significance level was set to α = 0.05.
Ranking methods
Boruta [16] with the random forests (RF) algorithm [17] and Boruta-SHAP [18] with the GBM algorithm were used to rank sequence variants. Each sequence variant was considered a separate feature with the alternative allele denoted as 1; the number of MII oocytes was considered a target feature. A rating was assigned to variants according to their importance in predicting the number of MII oocytes.
Correspondence analysis
Correspondence analysis (CA) [19] was used to find sequence variants characteristic of patient groups. Inertia covered I ∈ [0,1] was used to assess the goodness of the model’s representation of the dataset, where 1 is a perfect representation. The identified genetic feature was defined as:
(1) |
Generation of a self-organizing map
A self-organizing map (SOM) [20] was applied to detect the most frequent sequence variants in patient groups. The model was trained with the following parameters: size 6×6 neurons, hexagonal topology, Gaussian neighboring function, sigma = 1.5, lr = 0.7, 100,000 train iterations, and Manhattan distance metric. The best matching unit (BMU), defined as a neuron with the lowest distance to observation was assigned for each IVF process. The quantization error (QE) metric (defined in S2 Appendix) was used to validate SOM performance. The identified genetic feature was defined as:
(2) |
A standardized version of the feature was used for training:
(3) |
Haplotype construction
Each chromosome was analyzed separately using Haploview software [21] with confidence intervals, the four gamete rule, and solid spine of linkage disequilibrium (LD) as block generation algorithms. Uniform Manifold Approximation and Projection (UMAP) [22] was used to reduce the number of variants in haplotypes. The K-means algorithm was used to cluster the observations and separate groups of similar patients. Observations were classified based on the genetic data using a decision tree [23]. Variants in the decision nodes between the root node and leaves created reduced haplotype defined as:
(4) |
Results
A preliminary predictive model based on clinical data
The study population was divided into two groups according to the availability of genetic data: Group 1, without genetic data, included 5,779 patients and 8,574 IVF processes; Group 2, with genetic data, included 264 patients and 516 IVF processes. All women were between 18 and 46 years old, had regular 26–32-day menstrual cycles, were undergoing their first or second IVF cycle, and exhibited no signs of androgenicity, endometriosis, or any chronic diseases. Baseline clinical characteristics are presented in Table 1. Clinical data, hormonal test results, data on previous IVF processes and basic characteristics of women in Group 1 (S1 Table) were used to construct the preliminary model to determine the most important features for predicting the number of MII oocytes retrieved after ovarian stimulation. The RMSE metric calculated for the trivial model, defined as the expected number of MII oocytes equal to the population average, was 4.75 oocytes (the model predicts the number of MII oocytes ± 2.375 oocytes); it was considered a benchmark value. Features were then added to the dataset in the iterative process and a model was trained in each iteration. If the model’s RMSE was reduced by adding a new feature, the feature was considered important for the prediction (Fig 1A). The final set of clinical features selected using this approach consisted of AMH, AFC on the first day of stimulation (follicles ≤10 mm), age, number of MII oocytes in the previous pick-up, number of cumulus-denuded oocytes in the previous pick-up, and polycystic ovary syndrome (PCOS).
Table 1. Baseline clinical characteristics of study groups.
Characteristic | Group 1 without genetic data Mean ± SD | Group 2 with genetic data Mean ± SD |
---|---|---|
No. of women | 5,779 | 264 |
No. of IVF processes | 8,574 | 516 |
Woman’s age (years) | 34.51 ± 4.54 | 34.55 ± 4.20 |
BMI (kg/m2) | 23.42 ± 4.62 | 23.15 ± 4.11 |
Stimulation days | 8.79 ± 4.36 | 9.77 ± 3.10 |
FSH (mIU/mL) | 6.46 ± 2.83 | 7.25 ± 3.53 |
LH (mIU/mL) | 6.23 ± 4.63 | 6.10 ± 4.63 |
E2 (pg/mL) | 18.62 ± 18.50 | 16.62 ± 16.73 |
AMH (ng/mL)a | 2.98 ± 2.42 | 3.38 ± 3.67 |
No. of follicles (≤ 10 mm) on the first day of stimulation | 14.63 ± 8.99 | 13.41 ± 10.24 |
No. of cumulus-denuded oocytes retrieved in previous stimulation | 8.26 ± 5.26 | 7.55 ± 6.15 |
No. of MII oocytes retrieved in previous stimulation | 6.17 ± 4.10 | 5.85 ± 4.59 |
No. of cumulus-denuded oocytes retrieved | 9.14 ± 5.82 | 7.67 ± 6.08 |
No. of MII oocytes retrieved | 6.96 ± 4.64 | 5.92 ± 4.75 |
Cumulative gonadotropin dose (IU) | 2,109.76 ± 826.62 | 2,347.36 ± 795.42 |
a) observations with an AMH level above 15ng/mL were removed from the database.
Based on these features, two versions of a machine learning model predicting the number of MII oocytes were trained. The first model was trained on the Group 1 dataset and achieved the precision metrics RMSE = 3.80, MAE = 2.85, and MAPE = 0.65. The second model was trained on Group 2 dataset using only selected clinical features (the model is referred to as the clinical model). To assess the impact of genetic data, we compared error metrics for models trained on the same group of patients (Group 2 without genetic data vs. Group 2 with genetic data).
The clinical model achieved better precision metrics (RMSE = 3.53, MAE = 2.58, and MAPE = 0.71) compared to the benchmark value. Feature importance ranking and the effect on prediction for the clinical model are presented in Fig 1B and 1C. The AMH is the most important feature for the prediction and its higher levels result in a higher number of MII oocytes, although the variance is still high. The second most important predictor is the AFC on the first day of stimulation. The third, although much less important feature, is the number of MII oocytes retrieved in the previous stimulation. Although the patient’s age is a less important feature, we observed that the AMH level decreases with age (r = −0.17, p < 0.01) but the rate differs between patients. The number of MII oocytes correlates with AMH (r = 0.64, p < 0.01) better than with age (r = −0.18, p < 0.01), and therefore the model prefers to follow correlations with AMH.
Sequence variants important for predicting the number of MII oocytes
To improve the prediction, we decided to include data on sequence variants in reproduction-related genes in our analysis. Fourteen genes were selected such as AMH, AMHR2, AR, BMP15, ESR1, ESR2, FSHB, FSHR, GDF9, LHB, LHCGR, PRL, PRLR, and SOX3, annotating to the Gene Ontology (GO) categories such as reproductive structure development, hormone-mediated signaling pathway, ovulation cycle process, signaling receptor binding or signaling receptor regulator activity (S2 Table).
A total of 544 sequence variants were identified in Group 2 by genotyping (S3 Table). Our approach to finding variants important for predicting the number of MII oocytes consisted of pre-selection with statistical tests or ranking methods, and identification of haplotypes or variant combinations important for the prediction followed by checking the performance of a model trained with clinical and genetic data (Fig 2A).
Kolmogorov–Smirnov and Mann–Whitney U tests revealed eight sequence variants that result in a statistically higher number of MII oocytes and 18 sequence variants that result in a statistically lower number of MII oocytes (S4 and S5 Tables). However, when individually included in the modeling dataset, these variants showed little or no improvement in the model’s error metrics. Next, the pre-selection of variants important for the prediction was performed with Boruta and Boruta-SHAP ranking methods on a dataset of all variants; the number of MII oocytes was a target feature. The Boruta was also applied to a dataset of variants and AMH levels. Variants were ranked according to their importance for the prediction with the final ranking created by re-sorting the combined score. The top 20 variants were selected for further analysis (S6 Table).
Combinations of sequence variants important for predicting the number of MII oocytes
The sequence variants selected by ranking methods were used to search for combinations of variants associated with high numbers of MII oocytes. According to the number of MII oocytes, IVF processes were categorized as MII <2, (2,4], (4,7], (7,11], and 11< (groups contained 144, 101, 122, 85, and 64 IVF processes, respectively). Correspondence analysis and self-organizing map algorithms were used to determine the combinations characteristic of patients with the lowest and highest oocyte numbers (<2 and >11 MII groups).
In correspondence analysis, the more the sequence variant is characteristic of a given MII group, the closer to the group it is located in the graph (S1 Fig). Based on this analysis, we defined a new genetic feature, referred to as IV-CA, consisting of three alternative variants in LHCGR rs11887058, PRLR rs112461, and ESR1 rs2207396, located closest to the group with above 11 MII oocytes retrieved (Table 2).
Table 2. Genetic features identified as important for the prediction of the number of MII oocytes after ovarian stimulation.
Genetic feature | Methoda | Gene | dbSNP IDb | Sequence variant | Allele frequencyc |
---|---|---|---|---|---|
IV-CA | CA | LHCGR | rs11887058 | NC_000002.12:g.48729336C>T | T = 0.428 |
PRLR | rs112461 | NC_000005.10:g.35063190A>T | T = 0.242 | ||
ESR1 | rs2207396 | NC_000006.12:g.152061247G>A | A = 0.356 | ||
IV8-6 | SOM | GDF9 | rs11739194 | NC_000005.10:g.132865538T>C | C = 0.515 |
rs17166294 | NC_000005.10:g.132866205T>C | C = 0.288 | |||
LHCGR | rs11887058 | NC_000002.12:g.48729336C>T | T = 0.428 | ||
FSHB | rs676349 | NC_000011.10:g.30234435A>G | G = 0.523 | ||
ESR1 | rs2273207 | NC_000006.12:g.152061190A>G | G = 0.117 | ||
ESR2 | rs928554 | NC_000014.9:g.64227477C>T | T = 0.731 | ||
IV22-2 | Haplotype | FSHR | rs80111020 | NC_000002.12:g.48962060 = | A = 0.909 |
rs6166 | NC_000002.12:g.48962782C>T | T = 0.701 | |||
IV41-8 | Haplotype | PRLR | rs387032 | NC_000005.10:g.35061629 = | T = 0.845 |
rs401694 | NC_000005.10:g.35062516 = | C = 0.765 | |||
rs112461 | NC_000005.10:g.35063190 = | A = 0.758 | |||
rs1057828 | NC_000005.10:g.35064413 = | C = 0.939 | |||
rs56251626 | NC_000005.10:g.35064922 = | C = 0.936 | |||
rs62355478 | NC_000005.10:g.35065548 = | C = 0.992 | |||
rs78373811 | NC_000005.10:g.35068146 = | G = 0.913 | |||
rs186609463 | NC_000005.10:g.35069864 = | G = 0.985 | |||
IV16-3 | Haplotype | GDF9 | rs75061517 | NC_000005.10:g.132866082 = | T = 0.689 |
rs17166294 | NC_000005.10:g.132866205 = | T = 0.712 | |||
rs30177 | NC_000005.10:g.132866719C>G | G = 0.924 |
a—features were determined using correspondence analysis (CA), self-organizing map (SOM) algorithm, and haplotype analysis followed by a reduction of the number of variants
b—dbSNP Reference SNP number according to the National Center for Biotechnology Information (NCBI) dbSNP database (https://www.ncbi.nlm.nih.gov/snp/, accessed: 2022-04-05)
c—allele frequencies were calculated for Group 2 consisting of 264 women.
The SOM algorithm detected the most frequent sequence variants in neurons, where the given MII group was over-represented (S2 Fig). The 10 most characteristic variants, with the biggest percentage difference between the frequency of alternative variants in a node and the percentage share in the entire population, were found for GDF9 rs11739194, ESR2 rs928554, ESR1 rs2077647, GDF9 rs17166294, FSHB rs676349, ESR1 rs2207396, LHCGR rs62137532, ESR1 rs2273206, LHCGR rs11887058, and ESR1 rs2273207. Of all variant combinations tested in terms of the effect on the predictive model, a set of six variants reduced the model’s RMSE metric and comprised one genetic feature important for predicting the number of MII oocytes, further referred to as IV8-6 (Table 2).
The IV-CA and IV8-6 features are non-binary: feature values for modeling are determined from the number of alternative alleles among the variants comprising the feature.
Haplotypes important for predicting the number of MII oocytes
We decided to test whether the sequence variants identified by genotyping could be arranged into haplotypes relevant for predicting the number of MII oocytes. Each of the 311 potential haplotypes generated in Haploview software was analyzed for the effect on the RMSE metric of the predictive model. Three haplotypes, generated by the 4 Gamete Rule and consisting of variants in the FSHR, PRLR, and GDF9 genes (S7 Table) showed the greatest impact on the prediction. Identified haplotypes increase the number of MII oocytes retrieved.
Haplotypes were composed of a relatively large number of variants, thus, we used the UMAP grouping algorithm and a decision tree to identify a subset of variants in each haplotype, which is sufficient to distinguish patient groups (S3 Fig). Reduced haplotypes, referred to as genetic features IV22-2, IV41-8, and IV16-3 (Table 2) occurred in the study population with a frequency of 0.58, 0.65, and 0.17, respectively. Haplotypes are considered binary features: if any detected variant is different from the variants comprising the haplotype, the feature value for modeling is set to 0.
A predictive model based on clinical data and genetic features
To validate the contribution of genetic features in predicting the number of MII oocytes, a total of 127 combinations of genetic features IV-CA, IV8-6, IV22-2, IV41-8, and IV16-3 were used in addition to clinical data in an iterative process. The addition of any genetic feature decreased the model’s RMSE metrics compared to the clinical model (S8 Table). The best combination of genetic features included the SOM-identified IV8-6 feature and haplotypes IV41-8 and IV22-2, for which the RMSE metric was reduced by 0.18 oocytes, which corresponds to 5% of the error metric.
The final clinical-genetic predictive model, trained on the dataset of selected clinical and genetic features, achieved lower error metrics (RMSE = 3.35, MAE = 2.48, and MAPE = 0.68) compared to the clinical model. Still, the AMH was the first, and AFC on the first day of stimulation was the second most important feature for the prediction (Fig 2B). The combined contribution of all three genetic features appeared to be over one-third of that revealed for AMH. The IV8-6 feature ranked third in terms of importance with the impact on prediction close to that of AFC. The two haplotypes had a lower impact on the prediction (Fig 2B), but each haplotype increased the number of MII oocytes (Fig 2C). Interestingly, we found a correlation between the genetic feature IV8-6 and the AMH level; with an increased AMH, the effect of the feature on the number of MII oocytes is more apparent (S4 Fig). The highest values in the IV8-6 feature decrease the predicted number of MII oocytes while feature values around 0 tend to increase the predictions (especially for patients with high AMH). The lowest IV8-6 values do not affect the prediction.
Interpretation of personalized predictions
Patient 1 is a 37-year-old woman with AMH = 1.27 ng/ml, AFC (below 10 mm) on the first day of the stimulation = 9, with no PCOS and no data on previous stimulation results. Haplotype IV22-2 was present while haplotype IV41-8 was absent. Four sequence variants had alternative alleles for the IV8-6 feature—GDF9 rs11739194 (T>C), LHCGR rs11887058 (C>T), FSHB rs676349 (A>G), ESR2 rs928554 (C>T). The patient grew three MII oocytes after stimulation while our clinical-genetic model predicted 3.75 MII oocytes (Fig 3A). A relatively low AMH lowered the prediction by 1.27. The next two most impactful features were IV8-6 and IV41-8, which lowered the prediction by 0.7 and 0.22, respectively. The AFC and the haplotype IV22-2 increased the prediction by 0.21 and 0.13, respectively. Age had a very low effect compared to other features. The prediction was higher by 0.75 compared to the actual number of MII oocytes retrieved. The genetic data had a cumulative impact of −0.79, therefore, the prediction would be twice as inaccurate without this information.
Patient 2 is a 34-year-old woman with AMH = 13.78 ng/ml, with no PCOS, and no data on AFC (below 10 mm) on the first day of the stimulation nor previous stimulation results. Haplotypes IV22-2 and IV41-8 were present. Two of the sequence variants had alternative alleles for feature IV8-6—LHCGR rs11887058 (C>T) and ESR2 rs928554 (C>T). The patient grew 13 MII oocytes after stimulation while our clinical-genetic model predicted 13.26 MII oocytes (Fig 3B). A relatively high AMH increased the prediction by 5.57. The next two most impactful features were IV8-6 and IV41-8, which increased the prediction by 1.25 and 0.31, respectively. Age and haplotype IV22-2 increased the prediction by 0.19 and 0.14, respectively. The prediction was higher by 0.26 than the actual number of MII oocytes retrieved. Genetic data had a cumulative impact of 1.7, therefore, the prediction would be an underestimation without this information.
Discussion
Here, we identified sequence variants with a significant impact on the number of MII oocytes retrieved after ovarian stimulation and improved the precision of a model for predicting this measure by including data on genetic variation in reproduction-related genes.
The risk of poor or excessive ovarian response after controlled stimulation remains a serious problem in reproductive medicine. Although the poor response (< 4 oocytes retrieved) occurs in 5 to 35% of women [24], higher doses of gonadotropins to retrieve more oocytes can lead to ovarian hyperstimulation syndrome [25]. Predictors providing a more accurate estimate of a woman’s ovarian response are being sought to optimize individual stimulation protocol before a given IVF cycle [10]. Since the low number of MII oocytes reduces the rates of fertilization, embryo development, pregnancy, and live birth [26], we chose this parameter as an outcome measure of ovarian stimulation.
AFC and AMH are used to predict ovarian response to gonadotropin stimulation most often [6]. Both correlate strongly with each other and the number of oocytes retrieved [27] but a discordance between AFC and AMH is observed in one in five patients [28] making the prediction less accurate. The woman’s age, levels of FSH and AMH, AFC, gonadotropin dose, or type of stimulation protocol increase the accuracy of personalized predictions [29]. According to our study, the number of MII oocytes in the previous pick-up, and the number of cumulus-denuded oocytes in the previous pick-up are also strong predictors of the ovarian stimulation outcome. Combining these features in a single model ensures its high accuracy. Although studies have suggested that PCOS also affects the number of retrieved oocytes [30], our study did not identify this condition as relevant for prediction. This discrepancy may be attributed to the low percentage of patients diagnosed with PCOS in our study population. PCOS also tends to correlate more strongly with the number of total oocytes than the mature MII oocytes retrieved [30] and here we chose the latter as the target feature.
Genetic variation makes a noteworthy contribution to ovarian response. Single nucleotide polymorphisms have been identified in genes with key roles in oogenesis, folliculogenesis, and female reproduction, such as in estrogen receptor 2 (ESR2), follicle-stimulating hormone receptor (FSHR), FSH β-chain (FSHB), luteinizing hormone β-chain (LHB), LH/choriogonadotropin receptor (LHCGR), growth differentiation factor-9 (GDF9), anti-Müllerian hormone (AMH), and AMH type II receptor (AMHR2) genes [31–34]. However, the usefulness of sequence variants in clinical practice is limited due to inconclusive results from population-based studies [31,35]. Here, we identified several variants in genes such as ESR1, ESR2, FSHB, FSHR, GDF9, LHCGR, and PRLR correlating with the number of MII oocytes retrieved after ovarian stimulation; adding these variants to our model improved its predictive potential. Important variants were mainly found in genes encoding hormone receptors, for which the association with response to gonadotropins or oocyte maturation has already been confirmed in animal models. For example, mutations found in the LHCGR-encoding gene correlate with the empty follicle syndrome resulting in no oocytes retrieved during IVF [36]. Estrogen receptors—Erα and Erβ (encoded by ESR1 and ESR2 genes, respectively), are required for controlling oocyte meiotic resumption [37]. Mice lacking either FSHR or LHCGR gonadotropin receptors show impaired ovarian follicle growth and antrum formation and fail to develop preovulatory follicles (reviewed in [38]). The benefit of prolactin on bovine oocyte developmental capacity was shown to be mediated by cumulus cells containing prolactin receptors [39].
Simultaneous analysis of clinical data and sequence variants in a machine learning model may be weakened by the overlapping contribution of information from genetic and clinical data since the latter is derived from the former; at the same time, clinical data shows a stronger correlation with the observed phenotype. Indeed, the sequence variants directly correlating with the number of MII oocytes did not increase the model’s effectiveness. Rather than creating a model based on single variants, our approach focused on combinations of variants that were implemented in the model as genetic features and improved the prediction metrics compared to previous studies [40]. Interestingly, the average effect of genetic features on the prediction is higher than the effect of IVF protocol choice or PCOS presence. Genetic status is even more advantageous for the IV8-6 genetic feature, in comparison to stimulation data available from previous IVF cycles. Unlike the singly considered sequence variants, the genetic features showed a high occurrence in the population—haplotypes IV22-2 and IV41-8 were found in 58% and 65% of patients, respectively, and an average of 2.6 variants were found within the IV8-6 feature. As a result, many patients could benefit from the application of our clinical-genetic model.
The strength of our study lies in the number of methods used to select sequence variants associated with the number of MII oocytes and the use of multi-variant genetic features instead of single variants for modeling which increased the predictive potential of genetic data. In contrast to others [10,40], our approach was also strengthened by including data on retrospective stimulations. However, our study is limited by the lack of data on expression levels or structure-function predictions for variant proteins. A prospective clinical study on a large group of participants will also be needed to determine the concordance between our model’s predictions and actual stimulation results and verify the performance of our model in real-life.
In conclusion, genetic features increase the accuracy of predicting the number of MII oocytes retrieved after ovarian stimulation. The encouragement of a clinical-genetic model utilization will be realized by, firstly, incorporation in existing genetic test packages in flat-rate treatment plans without increasing prices, and secondly, by propagation to patients of the information of substantive add-on value enriching the standards of clinical procedures. When applied to clinical practice, we believe our model will improve personalized counseling and facilitate decision-making regarding the setting of gonadotropin doses to improve the safety and efficiency of stimulation protocols for IVF.
Supporting information
Data Availability
The raw sequencing data generated for this study has been deposited in the European Nucleotide Archive (ENA) at EMBL-EBI under accession number PRJEB54062 (https://www.ebi.ac.uk/ena/browser/view/PRJEB54062). The source code is available at the GitHub repository (https://github.com/CBR-Invicta/GPS)—notebooks contain results and plots generated using the dataset. INVICTA Fertility Clinics do not allow public disclosure of patient data used in this study. In case of additional questions, please contact the INVICTA Research and Development Center (cbr@invicta.pl).
Funding Statement
The research was co-financed by the European Regional Development Fund under the Pomorskie Voivodeship Regional Operational Programme for 2014-2020 as part of the project: The Development and Implementation of a New Method for Diagnosing Fertility Disorders of Genetic Origin Based on Next-generation High-throughput Sequencing. Co-financing agreement No. RPPM.01.01.01-22-0060/17. The funders had no role in study design, data collection or analysis, the decision to publish, or the preparation of the manuscript.
References
- 1.Vander Borght M, Wyns C. Fertility and infertility: Definition and epidemiology. Clin Biochem. 2018;62: 2–10. doi: 10.1016/j.clinbiochem.2018.03.012 [DOI] [PubMed] [Google Scholar]
- 2.Vanaken GJ, Bassinet L, Boon M, Mani R, Honoré I, Papon J-F, et al. Infertility in an adult cohort with primary ciliary dyskinesia: phenotype–gene association. European Respiratory Journal. 2017;50: 1700314. doi: 10.1183/13993003.00314-2017 [DOI] [PubMed] [Google Scholar]
- 3.Wen J, Huang K, Du X, Zhang H, Ding T, Zhang C, et al. Can Inhibin B Reflect Ovarian Reserve of Healthy Reproductive Age Women Effectively? Front Endocrinol (Lausanne). 2021;12: 343. doi: 10.3389/fendo.2021.626534 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Walker MH, Tobler KJ. Female Infertility. StatPearls. StatPearls Publishing; 2022. Available from: http://www.ncbi.nlm.nih.gov/pubmed/32310493. [PubMed] [Google Scholar]
- 5.Yahaya TO, Oladele EO, Anyebe D, Obi C, Bunza MDA, Sulaiman R, et al. Chromosomal abnormalities predisposing to infertility, testing, and management: a narrative review. Bull Natl Res Cent. 2021;45: 65. doi: 10.1186/s42269-021-00523-z [DOI] [Google Scholar]
- 6.Vural B, Cakiroglu Y, Vural F, Filiz S. Hormonal and functional biomarkers in ovarian response. Arch Gynecol Obstet. 2014;289: 1355–1361. doi: 10.1007/s00404-013-3132-1 [DOI] [PubMed] [Google Scholar]
- 7.Yan S, Jin W, Ding J, Yin T, Zhang Y, Yang J. Machine-intelligence for developing a potent signature to predict ovarian response to tailor assisted reproduction technology. Aging. 2021;13: 17137–17154. doi: 10.18632/aging.203032 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Xi Q, Yang Q, Wang M, Huang B, Zhang B, Li Z, et al. Individualized embryo selection strategy developed by stacking machine learning model for better in vitro fertilization outcomes: an application study. Reproductive Biology and Endocrinology. 2021;19: 53. doi: 10.1186/s12958-021-00734-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Louis CM, Erwin A, Handayani N, Polim AA, Boediono A, Sini I. Review of computer vision application in in vitro fertilization: the application of deep learning-based computer vision technology in the world of IVF. J Assist Reprod Genet. 2021;38: 1627–1639. doi: 10.1007/s10815-021-02123-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Liu L, Shen F, Liang H, Yang Z, Yang J, Chen J. Machine Learning-Based Modeling of Ovarian Response and the Quantitative Evaluation of Comprehensive Impact Features. Diagnostics. 2022;12: 492. doi: 10.3390/diagnostics12020492 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Li HWR, Lee VCY, Lau EYL, Yeung WSB, Ho PC, Ng EHY. Role of Baseline Antral Follicle Count and Anti-Mullerian Hormone in Prediction of Cumulative Live Birth in the First In Vitro Fertilisation Cycle: A Retrospective Cohort Analysis. Lambalk CB, editor. PLoS One. 2013;8: e61095. doi: 10.1371/journal.pone.0061095 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Lensen SF, Wilkinson J, Mol BWJ, la Marca A, Torrance H, Broekmans FJ. Individualised gonadotropin dose selection using markers of ovarian reserve for women undergoing IVF/ICSI. Cochrane Database of Systematic Reviews. 2017;2017. doi: 10.1002/14651858.CD012693 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Friedman JH. Stochastic gradient boosting. Comput Stat Data Anal. 2002;38: 367–378. doi: 10.1016/S0167-9473(01)00065-2 [DOI] [Google Scholar]
- 14.Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, et al. LightGBM: A Highly Efficient Gradient Boosting Decision Tree. In: Guyon I, Luxburg U von, Bengio S, Wallach H, Fergus R, Vishwanathan S, et al. editors. Advances in Neural Information Processing Systems. Curran Associates, Inc.; 2017. Available from: https://proceedings.neurips.cc/paper/2017/file/6449f44a102fde848669bdd9eb6b76fa-Paper.pdf. [Google Scholar]
- 15.Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, et al. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. 2020;2: 56–67. doi: 10.1038/s42256-019-0138-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Kursa MB, Rudnicki WR. Feature Selection with the Boruta Package. J Stat Softw. 2010;36: 1–13. doi: 10.18637/jss.v036.i11 [DOI] [Google Scholar]
- 17.Breiman L. Random Forests. Mach Learn. 2001;45: 5–32. doi: 10.1023/A:1010933404324 [DOI] [Google Scholar]
- 18.Keany E. BorutaShap: A wrapper feature selection method which combines the Boruta feature selection algorithm with Shapley values. Zenodo. 2020. doi: 10.5281/zenodo.4247618 [DOI] [Google Scholar]
- 19.Tekaia F. Genome Data Exploration Using Correspondence Analysis. Bioinform Biol Insights. 2016;10: BBI.S39614. doi: 10.4137/BBI.S39614 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Kohonen T. The self-organizing map. Proceedings of the IEEE. 1990;78: 1464–1480. doi: 10.1109/5.58325 [DOI] [Google Scholar]
- 21.Barrett JC, Fry B, Maller J, Daly MJ. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005;21: 263–265. doi: 10.1093/bioinformatics/bth457 [DOI] [PubMed] [Google Scholar]
- 22.McInnes L, Healy J, Saul N, Großberger L. UMAP: Uniform Manifold Approximation and Projection. J Open Source Softw. 2018;3: 861. doi: 10.21105/joss.00861 [DOI] [Google Scholar]
- 23.Quinlan JR. Induction of decision trees. Mach Learn. 1986;1: 81–106. doi: 10.1007/BF00116251 [DOI] [Google Scholar]
- 24.Maged AM, Amin A, Sherif NA, Askalani AN, Saadalla SS, Saad AH, et al. Comparative Study Between Ovarian Response Prediction Index Versus Anti-Müllerian Hormone, Antral Follicular Count as Predictors of Ovarian Response in Women Undergoing IVF/ICSI Cycles. Evidence Based Women’s Health Journal. 2021;11: 146–156. doi: 10.21608/EBWHJ.2019.17205.1020 [DOI] [Google Scholar]
- 25.Fiedler K, Ezcurra D. Predicting and preventing ovarian hyperstimulation syndrome (OHSS): the need for individualized not standardized treatment. Reproductive Biology and Endocrinology. 2012;10: 32. doi: 10.1186/1477-7827-10-32 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Parrella A, Irani M, Keating D, Chow S, Rosenwaks Z, Palermo GD. High proportion of immature oocytes in a cohort reduces fertilization, embryo development, pregnancy and live birth rates following ICSI. Reprod Biomed Online. 2019;39: 580–587. doi: 10.1016/j.rbmo.2019.06.005 [DOI] [PubMed] [Google Scholar]
- 27.Li HWR, Nelson SM. Clinical Application of AMH Measurement in Assisted Reproduction. Front Endocrinol (Lausanne). 2020;11. doi: 10.3389/fendo.2020.606744 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Zhang Y, Xu Y, Xue Q, Shang J, Yang X, Shan X, et al. Discordance between antral follicle counts and anti-Müllerian hormone levels in women undergoing in vitro fertilization. Reproductive Biology and Endocrinology. 2019;17: 51. doi: 10.1186/s12958-019-0497-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Ebid AHIM Motaleb SMA, Mostafa MI Soliman MMA. Novel nomogram-based integrated gonadotropin therapy individualization in in vitro fertilization/intracytoplasmic sperm injection: A modeling approach. Clin Exp Reprod Med. 2021/05/12. 2021;48: 163–173. doi: 10.5653/cerm.2020.03909 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Nikbakht R, Mohammadjafari R, Rajabalipour M, Moghadam MT. Evaluation of oocyte quality in Polycystic ovary syndrome patients undergoing ART cycles. Fertil Res Pract. 2021;7: 2. doi: 10.1186/s40738-020-00094-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Sindiani AM, Batiha O, Al-zoubi E, Khadrawi S, Alsoukhni G, Alkofahi A, et al. Association of single-nucleotide polymorphisms in the ESR2 and FSHR genes with poor ovarian response in infertile Jordanian women. Clin Exp Reprod Med. 2021;48: 69–79. doi: 10.5653/cerm.2020.03706 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Bilibio JP, Meireles AJC, de Conto E, Lorenzzoni PL, do Nascimento FC, da Cunha-Filho JS. GDF9 polymorphisms: influence on ovarian response in women undergoing controlled ovarian hyperstimulation. JBRA Assist Reprod. 2020;24: 447–453. doi: 10.5935/1518-0557.20200027 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Conforti A, Tüttelmann F, Alviggi C, Behre HM, Fischer R, Hu L, et al. Effect of Genetic Variants of Gonadotropins and Their Receptors on Ovarian Stimulation Outcomes: A Delphi Consensus. Front Endocrinol (Lausanne). 2022;12. doi: 10.3389/fendo.2021.797365 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Chen D, Zhu X, Wu J. Can polymorphisms of AMH/AMHR2 affect ovarian stimulation outcomes? A systematic review and meta-analysis. J Ovarian Res. 2020;13: 103. doi: 10.1186/s13048-020-00699-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.van Disseldorp J, Franke L, Eijkemans R, Broekmans F, Macklon N, Wijmenga C, et al. Genome-wide analysis shows no genomic predictors of ovarian response to stimulation by exogenous FSH for IVF. Reprod Biomed Online. 2011;22: 382–388. doi: 10.1016/j.rbmo.2010.12.006 [DOI] [PubMed] [Google Scholar]
- 36.Yuan P, He Z, Zheng L, Wang W, Li Y, Zhao H, et al. Genetic evidence of ‘genuine’ empty follicle syndrome: a novel effective mutation in the LHCGR gene and review of the literature. Human Reproduction. 2017;32: 944–953. doi: 10.1093/humrep/dex015 [DOI] [PubMed] [Google Scholar]
- 37.Liu W, Xin Q, Wang X, Wang S, Wang H, Zhang W, et al. Estrogen receptors in granulosa cells govern meiotic resumption of pre-ovulatory oocytes in mammals. Cell Death Dis. 2017;8: e2662–e2662. doi: 10.1038/cddis.2017.82 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Lee EB, Chakravarthi VP, Wolfe MW, Rumi MAK. ERβ Regulation of Gonadotropin Responses during Folliculogenesis. Int J Mol Sci. 2021;22: 10348. doi: 10.3390/ijms221910348 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Lebedeva IY, Singina GN, Volkova NA, Vejlsted M, Zinovieva NA, Schmidt M. Prolactin affects bovine oocytes through direct and cumulus-mediated pathways. Theriogenology. 2014;82: 1154–1164.e1. doi: 10.1016/j.theriogenology.2014.08.005 [DOI] [PubMed] [Google Scholar]
- 40.Abittan B, Karten J, Domney A, Shan W, Goldman RH. Predicting number of mature oocytes at retrieval: a machine-learning model for patient counseling. Fertil Steril. 2020;114: e154. doi: 10.1016/j.fertnstert.2020.08.444 [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The raw sequencing data generated for this study has been deposited in the European Nucleotide Archive (ENA) at EMBL-EBI under accession number PRJEB54062 (https://www.ebi.ac.uk/ena/browser/view/PRJEB54062). The source code is available at the GitHub repository (https://github.com/CBR-Invicta/GPS)—notebooks contain results and plots generated using the dataset. INVICTA Fertility Clinics do not allow public disclosure of patient data used in this study. In case of additional questions, please contact the INVICTA Research and Development Center (cbr@invicta.pl).