Summary
Preeclampsia is a complex disease of pregnancy whose physiopathology remains unclear. We developed machine-learning models for early prediction of preeclampsia (first 16 weeks of pregnancy) and over gestation by analyzing six omics datasets from a longitudinal cohort of pregnant women. For early pregnancy, a prediction model using nine urine metabolites had the highest accuracy and was validated on an independent cohort (area under the receiver-operating characteristic curve [AUC] = 0.88, 95% confidence interval [CI] [0.76, 0.99] cross-validated; AUC = 0.83, 95% CI [0.62,1] validated). Univariate analysis demonstrated statistical significance of identified metabolites. An integrated multiomics model further improved accuracy (AUC = 0.94). Several biological pathways were identified including tryptophan, caffeine, and arachidonic acid metabolisms. Integration with immune cytometry data suggested novel associations between immune and proteomic dynamics. While further validation in a larger population is necessary, these encouraging results can serve as a basis for a simple, early diagnostic test for preeclampsia.
Keywords: preeclampsia, machine learning, predictive modeling, multiomics, biomarkers
Highlights
-
•
Machine-learning models for prediction of preeclampsia were developed
-
•
Six omics datasets from a longitudinal cohort of pregnant women were analyzed
-
•
Prediction models from urine metabolome and from proteome had the best accuracy
-
•
The prediction model from urine metabolites was validated on an independent cohort
The bigger picture
The World Health Organization estimates that more than 800 women worldwide die from pregnancy-related causes every day. One of the main causes is a hypertensive disorder, preeclampsia, for which the only treatment is to deliver, often too early. Preeclampsia affects 3%–5% of pregnancies in the United States and up to 8% globally. Machine-learning analyses of high-dimensional multiomics data could potentially capture complex dynamics involved in the preeclampsia pathogenesis. We developed machine-learning models for early prediction of preeclampsia (first 16 weeks of pregnancy) and over gestation by analyzing six omics datasets from a longitudinal cohort of pregnant women. A prediction model using nine urine metabolites had high accuracy and was validated on an independent cohort. While encouraging, our results need to be validated on a larger cohort. If generalizable, our findings could lead to a simple prediction test for use in both developed and developing parts of the world.
Preeclampsia is one of the main complications of pregnancy, posing risk both to the mother and the baby. We developed machine-learning models for early prediction of preeclampsia (first 16 weeks of pregnancy) and over gestation by analyzing six omics datasets from a longitudinal cohort of pregnant women. If further validated, our findings could lead to a simple prediction test for use in both developed and developing parts of the world.
Introduction
The World Health Organization estimates that more than 800 women worldwide die from pregnancy-related causes every day, with the highest rates of maternal mortality and morbidity in low-income countries.1 One of the main causes is a hypertensive disorder of pregnancy, preeclampsia, for which the only treatment is to deliver, often too early. Preeclampsia affects 3%–5% of pregnancies in the United States and up to 8% of all pregnancies globally,1 and accounts for 10%–15% of maternal deaths2 and 15%–20% of preterm births.3
The pathophysiology of preeclampsia is complex and is thought to be caused in part by abnormal placentation as well as a woman’s genetic and immunologic predisposition.4 It is believed that abnormal placentation leads to a maternal inflammatory response.4 Placental ischemia, oxidative stress, and the presence of a maternal angiogenic imbalance are all characteristics of preeclampsia,5,6 leading to endothelial and end-organ damage, and in some cases to stroke and even death.
Specific biological processes involved in the development of preeclampsia are not yet completely understood. Early prediction of preeclampsia has remained a clinical challenge, owing to incompletely understood causes, various risk factors, and likely multiple pathogenic phenotypes of preeclampsia.7,8 The recent availability of high-throughput omics (e.g., genome, transcriptome, proteome, and metabolome) assays, where each can be performed on small sample volumes, has enabled joint analyses of the high-dimensional multidomain or “multiomics” data measured from the same biological sample.4,9,10 An integrated analysis may capture complex dynamics involved in the preeclampsia which could ultimately lead to novel therapeutic interventions. Furthermore, applying machine-learning methods capable of extracting the most predictive features from high-dimensional multiomics data could lead to more accurate predictive models, discovery of biomarkers, and improved early detection of women at risk for developing preeclampsia.
In this study, we performed a multiomics analysis of the transcriptome, proteome, metabolome, lipidome, and microbiome from a coordinated set of biospecimens collected longitudinally from normotensive and preeclamptic pregnant women; we then integrated immune system mass spectrometry features that were available for a subset of the women; and finally, we combined the multiomics data with the available clinical/demographics data and performed a joint analysis. Our goals were to: (1) build an early prediction model of preeclampsia; (2) develop a simple and interpretable predictive model based on a small number of biomarkers that can lead to the development of a diagnostic test; (3) compare prediction capabilities of different omics sets; (4) build an integrated multiomics predictive model of preeclampsia to identify a signature of preeclampsia; and (5) gain insights into pathways involved in the pathogenesis of preeclampsia.
Results
Study design and multiomics data collection
Our prospective study included 33 women in the discovery cohort (17 preeclamptic, 16 normotensive) and 16 women in the validation cohort (12 preeclamptic, 4 normotensive) (Figure 1A). The validation cohort was used to validate the metabolomics results. Among the preeclamptic women in the discovery cohort, severe and mild preeclampsia were observed in 10 and 7 women, respectively; early- and late-onset preeclampsia were observed in 5 and 12 women, respectively (Table S2). Maternal characteristics, demographics, and gestational ages at delivery are shown in Table S1. Both in discovery and validation cohorts there was a higher prevalence of chronic hypertension, high body mass index (BMI), and twin pregnancies—all known risks for preeclampsia—among preeclamptic compared with normotensive women (Table S1).
Blood, urine, and vaginal swabs were collected longitudinally at two or three time points during pregnancy: early, mid, and late (Figure 1). Across the gestation, we found no significant difference in sampling time between preeclamptic and normotensive groups (p > 0.74 first sample, p > 0.6 second sample, and p > 0.3 third sample; Wilcoxon rank-sum test). These samples were used for measurements of six omics assays: cell-free RNA (cfRNA)/transcriptome (plasma), proteome (plasma), metabolome (plasma and urine), lipidome (plasma), and microbiome (vaginal swab). In addition, immune-system-wide mass cytometry measurements of single cells were obtained on a subset of 19 women from the same cohorts (18 women from the discovery cohort and one woman from the validation cohort). The number of measurements differed markedly among omics datasets, with transcriptome containing the highest number of measurements (Figure S1A). In contrast, the number of principal components explaining 90% of the variance, which quantifies the internal correlation of a dataset, exhibited a smaller difference among datasets (Figure S1B). Thus, although the amount of data varied several orders of magnitude among datasets, their numbers of principal components were much more similar.
Prediction of preeclampsia in early pregnancy
From a clinical perspective, early prediction of preeclampsia, i.e., within the first 16 weeks of gestation, is of critical importance, as it would enable: early treatment of high-risk women (e.g., with low-dose aspirin11); closer monitoring of high-risk pregnancies; and the enrichment of preemptive interventional studies in women at risk for developing preeclampsia.12 Identifying a small number of specific biomarkers that are predictive of preeclampsia early in pregnancy could ultimately facilitate the development of a simple and affordable diagnostic test for both high-income and low- and middle-income countries. To this end, we developed an early prediction model for preeclampsia using only samples collected from each omics dataset during the first 16 weeks of pregnancy. To agnostically examine all the measurements in our high-dimensional data, we used Elastic Net (EN), a regularized regression machine-learning method (see experimental procedures). EN was chosen for its ability to extract, from high-dimensional data, a handful of the most predictive features that can predict an outcome with high accuracy.13 Performance was evaluated using the leave-one-out cross-validation method. Comparison of predictors demonstrated the highest performance of the urine metabolome predictive model (area under the receiver-operating characteristics curve [AUC] = 0.88, 95% confidence interval [CI] [0.76, 0.99]) followed by the proteome model (AUC = 0.87, 95% CI [0.75, 0.99]) and cfRNA models (AUC = 0.68, 95% CI [0.49, 0.87]) outperforming models from other omics (Figure 2A). Top identified proteins and genes are shown in Figure S7.
The heatmap of rank values of features selected by EN from all omics is shown in Figure 2B. Hierarchical clustering was used to separate preeclamptic from normotensive pregnancies.
We next focused on our top-performing model, which was trained from the urine metabolome dataset and consisted of nine metabolites, and evaluated its performance in the validation cohort. The model validated maintaining high performance with an AUC of 0.83 (95% CI [0.62, 1.0]) (Figure 3A), confirming identified metabolites (Figure 3B) as biomarkers of preeclampsia. Furthermore, p values obtained using a separate univariate analysis demonstrated the statistical significance of each of the identified metabolites (Figure 3B). The metabolites identified by EN as the biomarkers were dihydroxybenzoic acid, tetradecadiencarnitine, adenine, dihydroxyphenylglycol O-sulfate, methoxyhydroxyphenylethyleneglycol, and four uncharacterized molecules (C23H39NO19, C26H32F2O7, C15H25N3O8S, C12H27Sn) (Figure 3B).
Machine-learning modeling of preeclampsia over gestation
We next analyzed longitudinal data that included all samples taken during pregnancy (Figure 1) in order to capture the changes that occur due to preeclampsia during pregnancy and possibly gain insights into the development of preeclampsia. As in the early pregnancy analysis, multivariate models of preeclampsia were trained for each omics using EN (see experimental procedures). To investigate whether a joint analysis of different omics can offer further gains, predictions from separate omics models were integrated in a joint model using stacked regression (see experimental procedures). The performance of all models was evaluated using the leave-one-out cross-validation method. The integrated model exhibited the highest prediction accuracy (AUC = 0.91, 95% CI [0.85, 0.97]), outperforming predictions from each separate model in terms of the point estimate (Figure 4A). EN models from the proteome and urine metabolome exhibited high performance (AUC = 0.89, 95% CI [0.83, 0.95]; AUC = 0.87, 95% CI [0.80, 0.94], respectively) outperforming other omics data, the same trend we observed in the early pregnancy performance. As before, the urine metabolite model was validated in an independent cohort, with an AUC of 0.87 (95% CI [0.76, 0.99]) (Figure 4B), confirming identified metabolites as true biomarkers of preeclampsia.
Top urine metabolites included adenine, isovalerylglutamic acid, uric acid ribonucleoside, 1,5-anhydroglucitol, dehydroepiandrosterone, sialyllactose, Nε-acetyl-L-lysine, and nonanoylcarnitine. p values obtained using a separate univariate analysis show statistical significance of each of the identified metabolites (Figure 4C). One of the identified metabolites, uric acid ribonucleoside, is an end product in the same pathway as uric acid, whose increased concentration is typical of preeclampsia.14 As an end product, uric acid ribonucleoside is more likely to be a sensitive biomarker. Interestingly, the uric acid levels in our data did not discriminate between controls and preeclamptic patients.
A model using top-scoring plasma proteins achieved an AUC of 0.83 (95% CI [0.73, 0.92]) (Figure 4A). The most predictive plasma proteins selected by EN included leptin (LEP), vascular endothelial growth factor A (VEGFA), L-selectin (SELL), E-selectin (SELE), interleukin-24 (IL-24), IL-22, tyrosine-protein kinase transmembrane receptor (ROR1), C-X-C motif chemokine ligand 10 (CXCL10), and SPARC-like 1 (SPARCL1) (Figure 4D), thereby confirming some of the established or indicated proteins associated with preeclampsia15,16,17 (see Table S4 and discussion for further details), as well as establishing new associations. Also in this case, p values obtained using univariate analysis show that all proteins chosen by EN are statistically significant (Figure 4D).
We point out that, as expected, EN models varied slightly owing to variability of the chosen training set in each leave-one-out cross-validation step18 and therefore, the features chosen by EN varied slightly across cross-validations. We recorded the frequency of occurrence for every feature across all cross-validation steps (shown for the proteome model in Figure S2). Having high frequency of occurrence indicates that the feature is relevant for all or a majority of samples, i.e., it is more stable.18
Because both in discovery and validation cohorts the prevalence of known preeclampsia risks including chronic hypertension, high BMI, and twin pregnancies was higher among preeclamptic compared with normotensive women (Table S1), we next investigated whether our multiomics model captures mostly these differences. We calculated Spearman correlation between prediction model scores and clinical variables (Figure S3). The first-trimester blood pressure was also included in the analysis. The highest correlation between the model and clinical variables, and the only one that was statistically significant, was found to be between BMI and the model (p < 0.0086). However, even in this case the BMI did not fully correlate with the model (Spearman correlation = 0.63) confirming that our model does not just capture differences in BMI to distinguish between preeclamptic and normotensive women. We observed a low correlation between the model (row labeled “prediction” in Figure S3) with other clinical variables, indicating that the omics model is not just describing differences in the available clinical/demographic characteristics of women but is capturing biological differences.
The correlation network of all features chosen by EN models for each of the omics sets was plotted using t-distributed stochastic neighbor embedding (t-SNE), revealing multiomics interaction of analytes associated with preeclampsia (Figure 5). Edges between features indicate a Spearman correlation >0.55. As expected, features from one omics set tend to group together, with higher correlation among them. In addition, we observe that correlation exists among different omics datasets. Exploiting these correlations may be a plausible explanation as to why the integrated prediction model outperforms individual models as shown in Figure 3. Pathway enrichment analysis revealed that three protein clusters observed in the plot were associated with different pathways: (1) pathways related to immune response (bottom cluster); (2) pathways related to neurodevelopment (middle cluster); and; (3) pathways related to intracellular signal transduction (top cluster) (Figure 5). Three metabolic pathways were enriched: (1) steroid biosynthesis; (2) tryptophan metabolism, as in the case of univariate analysis (Figure 7); and (3) β-oxidation of very long chain fatty acids whose role in preeclampsia has been previously observed.
Finally, we compared the most predictive features as identified by EN in early pregnancy versus during gestation in terms of their significance (−log10(p values)). Plasma proteins are shown in Figure S5A and urine metabolites in Figure S5B. We observe that a large number of proteins identified by EN stay statistically significant in both cases, including LEP, SELL, CCL23, ROR1, IL1RAP, SELL, SELE, VEGFA, IGFBP1, and SPARCL1 (Figure S4A). Some of the proteins are significant over gestation but not early in pregnancy (e.g., APOB) possibly due to a smaller number of samples. A similar trend is observed for urine metabolites (Figure S4B). Also noteworthy is that because EN uses sparsity, a feature that is associated with preeclampsia may be excluded from the final model if that model already includes another feature highly correlated with the original one. This is especially true in scenarios with a large number of features and high-dimensional regime such as in our study. This effect is illustrated in Figure S5, showing the difference in chosen features for prediction models over gestation and in early pregnancy.
Single-cell characterization of the immune system
Preeclampsia is strongly associated with inflammation and aberrant maternal immune system adaptations during pregnancy.19 To assess immunity—which is complementary to pathways covered by proteins and metabolites—and connect differential abundances of plasma proteins and urine metabolites in preeclamptic pregnancies to biological changes, immune-system-wide mass cytometry measurements of single cells obtained in a subset of the same patient cohort were integrated with our plasma proteome and urine metabolome prediction models, as these two models had the best accuracy. Immune cell dynamics between first- and second-trimester blood samples obtained from high-dimensional mass cytometry were previously used to develop a prediction model of preeclampsia.20 We found that seven (out of eight) of the immune features reported by Han et al.20 correlated highly with the prediction based on our integrated algorithm (Spearman correlation p < 0.05) (Figure 6A, highlighted in orange), confirming the predictive value of immune cell features as well as plasma proteins and urine metabolites. To investigate whether this correlation between predictive features was biologically meaningful, we focused on the correlations of feature behavior between the eight earlier reported predictive immune features (Figure 6A) and the top 12 most informative plasma proteome features (Figure 4D) across pregnancy (Figure 6B). LEP and SELL levels were particularly strongly correlated with the eight immune cell features (Figure 6B). Interestingly, basal pSTAT5 signaling in T helper 1 (Th1) cells (CD4+Tbet+), the top immune feature to distinguish control from preeclamptic pregnancies,20 correlated with LEP levels in both control and preeclamptic patients. Uniquely in preeclamptic cases, LEP levels were correlated with basal pSTAT1 signaling in intermediate myeloid cells (intMCs) (Spearman correlation p = 0.002) and basal STAT5 signaling in myeloid dendritic cells (mDCs) (Spearman correlation p = 0.01). Moreover, SELL levels were uniquely correlated with immune features in preeclamptic pregnancies and not with controls, i.e., correlated with basal pNFkB and pSTAT1 signaling in cMCs, basal pSTAT5 signaling in Th1 cells and mDCs, and basal pMAPKAPK2 signaling in naive CD4 T cells. Preeclamptic pregnancies were not characterized by—in other words, had potentially lost—concerted proteome/immune behavior, which was prominently observed in healthy pregnancies, i.e., correlations of leptin with basal pP38 signaling in T regulatory (Treg) and T cell receptor γδ (TCRγδ) cells. These correlations exemplify the biological connection between responsiveness of immune cells and its plasma environment.
Relationship between clinical data and omics measurements
Clinical and demographics data contain maternal characteristics known to be associated with the risk of preeclampsia, e.g., preexisting hypertension, race, BMI, height, and gravida. We combined ten variables that were available in this dataset (Table S1) with the most predictive sets, (1) plasma proteome and (2) urine metabolome models, to better understand their mutual relationship. The ten clinical variables were included together with the top ten omics features, all combined in the single cross-validation step. Inclusion of clinical and demographics data improved the performance when combined with both the plasma proteome and the urine metabolome (urine metabolome AUC = 0.96, 95% CI [0.92, 0.99]; proteome AUC = 0.91, 95% CI [0.85, 0.97]) (Figure S6A). The most predictive clinical variables included maternal age, BMI, height, and preexisting hypertension. We observed several significant correlations (Spearman correlation p < 0.05) between clinical variables and plasma proteins/urine metabolites that were present only among preeclamptic women. These included: leptin with maternal BMI/weight, in agreement with existing literature;21 CCL23 with height; SELL with gravida (Figure S6B); maternal age with adenine—previously observed22—and maternal age with isovalerylglutamic acid (Figure S6C).
In addition, to compare the above performance with the performance of a baseline model, we trained EN only on the available maternal and pregnancy characteristics. The obtained model still had a good accuracy, AUC = 0.85, 95% CI [0.66, 1.0], but the performance, as expected, was lower compared with the combined model. The most predictive variables in the model were maternal hypertension, BMI, race, age, and number of babies in the current pregnancy. This is in line with results in our previous work that developed a machine-learning model for early prediction of preeclampsia from electronic health record data.23
Univariate analysis of preeclampsia pathogenesis from multiomics measurements
We next present univariate analysis with the Benjamini-Hochberg procedure that identifies all features significantly associated with preeclampsia. We further used these features to perform pathway enrichment analysis.
Over the course of pregnancy
Changes over gestation of 1,215 metabolic features among 8,718 were significantly associated with preeclampsia outcome (false discovery rate [FDR] < 0.05, linear mixed-effects [LME] model with Benjamini-Hochberg procedure). Pathway enrichment analysis using these urine metabolites identified the following pathways (p < 0.05) (Figure 7A): (1) tryptophan metabolism; (2) caffeine metabolism; (3) tyrosine metabolism; (4) steroid hormone biosynthesis; (5) pentose and glucuronate interconversions; (6) linoleic acid metabolism. The steroid hormone biosynthesis pathway plays an important role in pregnancy progression.24 Both the steroid hormone biosynthesis pathway and the caffeine metabolism pathways and caffeine metabolites have previously been associated with pregnancy,25,26 and tryptophan metabolism with preeclampsia.27 Metabolites in the steroid hormone biosynthesis pathway and in the caffeine metabolism pathway present in the data with high level of significance are respectively shown in Figures S8C and S8D. The comparison between enrichment factors and statistical significance of pathways enriched over gestation versus in early pregnancy is shown in Figure S22.
Among 1,305 proteins, 437 had changes that were significantly associated with preeclampsia outcome over gestation (FDR < 0.05, LME model with Benjamini-Hochberg procedure). The top 64 proteins at significance level (LME model) showed markedly different values between normotensive and preeclamptic women (Figure S9). Top proteins included IL-1 receptor accessory protein (IL1RAP) and SELL, both known to play a role in the immune response.28 Enriched pathways grouped into ten biological processes, the most prevalent being positive regulation of cellular processes (including biological, cellular, protein metabolic, immune system, and apoptotic processes among others) (46.4%) (Figure S10). In the cfRNA set, 306 features were significantly associated with preeclampsia outcome over gestation (FDR < 0.05, LME model with Benjamini-Hochberg procedure). Enriched pathways grouped into 11 biological processes, the most prevalent being RNA splicing (37.3%) (Figure S11). Top features included YOD1 (known to be related to developmental processes29), BIRC2, CEP63, and LCP1 (also previously implicated with preeclampsia30). A network of top proteome, transcriptome, and urine and plasma metabolome features is shown in Figure S12.
Early pregnancy
In early pregnancy, 497 out of 8,718 urine metabolic features had changes significantly associated with preeclampsia when compared with normotensive controls (FDR < 0.05, Wilcoxon signed-rank test with Benjamini-Hochberg procedure). Pathway enrichment analysis on these urine metabolites identified the following pathways (p < 0.05) (Figure 7B): (1) tyrosine metabolism; (2) lysine degradation; (3) tryptophan metabolism; (4) phenylalanine metabolism; (5) steroid hormone biosynthesis; (6) arachidonic acid metabolism; (7) phenylalanine, tyrosine, and tryptophan biosynthesis; (8) aminoacyl-tRNA biosynthesis. Arachidonic acid metabolism is a central regulator of the inflammatory response and has a known role in the pathogenesis of preeclampsia.31 Similarly, tryptophan metabolism has an important role in pregnancy, providing increased protein synthesis by the mother, fetal growth development; and serotonin for signaling pathways.32 Individual metabolites from these two pathways are shown in Figures S8A and S8B.
In the proteome set containing 1,305 proteins, three proteins—LEP, CCL23, and FAM3D—were significantly associated with preeclampsia outcome (FDR < 0.05, Wilcoxon signed-rank test with Benjamini-Hochberg procedure) identifying one significantly enriched pathway, negative regulation of glucagon secretion (Fisher’s exact test with Benjamini-Hochberg procedure, FDR < 0.05). The reason we did not adjust for covariates, specifically BMI—the only covariate with statistically significant correlation with the model predictions (Figure S3)—is that we wanted to capture the underlying biology including the mechanism under which the existing factors such as BMI are associated with preeclampsia. By adjusting for BMI, we would potentially remove pathways otherwise enriched and involved in preeclampsia.
Outlier analysis
We observed that a few women in our cohort were consistently misclassified by our prediction algorithm (Figure S13). A few normotensive control women resembled those with preeclampsia on a molecular level in some of the top predictive features across omics sets. Vice versa, there were some preeclamptic women whose top molecular features more closely resembled those of controls. Reexamination of the clinical charts revealed that one of the preeclamptic women, while clearly hypertensive, had proteinuria in the context of gross hematuria, obscuring whether proteinuria was related to preeclampsia. Therefore, she may have been misdiagnosed with preeclampsia but rather only had gestational hypertension. This highlights that the predictive model can pick up discrepancies within the clinical chart. For the other women whose clinical diagnosis held, this implies that their phenotypic features that classified them in either normotensive or preeclampsia group did not match their molecular phenotypes. Of interest, one preeclamptic woman, which the prediction classified as control, developed HELLP syndrome very late in gestation at 41 + 3 weeks. Therefore, if she had delivered closer to the due date she would have been considered a control. Thus, if others in the control group have a similar molecular phenotype, this may represent a late-onset preeclampsia related to placental aging in the post-term period.
Discussion
Recent omics studies of preeclampsia typically included up to two omics datasets.10,33,34 Our study presents an integrated analysis of six high-throughput omics datasets obtained on the same biological sample, containing more than 50,000 measurements per sample. This multiomics analysis enabled uniform comparison of omics sets and revealed improved predictive ability for preeclampsia status relative to individual biological modalities, and indications of biological processes associated with the disease across multiple modalities. The first part of the analysis focused on early prediction of preeclampsia with the goal of comparing the six omics and identifying the best biomarkers. We then used a multiomics approach to integrate six omics datasets into one integrated (stacked) prediction model. The multiomics analysis demonstrated that the plasma protein and urine metabolome had the highest accuracy, both early in pregnancy (Figure 2A) and over gestation (Figure 4A). For that reason, we followed this with a more targeted analysis of plasma proteins and urine metabolites that were identified as having the highest accuracy. Ultimately, our goal is to develop a simple diagnostic test with high accuracy, and these two omics datasets were identified as most promising from our analysis.
One of the main strengths of our study is that, in our cohort, biological samples were not only collected longitudinally from each woman, but each individual sample was also simultaneously measured for proteome, transcriptome, metabolome, lipidome, and vaginal swab for microbiome, thereby providing a unique opportunity to systematically study changes attributable to preeclampsia over gestation, and compare the capability of each of these omics sets to predict and characterize preeclampsia. All 50,000 measurements were used in the prediction algorithm to agnostically identify the best biomarkers of preeclampsia.
Among our six omics, urine metabolomic and plasma proteomic datasets demonstrated the highest prediction accuracies, both over gestation and early in pregnancy. A prediction model using a small number of urine metabolites provided high accuracy over gestation (AUC = 0.88, cross-validated) and early in pregnancy (AUC = 0.875, cross-validated). The prediction model was validated on an independent cohort (AUC = 0.83 in early pregnancy; AUC = 0.87 over gestation), confirming identified metabolites as true biomarkers. Univariate analysis demonstrated the statistical significance of these biomarkers.
The EN prediction model with plasma proteins achieved AUC of 0.83 over gestation and of 0.88 in early pregnancy. Several of the proteins identified by our model as the most predictive have previously been well established as biomarkers of preeclampsia (VEGFA,15 LEP,16 SELL,35 CXCL10,36 ROR1,37 and IL1RAP38), further validating our results. Some of the identified proteins have been previously indicated in individual studies but have not yet been confirmed (IL-24,17 HIPK3,39 and SPARCL140), and some have been identified for the first time (IL-22). While these biomarkers were previously examined in typically more targeted studies examining a single or small group of these proteins (e.g., IL-2417), our agnostic approach demonstrates that put together, their combined measurements result in an accurate model of preeclampsia. One of identified proteins in our model was VEGFA. Reduced levels of VEGFA have previously been described in preeclamptic pregnancies, owing to increased levels of placental soluble FMS-like tyrosine kinase-1 (sFLT-1), which validate our study.15,41,42 Among the other known biomarkers of preeclampsia—sFLT-1, pregnancy-associated plasma protein A (PAPP-A), placental growth factor (PIGF), and endoglin (ENG)—PIGF and PAPP-A were indeed significantly different between normotensive and preeclamptic women (Figure S15). The fact that ENG and sFLT-1 were not significant may be in part due to the small size of our cohort. In addition, we point out that sFLT-1 is a good predictor of preeclampsia later in pregnancy and once suggestive clinical features are observed,43 whereas the majority of our samples were taken before that point. Clinically, it is the sFLT-1/PIGF ratio that is used as a biomarker and not individual levels (Figure S14B). Throughout pregnancy PIGF levels are increasing until sFLT-1 levels start to increase, which is consistent with what is observed in our data (Figure S14) and suggests a variety of new hypotheses for testing. We did not include PP13 (galectin13) measurements, another known biomarker of preeclampsia.
Preeclampsia is accompanied by a dysregulated maternal immune adaptation to pregnancy, which is already detectable in early pregnancy.19,20 This aberrant signature was previously identified in women who developed preeclampsia later.20 Here we report that the intricate functional capacities of immune cells are coevolving with their environment throughout the course of pregnancy, showing that top informative immune feature levels are highly correlated with top informative plasma proteins. This interconnectedness supports both prediction approaches, confirming their individual usefulness while complementing the validity of each approach. The results highlight the known pathophysiology of preeclampsia and suggest novel associations between immunological and proteomic dynamics. In preeclamptic pregnancies, immune responses were uniquely correlated with levels of LEP and SELL.
LEP, known to be elevated in the plasma of preeclamptic women,16 is an immune regulatory hormone produced by adipose tissue and the placenta.16,44 LEP activates the JAK/STAT and MAPK pathways directly through binding to the leptin receptor expressed on leukocytes, and thereby modulates both innate and adaptive immune responses,45,46 including skewing of CD4 T cells toward Th1 polarization47 and inhibiting Treg proliferation.46 Accordingly, we observed that LEP levels in preeclamptic and control pregnancies correlated with STAT and MAPK pathway signaling both in innate and adaptive immune cells, suggesting that dysregulated leptin levels in preeclamptic pregnancies might contribute to the aberrant immune signature while, reciprocally, inflammation itself might enhance plasma leptin levels.44,45 Moreover, while in healthy pregnancies LEP levels correlated with p38 signaling in Treg and TCRγδ, this correlation was lost in preeclamptic pregnancies, suggesting that regulation of immune tolerance might be disrupted in preeclamptic pregnancies.
Furthermore, we reported decreased SELL levels in preeclamptic pregnancies that correlated with basal pSTAT, pNFkB, and pMAPKAP2 signaling in innate (mDC and cMC) and adaptive immune cells (Th1 and naive CD4 T cells). SELL is shed from leukocytes during activation and migration, and soluble L-selectin can be used as a surrogate marker for inflammation.48 Notably, a drop in soluble SELL levels is observed during sepsis.49 Previous studies reported conflicting results for circulating soluble SELL levels in preeclampsia,35,50,51 including low soluble SELL levels at 20 weeks of gestation, prior to onset of preeclampsia.50 Preeclampsia-associated enhanced ectodomain shedding of cell-adhesion molecules could be directly linked to changes in signaling responses in circulating immune cells by shedding-mediated activation of intracellular pathways.48 Alternatively, the correlation could reflect independent inflammatory mechanisms, as decreased levels of circulating SELL have been proposed to be due to its adsorption to luminal vascular ligands, which are upregulated by an activated endothelium, a feature of preeclampsia.6,50,52
The model from urine metabolites predicted preeclampsia with highest accuracy. Enrichment analysis identified discriminant biological pathways associated with preeclampsia when considering early and all time points. The steroid hormone biosynthesis pathway was significant (p < 0.05) in both models while arachidonic acid metabolism was significant in early pregnancy. Arachidonic acid is a precursor to a myriad of bioactive lipids including prostaglandins (PGs), prostacyclin, thromboxane, hydroperoxyeicosatetraenoic acid, leukotrienes, lipoxins, hypoxins, anandamide, and epoxyeicosatrienoic acids, which play key roles in inflammatory, vascular, and coagulation processes.53 As early as the 1960s the role of the eicosanoids in preeclampsia pathogenesis was proposed, and by the 1970s evidence supported that an increase in thromboxane (TXA2; produced by platelets) over prostacyclin (PGI2; produced by endothelium) associated with preeclampsia.54 This is one of the biological underpinnings for the use of low-dose aspirin for the prevention of preeclampsia. Mills et al.55 reported longitudinal measurements of the urinary metabolites of thromboxane and PGI2 throughout gestation. Although they did not find a significant increase in the urinary concentrations of TXA2, they did find a significant decrease in PGI2 as early as 13–16 weeks of gestation and a significant elevation in the ratio of thromboxane to PGI2 as early as 17–20 weeks of gestation in women destined to develop preeclampsia. While this PG imbalance is noted both prior to and at the time of clinical presentation (after 20 weeks), the fact that arachidonic acid metabolism was only observed in early pregnancy may explain why clinical studies note that low-dose aspirin initiation prior to 16 weeks is needed for significant prevention of preeclampsia.56
The tryptophan pathway was identified as highly associated with preeclampsia over gestation (Figure 7). Indoleamine-2,3-dioxygenase (IDO) is the first and rate-limiting enzyme in this pathway producing kynurenine, which then is converted into a number of bioactive metabolites. IDO is an intracellular enzyme produced by many cell types and while not secreted, impacts neighboring cells by tryptophan depletion and production of bioactive metabolites. The role of IDO in both normal and abnormal pregnancies, including preeclampsia, has been recently reviewed.57 IDO expression increases with pregnancy, and tryptophan depletion in the placenta inhibits T cell-mediated rejection of semi-allogeneic fetal tissues.58 Kynurenine is an endogenous ligand that activates the aryl hydrocarbon receptor (AhR).59 This activation skews the differentiation of T cells to immunosuppressive Tregs rather than proinflammatory Th17 cells after exposure to transforming growth factor β.60,61 Notably, kynurenic acid and xanthurenic acid, two metabolites of kynurenine, can also activate AhR signaling and may participate in immune regulation.62,63 Therefore, deficiency of IDO impacts Treg development. Notably, IDO knockout mice, when pregnant, develop a preeclampsia-like phenotype.64 The metabolic signal related to tryptophan metabolism in the model over gestation may be related to the immune signature of preeclampsia, highlighting the importance of immune alterations occurring in the later stages of preeclampsia. Caffeine metabolism was also identified as highly associated with preeclampsia over gestation. This pathway and caffeine metabolites have previously been associated with pregnancy.25,26
Models to predict preeclampsia early in pregnancy were previously based on maternal characteristics (demographics and medical history), followed by addition of uterine artery Doppler measurements and specific biomarkers.65,66,67,68,69,70 Levels of angiogenic and/or antiangiogenic proteins (PlGF, sFlt-1, and ENG), or their ratios, have been established as biomarkers with high prediction accuracy later in pregnancy.15,41,71 More recently, analysis of omics datasets have been successfully applied to identify various biomarkers related to preeclampsia.10,33,72 Most of these studies were based on measurements from one or at most two omics datasets, and often from samples taken only at one time point during pregnancy. Here we show that clinical and demographic characteristics (i.e., weight, height, race) were complementary to omics measurements and improved prediction models.
Another important problem would be to develop a more specific model to predict severe preeclampsia. Among existing models for prediction of preeclampsia based on maternal characteristics and specific biomarkers, there are fewer that predict specifically severe preeclampsia,73,74,75 and a prediction model from multiomics assays would be an important contribution to this literature. Given the small number of women with severe preeclampsia in our cohorts, we plan to address this topic in future studies by analyzing cohorts richer in this pregnancy outcome.
Our study is limited by a small sample size—a typical limitation when high-cost multiomics analysis is conducted—and consideration of a cohort from a single hospital. Another limitation comes from the fact that targeted assays (and untargeted assays that rely on a reference database) need to be carefully validated for the samples to which they are applied. For example, our targeted aptamer-based proteomics assay has been carefully validated in human plasma samples but cannot be readily applied to vaginal swabs without careful validation studies. Inherently to the machine-learning approach, developing a prediction model depends on the underlying sample distribution of the data used. Distribution shift, caused by differences among various cohorts, can impact the performance of a machine-learning algorithm.76 For this reason, we a took special care in obtaining our results by: (1) performing careful machine-learning analysis to avoid overfitting; (2) validating our model on an independent cohort; (3) demonstrating that features identified by machine learning are statistically significant when analyzed by a separate, univariate analysis; and (4) examining our prediction model in relation to a previously established model from immunological data. In this study, the mass cytometry data were not included in the multiomics prediction model because these data were not available for 14 out of 33 women. However, integrative analysis of the restricted set of common samples revealed important connections between our model and key immune features.
While encouraging, our results need to be validated on a larger, more diverse set of women. If the results prove generalizable, our findings demonstrating high predictive power from a small number of urine metabolites and proteins could lead to a simple prediction test based on a small number of urine metabolites, suitable for use in both developed and developing parts of the world.
Experimental procedures
Resource availability
Lead contact
Further information and requests for resources should be directed to and will be fulfilled by the lead contact, Ivana Marić (ivanam@stanford.edu).
Materials availability
This study did not generate new unique reagents.
Study design
We performed a longitudinal, prospective study of a cohort of pregnant women receiving routine ante- and postpartum care at the Lucile Packard Children’s Hospital at Stanford University, California, as previously described.20,77 Women were eligible for the study if they were at least 18 years of age and were in their first trimester of pregnancy. The study was approved by the Institutional Review Board of Stanford University (#21956), and all participants signed an informed consent form.
Peripheral blood samples (for mass cytometry analysis), plasma samples (for proteomic, transcriptomic [cfRNA], metabolomic, and lipidomic analyses), urine samples (for metabolomic analysis), and vaginal swabs (for microbiome analysis) were collected from each woman at two or three time points during pregnancy. Sample collection, their analyses, and quality assessment for some of them was previously described,9 and are presented in the supplemental information. The validation cohort included 16 women from the same hospital, for which longitudinal samples with only metabolomic analyses were available. Metabolomic analyses were performed following the same methodology as for the discovery cohort.
Definition of preeclampsia
Preeclampsia was defined using the American College of Obstetrics and Gynecology classification3 as follows: hypertension that develops after 20 weeks of gestation (systolic or diastolic blood pressure of 140 and/or 90 mmHg, respectively, measured on at least two occasions, 4 h to 1 week apart) and proteinuria (300 mg in a 24-h urine collection, a protein/creatinine ratio of at least 0.3 [each measured as mg/dL] or, if these were not readily available, a random urine specimen containing 1+ protein by dipstick). In the absence of proteinuria, preeclampsia was diagnosed if the presence of thrombocytopenia (platelet count less than 100,000/μL), impaired liver function (elevated blood levels of liver transaminases to twice the normal concentration), the new development of renal insufficiency (elevated serum creatinine greater than 1.1 mg/dL), pulmonary edema, or new-onset cerebral or visual disturbances. Early-onset and late-onset preeclampsia were distinguished based on whether diagnosis was before or after 34 weeks of gestation.
Machine-learning analyses
Prediction models for each omics dataset were developed for each omics set using an EN model.13 Given matrix of predictors (measurements) and a vector of responses , regression coefficients and an intercept term in the EN model are obtained by maximizing the likelihood, or equivalently minimizing the negative log likelihood together with and penalty:
(Equation 1) |
Logistic regression was used, for which the negative log likelihood evaluates to
For the high-dimensional setting () considered here, EN, which performs both shrinkage and automatic selection of predictors, can provide both high accuracy and facilitate interpretability. Prior to training a model, low-variance measurements from transcriptome and microbiome were filtered out. Other omics sets did not have near-zero variance measurements.
For integration of omics datasets (Figure 4), a nested (two-level) cross-validation approach was used to train predictive models to estimate the risk of preeclampsia (Figure S15). At the first level, the EN model was used as described above (Equation 1). At the second level, predictions of EN models were integrated using stacked regression.78,79,80 Specifically, to use EN models in the two-level approach, for each modality k, and data , a leave-one-out EN model, denoted , was repeatedly fitted and evaluated at patient . At the second level, stacked regression with non-negative coefficients14 was used, so that the regression coefficients of the final model were determined by
Note that the leave-one-out approach used in stacked regression has a purpose to form an unbiased linear combination of EN models.79 In contrast to the original stacking approach in which different prediction models fit on the same data are stacked, here, we use the same model (EN) but fit to different omics to obtain different estimators which are then stacked. A stacked regression model can be regarded as a special case of a two-layer neural network; its special construction provides for an easier interpretation.
We point out that the nested cross-validation is done where in each step of cross-validation, EN models for each omics set are first trained and then the stacked model is trained in the same step. After the stacked model is built, it is tested on the test patient who was left out in the outer cross-validation loop. Therefore, no leakage of information between training and test data occurred (see detailed flowchart in Figure S15). In addition, the manuscript is accompanied by data and source code to enable both independent reproduction of our results and evaluation of the machine-learning techniques used. Furthermore, tuning of two parameters of EN algorithm, λ,α shown in Equation 1 above was also performed without using the test set data: function cv.glmnet in R package glmnet was used for implementation on EN that internally performs a separate cross-validation using the training set to choose λ. The value of α was not optimized and it was set to α = 0.9.
One of our main goals was to identify a small subset of biomarkers that can predict preeclampsia with high accuracy and could thereby be used as a simple diagnostic test. For these reasons, performance of the refitted EN model for each omics set was next evaluated by treating the EN model as a model-selection procedure and performing a refitting step on the selected support (features) in the same cross-validation step.81 The refitted model is then tested on the test patient who was left out in the cross-validation loop (see detailed flowchart in Figure S16). It is known that -penalization used in EN performs excessive shrinkage of the large coefficients of the prediction model.82 Refitting can resolve this problem and obtain a model with a smaller number of features.
Finally, to investigate a possible gain from integration of available clinical and demographic characteristics, a prediction model that takes omics (from a specific multiomics set) and clinical and demographics variables as an input to an EN model was fit and evaluated.
To build the model over gestation, multiple (2–3) samples available from the same patient were treated as independent inputs to the algorithm. Once prediction scores were obtained for each sample, scores for a same patient were averaged into the final risk score for that patient. Performance was estimated using a leave-one-out cross-validation procedure, such that in each cross-validation step all measurements of one patient are left out from the training set and are used for testing. In addition, urine metabolome prediction models, with and without clinical/demographics variables, were validated on a separate validation cohort. This dataset was produced independently of the initial dataset and was only used once at the end. Specifically, a prediction model was trained and its parameters determined using the discovery cohort and was then tested only once on the validation cohort. The prediction accuracy of the model in terms of the area under receiver-operating characteristics curve was evaluated. t-SNE83 was used for network visualization in Figure 5. For network visualization in Figure S13, a k-nearest-neighbor graph (with ) was constructed between features. The network layout was computed with the LargeVis algorithm.84 The analysis was performed using R software (version 3.6.1).
Pathway enrichment analysis
Univariate analysis was performed to identify features with significant associations between each feature and the pregnancy outcome, both in early pregnancy (Wilcoxon signed-rank test) and over gestation (LME model). The Benjamini-Hochberg procedure was used to control the FDR.85 Metabolome pathway enrichment analysis on identified metabolites was performed using MetaboAnalyst.86 The hypergeometric test was used for over-representation analysis in MetaboAnalyst. Proteome pathway enrichment analysis was performed using GeneOntology87,88 and topology-based Gene Ontology scoring (topGo), an R software package. Circular Gene Ontology (CirGO) software for visualizing two-level hierarchically structured gene ontology terms89 was used to visualize proteome and transcriptome pathway enrichment.
Acknowledgments
This study was supported by the March of Dimes Prematurity Research Center at Stanford University School of Medicine, Stanford Maternal & Child Health Research Institute, the Christopher Hess Research Fund, the National Institutes of Health grants 1R01HL139844 and R35GM138353, Burroughs Wellcome Fund, the Alfred E. Mann Foundation, and grants from the Bill & Melinda Gates Foundation OPP1112382, OPP1113682, and INV037517. M.P.S. was supported by National Institutes of Health, grant 5RM1HG00773507. D.A.R. was supported by the Thomas C. and Joan M. Merigan Endowment at Stanford University and the Chan Zuckerberg Biohub Microbiome Initiative.
Author contributions
I.M. and N.A. designed machine-learning experiments and performed interpretation of results; I.M. performed computational analysis and wrote the paper; N.A. provided guidance and feedback and contributed to writing the paper; K.C. performed mass spectrometric metabolomic analysis and contributed to metabolomic pathway analysis and writing of the paper; M.N.M. performed cfRNA transcriptome analysis; I.A.S., D.F., and X.H. performed immunome analysis and contributed to writing the paper; A.T. performed part of the computational analysis; N.S. assisted with the computational analysis; R.J.W. supervised the study data collection and edited the manuscript; G.M.T. and M.E. performed MS metabolomic analysis; N.S., A.L.C., R.F., H.N., M.B., M.X., C.E., D.D.F., M.S.G., and A.C. helped with editing the manuscript; E.K.C. performed microbiome analysis; X.B.L., K.G.S., G.L.D., D.A.R., S.R.Q., and M.S.A. provided critical feedback on the study and edited the manuscript; D.A.R. was involved with the microbiome analysis; M.P.S. was involved in the MS metabolome analysis; V.D.W. wrote a part of the paper and contributed to interpretation of results; G.M.S. provided the study data, provided guidance and feedback, contributed to interpretation of results, and edited the manuscript; B.G. designed the immunome part of the analysis, performed interpretation of the results, and edited the manuscript; D.K.S. provided the study data, provided guidance, and supervised the research. All authors discussed results and contributed to the final manuscript.
Declaration of interests
S.R.Q. is a founder, consultant, and shareholder of Mirvie. M.N.M. is a shareholder of Mirvie. D.A.R. is a shareholder of Blue Willow Biologics, Cantata Bio, Evelo Biosciences, Karius, Proderm IQ, and Second Genome. D.A.R. is an advisor to Cantata Bio and Visby Medical. K.G.S. is paid advisor of Mission Biocapital and Infinant Health; equity holder and paid advisor of Avexegen; and equity holder of mProbe. M.P.S. is a co-founder and scientific advisor of Personalis, SensOmics, Qbio, January AI, Fodsel, Filtricine, Protos, RTHM, Iollo, Marble Therapeutics, and Mirvie, and a scientific advisor of Genapsys, Jupiter, Neuvivo, Swaza, and Mitrix.
Published: December 9, 2022
Footnotes
Supplemental information can be found online at https://doi.org/10.1016/j.patter.2022.100655.
Supplemental information
Data and code availability
Raw and processed untargeted metabolomics data were deposited to the Metabolomics Workbench with the following study IDs: ST001889 for plasma and ST001890 for urine. The Project DOI for these studies is https://doi.org/10.21228/M8WD84.
Our microbiome reads have been submitted to SRA. The BioProject accession is PRJNA752652.
https://dataview.ncbi.nlm.nih.gov/object/PRJNA752652?reviewer=aofjjbr2j556u6vckeolc1i2t2.
Transcriptome data are available at:
https://drive.google.com/file/d/12JXm30he5psipz6iCiIUtZxWsy-IwG08/view?usp=sharing.
The data that support the findings of this study have also been deposited on GitHub in the form of csv files at https://github.com/ivanam5/Multiomics_Preeclampsia. All data except for the clinical variables have been made available. Clinical variables cannot be shared due to the HIPAA constraints.
Code to reproduce main analyses in the manuscript is available on GitHub at https://github.com/ivanam5/Multiomics_Preeclampsia. R software is needed to run the code.
References
- 1.WHO. UNICEF. UNFPA. World Bank Group and the United Nations Population Division . World Health Organization; 2019. Maternal Mortality: Levels and Trends, 2000 to 2017. [Google Scholar]
- 2.Duley L. The global impact of pre-eclampsia and eclampsia. Semin. Perinatol. 2009;33:130–137. doi: 10.1053/j.semperi.2009.02.010. [DOI] [PubMed] [Google Scholar]
- 3.Jeyabalan A. Epidemiology of preeclampsia: impact of obesity. Nutr. Rev. 2013;71(Suppl 1):S18–S25. doi: 10.1111/nure.12055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Than N.G., Romero R., Tarca A.L., Kekesi K.A., Xu Y., Xu Z., Juhasz K., Bhatti G., Leavitt R.J., Gelencser Z., et al. Integrated systems biology approach identifies novel maternal and placental pathways of preeclampsia. Front. Immunol. 2018;9:1661. doi: 10.3389/fimmu.2018.01661. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Phipps E.A., Thadhani R., Benzing T., Karumanchi S.A. Pre-eclampsia: pathogenesis, novel diagnostics and therapies. Nat. Rev. Nephrol. 2019;15:275–289. doi: 10.1038/s41581-019-0119-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Chaiworapongsa T., Chaemsaithong P., Yeo L., Romero R. Pre-eclampsia part 1: current understanding of its pathophysiology. Nat. Rev. Nephrol. 2014;10:466–480. doi: 10.1038/nrneph.2014.102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Duckitt K., Harrington D. Risk factors for pre-eclampsia at antenatal booking: systematic review of controlled studies. BMJ. 2005;330:565. doi: 10.1136/bmj.38380.674340.E0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Tranquilli A.L., Brown M.A., Zeeman G.G., Dekker G., Sibai B.M. The definition of severe and early-onset preeclampsia. Statements from the international society for the study of hypertension in pregnancy (ISSHP) Pregnancy Hypertens. 2013;3:44–47. doi: 10.1016/j.preghy.2012.11.001. [DOI] [PubMed] [Google Scholar]
- 9.Ghaemi M.S., DiGiulio D.B., Contrepois K., Callahan B., Ngo T.T.M., Lee-McMullen B., Lehallier B., Robaczewska A., Mcilwain D., Rosenberg-Hasson Y., et al. Multiomics modeling of the immunome, transcriptome, microbiome, proteome and metabolome adaptations during human pregnancy. Bioinformatics. 2019;35:95–103. doi: 10.1093/bioinformatics/bty537. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Benny P.A., Alakwaa F.M., Schlueter R.J., Lassiter C.B., Garmire L.X. A review of omics approaches to study preeclampsia. Placenta. 2020;92:17–27. doi: 10.1016/j.placenta.2020.01.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Roberge S., Nicolaides K., Demers S., Hyett J., Chaillet N., Bujold E. The role of aspirin dose on the prevention of preeclampsia and fetal growth restriction: systematic review and meta-analysis. Am. J. Obstet. Gynecol. 2017;216:110–120.e6. doi: 10.1016/j.ajog.2016.09.076. [DOI] [PubMed] [Google Scholar]
- 12.American College of Obstetricians and Gynecologists. Task Force on Hypertension in Pregnancy Hypertension in pregnancy. Report of the American College of obstetricians and gynecologists’ task force on hypertension in pregnancy. Obstet. Gynecol. 2013;122:1122–1131. doi: 10.1097/01.AOG.0000437382.03963.88. [DOI] [PubMed] [Google Scholar]
- 13.Zou H., Hastie T. Regularization and variable selection via the elastic net. J. Roy. Stat. Soc. B. 2005;67:301–320. [Google Scholar]
- 14.Lam C., Lim K.-H., Kang D.-H., Karumanchi S.A. Uric acid and preeclampsia. Semin. Nephrol. 2005;25:56–60. doi: 10.1016/j.semnephrol.2004.09.009. [DOI] [PubMed] [Google Scholar]
- 15.Maynard S.E., Karumanchi S.A. Angiogenic factors and preeclampsia. Semin. Nephrol. 2011;31:33–46. doi: 10.1016/j.semnephrol.2010.10.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Taylor B.D., Ness R.B., Olsen J., Hougaard D.M., Skogstrand K., Roberts J.M., Haggerty C.L. Serum leptin measured in early pregnancy is higher in women with preeclampsia compared with normotensive pregnant women. Hypertension. 2015;65:594–599. doi: 10.1161/HYPERTENSIONAHA.114.03979. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Ma H.Y., Cu W., Sun Y.H., Chen X. MiRNA-203a-3p inhibits inflammatory response in preeclampsia through regulating IL24. Eur. Rev. Med. Pharmacol. Sci. 2020;24:5223–5230. doi: 10.26355/eurrev_202005_21304. [DOI] [PubMed] [Google Scholar]
- 18.Meinshausen N., Bühlmann P. Stability selection. J. Roy. Stat. Soc. B. 2010;72:417–473. [Google Scholar]
- 19.Redman C.W.G., Sargent I.L. Immunology of pre-eclampsia. Am. J. Reprod. Immunol. 2010;63:534–543. doi: 10.1111/j.1600-0897.2010.00831.x. [DOI] [PubMed] [Google Scholar]
- 20.Han X., Ghaemi M.S., Ando K., Peterson L.S., Ganio E.A., Tsai A.S., Gaudilliere D.K., Stelzer I.A., Einhaus J., Bertrand B., et al. Differential dynamics of the maternal immune system in healthy pregnancy and preeclampsia. Front. Immunol. 2019;10:1305. doi: 10.3389/fimmu.2019.01305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Samolis S., Papastefanou I., Panagopoulos P., Galazios G., Kouskoukis A., Maroulis G. Relation between first trimester maternal serum leptin levels and body mass index in normotensive and pre-eclamptic pregnancies--role of leptin as a marker of pre-eclampsia: a prospective case-control study. Gynecol. Endocrinol. 2010;26:338–343. doi: 10.3109/09513590903511463. [DOI] [PubMed] [Google Scholar]
- 22.Hashimoto M., Shinozuka K., Bjur R.A., Westfall D.P., Hattori K., Masumura S. The effects of age on the release of adenine nucleosides and nucleotides from rat caudal artery. J. Physiol. (Lond.) 1995;489:841–848. doi: 10.1113/jphysiol.1995.sp021096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Marić I., Tsur A., Aghaeepour N., Montanari A., Stevenson D.K., Shaw G.M., Winn V.D. Early prediction of preeclampsia via machine learning. Am. J. Obstet. Gynecol. MFM. 2020;2:100100. doi: 10.1016/j.ajogmf.2020.100100. [DOI] [PubMed] [Google Scholar]
- 24.Chatuphonprasert W., Jarukamjorn K., Ellinger I. Physiology and pathophysiology of steroid biosynthesis, transport and metabolism in the human placenta. Front. Pharmacol. 2018;9:1027. doi: 10.3389/fphar.2018.01027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Liang L., Rasmussen M.H., Piening B., Shen X., Chen S., Röst H., Snyder J.K., Tibshirani R., Skotte L., Lee N.C., et al. Metabolic dynamics and prediction of gestational age and time to delivery in pregnant women. Cell. 2020;181:1680–1692.e15. doi: 10.1016/j.cell.2020.05.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Handelman S.K., Romero R., Tarca A.L., Pacora P., Ingram B., Maymon E., Chaiworapongsa T., Hassan S.S., Erez O. The plasma metabolome of women in early pregnancy differs from that of non-pregnant women. PLoS One. 2019;14:e0224682. doi: 10.1371/journal.pone.0224682. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Nilsen R.M., Bjørke-Monsen A.L., Midttun O., Nygård O., Pedersen E.R., Ulvik A., Magnus P., Gjessing H.K., Vollset S.E., Ueland P.M. Maternal tryptophan and kynurenine pathway metabolites and risk of preeclampsia. Obstet. Gynecol. 2012;119:1243–1250. doi: 10.1097/AOG.0b013e318255004e. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Luppi P., Tse H., Lain K.Y., Markovic N., Piganelli J.D., DeLoia J.A. Preeclampsia activates circulating immune cells with engagement of the NF-kappaB pathway. Am. J. Reprod. Immunol. 2006;56:135–144. doi: 10.1111/j.1600-0897.2006.00386.x. [DOI] [PubMed] [Google Scholar]
- 29.Rumpf S., Jentsch S. Functional division of substrate processing cofactors of the ubiquitin-selective Cdc48 chaperone. Mol. Cell. 2006;21:261–269. doi: 10.1016/j.molcel.2005.12.014. [DOI] [PubMed] [Google Scholar]
- 30.Trifonova E.A., Gabidulina T.V., Ershov N.I., Serebrova V.N., Vorozhishcheva A.Y., Stepanov V.A. Analysis of the placental tissue transcriptome of normal and preeclampsia complicated pregnancies. Acta Naturae. 2014;6:71–83. [PMC free article] [PubMed] [Google Scholar]
- 31.Massobrio M., Alexander B.T., Bennett W.A., Khalil R.A. Arachidonic acid derivatives in the pathophysiology of pregnancy-induced hypertension. Clin. Exp. Hypertens. - Part B Hypertens. Pregnancy. 1988;7:43–55. [Google Scholar]
- 32.Badawy A.A.-B. Tryptophan metabolism, disposition and utilization in pregnancy. Biosci. Rep. 2015;35 doi: 10.1042/BSR20150197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Tarca A.L., Romero R., Benshalom-Tirosh N., Than N.G., Gudicha D.W., Done B., Pacora P., Chaiworapongsa T., Panaitescu B., Tirosh D., et al. The prediction of early preeclampsia: results from a longitudinal proteomics study. PLoS One. 2019;14:e0217273. doi: 10.1371/journal.pone.0217273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Austdal M., Tangerås L.H., Skråstad R.B., Salvesen K., Austgulen R., Iversen A.C., Bathen T.F. First trimester urine and serum metabolomics for prediction of preeclampsia and gestational hypertension: a prospective screening study. Int. J. Mol. Sci. 2015;16:21520–21538. doi: 10.3390/ijms160921520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Docheva N., Romero R., Chaemsaithong P., Tarca A.L., Bhatti G., Pacora P., Panaitescu B., Chaiyasit N., Chaiworapongsa T., Maymon E., et al. The profiles of soluble adhesion molecules in the “great obstetrical syndromes”. J. Matern. Fetal Neonatal Med. 2019;32:2113–2136. doi: 10.1080/14767058.2018.1427058. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Gotsch F., Romero R., Friel L., Kusanovic J.P., Espinoza J., Erez O., Than N.G., Mittal P., Edwin S., Yoon B.H., et al. CXCL10/IP-10: a missing link between inflammation and anti-angiogenesis in preeclampsia? J. Matern. Fetal Neonatal Med. 2007;20:777–792. doi: 10.1080/14767050701483298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Chen J., Yue C., Xu J., Zhan Y., Zhao H., Li Y., Ye Y. Downregulation of receptor tyrosine kinase-like orphan receptor 1 in preeclampsia placenta inhibits human trophoblast cell proliferation, migration, and invasion by PI3K/AKT/mTOR pathway accommodation. Placenta. 2019;82:17–24. doi: 10.1016/j.placenta.2019.05.002. [DOI] [PubMed] [Google Scholar]
- 38.Wang N., Li R., Xue M. Potential regulatory network in the PSG10P/miR-19a-3p/IL1RAP pathway is possibly involved in preeclampsia pathogenesis. J. Cell Mol. Med. 2019;23:852–864. doi: 10.1111/jcmm.13985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Zhang Y., Cao L., Jia J., Ye L., Wang Y., Zhou B., Zhou R. CircHIPK3 is decreased in preeclampsia and affects migration, invasion, proliferation, and tube formation of human trophoblast cells. Placenta. 2019;85:1–8. doi: 10.1016/j.placenta.2019.07.010. [DOI] [PubMed] [Google Scholar]
- 40.Løset M., Mundal S.B., Johnson M.P., Fenstad M.H., Freed K.A., Lian I.A., Eide I.P., Bjørge L., Blangero J., Moses E.K., Austgulen R. A transcriptional profile of the decidua in preeclampsia. Am. J. Obstet. Gynecol. 2011;204 doi: 10.1016/j.ajog.2010.08.043. 84.e1-27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Maynard S.E., Min J.Y., Merchan J., Lim K.H., Li J., Mondal S., Libermann T.A., Morgan J.P., Sellke F.W., Stillman I.E., et al. Excess placental soluble fms-like tyrosine kinase 1 (sFlt1) may contribute to endothelial dysfunction, hypertension, and proteinuria in preeclampsia. J. Clin. Invest. 2003;111:649–658. doi: 10.1172/JCI17189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Rath G., Tripathi R. Angiogenic balance and diagnosis of pre-eclampsia: selecting the right VEGF receptor. J. Hum. Hypertens. 2012;26:207–210. doi: 10.1038/jhh.2011.89. [DOI] [PubMed] [Google Scholar]
- 43.Verlohren S., Herraiz I., Lapaire O., Schlembach D., Zeisler H., Calda P., Sabria J., Markfeld-Erol F., Galindo A., Schoofs K., et al. New gestational phase-specific cutoff values for the use of the soluble fms-like tyrosine kinase-1/placental growth factor ratio as a diagnostic test for preeclampsia. Hypertension. 2014;63:346–352. doi: 10.1161/HYPERTENSIONAHA.113.01787. [DOI] [PubMed] [Google Scholar]
- 44.Pérez-Pérez A., Toro A., Vilariño-García T., Maymó J., Guadix P., Dueñas J.L., Fernández-Sánchez M., Varone C., Sánchez-Margalet V. Leptin action in normal and pathological pregnancies. J. Cell Mol. Med. 2018;22:716–727. doi: 10.1111/jcmm.13369. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Naylor C., Petri W.A. Leptin regulation of immune responses. Trends Mol. Med. 2016;22:88–98. doi: 10.1016/j.molmed.2015.12.001. [DOI] [PubMed] [Google Scholar]
- 46.Abella V., Scotece M., Conde J., Pino J., Gonzalez-Gay M.A., Gómez-Reino J.J., Mera A., Lago F., Gómez R., Gualillo O., et al. Leptin in the interplay of inflammation, metabolism and immune system disorders. Nat. Rev. Rheumatol. 2017;13:100–109. doi: 10.1038/nrrheum.2016.209. [DOI] [PubMed] [Google Scholar]
- 47.Martín-Romero C., Santos-Alvarez J., Goberna R., Sánchez-Margalet V. Human leptin enhances activation and proliferation of human circulating T lymphocytes. Cell. Immunol. 2000;199:15–24. doi: 10.1006/cimm.1999.1594. [DOI] [PubMed] [Google Scholar]
- 48.Ivetic A., Hoskins Green H.L., Hart S. J. L-selectin: a major regulator of leukocyte adhesion, migration and signaling. Front. Immunol. 2019;10:1068. doi: 10.3389/fimmu.2019.01068. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Seidelin J.B., Nielsen O.H., Strøm J. Soluble L-selectin levels predict survival in sepsis. Intensive Care Med. 2002;28:1613–1618. doi: 10.1007/s00134-002-1501-5. [DOI] [PubMed] [Google Scholar]
- 50.Chavarría M.E., Lara-González L., García-Paleta Y., Vital-Reyes V.S., Reyes A. Adhesion molecules changes at 20 gestation weeks in pregnancies complicated by preeclampsia. Eur. J. Obstet. Gynecol. Reprod. Biol. 2008;137:157–164. doi: 10.1016/j.ejogrb.2007.06.014. [DOI] [PubMed] [Google Scholar]
- 51.Sabatier F., Bretelle F., D'ercole C., Boubli L., Sampol J., Dignat-George F. Neutrophil activation in preeclampsia and isolated intrauterine growth restriction. Am. J. Obstet. Gynecol. 2000;183:1558–1563. doi: 10.1067/mob.2000.108082. [DOI] [PubMed] [Google Scholar]
- 52.Rainer T.H. L-selectin in health and disease. Resuscitation. 2002;52:127–141. doi: 10.1016/s0300-9572(01)00444-0. [DOI] [PubMed] [Google Scholar]
- 53.Sonnweber T., Pizzini A., Nairz M., Weiss G., Tancevski I. Arachidonic acid metabolites in cardiovascular and metabolic diseases. Int. J. Mol. Sci. 2018;19:3285. doi: 10.3390/ijms19113285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Walsh S.W. Eicosanoids in preeclampsia. Prostaglandins Leukot. Essent. Fatty Acids. 2004;70:223–232. doi: 10.1016/j.plefa.2003.04.010. [DOI] [PubMed] [Google Scholar]
- 55.Mills J.L., DerSimonian R., Raymond E., Morrow J.D., Roberts L.J., 2nd, Clemens J.D., Hauth J.C., Catalano P., Sibai B., Curet L.B., Levine R.J. Prostacyclin and thromboxane changes predating clinical onset of preeclampsia: a multicenter prospective study. JAMA. 1999;282:356–362. doi: 10.1001/jama.282.4.356. [DOI] [PubMed] [Google Scholar]
- 56.Cui Y., Zhu B., Zheng F. Low-dose aspirin at ≤16 weeks of gestation for preventing preeclampsia and its maternal and neonatal adverse outcomes: a systematic review and meta-analysis. Exp. Ther. Med. 2018;15:4361–4369. doi: 10.3892/etm.2018.5972. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Chang R.-Q., Li D.-J., Li M.-Q. The role of indoleamine-2,3-dioxygenase in normal and pathological pregnancies. Am. J. Reprod. Immunol. 2018;79:e12786. doi: 10.1111/aji.12786. [DOI] [PubMed] [Google Scholar]
- 58.Munn D.H., Zhou M., Attwood J.T., Bondarev I., Conway S.J., Marshall B., Brown C., Mellor A.L. Prevention of allogeneic fetal rejection by tryptophan catabolism. Science. 1998;281:1191–1193. doi: 10.1126/science.281.5380.1191. [DOI] [PubMed] [Google Scholar]
- 59.DiNatale B.C., Murray I.A., Schroeder J.C., Flaveny C.A., Lahoti T.S., Laurenzana E.M., Omiecinski C.J., Perdew G.H. Kynurenic acid is a potent endogenous aryl hydrocarbon receptor ligand that synergistically induces interleukin-6 in the presence of inflammatory signaling. Toxicol. Sci. 2010;115:89–97. doi: 10.1093/toxsci/kfq024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Nguyen N.T., Kimura A., Nakahama T., Chinen I., Masuda K., Nohara K., Fujii-Kuriyama Y., Kishimoto T. Aryl hydrocarbon receptor negatively regulates dendritic cell immunogenicity via a kynurenine-dependent mechanism. Proc. Natl. Acad. Sci. USA. 2010;107:19961–19966. doi: 10.1073/pnas.1014465107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Mezrich J.D., Fechner J.H., Zhang X., Johnson B.P., Burlingham W.J., Bradfield C.A. An interaction between kynurenine and the aryl hydrocarbon receptor can generate regulatory T cells. J. Immunol. 2010;185:3190–3198. doi: 10.4049/jimmunol.0903670. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Jaronen M., Quintana F.J. Immunological relevance of the coevolution of Ido1 and AHR. Front. Immunol. 2014;5:521. doi: 10.3389/fimmu.2014.00521. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Fazio F., Lionetto L., Curto M., Iacovelli L., Copeland C.S., Neale S.A., Bruno V., Battaglia G., Salt T.E., Nicoletti F. Cinnabarinic acid and xanthurenic acid: two kynurenine metabolites that interact with metabotropic glutamate receptors. Neuropharmacology. 2017;112:365–372. doi: 10.1016/j.neuropharm.2016.06.020. [DOI] [PubMed] [Google Scholar]
- 64.Santillan M.K., Pelham C.J., Ketsawatsomkron P., Santillan D.A., Davis D.R., Devor E.J., Gibson-Corley K.N., Scroggins S.M., Grobe J.L., Yang B., et al. Pregnant mice lacking indoleamine 2,3-dioxygenase exhibit preeclampsia phenotypes. Physiol. Rep. 2015;3:e12257. doi: 10.14814/phy2.12257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Wright D., Syngelaki A., Akolekar R., Poon L.C., Nicolaides K.H. Competing risks model in screening for preeclampsia by maternal characteristics and medical history. Am. J. Obstet. Gynecol. 2015;213:62.e1–62.e10. doi: 10.1016/j.ajog.2015.02.018. [DOI] [PubMed] [Google Scholar]
- 66.Odibo A.O., Zhong Y., Goetzinger K.R., Odibo L., Bick J.L., Bower C.R., Nelson D.M. First-trimester placental protein 13, PAPP-A, uterine artery Doppler and maternal characteristics in the prediction of pre-eclampsia. Placenta. 2011;32:598–602. doi: 10.1016/j.placenta.2011.05.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Yu C.K.H., Smith G.C., Papageorghiou A.T., Cacho A.M., Nicolaides K.H. An integrated model for the prediction of preeclampsia using maternal factors and uterine artery Doppler velocimetry in unselected low-risk women. Am. J. Obstet. Gynecol. 2005;193:429–436. doi: 10.1016/j.ajog.2004.12.014. [DOI] [PubMed] [Google Scholar]
- 68.Audibert F., Boucoiran I., An N., Aleksandrov N., Delvin E., Bujold E., Rey E. Screening for preeclampsia using first-trimester serum markers and uterine artery Doppler in nulliparous women. Am. J. Obstet. Gynecol. 2010;203:383.e1–383.e8. doi: 10.1016/j.ajog.2010.06.014. [DOI] [PubMed] [Google Scholar]
- 69.Wright D., Wright A., Nicolaides K.H. The competing risk approach for prediction of preeclampsia. Am. J. Obstet. Gynecol. 2019;223:12–23.e7. doi: 10.1016/j.ajog.2019.11.1247. [DOI] [PubMed] [Google Scholar]
- 70.North R.A., McCowan L.M., Dekker G.A., Poston L., Chan E.H., Stewart A.W., Black M.A., Taylor R.S., Walker J.J., Baker P.N., Kenny L.C. Clinical risk prediction for pre-eclampsia in nulliparous women: development of model in international prospective cohort. BMJ. 2011;342:d1875. doi: 10.1136/bmj.d1875. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Parra-Cordero M., Rodrigo R., Barja P., Bosco C., Rencoret G., Sepúlveda-Martinez A., Quezada S. Prediction of early and late pre-eclampsia from maternal characteristics, uterine artery Doppler and markers of vasculogenesis during first trimester of pregnancy. Ultrasound Obstet. Gynecol. 2013;41:538–544. doi: 10.1002/uog.12264. [DOI] [PubMed] [Google Scholar]
- 72.Kelly R.S., Croteau-Chonka D.C., Dahlin A., Mirzakhani H., Wu A.C., Wan E.S., McGeachie M.J., Qiu W., Sordillo J.E., Al-Garawi A., et al. Integration of metabolomic and transcriptomic networks in pregnant women reveals biological pathways and predictive signatures associated with preeclampsia. Metabolomics. 2017;13 doi: 10.1007/s11306-016-1149-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.De Kat A.C., Hirst J., Woodward M., Kennedy S., Peters S.A. Prediction models for preeclampsia: a systematic review. Pregnancy Hypertens. 2019;16:48–66. doi: 10.1016/j.preghy.2019.03.005. [DOI] [PubMed] [Google Scholar]
- 74.Stamilio D.M., Sehdev H.M., Morgan M.A., Propert K., Macones G.A. Can antenatal clinical and biochemical markers predict the development of severe preeclampsia? Am. J. Obstet. Gynecol. 2000;182:589–594. doi: 10.1067/mob.2000.103890. [DOI] [PubMed] [Google Scholar]
- 75.Chaiworapongsa T., Romero R., Korzeniewski S.J., Kusanovic J.P., Soto E., Lam J., Dong Z., Than N.G., Yeo L., Hernandez-Andrade E., et al. Maternal plasma concentrations of angiogenic/antiangiogenic factors in the third trimester of pregnancy to identify the patient at risk for stillbirth at or near term and severe late preeclampsia. Am. J. Obstet. Gynecol. 2013;208:287.e1–287.e15. doi: 10.1016/j.ajog.2013.01.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Recht B., Roelofs R., Schmidt L., Shankar V. Do ImageNet classifiers generalize to ImageNet? arXiv. 2019 doi: 10.48550/arXiv.1902.10811. Preprint at. [DOI] [Google Scholar]
- 77.Aghaeepour N., Ganio E.A., Mcilwain D., Tsai A.S., Tingle M., Van Gassen S., Gaudilliere D.K., Baca Q., McNeil L., Okada R., et al. An immune clock of human pregnancy. Sci. Immunol. 2017;2:eaan2946. doi: 10.1126/sciimmunol.aan2946. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Breiman L. Stacked regressions. Mach. Learn. 1996;24:49–64. [Google Scholar]
- 79.Wolpert D.H. Stacked generalization. Neural Network. 1992;5:241–259. [Google Scholar]
- 80.Stone M. Cross-validatory choice and assessment of statistical predictions. J. Roy. Stat. Soc. B. 1974;36:111–133. [Google Scholar]
- 81.Chzhen E., Hebiri M., Salmon J. On Lasso refitting strategies. Bernoulli. 2019;25:3175–3200. [Google Scholar]
- 82.Hastie T., Tibshirani R., Wainwright M. Statistical Learning with Sparsity: The Lasso and Generalizations. Chapman and Hall/CRC; 2015. pp. 155–182. [DOI] [Google Scholar]
- 83.van der Maaten L. Visualizing high-dimensional data using t-SNE. J. Mach. Learn. Res. 2008 [Google Scholar]
- 84.Tang J., Liu J., Zhang M., Mei Q. Proceedings of the 25th International Conference on World Wide Web - WWW ’16. ACM Press; 2016. Visualizing large-scale and high-dimensional data; pp. 287–297. [DOI] [Google Scholar]
- 85.Benjamini Y., Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. Roy. Stat. Soc. B. 1995;57:289–300. [Google Scholar]
- 86.Pang Z., Chong J., Li S., Xia J. Metaboanalystr 3.0: toward an optimized workflow for global metabolomics. Metabolites. 2020;10:186. doi: 10.3390/metabo10050186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Ashburner M., Ball C.A., Blake J.A., Botstein D., Butler H., Cherry J.M., Davis A.P., Dolinski K., Dwight S.S., Eppig J.T., et al. Gene Ontology: tool for the unification of biology. Nat. Genet. 2000;25:25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.The Gene Ontology Consortium The gene ontology resource: 20 years and still GOing strong. Nucleic Acids Res. 2019;47:D330–D338. doi: 10.1093/nar/gky1055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Kuznetsova I., Lugmayr A., Siira S.J., Rackham O., Filipovska A. CirGO: an alternative circular way of visualising gene ontology terms. BMC Bioinf. 2019;20:84. doi: 10.1186/s12859-019-2671-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Raw and processed untargeted metabolomics data were deposited to the Metabolomics Workbench with the following study IDs: ST001889 for plasma and ST001890 for urine. The Project DOI for these studies is https://doi.org/10.21228/M8WD84.
Our microbiome reads have been submitted to SRA. The BioProject accession is PRJNA752652.
https://dataview.ncbi.nlm.nih.gov/object/PRJNA752652?reviewer=aofjjbr2j556u6vckeolc1i2t2.
Transcriptome data are available at:
https://drive.google.com/file/d/12JXm30he5psipz6iCiIUtZxWsy-IwG08/view?usp=sharing.
The data that support the findings of this study have also been deposited on GitHub in the form of csv files at https://github.com/ivanam5/Multiomics_Preeclampsia. All data except for the clinical variables have been made available. Clinical variables cannot be shared due to the HIPAA constraints.
Code to reproduce main analyses in the manuscript is available on GitHub at https://github.com/ivanam5/Multiomics_Preeclampsia. R software is needed to run the code.