Abstract
Right ventricular (RV) function is critical to prognosis in all forms of pulmonary hypertension. Here we perform molecular phenotyping of RV remodeling by transcriptome analysis of RV tissue obtained from 40 individuals, and two animal models of RV dysfunction of both sexes. Our unsupervised clustering analysis identified ‘early’ and ‘late’ subgroups within compensated and decompensated states, characterized by the expression of distinct signaling pathways, while fatty acid metabolism and estrogen response appeared to underlie sex-specific differences in RV adaptation. The circulating levels of several extracellular matrix proteins deregulated in decompensated RV subgroups were assessed in two independent cohorts of individuals with pulmonary arterial hypertension, revealing that NID1, C1QTNF1 and CRTAC1 predicted the development of a maladaptive RV state, as defined by magnetic resonance imaging parameters, and were associated with worse clinical outcomes. Our study provides a resource for subphenotyping RV states, identifying state-specific biomarkers, and potential therapeutic targets for RV dysfunction.
Subject terms: Cardiac hypertrophy, Predictive markers
Khassafi et al. identify molecular subgroups of right ventricular (RV) dysfunction using transcriptomic analysis of RV samples from individuals with compensated RV hypertrophy or decompensated RV failure and two rat models of RV dysfunction. Several extracellular matrix genes found to be deregulated in decompensated RV subgroups were validated at the protein level in two independent cohorts of individuals with pulmonary arterial hypertension, revealing their predictive biomarker potential in maladaptive RV development.
Main
RV dysfunction can occur as a consequence of a variety of underlying clinical conditions that cause pressure overload, volume overload or intrinsic cardiomyopathies1. A major pathological mechanism of RV failure with normal left heart function is increased afterload due to pulmonary hypertension (PH)2. As progressive RV dysfunction is associated with increased morbidity and mortality, accurate assessment and characterization of RV function and molecular phenotype in patients with PH is crucial in diagnosis and management of the disease1.
The normal RV structure consists of a compliant thin wall that accommodates substantial changes in volume (preload), and therefore an intrinsic capacity to adapt to progressive increases in afterload caused by gradual elevation in pulmonary arterial pressures as is typically observed in PH3. Persistently elevated afterload caused by pulmonary vascular remodeling activates adaptive RV alterations. However, the critical factors that mediate deterioration from adaptive to maladaptive hypertrophy and RV failure remain elusive. This is particularly important when adaptive remodeling fails to maintain the stroke volume (SV), which usually causes RV dilatation and failure4.
A deeper understanding of the molecular mechanisms governing adaptive responses and their transition to RV failure has recently been explored, by several omics analyses in human and animal models of RV dysfunction, which revealed molecular mechanisms associated with adverse RV remodeling5–14. Of note, Park et al. compared the RV transcriptome from two rat models of PH and identified the common signature of RV remodeling between monocrotaline (MCT) and Sugen5416/hypoxia (SuHx) rats and validated some of the epithelial-to-mesenchymal transition (EMT)-associated target genes in human RV tissue5. In addition, Kobayashi et al. investigated transcriptome and metabolome changes in SuHx rat models after chrysin treatment compared with control RV samples6. Moreover, several proteomic profiling studies of RV in various PH animal models have been generated7–10. Hindmarch et al. performed an integrative transcriptome and proteome analysis of the MCT rat model, in which they identified 410 proteins associated with the RV failure phenotype7. In addition, a human RV proteome was recently published by Boucherat et al., which showed the altered protein profile associated with RV remodeling in pulmonary arterial hypertension (PAH) by an integrative transcriptome and proteomics analysis and also validated a panel of proteins in animal models of PH11. Recent transcriptomic and proteomic studies of plasma from various human PAH cohorts have been performed12–14, in which various proteins in the participants’ blood were identified and their correlation with prognosis or survival in PAH has been demonstrated.
However, downstream mechanisms that drive adaptive remodeling to RV failure have not yet been investigated. The progression from initial stages into the uncoupled failing stage is a continuum in which the remodeling events overlap and smoothly transit throughout the stages15. This leads to the assumption that there may be different phases of adaptive and maladaptive remodeling. On the other hand, RV dysfunction needs clinically different treatments as the RV is supported differently depending on the stage of remodeling. Hence, it is important to develop accurate diagnostic tools that can assess RV disease severity and detect early maladaptive changes in RV function and structure16–19.
In this Article, we aim to (i) perform precise molecular phenotyping of RV remodeling by identifying subgroups within compensated and decompensated states of RV function in individuals with PH and animal models of RV dysfunction, (ii) investigate sex-specific transcriptional differences in RV remodeling, and (iii) evaluate whether a number of potential RV-relevant biomarkers could distinguish early- and late-decompensated states in individuals with PAH, and their impact on clinical features and prognosis.
Results
Transcriptomic analysis of the right ventricle in two rat models of pulmonary hypertension
To investigate the transcriptional comparison between compensated and decompensated RV tissues affected by increased afterload in PH, we first classified the pathophysiology of RV function into normal, compensated hypertrophy and decompensated failure, based on RV function and hemodynamic parameters, in two rodent models of PH and RV dysfunction, that is, in models induced by MCT (Extended Data Fig. 1a–f and Methods) and pulmonary artery banding (PAB) in rats (Extended Data Fig. 2a–f and Methods).
RNA sequencing (RNA-seq) was performed on RV tissues obtained from 30 rats, grouped into control, compensated and decompensated RV samples from MCT-induced PH rats (Fig. 1a). Principle component (PC) analysis demonstrated a clear separation between the three groups through PC1 (Fig. 1b). Analysis of differentially expressed genes (DEGs) revealed substantial transcriptional changes in compensated (1,383 genes) and decompensated (2,991 genes) RV samples, compared with control RV samples, as well as 1,159 genes differentially expressed between decompensated and compensated RV samples (Fig. 1c–e and Supplementary Data 1). The correlation between the samples based on their gene expression profile and top 50 significant DEGs for each pair of comparison shown in Supplementary Fig. 1. Pathway enrichment analysis revealed that the DEGs in compensated RV samples, compared to controls, were associated with the cell cycle, DNA replication initiation, response to interleukin-1 and cytokine-mediated signaling pathways, while genes associated with extracellular matrix (ECM) organization, the immune responses, cardiac conduction and different metabolic processes were differentially regulated in decompensated RV samples (Fig. 1f, g). Regarding the transition of adaptive RV to maladaptive RV failure, genes associated with response to hypoxia, ECM receptor interactions and cell adhesion were notably dysregulated in decompensated RV samples, compared with compensated RV samples (Fig. 1h and Supplementary Fig. 2).
Furthermore, RNA-seq analysis of 15 RV tissue samples obtained from PAB rats, including 5 samples in each control, compensated and decompensated groups, was performed (Fig. 2a). PC analysis separated samples from control, compensated and decompensated PAB RV samples while indicating a large heterogeneity in decompensated RV samples (Fig. 2b and Supplementary Fig. 3a). DEG analysis revealed dysregulation of 2,511 and 1,875 genes in compensated and decompensated RV groups respectively, compared with control, while lower DEGs (445) between compensated and decompensated RV samples suggest less distinct profiles of decompensation in PAB rats compared with MCT-treated rats (Fig. 2c–e and Supplementary Data 2). Correlation between the samples, the top 50 significant DEGs and enriched GO and KEGG pathways in all three contrasts are provided in Supplementary Figs. 3 and 4. Similarly to the MCT model, the enrichment analysis of DEGs of compensated RV samples revealed cell cycle, DNA replication initiation and increased response to interleukin-1 and interferon-γ. However, downregulation of a few other pathways such as amino acids, pyruvate and propanoate metabolism was also distinctly regulated in PAB-associated compensated RV samples (Fig. 2f). In contrast, ECM reorganization, nuclear factor (NF)-kB signaling pathway, various immune-related cytokines such as tumor necrosis factor (TNF), along with hypoxia and leukocyte recruitment were significantly increased in the the PAB rat decompensated phase, which was very similar to the dysregulated pattern observed in the MCT rat decompensated RV samples (Fig. 2g and Supplementary Data 2). DEGs in decompensated versus compensated RV samples in PAB rats were mostly associated with upregulation of the AMPK signaling pathway, cAMP catabolism and the PI3K–Akt signaling pathway, and downregulation of mitotic proliferation (Fig. 2h).
Transcriptomic analysis of human adaptive and maladaptive right ventricle
To investigate the underlying molecular signature of RV dysfunction in humans, RNA-seq was performed on 40 human RV tissues that were classified, by hemodynamic assessment and clinical symptoms20, into: control, compensated RV and decompensated RV states (Fig. 3a and Methods). A batch correction method was applied before any downstream analysis as the human RV samples were obtained in two different time points. PC analysis showed that RV samples were separated mostly based on the different RV states rather than different batches of sampling (Fig. 3b and Supplementary Fig. 5a). However, a substantial heterogeneity existed, especially within the samples. Differential expression analysis showed dysregulation of 2,027 genes in decompensated RV samples, compared with control RV samples, and 260 DEGs between decompensated and compensated RV samples. We did not detect any significant DEGs associated with compensated RV samples compared to controls, which could be due to the molecular heterogeneity within the normal and hypertrophic RV samples or the functional similarities between them. Enrichment analysis showed that DEGs in the decompensated RV samples were significantly associated with increased ECM and receptor interaction and cell adhesion as well as activation of cytokines such as TNF. In contrast, pathways associated with the tricarboxylic acid (TCA) cycle and cardiac muscle contraction were significantly downregulated in decompensated RV samples, while they were upregulated in the compensated RV state (Fig. 3c–f).
Finally, we looked for similarities between the DEG results of all datasets. As expected, we observed higher similarity between two rat models, in terms of decompensated versus control RV (1,286 genes), while in terms of similarity with human, more common genes were regulated between MCT-induced decompensated and human PAH decompensated RV samples as compared to PAB (522 versus 317 genes; Fig. 3g). Both gene sets were significantly enriched for ECM and integral membrane proteins; however, TNF regulation of cell death seemed to be regulated in MCT-induced RV samples, similar to human decompensated RV samples, while not in the PAB model, which suggests more molecular similarity of the MCT model to the human RV remodeling. Further, focusing on 259 genes that were commonly regulated in decompensated RV samples in all three datasets (Fig. 3g), we found that ECM along with the response to several cytokines as well as vasoconstriction and vasodilation mechanisms were commonly upregulated in all three profiles (Fig. 3h).
Sex differences influence the molecular signature of right ventricular remodeling
Furthermore, to check for the potential differences in terms of transcriptomic signature between the female and male participants, as sex differences are shown to have an impact on RV adaptation and function21,22, we investigated whether (i) the inclusion of males and females has an impact on the downstream analysis, and (ii) there are transcriptome differences between male and female participants separately. Although there were more females in the diseased group compared to controls (P value = 0.04), sex did not significantly affect the downstream analysis between RV subgroups (Fig. 4a).
Further comparing the molecular differences within the female group, we did not observe significant alteration in compensated RV, while we observed greater changes in decompensated RVs compared with control RV group, such as downregulation of oxidative phosphorylation, upregulation of early estrogen responses, endothelial cell dysfunction, EMT and ECM remodeling pathways, similar to the DEG analysis of the entire group. When we compared similar pairs in male samples, taking the P value < 0.05 to determine the DEGs, we found 739 genes dysregulated in the compensated RV group and 38 genes dysregulated in the male decompensated RV group. Enrichment analysis revealed that ECM remodeling, EMT, leukocyte migration and activation and apoptotic regulation occurred in the male compensated RV group, while they were similarly active along with enhanced activation of apoptotic pathways in the decompensated state. This suggests that male participants may develop maladaptive hypertrophy earlier than females (Fig. 4b). Furthermore, upregulation of estrogen-mediated signaling pathways, in female decompensated RVs compared with normal and compensated RVs, suggests that sex hormones influence RV remodeling in female participants (Fig. 4c), and females might take a different route to develop a decompensated state or maintain the compensated state of the RV longer.
We then performed RNA-seq of both male and female RV data from a rat model of MCT-induced PH to see if this also reflects in animal model. Hemodynamics analysis showed higher deterioration of RV function in male MCT rats compared with their female counterparts especially in terms of cardiac output (Fig. 4d). When comparing the compensated RV versus control group in female and male rats separately, we found similar signaling pathways were upregulated such as the mitotic cell cycle. In contrast, comparison of the decompensated RV samples showed differences between female and male groups, despite the similar ECM remodeling. Comparative analysis showed that in male decompensated RV samples, calcium signaling, amino acid metabolism and histone demethylation were upregulated (Fig. 4e), whereas in female decompensated RVs, fatty acid ß-oxidation and various inflammatory responses were exclusively enriched (Fig. 4f). A summary of altered pathways in male and female MCT-induced rats relative to their RV dysfunction suggests that the delayed maladaptive RV remodeling happens in female rats compared to males, similarly to what we observed in human RV hypertrophy and dilation (Fig. 4g).
Subclassification of decompensated right ventricle in MCT-induced rats
Although RV transcriptome data from MCT rats were classified into three groups based on hemodynamics, unsupervised clustering (Methods) allowed us to find four molecular clusters in the RV transcriptome of MCT rats (Fig. 5a,b). Using k-means clustering, we identified a distinct subgroup of decompensated RV samples that had a similar molecular phenotype to the compensated group, and a distinguished transcriptional signature compared with the rest of decompensated RV samples. We subsequently relabeled these three samples as ‘early-decompensated’ (Fig. 5b) and the other six samples as ‘late-decompensated’ RV samples. DEG analysis of early and late-decompensated subgroups revealed 804 significant DEGs that defined the transition signatures from early- to late-decompensated RV function in the rat model of MCT-induced PH (Fig. 5c).
The transition from compensated RV to early-decompensated RV was associated with a reduction in triglyceride metabolism, fatty acid β-oxidation and cell cycle regulation (Fig. 5d and Supplementary Fig. 6a,b), while a significant increase in NF-κB signaling, ECM and collagen reorganization, as well as active immune cell recruitment and hypoxia response regulation were markedly associated with late-decompensated RV (Fig. 5d and Supplementary Fig. 6c,d). Additional glucose and amino acid metabolism were also observed specifically in late-decompensated RV, which confirms the hypoxic metabolic shift occurs in the transition from early to late stages of RV dysfunction. Figure 5e shows the cumulative differentially regulated genes (n = 576) associated with MCT-induced RV remodeling along with their linked pathways.
Based on the transcriptomic analysis confirmation of the transcriptional landscape between compensated and late-decompensated RV states, the hemodynamic data from MCT-induced PH rats were reassessed to validate the early state of decompensation. The transcriptomic status of RV was closer to the compensated RV than to the decompensated RV, which was also consistent with the significant increase in right ventricle systolic pressure (RVSP), right ventricle end-diastolic pressure (RVEDP), total pulmonary resistance and RV hypertrophy that distinguished the early and late states of decompensation; however, the functional shift from the compensated to the early-decompensated RV states was mirrored by a significant reduction in cardiac output (CO) and stroke volume (SV) (Fig. 5f–k). Thus, the combined transcriptomic and hemodynamic data demonstrates the presence of an intermediate early-decompensated phase between the compensated and late-decompensated states in the progressive remodeling of RV in MCT-induced PH.
Molecular subclassification of human compensated and decompensated right ventricle
We further investigated the underlying molecular heterogeneity within the human RV samples by applying a similar unsupervised clustering to the human transcriptome and identified five distinct clusters (A to E; Fig. 6a). To further identify these clusters, we performed DEG analysis for all the clusters compared with the normal RV group (Fig. 6b). Comparison of clusters C and E showed a higher number of DEGs, compared with clusters B and D. Cluster A showed no DEGs as it contains all the normal samples, and a few compensated and decompensated RVs with similar gene expression profiles, which may be caused by their functional similarity, such as in cardiac output, that reflects at the molecular level.
To validate the identity of the clusters, we performed another DEG analysis between the RV subgroups (groups 1 to 6), in which each pair had only compensated or decompensated samples from the same cluster. The results were similar to those of the clusters DEG analysis (Supplementary Fig. 5b and Fig. 6b), which demonstrates a similar identity of clusters with the subgroups of RV. Therefore, we assigned each cluster as a subgroup of compensated (B and C) or decompensated (D and E) RVs, based on the majority of the samples in each. These results suggest that molecular differences between RV subgroups can differentiate the compensated and decompensated RV samples beyond primary hemodynamics classification. Surprisingly, clusters distribution was not different between PAH-associated or dilated cardiomyopathy (DCM)-associated decompensated RV samples (Fig. 6c).
Furthermore, we applied a supervised learning approach (Methods) to see if any of the confounding metadata factors (sex, age, sample type and batches) could predict the specific gene expression pattern observed in the compensated-to-decompensated RV transition. We identified 49 genes with a mean receiver operating characteristic area-under-the-curve (ROC AUC) value ≥ 0.6 among 184 identified bimodal genes (Supplementary Data 3), which revealed that the most impact is driven by sampling batches, while participants’ sex and age did not have significant correlation with the identified genes. However, two other subsets of genes, (independent of sex, age and sample type) were found to be highly expressed in PAH decompensated, and compensated RV, respectively. This suggests that our unsupervised clustering and, therefore, the DEG analysis between the clusters mainly rely on molecular differences within the RV subgroups and are not derived by any of the background factors (Supplementary Fig. 5c).
Subgroups of human right ventricle show distinct pathway regulation
Accumulative enrichment analysis of each cluster compared to normal RV samples is demonstrated in Fig. 6d, and the heatmap shows the expression of 659 selected genes, corresponding to all the top dysregulated pathways in different RV subgroups (Fig. 6e). Notably, cluster C shows a very similar molecular signature to cluster E, especially in collagen reorganization, inflammatory responses and mitochondrial metabolic shift (Supplementary Fig. 7), which suggests that both clusters have an extremely fibrotic RV, eventually developing end-stage RV failure. This was in line with the participant’s clinical data that confirms severe hypertrophic RV in two of three samples of cluster C. However, in head-to-head differential enrichment analysis, they showed several profound differences such as in notch signaling pathways and citric acid levels (tricarboxylic acid cycle; Fig. 6f).
Moreover, enrichment analysis of DEGs between decompensated subgroups (clusters D and E) revealed impaired fatty acid metabolism in cluster D, while pathways related to EMT, immune response and ECM remodeling were enriched in cluster E (Fig. 6g). Comparing these findings with RV changes in MCT rats, we found one cluster (D) mainly associated with fatty acid dysregulation, similar to early-decompensated RV, while the other (cluster E) showed inflammation, resembling late-stage PAH-associated RV failure. Consequently, the human decompensated subgroups (D and E) were redefined as early-decompensated and late-decompensated phases for further analysis. In addition, enrichment analysis of DEGs between PAH-associated and DCM-associated decompensated RVs, highlighted upregulated inflammation, EMT and distinct ECM regulation in PAH-associated RV failure (Fig. 6h), mirroring cluster E characteristics. Altogether, subgrouping of human RV samples not only unveiled pathways from compensated to decompensated states but also determined diverse RV failure routes in different disease conditions.
Common regulated pathways in MCT rat and human right ventricular remodeling
Combination of MCT rat and human selected DEGs (576 and 659 genes, respectively) revealed 85 commonly dysregulated genes between the two species (Extended Data Fig. 3a,b). Pathway enrichment analysis of these 85 genes highlighted three biological pathways: ECM reorganization, cell cycle and hypoxia regulation (Extended Data Fig. 3c,d). We confirmed expression of several associated genes of each pathway in RV tissues and showed their correlation with RVEDP in MCT rats. Hypoxia-related genes (HMOX1, ENO3 and ANGPT1), as well as multiple cell cycle genes (FGF9, WNT5A and IL18) were significantly correlated with RVEDP in MCT rats, suggesting amplification of hypoxic response23 and altered glycolytic activity associated with impaired angiogenesis in decompensated RVs24 (Supplementary Figs. 8 and 9).
Furthermore, we performed transcription factor (TF) enrichment analysis for the associated DEGs in each subgroup of RVs, in which we identified forkhead box protein M1 (FOXM1) protein as a top regulated TF in compensated and early-decompensated RVs of MCT rats, accompanied by other components (E2F TF family, centromere protein A (CENPA) and DNA methyltransferase-1 (DNMT1)) forming a core regulatory network responsible for cell cycle alterations in the hypertrophic RV in line with previous studies25. In addition, FOSL1 turned up as an upstream TF in the late-decompensated RV of MCT rats and its direct association with CEBPE confirms the existence of a pro-apoptotic network activated in the late stage of RV decompensation (Supplementary Fig. 10).
Extracellular matrix alteration in pulmonary hypertension animal models and human right ventricular remodeling
As mentioned above, ECM remodeling and collagen organization were among the top commonly regulated pathway across species (Fig. 3h). Besides collagens, increased transcription of structural components of cardiac ECM such as fibronectin (FN1) and fibromodulin (FMOD), as well as nonstructural proteins like tenascin (TNC) and thrombospondin (THBS1), was observed in decompensated RV, in line with previous studies of cardiac ECM26. In addition, we observed elevated expression of ECM regulators such as matrix metallopeptidase 9 (MMP9) and tissue inhibitors of MMPs (TIMP1), as well as a significant correlation of FN1, MMP9 and NPPA expression with RVEDP (Extended Data Fig. 3 and Supplementary Fig. 11), which corroborates evidence of active collagen remodeling and the fibrotic phenotype of end-stage RV failure. Among all the differentially expressed ECM genes, considering the DEGs between PAH-associated versus DCM-decompensated RVs, and late-versus-early-decompensated subgroups, we shortlisted 13 ECM-related target genes, for further validation at the protein level in PAH.
Next, to confirm the alteration of those 13 ECM-associated transcripts at the protein level, we evaluated publicly available proteome datasets from human RV samples as well as different animal models. We primarily found 7 proteins to be regulated in 4 RV proteomics from different PH models as well as PAH participants7–11 (Extended Data Fig. 4a). We confirmed the remaining ECM molecules (ITGA10, ITGB6, HMMR, MMP9, SPP1 and TIMP1) with no proteomics record in public datasets, as either regulated in plasma of PAH participants (MMP9 and TIMP1)12,14, or similarly regulated in transcriptomes from other rat models of PH5,6. The remaining molecules were confirmed by western blotting in MCT rat RV tissues (Extended Data Fig. 4b,c).
Integration of right ventricular transcriptomes with pulmonary arterial hypertension plasma proteomes
To evaluate the potential of dysregulated ECM proteins (the predominantly dysregulated pathway in both human and rat RV transcriptomes; Fig. 7a) as circulating biomarkers for PAH-associated RV dysfunction, we performed a proteomic profiling from the plasma of 35 PAH participants including 20 participants with compensated and 15 with decompensated RV states, serving as discovery cohort (German). Classification of compensated and decompensated RV was done based on the RV end-systolic elastance (Ees)/arterial elastance (Ea) ratio measured by magnetic resonance imaging (MRI) as previously described19, and Ees/Ea = 0.8 was considered as the cutoff (Methods). Using an unbiased approach, we first selected the top significant DEGs in all pairs of our transcriptomes (P < 0.05), which included about 30% of total transcripts (15,305 genes). Then, we selected the top 25% of differentially expressed proteins of our discovery cohort (23 proteins). Next, we combined both differentially regulated candidates from transcriptomes and plasma proteomes to find common targets. Interestingly, more than 60% of plasma regulated proteins (15 of 23) were commonly dysregulated in both datasets, where 8 proteins were functionally linked to ECM remodeling or cell adhesion (Fig. 7b). We selected those 8 ECM proteins for downstream analysis, and cross-checked their differentially regulation in plasma between compensated and decompensated groups in the corresponding proteome cohort. Of the 8 proteins, 5 (CRTAC1, NID1, SPARCL1, MEGF9 and C1QTNF1) were selected to continue further confirmation analysis (Fig. 7c and Methods). Among these 5 proteins, NID1 and C1QTNF1 were upregulated in participants with decompensated RVs, whereas the other 3 were upregulated in the participants with compensated RV (Extended Data Fig. 5).
Plasma levels of extracellular matrix proteins act as potential biomarkers
We then assessed the clinical correlation of these five deregulated ECM proteins with mean pulmonary arterial pressure (mPAP), pulmonary vascular resistance (PVR) and cardiac index (CI), as well as N-terminal prohormone of brain natriuretic peptide (NT-proBNP levels) in the PAH participants. Increased protein expression of NID1 and C1QTNF1 was significantly correlated with increased mPAP and proBNP levels, while NID1 was also significantly correlated with increased PVR and decreased CI. In contrast, a decreased level of MEGF9 was strongly correlated with PVR, CI and proBNP levels, altogether suggesting a strong association of ECM protein regulation with RV-associated prognostic factors of PAH in our discovery cohort (Extended Data Fig. 5).
To validate these results, we analyzed expression levels of all five proteins in an independent plasma proteomics analysis of a PAH cohort (United Kingdom) including 61 PAH participants and 56 controls (Methods). Notably, we observed a significant increase in the expression of NID1 and C1QTNF1 as well as decreased expression of CRTAC1 and MEGF9 in participants with decompensated RV compared to participants with compensated RV (Extended Data Fig. 6) in line with the discovery cohort findings. Similarly, NID1 showed a negative correlation with RV functional parameters such as CI, while a positive correlation with mPAP and PVR, and proBNP levels, was observed; however, this effect was opposite for downregulated proteins (CRTAC1 and MEGF9). C1QTNF1 and SPARCL1 showed a positive correlation only with proBNP levels in the validation cohort (Extended Data Fig. 6).
To determine whether plasma levels correspond to changes in RV tissues, additional quantitative PCR with reverse transcription (RT–qPCR) and western blot analyses of these five molecules in MCT-induced rats and the comparison with their human transcriptome were performed. Upregulation of NID1 and C1QTNF1 and downregulation of MEGF9 at both RNA and protein levels in RV tissue corresponded with their plasma level alterations in PAH participants. However, the observed downregulation pattern of two other proteins (especially CRTAC1) in the plasma contrasted with the increased mRNA levels in the RV, which could be due to the lower protein expressions in RV, or the secretion is more pronounced in other PAH-affected organs such as the lung (Extended Data Fig. 7)27,28.
Plasma signature of five extracellular matrix proteins classifies right ventricular states
We further assessed the ability of these five proteins to predict RV dysfunction and disease severity in PAH using a random forest model (Methods), followed by a ROC curve analysis. We demonstrated that the panel of five proteins could significantly differentiate between compensated and decompensated groups of PAH participants in our discovery cohort with ROC AUC = 0.83 (P = 0.001, accuracy = 81%; Fig. 7d). Based on the calculated feature importance score (Methods), CRTAC1 and NID1 were highly predictive in the discovery cohort (Fig. 7d), while in the validation cohort, NID1 and C1QTNF1 had the most predictive performance in discriminating compensated and decompensated states of the RV samples of PAH participants (Fig. 7e). Considering both cohorts, this result suggests NID1 demonstrates the most significant biomarker potential, while the combination of NID1 with C1QTNF1 and CRTAC1 had highest overall performance compared to other combinations in discriminating two subgroups of PAH (AUC = 0.88 and 0.77 in discovery and validation cohorts, respectively; Extended Data Fig. 8).
To evaluate the potential of these five proteins to predict survival, we used participant follow-up data from the validation cohort. Among 61 PAH participants, 31 were deceased or received lung transplantation, while the other 30 survived during the median follow-up of 4.5 years. Interestingly, ROC analysis revealed that the panel of five proteins could significantly discriminate PAH participants with lower survival outcome (Fig. 7f,g). NID1 and SPARCL1 stand as independent predictors for PAH participants’ survival (AUC = 0.73 for both; Extended Data Fig. 9a–j), while the combination of NID1 with SPARCL1 or MEGF9 increased the survival prediction performance to AUC = 0.76 and 0.74, respectively (Extended Data Fig. 9).
Furthermore, transplant-free survival estimation was performed based on the cutoff values introduced by ROC analysis of single proteins and for all the possible combinations (Methods and Extended Data Fig. 9), in which we confirmed NID1 and SPARCL1 can discriminate PAH participants with poorer transplant-free survival (log-rank P value = 0.007 and 0.002, respectively), while NID1 in combination with each of the other proteins (SPARCL1, C1QTNF1 or MEGF9) remained significantly predictive.
NID1 and C1QTNF1 distinguish early- and late-decompensated right ventricle
Moreover, we investigated whether the panel of five proteins can predict the early stage versus the late stage of RV dysfunction. In this regard, we utilized participant MRI data to characterize early and late stages of RV decompensation in both cohorts. Because RV–PA coupling mirrors right ventricular adaptation and provides a very sensitive measure of early adaptation and maladaptation, we decided to define the different stages of RV decompensation based on the ratio of end-systolic elastance to afterload. In our discovery cohort, we used a gold-standard pressure volume loop analysis to define Ees/Ea as described before19, while in the validation cohort, the ratio of SV to end-systolic volume (SV/ESV), a validated surrogate was used, based on MRI data as previously established29,30. Thus, participants with 0.8 < Ees/Ea < 1.2 or 0.5 < SV/ESV < 0.8 were considered as having early-decompensated RV, and participants with Ees/Ea < 0.8 or SV/ESV < 0.5 were considered as being in the late-decompensated state (Fig. 8a,b). We then compared the plasma expression levels of five proteins in the identified decompensated subgroups as well as in participants with compensated RV. Interestingly, C1QTNF1 and CRTAC1 showed a significant regulation between early- to late-decompensated RVs, while NID1, MEGF9 and SPARCL1 were strongly regulated only in the late stage of decompensation in discovery cohort (Fig. 8c–g). In the validation cohort, NID1 and CRTAC1 showed a significant dysregulation in PAH patients with early-decompensated RV and further in the late stage, while C1QTNF1 and MEGF9 were regulated only in late-stage PAH patients with decompensated RV (Fig. 8h–l). Irrespective of slight differences in the regulation between the two cohorts, the panel of five proteins enabled us to discriminate late-versus-early decompensation RV states of PAH participants (AUC ROC = 0.71 and 0.73 in both cohorts, respectively; Fig. 8m). Moreover, four ECM proteins were enough to significantly predict early versus late stages of RV decompensation in PAH participants, which is in line with less significant changes in SPARCL1 protein levels both in plasma and RV (Fig. 8n). Furthermore, feature importance measurements indicated that NID1 and C1QTNF1 have the highest overall performance in both cohorts in predicting the late stage of RV decompensation, along with CRTAC1 or MEGF9 as an additional promising factor in both cohorts, respectively (Extended Data Fig. 10).
Discussion
In this study, utilizing human RV and rodent models of RV dysfunction (MCT and PAB), we determined a comprehensive transcriptomic signature of RV hypertrophy (compensated) and failure (decompensated). By integrating plasma proteomics of two independent PAH participants cohorts, we further revealed several important findings (Fig. 8o). First, unsupervised clustering revealed the presence of molecular subgroups within each compensated and decompensated RV states beyond their hemodynamics characterization. Second, we identified and validated several molecular pathways associated with ECM, cell cycle, energy metabolism and hypoxia, TNF and other inflammatory factors along with their key corresponding genes that underlie the pathological transition from adaptive to maladaptive RV to failure. Third, the sex-specific differences in RV adaptation in both human and MCT rats have been deciphered. Fourth, we provided evidence for five potential ECM-related biomarkers (NID1, C1QTNF1, CRTAC1, MEGF9 and SPARCL1) that were altered at both RV transcriptome and plasma proteome levels, which could classify compensated and decompensated RV and predict worse clinical outcomes in PAH participants. Lastly, we showed plasma levels of NID1 and C1QTNF1 can determine early- versus late-decompensated RV states in PAH participants. Integrative transcriptomic analysis of RV remodeling with plasma proteins suggests a path toward the identification of state-specific biomarkers and therapeutic targets using the omics-based phenotyping of RV in PAH.
This study aimed to compare compensated and decompensated RV from human and two well-known PH animal models (MCT and PAB-induced RV dysfunction). ECM remodeling emerged as the most dysregulated pathway across all datasets, in addition to similar inflammatory responses in the decompensated RV across all datasets, while molecular signatures of MCT rat decompensated RV were stronger than in PAB and, therefore, resembled human RV dysfunction better. However, each animal model has unique features and drawbacks. MCT closely mirrors human RV remodeling, but itself affects the heart31. In contrast, the PAB model focuses on vascular-independent mechanisms, potentially preventing severe RV failure. Our comprehensive RV transcriptomic profiles stand out, confirming a continuous transition through initial adaptive stages, to the uncoupled maladaptive stages32, and uniquely identifies genes and pathways linked to each RV stage/subgroup, providing deeper insights into disease mechanisms.
We identified two compensated and two decompensated states of human RV based on molecular signature. A subgroup of human compensated RV (cluster C) with a similar signature to end-stage decompensated RV (cluster E) suggested that a particular compensatory RV state could develop (faster) into maladaptive hypertrophy. Furthermore, comparison of two decompensated RV subgroups showed that strong reorganization of ECM starts in early stages, while it also majorly impacts on fatty acid dysregulation. Moreover, specific pathways such as TNF activation via NF-kB, EMT and activated inflammatory response could discriminate PAH-associated decompensated RV versus the non-PAH-decompensated RV.
RNA-seq analysis of RV samples from human and MCT rats, including both sexes, provided valuable insights into the sex-specific RV adaptation. The results highlighted that despite key changes in the transition from the compensated to decompensated RV independent of sex, females may develop a different route to decompensated RV and prolong the compensated phase. We observed that the protective mechanism against decompensated RV in female animal models mainly depends on fatty acid metabolism, whereas the deterioration signature with massive remodeling of the ECM occurs in the earlier phase in males, in line with previously published findings33. Similarly, cell death signaling is activated earlier in male PAH participants, whereas the decompensated phase in females is strongly regulated by hypoxic metabolism, estrogen response and fatty acid regulation. Notably, differential TF complexes (HIF-1A/NF-kB1 versus IRF5/EGR2), as well as distinctive histone demethylase activity (Kdm6a versus Kdm5d) showed that sex differences observed during RV remodeling are caused not only by sex hormones, but also by sex chromosome-mediated epigenetic mechanisms34.
Moreover, ECM component alteration was predominant in our study. As described before35, myocardial fibrosis, mainly due to the reorganization of collagen fibrils, plays a crucial role in the development of RV failure1,35, and in the formation of both adaptive and maladaptive RV responses in PH36,37. We demonstrated that increased transcription of ECM components occurs in the decompensated RVs of both PH animal models and PAH participants. Validation of selected ECM molecules at the protein level7–11, along with western blotting in RV tissues suggested a high correlation between mRNA and protein abundance, indicating their potential role in ECM remodeling that facilitates the transition from adaptive to maladaptive RV in PAH.
Combined analysis of RV transcriptome and plasma proteomes from PAH participants confirmed deregulation of five ECM molecules that serve as cardiac-specific biomarkers. Among these, C1QTNF1 (refs. 38–42), SPARCL1 (refs. 43,44) and CRTAC1 (ref. 45) have been previously shown as biomarkers for various cardiovascular diseases. In addition, the roles of NID1 (ref. 46), C1QTNF1 (refs. 42,47), SPARCL1 (ref. 48), CRTAC1 (ref. 45) and MEGF9 (ref. 49) in cardiac remodeling confirmed their relevance to and the prognostic significance for RV dysfunction in PAH. Notably, the panel of five proteins (also when excluding SPARCL1) could discriminate compensated versus early and late stages of RV decompensation in both cohorts. NID1 (also in combination with the other four) could significantly discriminate PAH participants with lower survival, which suggests the panel of five ECM proteins can serve as a prognostic biomarker alone or in combination with other diagnostic methods.
The major limitation in molecular profiling (omics) approaches, as in our study, is the limited access to clinical samples from humans at different stages of RV disease, as well as lacking the longitudinal assessment during disease progression, resulting in a low number of samples for comprehensive computational analysis. In the same regard, we must be cautious in drawing the indisputable conclusion, as the low number of samples are likely to be associated with insufficient performance power in machine learning approaches such as clustering and linear regression (resulting in lower ROC). Secondly, the heterogeneity within human RV samples, which reflects the etiologic complexity underlying the pathophysiology of RV dysfunction in PAH, mitigates the differential expression discoveries, and therefore transcriptomic data may reflect dysregulation of regulatory mechanisms associated with disease progression partially. Nevertheless, transcriptome profiling is still a valuable technique for mapping the structural and functional alterations of RV myocardium.
Additionally, despite high concordance between mRNA and protein abundance in RV tissues, a proteomic study provides more valuable insights. On the other hand, the low number of gene–protein matches when integrating proteomic with transcriptomic datasets may be explained by posttranslational regulations, but also by the low coverage and sensitivity of current proteomic assays, especially regarding ECM components of cardiac proteomics50. Similarly, the lower detection rate of the O-link panel used for biomarker screening may limit the discovery aspect51. Future efforts in the technological development of clinical proteomics, focusing on increasing the specificity and quantity of proteins, as well as cell-specific discovery approaches, are essential to unravel the stage-specific regulatory landscape of RV dysfunction associated with cardiopulmonary disease.
Methods
Inclusion and ethics
Studies with human RV tissues or cells were performed with the approval of Laval University and the Biosafety and Ethics committees of the University Institute of Cardiology and Respirology of Quebec (CER 20773, CER 20735, CER 21747). Preclinical PAH experiments in rats were performed according to the guidelines of the Canadian Council on Animal Care and approved by the Animal Care and Use committees of Laval University (2014-176 and 2018-015). Studies on PAH plasma were performed with the approval of the local ethics committee (AZ 58/15) at University Hospital Giessen and Marburg, Department of Pneumology and Critical Care Medicine, Germany from 2016 to 2018 for the German cohort. Participants in this study signed an informed consent form before the sample collection. The UK cohort plasma proteome analysis was done with the approval of Laval University and the IUCPQ Biosafety and Ethics Committees (CER 20735), and the University of Sheffield from the Sheffield Teaching Hospitals Observational Study of Patients with Pulmonary Hypertension, Cardiovascular and Lung Disease (UK REC ref. 18/YH/0441) from 2013 to 2018. All participants gave informed consent to be part of the study beforehand.
Human right ventricular tissue collection
Our experimental procedures for using human tissues or cells conformed to the principles outlined in the Declaration of Helsinki. They were performed with the approval of Laval University and the Biosafety and Ethics committees of the University Institute of Cardiology and Respirology of Quebec (CER 20773, CER 20735, CER 21747). All experiments were performed in accordance with the latest preclinical PAH research guidelines52,53. Tissues were obtained from participants who had previously given written, informed consent. Initially, 45 participants (male and female) were classified as control, or with compensated, or decompensated RV condition, based on clinical history and CI. The procedures and criteria for the acquisition of control, compensated and decompensated RV samples were as previously described20. In brief, participants with RV dilation, preserved CI (>2.2 l min−1 per m2), or normal (≥17 mm) tricuspid annular plane systolic excursion measured by echocardiography were considered as compensated hypertrophy, whereas PAH participants who received a lung transplantation or who died with RV failure were considered as those with decompensated RV. Normal distribution of sex per group of participants (normal/PAH and compensated/decompensated) was tested by Fisher’s exact test to confirm that there was no significant effects of sex in any of the comparisons (Fig. 4a).
Animal model of pulmonary hypertension and right ventricular hypertrophy
Experiments were performed according to the guidelines of the Canadian Council on Animal Care and approved by the Animal Care and Use committees of Laval University (2014-176 and 2018-015). For the animal studies, male and female Sprague Dawley rats (Rattus norvegicus) at age 8–12 weeks, (Charles River Laboratories) were subjected to the treatment protocol as previously described20. In brief, for the MCT model, a single subcutaneous injection of MCT (60 mg per kg body weight) was applied (control rats received saline), and the RV function was monitored weekly using echocardiography. Terminal right heart catheterization (RHC) was performed on anesthetized, closed-chest rats. The whole experiment procedure was 5 weeks for the MCT rat model.
For the PAB model, following anesthesia, the PA was separated from the aorta and left atrium, and was tied against a 19-gauge needle and then released quickly. PAB-operated rats with peak velocities > 3.5 m s−1 per m2 at the banding site (assessed by echocardiography), as well as sham-operated rats without tying the pulmonary trunk were included for the study. Between weeks 3 and 8 following PAB operation, sham and PAB-operated rats were euthanized following echocardiography, and underwent terminal RHC at the end of protocol, based on clinical symptoms of RV failure. The numbers of initial animal groups were 30, 30 and 15 for first and second batches of MCT and PAB experiments, respectively. The number of samples left after removing the outliers were 26, 27 and 14, respectively.
We classified rat RVs into control, compensated RV and decompensated RV (and into early or late decompensated for the MCT model) as previously described20, according to CO, RVEDP and clinical RV failure signs. After performing measurements, the heart, lungs and other major organs were harvested.
Histological analyses
RV fibrosis and cardiomyocyte cross-sectional area analysis was performed as described before. Briefly, rat RVs were harvested, fixed with 4% formaldehyde, sectioned at 5 μm, and subsequently stained with H&E (for RV hypertrophy) or Masson’s trichrome (for fibrosis). Cardiomyocyte cross-sectional area was obtained by tracing the outlines of cardiomyocytes with a clear nucleus image in H&E-stained images. The quantification of fibrosis and cross-sectional area of cardiomyocytes were determined in at least ten randomly chosen areas with ImageJ software54.
Right ventricle tissue isolation, library preparation and RNA-seq
Human RV tissues were manually dissected in cardiac biopsy or autopsy, according to the protocol previously described20. RV tissues from MCT-treated, and PAB-induced rats were harvested after hemodynamic measurements and snap frozen for further assessment. Total RNA from each batch of RV samples was isolated separately using the RNeasy Mini Kit (miRNeasy Micro Kit, when less sample provided; Qiagen) and snap frozen. Purification of total RNA for both human and animal tissues was performed, as described in the RNeasy handbook, using DNase I digestion (RNase-free DNase Set, Qiagen). RNA integrity was verified using a LabChip GX Touch 24 (PerkinElmer). Approximately 10 ng and 2 µg of total RNA, respectively, were used as starting material for library preparation using the SMARTer Stranded Total RNA-seq Kit - Pico Input Mammalian (Takara Bio), and the VAHTS Stranded mRNA-seq Library Prep Kit (Vazyme), following the manufacturers’ protocols. Sequencing was performed using the NextSeq 500 platform (Illumina) with paired-end setup (first batch of human RV tissues) and 75-bp single-end setup (the rest of datasets).
Bulk RNA-seq data analysis
Raw reads were assessed for quality, adaptor content and duplication rates using FastQC55. Trimmomatic version 0.39 was used to trim reads following a quality drop (below a mean of Q15) in a window of five nucleotides56. Trimmed and filtered reads between 15 and 150 nucleotides were cleared for further analysis and aligned against the Ensembl Human Genome (version hg38; GRCh38.27) and Rat Genome (version Rn06) using STAR software version 2.6.1 to include multi-mapping reads, with the parameters: outFilterMismatchNoverLmax, 0.1; outFilterScoreMinOverLread, 0.9; outFilterMatchNminOverLread, 0.9; alignIntronMax, 200,000; and outFilterMultimapNmax, 999 (ref. 57). The number of reads that aligned to genes was counted using ‘featureCounts’ version 1.6.5 from the ‘Subread’ package58. Only reads mapping at least partially inside exons were aggregated and counted per gene. Reads aligning to multiple regions or genes were excluded.
Raw count values for each organism were normalized separately using DESeq2 (version 1.30.0)59, then transformed into regularized logarithm values for further analysis. Because we had two different batches of human samples, we performed a batch effect correction on this data using the sva package60. The normalized and batch-corrected data were the basis of downstream analysis.
DEGs were identified using DESeq2 (ref. 59). Those with a Benjamini–Hochberg-corrected P value ≤ 0.05 were considered to be upregulated. Those with log2FC ≥ 0.58 and ≤ −0.58 were considered to be downregulated. The annotations were enriched with UniProt data (release 25.06.2019), based on Ensembl gene identifiers (activities at the Universal Protein Resource).
Dimension reduction analyses (PC analyses) were performed on normalized and regularized, log-transformed DESeq2 counts using the PC analysis and FactoMineR packages for R software (Foundation for Statistical Computing). Heatmaps were generated, using the ‘complexheatmap’ R package, as visualizations of the most differentially regulated genes. For assembly of the heatmap, normalized count values of all DEGs were transformed by a z-score transformation per row. Volcano plots were produced to show DEG regulation, per contrast, based on DESeq2 normalized counts.
Unsupervised clustering for 40 RV samples from human and for 26 MCT-induced rat samples was performed using k-means clustering. k-means clustering is a well-known, and widely used unsupervised machine learning algorithm used to partition a dataset into distinct groups or clusters based on their similarities. It starts by randomly initializing cluster centroids and assigns data points to the nearest centroid. The centroids are then updated iteratively by recalculating their means based on the assigned data points, and the process continues until convergence. The algorithm aims to minimize the within-cluster sum of squared distances, producing a final clustering solution. k-means clustering is an efficient method; however, since its results can be sensitive to initialization parameters, it requires multiple runs to ensure robustness. Therefore, we tested different values of k on normalized counts to optimize the clustering for each dataset. After comparing the possibilities, with the optimized k = 4 for rat samples and k = 5 for human samples were selected for further analysis. Second-step DEG analysis was performed using DESeq2 with the same criteria as in the first step, in which we initially performed DEGs between all identified clusters (clusters B–E for human, and B–D for MCT rat RVs) versus control samples from cluster A. Furthermore, we performed a comparison between subgroups based on their RV state (compensated versus decompensated), considering their different clusters. Finally, DEG analysis was performed for some extra pairs of comparisons between identified clusters.
All groups of DEG lists were submitted to gene-set enrichment analyses using KOBAS 3 (ref. 61). The resulting bar plots show Gene Ontology terms and KEGG pathways, with corrected FDR < 0.05 considered to indicate significantly deregulated pathways, for each pair of comparisons in both directions. The numbers of under-represented and over-represented genes are shown along with each pathway.
Combinatorial heatmaps visualize all the specific deregulated genes, in some of the common, significantly altered pathways, in the human and rat datasets. All samples were distributed unsupervised through the x axis of the heatmap, which has been highlighted by the identified subgroups. Violin plots were generated using r-log-normalized count values for all samples, divided into subgroups along the x axis. When the differentially regulated pathways were identified and summarized in both datasets, we used a generalized additive model to represent the relative expression of corresponding genes in each selected pathway, through RV states that ranged from normal to late decompensated.
Metadata analysis/methods
Using participant background metadata and the whole gene expression profile, we used a simple generalized linear regression (using ‘glmnet’ R package62,63) to first find binominal genes (184), and then determine whether the assigned expressions could be predicted by (or correlated with) any of the given metadata (batch, sex, age and sample type). In addition, leave-one-out cross-validation was used to evaluate the performance of the model. Of the 184 bimodal genes, only 49 genes demonstrated AUC greater than 0.6, indicating batch as the most influential predictor among the selected variables, while the participant background differences such as age/sex did not have a significant effect. We also used standard error estimation (‘glmnet’ R package) to calculate the accuracy of prediction for the selected genes, which resulted in error rate ≥ 0.5 for most of them. This implies a very high misclassification error, which is mainly due to the small sample size per group/type, as it becomes unavoidable in our study with 40 participant samples.
Gene expression quantification by real-time quantitative PCR
Real-time quantitative PCR analyses were performed as previously described64. Briefly, total RNA from cells/tissues and plasma were extracted using TRIzol (Invitrogen), according to the manufacturer’s instructions. Total RNA was reverse-transcribed into complementary DNA and qPCR was performed, using appropriate primers (all the oligonucleotide sequences are provided in (Supplementary Tables 1 and 2). Glyceraldehyde 3-phosphate dehydrogenase or 18 S ribosomal RNA were used as housekeeping genes for analysis. The relative expression levels of each candidate gene were determined by the ΔCt method. Values are expressed as means ± s.e.m. The Shapiro–Wilk normality test was used to determine whether the collected data was normally distributed. Comparisons of means between two groups used the unpaired t-test, for normally distributed samples, or the Mann–Whitney U test, for non-normally distributed samples. Comparisons between means among three or more groups were performed by one-way ANOVA for normally distributed samples, followed by Tukey’s multiple-comparisons test.
Western blotting and quantification
RV tissues were homogenized in RIPA lysis buffer (Thermo Scientific), quantified and lysates were separated on 7% and 10% polyacrylamide gels and transferred to PVDF membranes. After blocking, the membranes were probed with corresponding primary antibody overnight at 4 °C. This was followed by 2 h of incubation with secondary antibodies conjugated with horseradish peroxidase and protein levels were detected by chemiluminescence with the SuperSignal West Femto substrate solution (Thermo Scientific). The image was developed using iBright Image reader (Thermo Scientific), and densitometric analysis of the blots was obtained using Fiji ImageJ. Expression was quantified using band intensity values (in arbitrary units), which were normalized to vinculin. A list of the antibodies used in this study is provided in (Supplementary Table 3).
Plasma sample collection
For the German cohort (discovery), blood samples from 35 IPAH participants (male and female) participating in the ‘Right Heart 1 trial’ (NCT03403868) were collected at University Hospital Giessen and Marburg, Department of Pneumology and Critical Care Medicine (Germany) from 2016 to 2018. Fasting blood samples were obtained during RHC and immediately frozen at −80° for the following proteomic assay. All data collection and experimental procedures were performed with the approval of the local ethics committee (AZ 58/15).
For the UK cohort (served as validation), blood samples from 61 PAH participants (male and female) undergoing RHC were obtained at Sheffield Pulmonary Vascular Unit (United Kingdom) from March 2013 to February 2018. All experimental procedures were performed with the approval of Laval University and the IUCPQ Biosafety and Ethics Committees (CER 20735), and the Sheffield Teaching Hospitals Observational Study of Pulmonary Hypertension, Cardiovascular and other Respiratory Diseases Scientific Advisory Board (UK REC ref. 18/YH/0441), with support from the NIHR Sheffield Clinical Research Facility. Fasting blood samples were collected following RHC measurements in both cohorts of study, and immediately frozen at −80° until proteomics analysis.
Characterization of pulmonary arterial hypertension participants in two proteome cohorts
Similar to the transcriptome cohort, participants who were not diagnosed with any cardiac or respiratory diseases were selected as the control group in the validation proteome cohort. The discovery cohort does not contain any control group. PAH participants from the discovery cohort were characterized using the coupling ratio of Ees/Ea as previously described19. In brief, participants with Ees/Ea ≥ 0.8 were considered as those with a compensated RV state, while Ees/Ea < 0.8 defined the worsening RV condition and considered as participants with a decompensated RV state. In the validation cohort, primary characterization was performed based on CI value; participants with compensated RV identified with CI > 2.2 l min−1 per m2 and decompensated RV with CI < 2.2 l min−1 per m2. Normal distribution of age and sex per group of participants (compensated to decompensated) was tested in both cohorts using Welch two-sample t-test for age, and Fisher’s exact test for sex, to confirm that there were no significant effects of these two factors in any of the comparisons.
Plasma proteome assay and analysis
Proteome assays were performed using a high-throughput multiplex proximity extension assay technology (Olink Bioscience) as previously described65. The exploration cohort was generated by cardiometabolic protein panel, version 3601 (including 92 proteins), and for the second cohort, a 384-plex panel (including 372 proteins) focusing on cardiometabolic proteins was used. The assay utilizes epitope-specific binding and hybridization of a set of paired oligonucleotide antibody probes, which is subsequently amplified using a qPCR, resulting in log2-normalized protein expression (NPX) values where a high value corresponds to a higher protein expression65. Statistical analysis to identify differentially regulated proteins was performed using the unpaired t-test between two groups (German cohort), for normally distributed samples, and among three groups (validation cohort) by one-way ANOVA, followed by Tukey’s post hoc test for multiple-comparisons correction. For the exploration cohort, no correction was used for P values to extend the protein selection for further confirmation using a validation cohort.
Correlations of different functional RV parameters with selected protein expression as well as proBNP levels were assessed by a simple linear regression model (for a single variable), and multiple regression model (when correction for age/sex was considered). Regression analyis P values were reported for each single comparison.
Random forest model and importance score measurement
We used a random forest regression-based model to test subgroup classification performance by the panel of five protein expression levels. We first ran the model on each PAH cohort independently for all the five proteins together, and then for different combinations. Random forest model testing, along with the tenfold cross-validation method to avoid overfitting, was implemented using the R package ‘ranger’66. Then, ROC AUC was used to assess the random forest model performance respective to each classification/combination. Corresponding ROC P values were calculated using a Mann–Whitney (Wilcoxon) test for the H0 = ‘the AUC is equal to 0.5’, which is implemented in R package ‘verification’ (https://cran.r-project.org/web/packages/verification/index.html)67.
Furthermore, we used the permutation approach to measure variable importance (in this case, features are proteins) using the ‘ranger’ package. This accuracy-based approach enabled us to calculate the importance of a specific variable using an out-of-bag estimation method. In brief, the importance score is defined as the difference between the calculated prediction accuracy and the prediction accuracy while the measured variable is randomly shuffled and all other variables remain the same.
Survival curve analysis
Transplant-free survival was analyzed from sampling to death or lung transplantation. The cutoff date was 30 September 2021. ROC curves were constructed for protein alone or several proteins to assess the optimal cutoff value and their performance in predicting transplant-free survival. Logistic regression models were used to calculate the combined models using several proteins for ROC analyses. The Kaplan–Meier analysis was used to estimate transplant-free survival rate, and a log-rank test was used for comparisons between groups using the optimal cutoff point with the ROC curves. For survival curves using several proteins, in each participant, each protein counted as one if the protein was greater than or equal to the level of the cutoff. For each participant, the numbers of selected proteins that were equal to the cutoff level or more than the cutoff level was summed, and participants were divided into the groups of the summed number. A significance level inferior to 5% (P < 0.05) was considered statistically significant.
All statistical analyses were made with R version 4.0.3 (R Foundation for Statistical Computing), or SPSS version 27 (IBM), or Prism version 9.4.0 (GraphPad).
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Source data
Acknowledgements
This study was funded by the German Research Foundation (DFG), Collaborative Research Center (CRC) 1213 (Project A01, A05 grants to S.S.P., and A10N grant to R.S.) and European Research Council (ERC) Consolidator Grant (866051 to S.S.P.). R.S., W.S. and S.S.P. are supported by the Excellence Cluster Cardio Pulmonary Institute (CPI; EXC2026) and the German Center for Lung Research (DZL), and W.S., R.S. and S.S.P. are supported by the Max Planck Society. F.K. and S.R.N. are funded by the DFG CRC 1213. This research was also supported by a grant from the Cardiovascular Medical Research and Education Fund (CMREF) to O.B. and S.P. O.B. and S.B. are funded by the Canadian Institute of Health Research (CIHR, IC121617). O.B. holds a junior scholar award from the Fonds de Recherche du Québec: Santé (FRQS). A.J.S. supported in part by the Wellcome Trust (grant no. 205188/Z/16/Z). The research of AL is supported by grants from British Heart Foundation PG/11/116/29288, FS/18/52/ 33808, RE/18/4/34215. S.B. holds a distinguished research scholar from FRQS.
Extended data
Author contributions
F.K., S.B., O.B. and S.S.P. conceived and designed the research study. F.K., P.C., C.V., S.R.N., S.M., N.S., J.O., S.B.-B., F.P. and S.G. conducted the experiments and acquired the data. S.B.-B., S.P., O.B. and S.B. provided the samples. A.J.S., S.A. and K.T. provided imaging data. A.L., D.G.K., S.B.-B., K.T. and S.P. provided plasma samples and clinical data. F.K., P.C., T.Y., C.K. and M.L. analyzed the data. A.K. and J.B.Z. helped with statistical analysis and machine learning applications. F.K., P.C., R.S., W.S., S.B. and S.S.P. drafted the paper. All authors provided critical feedback and helped shape the research, analysis and paper.
Peer review
Peer review information
Nature Cardiovascular Research thanks Ruibing Chen, Timothy McMahon, and the other, anonymous, reviewers for their contribution to the peer review of this work. Primary Handling Editor: Elvira Forte, in collaboration with the Nature Cardiovascular Research team.
Data availability
All the expression data (human and MCT and PAB rat RV) generated in this study are deposited in the Gene Expression Omnibus under accession number GSE240941. Source data are provided with this paper including the clinical data of all the participants in transcriptome and plasma proteome cohorts. All other data supporting the findings in this study are available within the article or Supplementary Information.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Sébastien Bonnet, Email: sebastien.bonnet@criucpq.ulaval.ca.
Soni Savai Pullamsetti, Email: soni.pullamsetti@mpi-bn.mpg.de.
Extended data
is available for this paper at 10.1038/s44161-023-00338-3.
Supplementary information
The online version contains supplementary material available at 10.1038/s44161-023-00338-3.
References
- 1.Konstam, M. A. et al. Evaluation and management of right-sided heart failure: a scientific statement from the American Heart Association. Circulation137, e578–e622 (2018). 10.1161/CIR.0000000000000560 [DOI] [PubMed] [Google Scholar]
- 2.Keranov, S. et al. CILP1 as a biomarker for right ventricular maladaptation in pulmonary hypertension. Eur. Respir. J.57, 1901192 (2021). 10.1183/13993003.01192-2019 [DOI] [PubMed] [Google Scholar]
- 3.van der Bruggen, C. E. E., Tedford, R. J., Handoko, M. L., van der Velden, J. & de Man, F. S. RV pressure overload: from hypertrophy to failure. Cardiovasc. Res.113, 1423–1432 (2017). 10.1093/cvr/cvx145 [DOI] [PubMed] [Google Scholar]
- 4.Ryan, J. J. et al. Right ventricular adaptation and failure in pulmonary arterial hypertension. Can. J. Cardiol.31, 391–406 (2015). 10.1016/j.cjca.2015.01.023 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Park, J. F. et al. Transcriptomic analysis of right ventricular remodeling in two rat models of pulmonary hypertension: identification and validation of epithelial-to-mesenchymal transition in human right ventricular failure. Circ. Heart. Fail.14, e007058 (2021). 10.1161/CIRCHEARTFAILURE.120.007058 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Kobayashi, T. et al. Multi‑omics analysis of right ventricles in rat models of pulmonary arterial hypertension: consideration of mitochondrial biogenesis by chrysin. Int. J. Mol. Med.49, 69 (2022). 10.3892/ijmm.2022.5124 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hindmarch, C. C. T. et al. An integrated proteomic and transcriptomic signature of the failing right ventricle in monocrotaline-induced pulmonary arterial hypertension in male rats. Front. Physiol.13, 966454 (2022). 10.3389/fphys.2022.966454 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Havlenova, T. et al. Right versus left ventricular remodeling in heart failure due to chronic volume overload. Sci. Rep.11, 17136 (2021). 10.1038/s41598-021-96618-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Holda, M. K. et al. Myocardial proteomic profile in pulmonary arterial hypertension. Sci. Rep.10, 14351 (2020). 10.1038/s41598-020-71264-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Qin, X. et al. Proteomic and metabolomic analyses of right ventricular failure due to pulmonary arterial hypertension. Front. Mol. Biosci.9, 834179 (2022). 10.3389/fmolb.2022.834179 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Boucherat, O. et al. Identification of LTBP-2 as a plasma biomarker for right ventricular dysfunction in human pulmonary arterial hypertension. Nat. Cardiovasc. Res.1, 748–760 (2022). 10.1038/s44161-022-00113-w [DOI] [PubMed] [Google Scholar]
- 12.Rhodes, C. J. et al. Plasma proteome analysis in patients with pulmonary arterial hypertension: an observational cohort study. Lancet Respir. Med.5, 717–726 (2017). 10.1016/S2213-2600(17)30161-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Kariotis, S. et al. Biological heterogeneity in idiopathic pulmonary arterial hypertension identified through unsupervised transcriptomic profiling of whole blood. Nat. Commun.12, 7104 (2021). 10.1038/s41467-021-27326-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Arvidsson, M., Ahmed, A., Saleby, J., Hesselstrand, R. & Radegran, G. Plasma matrix metalloproteinase 2 is associated with severity and mortality in pulmonary arterial hypertension. Pulm. Circ.12, e12041 (2022). 10.1002/pul2.12041 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Llucia-Valldeperas, A., de Man, F. S. & Bogaard, H. J. Adaptation and maladaptation of the right ventricle in pulmonary vascular diseases. Clin. Chest Med.42, 179–194 (2021). 10.1016/j.ccm.2020.11.010 [DOI] [PubMed] [Google Scholar]
- 16.Campo, A. et al. Outcomes of hospitalisation for right heart failure in pulmonary arterial hypertension. Eur. Respir. J.38, 359–367 (2011). 10.1183/09031936.00148310 [DOI] [PubMed] [Google Scholar]
- 17.Tello, K., Gall, H., Richter, M., Ghofrani, A. & Schermuly, R. Right ventricular function in pulmonary (arterial) hypertension. Herz44, 509–516 (2019). 10.1007/s00059-019-4815-6 [DOI] [PubMed] [Google Scholar]
- 18.Naeije, R. & Manes, A. The right ventricle in pulmonary arterial hypertension. Eur. Respir. Rev.23, 476–487 (2014). 10.1183/09059180.00007414 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Tello, K. et al. Relevance of the TAPSE/PASP ratio in pulmonary arterial hypertension. Int. J. Cardiol.266, 229–235 (2018). 10.1016/j.ijcard.2018.01.053 [DOI] [PubMed] [Google Scholar]
- 20.Omura, J. et al. Identification of long noncoding RNA H19 as a new biomarker and therapeutic target in right ventricular failure in pulmonary arterial hypertension. Circulation142, 1464–1484 (2020). 10.1161/CIRCULATIONAHA.120.047626 [DOI] [PubMed] [Google Scholar]
- 21.Jacobs, W. et al. The right ventricle explains sex differences in survival in idiopathic pulmonary arterial hypertension. Chest145, 1230–1236 (2014). 10.1378/chest.13-1291 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Keen, J., Prisco, S. Z. & Prins, K. W. Sex differences in right ventricular dysfunction: insights from the bench to bedside. Front. Physiol.11, 623129 (2020). 10.3389/fphys.2020.623129 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Dunn, L. L. et al. Hmox1 (heme oxygenase-1) protects against ischemia-mediated injury via stabilization of HIF-1alpha (hypoxia-inducible factor-1alpha). Arterioscler. Thromb. Vasc. Biol.41, 317–330 (2021). [DOI] [PubMed] [Google Scholar]
- 24.Giusti, B. et al. Gene expression profile of rat left ventricles reveals persisting changes following chronic mild exercise protocol: implications for cardioprotection. BMC Genomics10, 342 (2009). 10.1186/1471-2164-10-342 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Bolte, C. et al. Expression of Foxm1 transcription factor in cardiomyocytes is required for myocardial development. PLoS ONE6, e22217 (2011). 10.1371/journal.pone.0022217 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Hortells, L., Johansen, A. K. Z. & Yutzey, K. E. Cardiac fibroblasts and the extracellular matrix in regenerative and nonregenerative hearts. J. Cardiovasc. Dev. Dis.6, 29 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Mayr, C. H. et al. Integrative analysis of cell state changes in lung fibrosis with peripheral protein biomarkers. EMBO Mol. Med.13, e12871 (2021). 10.15252/emmm.202012871 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Steck, E. et al. Chondrocyte secreted CRTAC1: a glycosylated extracellular matrix molecule of human articular cartilage. Matrix Biol.26, 30–41 (2007). 10.1016/j.matbio.2006.09.006 [DOI] [PubMed] [Google Scholar]
- 29.Sanz, J. et al. Right ventriculo-arterial coupling in pulmonary hypertension: a magnetic resonance study. Heart98, 238–243 (2012). 10.1136/heartjnl-2011-300462 [DOI] [PubMed] [Google Scholar]
- 30.Richter, M. J. & Tello, K. Against the odds: risk stratification with cardiac magnetic resonance imaging in pulmonary arterial hypertension. Am. J. Respir. Crit. Care Med.201, 403–405 (2020). 10.1164/rccm.201910-2069ED [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Akhavein, F., St-Michel, E. J., Seifert, E. & Rohlicek, C. V. Decreased left ventricular function, myocarditis, and coronary arteriolar medial thickening following monocrotaline administration in adult rats. J. Appl. Physiol.103, 287–295 (2007). 10.1152/japplphysiol.01509.2005 [DOI] [PubMed] [Google Scholar]
- 32.Jiang, B. et al. Marked strain-specific differences in the SU5416 rat model of severe pulmonary arterial hypertension. Am. J. Respir. Cell Mol. Biol.54, 461–468 (2016). 10.1165/rcmb.2014-0488OC [DOI] [PubMed] [Google Scholar]
- 33.Bal, E., Ilgin, S., Atli, O., Ergun, B. & Sirmagul, B. The effects of gender difference on monocrotaline-induced pulmonary hypertension in rats. Hum. Exp. Toxicol.32, 766–774 (2013). 10.1177/0960327113477874 [DOI] [PubMed] [Google Scholar]
- 34.Umar, S. et al. The Y chromosome plays a protective role in experimental hypoxic pulmonary hypertension. Am. J. Respir. Crit. Care Med.197, 952–955 (2018). 10.1164/rccm.201707-1345LE [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Andersen, S., Nielsen-Kudsk, J. E., Vonk Noordegraaf, A. & de Man, F. S. Right ventricular fibrosis. Circulation139, 269–285 (2019). 10.1161/CIRCULATIONAHA.118.035326 [DOI] [PubMed] [Google Scholar]
- 36.Rain, S. et al. Right ventricular myocardial stiffness in experimental pulmonary arterial hypertension: relative contribution of fibrosis and myofibril stiffness. Circ. Heart. Fail.9, e002636 (2016). 10.1161/CIRCHEARTFAILURE.115.002636 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Frangogiannis, N. G. Fibroblasts and the extracellular matrix in right ventricular disease. Cardiovasc. Res.113, 1453–1464 (2017). 10.1093/cvr/cvx146 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Lu, L. et al. C1q/TNF-related protein-1: an adipokine marking and promoting atherosclerosis. Eur. Heart J.37, 1762–1771 (2016). 10.1093/eurheartj/ehv649 [DOI] [PubMed] [Google Scholar]
- 39.Muendlein, A. et al. The novel adipokine CTRP1 is significantly associated with the incidence of major adverse cardiovascular events. Atherosclerosis286, 1–6 (2019). 10.1016/j.atherosclerosis.2019.04.222 [DOI] [PubMed] [Google Scholar]
- 40.Shen, L., Wang, S., Ling, Y. & Liang, W. Association of C1q/TNF-related protein-1 (CTRP1) serum levels with coronary artery disease. J. Int. Med. Res.47, 2571–2579 (2019). 10.1177/0300060519847372 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Su, Z., Tian, S. & Liang, W. Circulating CTRP1 levels are increased and associated with the STOD in essential hypertension in Chinese patients. Cardiovasc. Ther.2019, 4183781 (2019). 10.1155/2019/4183781 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Gu, Y. et al. CTRP1 aggravates cardiac dysfunction post myocardial infarction by modulating TLR4 in macrophages. Front. Immunol.12, 635267 (2021). 10.3389/fimmu.2021.635267 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.di Salvo, T. G. et al. Right ventricular myocardial biomarkers in human heart failure. J. Card. Fail.21, 398–411 (2015). 10.1016/j.cardfail.2015.02.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Keranov, S. et al. SPARCL1 as a biomarker of maladaptive right ventricular remodelling in pulmonary hypertension. Biomarkers25, 290–295 (2020). 10.1080/1354750X.2020.1745889 [DOI] [PubMed] [Google Scholar]
- 45.Kuwashiro, T. et al. Oxidized albumin and cartilage acidic protein-1 as blood biomarkers to predict ischemic stroke outcomes. Front. Neurol.12, 686555 (2021). 10.3389/fneur.2021.686555 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Zbinden, A. et al. Nidogen-1 mitigates ischemia and promotes tissue survival and regeneration. Adv. Sci.8, 2002500 (2021). 10.1002/advs.202002500 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Yang, Y. et al. Association Between C1q/TNF-related protein-1 levels in human plasma and epicardial adipose tissues and congestive heart failure. Cell. Physiol. Biochem.42, 2130–2143 (2017). 10.1159/000479915 [DOI] [PubMed] [Google Scholar]
- 48.Schellings, M. W. et al. Absence of SPARC results in increased cardiac rupture and dysfunction after acute myocardial infarction. J. Exp. Med.206, 113–123 (2009). 10.1084/jem.20081244 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Li, M. et al. Identification of post-myocardial infarction blood expression signatures using multiple feature selection strategies. Front. Physiol.11, 483 (2020). 10.3389/fphys.2020.00483 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Michelhaugh, S. A. & Januzzi, J. L. Jr. Finding a needle in a haystack: proteomics in heart failure. JACC Basic Transl. Sci.5, 1043–1053 (2020). 10.1016/j.jacbts.2020.07.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Raffield, L. M. et al. Comparison of proteomic assessment methods in multiple cohort studies. Proteomics20, e1900278 (2020). 10.1002/pmic.201900278 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Bonnet, S. et al. Translating research into improved patient care in pulmonary arterial hypertension. Am. J. Respir. Crit. Care Med.195, 583–595 (2017). 10.1164/rccm.201607-1515PP [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Provencher, S. et al. Standards and methodological rigor in pulmonary arterial hypertension preclinical and translational research. Circ. Res.122, 1021–1032 (2018). 10.1161/CIRCRESAHA.117.312579 [DOI] [PubMed] [Google Scholar]
- 54.Shimauchi, T. et al. TRPC3-Nox2 complex mediates doxorubicin-induced myocardial atrophy. JCI Insight2, e93358 (2017). 10.1172/jci.insight.93358 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Andrews, S. FastQC: a quality control tool for high throughput sequence data. http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (2010).
- 56.Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics30, 2114–2120 (2014). 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics29, 15–21 (2013). 10.1093/bioinformatics/bts635 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics30, 923–930 (2014). 10.1093/bioinformatics/btt656 [DOI] [PubMed] [Google Scholar]
- 59.Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol.15, 550 (2014). 10.1186/s13059-014-0550-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Leek, J. T., Johnson, W. E., Parker, H. S., Jaffe, A. E. & Storey, J. D. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics28, 882–883 (2012). 10.1093/bioinformatics/bts034 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Bu, D. et al. KOBAS-i: intelligent prioritization and exploratory visualization of biological functions for gene enrichment analysis. Nucleic Acids Res.49, W317–W325 (2021). 10.1093/nar/gkab447 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Tay, J. K., Narasimhan, B. & Hastie, T. Elastic net regularization paths for all generalized linear models. J. Stat. Softw.106, 1 (2023). 10.18637/jss.v106.i01 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw.33, 1–22 (2010). 10.18637/jss.v033.i01 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Pullamsetti, S. S. et al. Lung cancer-associated pulmonary hypertension: role of microenvironmental inflammation based on tumor cell-immune cell cross-talk. Sci. Transl. Med.9, eaai9048 (2017). 10.1126/scitranslmed.aai9048 [DOI] [PubMed] [Google Scholar]
- 65.Assarsson, E. et al. Homogenous 96-plex PEA immunoassay exhibiting high sensitivity, specificity, and excellent scalability. PLoS ONE9, e95192 (2014). 10.1371/journal.pone.0095192 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Wright, M. N. & Ziegler, A. ranger: a fast implementation of random forests for high dimensional data in C++ and R. J. Stat. Softw.77, 1–17 (2017). 10.18637/jss.v077.i01 [DOI] [Google Scholar]
- 67.Mason, S. J. & Graham, N. E. Areas beneath the relative operating characteristics (ROC) and relative operating levels (ROL) curves: statistical significance and interpretation. Q. J. R. Meteorolog. Soc.128, 2145–2166 (2002). 10.1256/003590002320603584 [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All the expression data (human and MCT and PAB rat RV) generated in this study are deposited in the Gene Expression Omnibus under accession number GSE240941. Source data are provided with this paper including the clinical data of all the participants in transcriptome and plasma proteome cohorts. All other data supporting the findings in this study are available within the article or Supplementary Information.