Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

medRxiv logoLink to medRxiv
[Preprint]. 2024 Oct 15:2024.10.12.24315151. [Version 1] doi: 10.1101/2024.10.12.24315151

Rare variants and survival of patients with idiopathic pulmonary fibrosis

Aitana Alonso-Gonzalez 1, David Jáspez 2, José M Lorenzo-Salazar 2, Shwu-Fan Ma 3, Emma Strickland 3, Josyf Mychaleckyj 4, John S Kim 3, Yong Huang 3, Ayodeji Adegunsoye 5, Justin M Oldham 6, Iain Steward 7, Philip L Molyneaux 7,8, Toby M Maher 7,9, Louise V Wain 10,11, Richard J Allen 10,11, R Gisli Jenkins 7, Jonathan A Kropski 12,13,14, Brian Yaspan 15, Timothy S Blackwell 16, David Zhang 17, Christine Kim Garcia 17,18, Fernando J Martinez 19, Imre Noth 3, Carlos Flores 1,2,20,21
PMCID: PMC11527076  PMID: 39484282

Abstract

Background

The clinical course of idiopathic pulmonary fibrosis (IPF) is highly variable and unpredictable, with multiple genetic variants influencing IPF outcomes. Notably, rare pathogenic variants in telomere-related genes are associated with poorer clinical outcomes in these patients. Here we assessed whether rare qualifying variants (QVs) in monogenic adult-onset pulmonary fibrosis (PF) genes are associated with IPF survival. Using polygenic risk scores (PRS), we also evaluated the influence of common IPF risk variants in individuals carrying these QVs.

Methods

We identified QVs in telomere and non-telomere genes linked to monogenic PF forms using whole-genome sequences (WGS) from 888 Pulmonary Fibrosis Foundation Patient Registry (PFFPR) individuals. We also derived a PRS for IPF (PRS-IPF) from 19 previously published common sentinel IPF variants. Using regression models, we then examined the mutual relationships of QVs and PRS-IPF and their association with survival. Validation of results was sought in WGS from an independent IPF study (PROFILE, n=472), and results from the two cohorts were meta-analyzed.

Results:

Carriers of QVs in monogenic adult-onset PF genes, representing nearly 1 out of 6 IPF patients, were associated with lower PRS-IPF (Odds Ratio [OR]: 1.79; 95% Confidence Interval [CI]: 1.15–2.81; p=0.010) and shorter survival (Hazard Ratio [HR]: 1.53; 95% CI: 1.12–2.10; p=7.3×10−3). Notably, carriers of pathogenic variants at telomere genes showed the strongest association with survival (HR: 1.76; 95% CI: 1.13–2.76; p=0.013). The meta-analysis of the results showed a consistent direction of effect across both cohorts.

Conclusions:

We revealed the opposite effects of QVs and PRS-IPF on IPF survival. Thus, a distinct IPF molecular subtype might be defined by QVs in monogenic adult-onset PF genes. Assessing the carrier status for QVs and modelling PRS-IPF promises to further contribute to predicting disease progression among IPF patients.

INTRODUCTION

Idiopathic pulmonary fibrosis (IPF) is a rare and progressive disease characterized by lung scarring and poor prognosis, with a median survival of 3–5 years after diagnosis (1). The clinical course of the disease varies greatly among patients and is difficult to predict. While some patients maintain relatively stable lung function for many years, 10–15% of them experience a rapid decline (2) and may succumb to their disease before initiation of effective therapy or lung transplantation. Identifying those in need of immediate therapy is crucial to improve clinical management.

Genetic studies have revealed that both rare and common genetic variants contribute to IPF susceptibility (38). However, the incorporation of genetic data in IPF diagnosis remains limited, as current guidelines rely primarily on radiological or histological criteria to identify interstitial pneumonia patterns. Instead, genetic testing is increasingly valuable for predicting disease prognosis (9). Genome-wide association studies (GWAS) have revealed multiple common loci involved in IPF progression and survival. For instance, the mucin 5B, oligomeric mucus/gel-forming gene (MUC5B) risk allele (rs3570950-T), which is the strongest common genetic risk factor known for IPF, is also linked to slower disease progression (10), although this association might be subject to an index event bias (11). A novel genetic risk locus involving the antisense RNA gene of protein kinase N2 (PKN2) gene has also shown association with forced vital capacity (FVC) decline, a common measure for monitoring disease progression (12). In addition, the first GWAS of IPF survival found a variant in PCSK6 associated with differential patient survival (13).

Rare qualifying variants (QVs) in telomere-related genes were associated with poor clinical outcomes among IPF patients (7) and among patients with other interstitial lung diseases (ILD), such as hypersensitivity pneumonitis (14). Specifically, QVs associate with progressive PF, a rapid decline in lung function, and reduced survival (1517). However, previous studies have focused mostly on the effects of a few genes (TERT, RTEL1, and PARN), even though many other telomere genes are known to be associated with monogenic PF.

Altogether, this evidence supports that multiple genetic factors are involved in distinct mechanisms of IPF pathogenesis and rates of progression. In addition, patients with the MUC5B risk allele are less likely to carry rare likely deleterious variants in adult-onset PF genes than non-carriers of the MUC5B risk allele, suggesting that the expectation of additive effects of common and rare variants may not hold in this case (7,18). Nevertheless, the effect of QVs across all known monogenic adult-onset PF genes (including telomere and non-telomere genes) in survival and the modifier role of polygenic risk scores (PRS) of common risk variants of IPF (PRS-IPF) in QVs carriers remains to be elucidated.

Using whole-genome sequencing (WGS) from IPF patients, we aimed to determine the prevalence of QVs in monogenic adult-onset PF genes and examine the mutual relationships of QVs and PRS-IPF and their association with IPF survival.

METHODS

Study design and sample description

We assessed the association of QVs and PRS-IPF with the primary outcome in patients with IPF. In the discovery stage, we utilized data from the Pulmonary Fibrosis Foundation Patient Registry (PFFPR) (19). In the second stage, we employed data from the Prospective Observation of Fibrosis in the Lung Clinical Endpoints (PROFILE) cohort for validation (20,21), ensuring the robustness of our findings. In the PFFPR, the primary outcome was the time from initial diagnosis to either death or lung transplantation. In the PROFILE cohort, the primary outcome was the time from diagnosis to death. For both cohorts, right censoring was applied at 60 months.

In the validation stage, we included 888 patients clinically diagnosed with IPF from the PFFPR, with baseline and longitudinal demographic and clinical information recorded in the United States since March 2016. In the second stage, 472 clinically diagnosed IPF patients from the PROFILE cohort, recruited in UK from 2010 to 2017, were included and followed for three years to track disease progression (Figure 1). For further details, see Supplementary methods, and Supplementary Table 1.

Figure 1. Patient cohorts included from the PFFPR and PROFILE studies.

Figure 1.

Both studies were conducted according to The Code of Ethics of the World Medical Association (Declaration of Helsinki) and written informed consent was obtained from all participants. The Research Ethics Committees at each participating centre approved the study.

Sequencing and bioinformatics analysis methods

In the PFFPR, library preparation and sequencing were performed by Psomagen (Rockville, MD). Genomic DNA libraries were prepared using the TruSeq DNA PCR Free kit (Illumina Inc.) and sequenced on an Illumina NovaSeq 6000 instrument (Illumina Inc.) with 150 bp paired-end reads at an average depth of 30X. At least 80% of the genome was covered by ≥20 reads, and ≥90% was covered by ≥10 reads. WGS was processed using the Illumina DRAGEN Bio-IT Platform Germline Pipeline v3.10.4 (Illumina Inc.) using the Illumina DRAGEN Multigenome Graph hg38 as the reference genome. Only variants with a “PASS” filter were included in subsequent analyses.

For the PROFILE cohort, WGS was performed at Human Longevity Inc. using the Illumina NovaSeq 6000 system with 150 pb paired-end reads. Coverage of at least 10X was achieved in over 98% of the Consensus Coding Sequence Release 22 (CCDS), with an average read depth of 42X across the CCDS as described previously (4). Sequences were processed using the Illumina DRAGEN Bio-IT Platform Germline Pipeline v3.0.7, with the GRCh38 as the reference genome.

In both cohorts, quality control (QC) included identifying QC outliers, detecting kinship between patients, checking for cross-contamination of samples, and identifying sex discordance, using metrics from different tools. Figure 1 summarized the number of individuals excluded and the reasons for exclusion. For further details, see Supplementary methods.

Identification of QVs in monogenic adult-onset PF genes

We restricted the identification of QVs to a curated list of 13 PF genes, categorized as either telomere related (TERC, TERT, TINF2, DKC1, RTEL1, PARN, NAF1, and ZCCHC8) or non-telomere related (SFTPC, SFTPA1, SFTPA2, SPDL1, and KIF15) (Supplementary Table 2). With the exception of SPDL1, and KIF15, this list includes genes with a known dominant inheritance pattern (presuming that QVs in these genes would have higher penetrance) and genes commonly found in familial IPF cohorts, despite they also occur in sporadic cases (7).

KIF15 and SPDL1 were incorporated to the list as recent largescale sequencing studies identified them as PF-related genes (4,22,23). Both genes are critical for mitosis, pointing to a novel, non-telomeric mechanism underlying IPF. Rare deleterious variants in KIF15 and three telomere genes (TERT, PARN and RTEL1) have been previously associated with IPF risk, early onset, and progression to early-age lung transplantation or death (23). In SPDL1, a rare missense variant was confirmed as a new IPF risk allele, although carriers did not exhibit distinct clinical features (4). For simplicity, we refer to this gene set as monogenic adult-onset PF genes.

Variants in these genes were filtered based on read depth (DP) <10, mapping quality (MQ) <50, or the percentage of missing genotypes (FMISS) >0.05 in the cohort. The remaining variants were annotated using the Variant Effect Predictor tool v109.3 (24). Variants with a global allele frequency (AF) >0.0005 in gnomAD v2.1 were excluded from the study. For our analyses, we retained protein-truncating variants (including frameshift, stop-gained, start-loss, and splicing variants) and missense variants with a CADD >15.

For the non-coding RNA gene TERC, due to the difficulty in predicting functional effects in non-coding genes, variants were considered for the analysis if their global population AF was <0.0005 and they were annotated by ClinVar as pathogenic (P), likely pathogenic (LP), or of uncertain significance (VUS).

All variants that met the aforementioned criteria were annotated as the total set of QVs. Additionally, QVs were manually classified according to ACMG guidelines (25) as P, LP, or VUS. Variants classified as P or LP comprised the set of pathogenic variants, while ClinVar variants were cross-referenced and annotated as VUS, P, or LP.

For sensitivity analysis, we defined six additional categories for QVs based on specific thresholds for population AF and different predicted variant effect. We also assessed a category of rare synonymous variants, expected to capture neutral variation, to use as a null model in in the association analyses. These criteria are summarized in Supplementary Table 3.

Principal components of genetic heterogeneity in the cohorts

Principal components (PC) were calculated after excluding single nucleotide polymorphisms (SNPs) with a minor allele frequency (MAF) <0.01 from WGS data, using BCFtools (https://samtools.github.io/bcftools/bcftools.html). Genotyping QC was then performed using PLINK v.1.9. First, SNPs with a genotyping call rate (CR) <95% or those deviating significantly from Hardy-Weinberg equilibrium (HWE) (p<1.×10−6) were removed. After linkage disequilibrium pruning (indep-pairwise 100 5 0.01), the main PCs of genetic variation were calculated based on 110,951 independent SNPs in the PFFPR and 143,214 independent SNPs in PROFILE (see Supplementary Figure 1).

Estimation of the PRS of IPF in the cohorts

The PRS-IPF for each patient was derived using the 19 previously published genome-wide significant IPF variants (Supplementary Table 4) using PRSice-2 (26). Briefly, PRS were calculated as the number of risk alleles carried by each individual, multiplied by the effect size of the variant as described in the GWAS study (26) summed across all variants included in the score:

PRS=i=1nβiGi

where βi is the OR (in the case of binary traits) from variant i,Gi represents the number of risk alleles carried at the variant i and n represents the conditionally independent signals identified elsewhere. Raw polygenic scores were then standardized as z-scores using the following formula:

PRSz=PRs-Mean(PRS)standarddeviations(PRS)

As part of the sensitivity analysis, we also used the same methodology to derive PRS for telomere length (PRS-TL) based on the 20 common variants that were previously found associated with leukocyte telomere length (TL) (27) (Supplementary Table 5). In this case, since TL is a quantitative trait, βi is represented by beta coefficients in the PRS formula.

Statistical analysis

Descriptive statistics were provided as mean (standard deviation, SD) or median (interquartile range, IQR) and valid percentage for continuous (quantitative) and categorical (binary) data, respectively. Categorical variables were compared using a Chi-squared test or a Fisher’s exact test as indicated.

To examine the relationship between the presence of QVs and the PRS, we first used the Student’s t-test and the Kolmogorov-Smirnov (KS) test to compare the mean PRS values and their distributions between QVs carriers and non-carriers. Additionally, we assessed this relationship using logistic regression models, adjusting for sex, age at diagnosis, and the two main PCs of genetic heterogeneity, which accounted for a significant proportion of genetic variance (Supplementary Figure 1C).

To examine the association between QVs, PRS, and survival, we used Cox proportional hazards regression models adjusted for sex, age at diagnosis, the two main PCs for genetic heterogeneity, smoking history, DLCO% predicted, FVC % predicted, and MUC5B risk allele carrier status necessary. The proportional hazards assumption of each covariate was assessed by plotting scaled Schoenfeld residuals against transformed time, revealing no evidence of non-proportional hazards.

The Survival R package (v.3.5–7) was used to calculate p-values, hazard ratios (HR), and 95% confidence intervals (CI). For visualizing survival differences, we generated Kaplan-Meier survival plots and tested the differences using log-rank tests.

Results from PFFPR and PROFILE studies were meta-analysed under a fixed-effects model to assess the directional concordance of associations, using the Meta R package (28). Statistical analyses were performed with R v.4.3.1, with p-values <0.05 considered significant.

RESULTS

Prevalence of QVs in the PFFPR

We identified 131 QVs in monogenic adult-onset PF genes in 144 patients from the PFFPR resulting in a diagnostic yield of finding a QV of 16.2%, with a 95% CI of 13.8–18.6 (Supplementary Figure 2, Table 1). Most patients (96.5%) carried a single QV, while five patients (3.5% of the QV carriers) had two or more QVs, including combinations such as NAF1/TERT, KIF15/RTEL1, TERT/SPDL1, TINF2/TERT/RTEL1, and RTEL1/RTEL1. Consistent with previous studies, the prevalence of QVs was higher in patients with a familial history of disease (27.3%) compared to those with sporadic disease (13.5%) (p=3.08×10−5).

Table 1.

Count of qualifying variants in monogenic adult-onset PF genes identified in the PFFPR patients.*

Gene Number of variants P/LP VUS Gene category

KIF15 15 3 12 Non_Telomere
SPDL1 8 2 7 Non_Telomere
SFTPC 5 0 5 Non_Telomere
SFTPA2 1 0 1 Non_Telomere
SFTPA1 1 0 1 Non_Telomere
ZCCHC8 5 1 4 Telomere
TINF2 3 3 0 Telomere
PARN 16 12 4 Telomere
RTEL1 33 18 15 Telomere
DKC1 1 0 1 Telomere
TERC 4 0 4 Telomere
NAF1 6 1 5 Telomere
TERT 31 18 13 Telomere

Total variants 131 58 73
*

Filtered by frequency (AF< 0.0005) and CADD score (>15). P/LP, Pathogenic or likely pathogenic; VUS, variant of uncertain significance.

Most QVs were in telomere genes (75.6%), while nearly a quarter were found in non-telomere genes (22.9%). The highest number of QVs were identified in telomere-related genes including RTEL1 (25.2%), TERT (23.7%), and PARN (12.2%). These genes also had the highest proportion of P/LP variants (31.0%, 31.0%, and 20.7%, respectively) (Table 1). In total, 42.7% of QVs were previously annotated in ClinVar as VUS, LP, or P.

Consistent with previous findings (7,18), carriers of the risk MUC5B allele (rs35705950 TT or TC genotype) had lower prevalence of QVs (13.5%) compared to those carrying the protective GG genotype (22.6%) (p=1.03×10−3).

Association of QVs and PRS-IPF in the PFFPR

Given the potential non-additive effects of QVs and the MUC5B common variant, we investigated whether individuals with lower polygenic risk were more likely to carry QVs compared to those with higher polygenic risks. We first compared the mean and distribution of PRS-IPF between QV carriers and non-carriers and found significant differences (Student’s t-test, p=1.30×10−3; KS-test, p=3.74×10−4) (Figure 2A). When patients were stratified into PRS-IPF tertiles, the prevalence of QVs was higher in the lowest tertile patients (low PRS-IPF) than in the highest tertile patients (high PRS-IPF), which associated with an increased risk of carrying a QV in the patients classified in the low PRS-IPF tertile (OR=1.79, 95% CI=1.15–2.81, p=0.010) (Figure 2B). The association persisted when the cohort was divided into two PRS-IPF categories, low and high (OR=1.74, 95% CI=1.20–2.53, p=3.57×10−3 (Supplementary Figure 3).

Figure 2. Association between prevalence of qualifying variants (QVs) and PRS-IPF in the PFFPR.

Figure 2.

A) Distribution of PRS-IPF in QV carriers (1) and non-carriers (0). Vertical dotted lines represent the mean value of the distribution. B) Risk of carrying a QV for patients with low polygenic risk (T1) and high polygenic risk (T3) compared to those in the middle tertile. The odds ratios (OR) and the 95% confidence intervals (CI) were estimated using logistic regression adjusted for sex, age of diagnosis, and the two main principal components.

Excluding the MUC5B locus from the PRS-IPF calculations yielded non-significant differences in the mean and distribution of PRS-IPF between QV carriers and non-carriers (Supplementary Figure 4A). However, QVs remained more common in the lowest PRS tertile patients compared to the highest (OR=1.60, 95% CI=1.01–2.54, p=0.05) (Supplementary Figure 4B).

To explore if these observations were independent of genetically predicted TL, we then assessed the association between the prevalence of QVs and PRS-TL, focusing only on the QVs in telomere genes. No significant associations were found between QVs and PRS-TL under the two assumptions (Supplementary Figures 57).

Association of QVs with survival in the PFFPR

Given that carriers of the MUC5B risk allele are associated with better survival and are less likely to carry QVs, we hypothesized that QV carriers might have poorer survival. Indeed, QVs carriers were associated with reduced survival (HR=1.53, 95% CI=1.12–2.10, p=7.33×10−3; log-rank test, p=0.022). The result was consistent when the analysis was limited to QVs that were classified as pathogenic or likely pathogenic variants (HR=1.71, 95%CI=1.11–2.65, p=0.016; log-rank test, p=0.043). However, no significant association was found for ClinVar variants alone (HR=1.35, 95% CI=0.87–2.09, p=0.18) (Figure 3A, Supplementary Figure 8). As an internal control, we found no association between survival and rare synonymous variants in the same monogenic adult-onset PF genes (HR=1.38, 95%CI=0.80–2.38, p=0.24) (Figure 3A).

Figure 3. Qualifying variants (QVs), MUC5B risk allele, PRS-IPF, and family history effect on survival.

Figure 3.

A) PFFPR. B) PROFILE. All analysis were performed using Cox regression models adjusted for sex, age of diagnosis, the two main principal components, smoking history, forced vital capacity (FVC) % predicted, and diffusing capacity for carbon monoxide (DLCO) % predicted, and the MUC5B risk allele whenever necessary. The X-axis shows Hazard-ratios (HR); the grey line corresponds to the HR=1.0. The circles correspond to adjusted HR and horizontal lines correspond to 95% confidence intervals (CI).

Further analyses showed that QVs in telomere-related genes had the largest effect on survival (HR=1.76, 95%CI=1.13–2.76, p=0.013; log-rank test, p=0.029), and QVs in PARN were particularly associated with worse survival (HR=2.28, 95%CI=1.11–4.68, p=0.03; log-rank test, p=0.035) (Figure 3A, Supplementary Figure 8).

We performed additional sensitivity analyses. First, excluding PARN QV carriers attenuated effect size, but the results remained significant (HR=1.46, 95% CI=1.04–2.03, p=0.03), indicating that other genes also contribute to the association with worse survival (Supplementary Figure 9). Second, as the probability of carrying QVs is higher among cases with a family history of PF, and family history of PF predicts reduced survival (29), we tried to account for the risk attributed to family history of PF. We found no relationship between family history of PF and survival (HR=1.09, 95%CI=0.80–1.49, p=0.59), suggesting that family history is not a major factor in this cohort’s survival outcomes (Figure 3A). Third, the criteria for defining QVs are subject to different choices and predictors. Using alternative and stricter definitions of QVs in the analyses (Supplementary Table 3), we found that the effect had consistent directionality across all QV definitions (Supplementary Figure 10). Additionally, we observed that applying more stringent in silico predictors resulted in a higher risk of reduced survival (ultrarare PTV HR=2.44, 95% CI=1.14–5.24, p=0.02). However, these criteria also led to a reduced number of identified carriers, thereby decreasing the power to detect significant differences.

Associations of PRS-IPF with survival in the PFFPR

Since both rare and common IPF genetic variants are associated with IPF survival, and QVs associate with worse survival, we then examined whether the polygenic component of IPF was also associated with survival. As PRS-IPF values were mainly influenced by the MUC5B effect, we did not adjust for the risk MUC5B genotype in these analyses, although we did for relevant individual covariates. We found that the lowest PRS-IPF tertile was associated with the worst survival (log-rank test, p=1.8×10−4; HR=1.61, 95% CI=1.25–2.07, p=1.9×10−4) (Figure 3A, Figure 4). In contrast, PRS-TL was not associated with survival, whether analyzed by tertiles or by high vs. low-risk groups (Supplementary Figure 11, Supplementary Figure 12).

Figure 4. Association between PRS-IPF and survival in PFFPR.

Figure 4.

Patients with low polygenic risk of IPF (T1) have worse survival in comparison with patients with high polygenic risk of IPF (T2 and T3).

A sensitivity analysis, excluding the MUC5B locus from the PRS-IPF calculation, yielded non-significant results (Supplementary Figure 13). To explore whether the association of PRS-IPF and survival was solely driven by the known association of MUC5B, we stratified patients by QV carrier status and assessed its effect in each group. The analyses showed that patients with lower PRS-IPF were associated with worse survival in both groups, although this association was attenuated among carriers (HR=1.76, 95% CI=0.99–3.15, p=0.055) compared to non-carriers (HR=1.54, 95% CI=1.16–2.04, p=2.5×10−3) (Supplementary Figure 14A, Supplementary Figure 14B). Similar results were obtained when we assessed the effect of the MUC5B rs35705950 genotypes on survival among both groups (Supplementary Figure 14C, Supplementary Figure 14D). Our findings suggest that the strong observed association between PRS-IPF and survival is mainly driven by MUC5B. While it is suspected that QVs may exert an independent but opposite effect on IPF survival, we cannot rule out that the observed differences were influenced by disparities in the group sizes.

Validation of results in PROFILE and meta-analysis

Using the same classification of QVs as in PFFPR, the diagnostic yield of finding a QVs in PROFILE was 15.67 % (95% CI=12.4–19.0%). However, PROFILE IPF patients showed no statistical differences in the mean and the distribution of PRS-IPF between QV carriers and non-carriers (Student’s t-test, p=0.17; KS-test, p=0.24). Similarly, although not statistically significant, an enrichment of QVs in the patients with lower polygenic component of IPF was observed compared to those with higher polygenic component (OR=1.29, 95% CI=0.78–2.15, p=0.31) (Supplementary Figure 15).

Despite these results, the association of QVs and IPF survival were validated in PROFILE patients (Figure 3B). As for PFFPR, the survival analyses of PROFILE patients focusing on the LP/P variants, both of all genes or only of telomeric genes, also showed the largest effect sizes (HR=1.98, 95% CI=1.28–3.05, p=2.1×10−3; log-rank test, p=0.023). However, we did not replicate the association of QVs in PARN with survival. Instead, the association with worse survival in PROFILE was observed for TERT QV carriers (HR=3.55, 95% CI=1.85–6.82, p=1.4×10−4; log-rank test, p=0.03) (Figure 3B, Supplementary Figure 16).

The meta-analysis of results from PFFPR and PROFILE cohorts showed a consistent direction of effect across all categories and supported a robust association between QVs (including “all variants”, “pathogenic”, “telomeric”, and “pathogenic telomeric”) and PRS-IPF tertiles with survival (Figure 5). In the meta-analysis, RTEL1 QVs were also nominally associated with IPF survival despite not reaching nominal significance in any of the two cohorts by separate. No association with IPF survival was found for QVs in SPDL1 and KIF15.

Figure 5. Meta-analysed results from adjusted PFFPR and PROFILE (N=1360) Cox regression models.

Figure 5.

DISCUSSION

Our study demonstrates that IPF patients carrying QVs in monogenic adult-onset PF genes are at increased risk of reduced survival compared to non-carriers. Additionally, we found that QV carriers tend to exhibit a lower polygenic risk component for IPF, as measured by PRS, suggesting non-additive effects between rare and common genetic variants. This indicates the existence of a distinct genetic subtype of IPF patients, defined by the interplay between these rare and common variants. These key findings were replicated across two independent studies encompassing a total of 1,360 IPF patients.

We describe for the first time that QVs in monogenic adult-onset PF genes are present in 1 in 6 to 1 in 7 IPF patients, a prevalence consistent with a recent WGS study, despite using different criteria for defining QVs (7). In comparison, our study used less stringent classification criteria for QV carrier, relying on one pathogenicity predictor instead of three, and including SPDL1 gene, recently identified as an IPF susceptibility gene (4). Previous studies primarily focused on telomere-related genes such as TERT, PARN, TERC, and RTEL1, resulting in lower diagnostic yield (8.5–11.5% carriers of QVs) (18,30). Some earlier studies were enriched for familial IPF patients (31), who tend to carry more QVs than sporadic cases (21.4% carriers of QVs). Unlike these studies, we expanded our analysis to include additional well-defined telomere genes in the analysis, such as DKC1, NAF1, ZCCHC8, and TINF2.

Most QVs were found in telomere-related genes (75.6% in PFFPR, 68.9% in PROFILE). The association between variants in these genes and worse IPF outcomes is well-established. Carriers of QVs in TERT, PARN, TERC, or RTEL1 tend to have earlier disease onset, more rapid lung function decline, and poorer survival compared to non-carriers (7,15,18), findings that are mirrored in individuals with short TL (7,17). However, the exact correlation between rare telomere-related variants and TL remains unclear, and the effect sizes of known common variants on TL are too small to fully explain this relationship (7).

Our study stands out by assessing the aggregate effect of QVs across both telomere and non-telomere genes on IPF survival. While the effect sizes were smaller compared to models that included only telomere-related variants, the most robust associations were found when considering all QVs across telomere and non-telomere genes. As expected, we identified few QVs in surfactant metabolism genes, and observed a comparable high burden of QVs in KIF15 and SPDL1, two recently reported IPF genes in European cohorts. A common SPDL1 missense variant (rs116483731) has been linked to IPF in the PROFILE cohort, particularly in the Finnish population (4), though without significant clinical differences between carriers and non-carriers. In contrast, KIF15 rs138043992 carriers from the FinnGen FinnIPF study demonstrated early disease onset and progression (23). In the PRRPF cohort, these two risk variants were excluded from the analysis based on the applied a global genome AF cut-off. We found that carriers of KIF15 and SPDL1 QVs did not exhibit worse survival. However, the same direction of effect was observed among KIF15 QV carriers in both the PFFPR and PROFILE cohorts. Future studies with larger sample sizes may be required to better define the clinical course of IPF patients with KIF15 QVs.

Identifying prognostic markers in PF is crucial to improve the clinical management. As antifibrotic drugs like nintedanib and pirfenidone can only slow disease progression (3234), early identification of patients with poorer prognosis could guide decisions on more aggressive treatments, such as lung transplantation. This information could also enhance the efficiency of clinical trials (35). Previous studies support that a family history of PF associates with patient survival, highlighting the importance of systematically ascertaining family histories for ILD patients to better inform prognosis (29). However, familial PF could encompass a broad group of ILD patients whose disease progression is dependent on different factors. For instance, one will be considering the risk linked to rare telomere-related variants (36,37), to common genetic variants (38), and to short age-adjusted TL (39), since they are interrelated with the family history of PF. Our findings revealed non-additive opposite effects of QVs and PRS-IPF on survival, while no apparent association was found with PRS-TL. Moreover, we did not find association between family history and IPF survival, although these results should be interpreted with caution considering that self-reported family history is prone to recall bias.

In contrast, our results show that QVs are robust predictors of IPF patient prognosis. Despite this, the use of genetic testing in IPF patients is not yet a generalized practice and current guidelines only recommend its use in familial forms of PF (9,40,41). Nonetheless we and others show the existence of a significant burden of QVs in cases in which familial history is not confirmed. It is therefore an urgent necessity to define additional criteria to improve the diagnostic yield of genetic tests in these patients. For some complex diseases, PRS have been suggested as a promising strategy for prioritizing patients who should undergo genetic sequencing (42). For example, in prostate cancer, the high penetrant variant HOXB13 G84E is most common in cases with the lowest PRS (43). The success of this approach depends on the strength of the PRS and the genetic architecture of the disease (42). IPF meets several criteria that make it suitable for it since: 1) There is modest genetic heterogeneity in this disease and the number of susceptibility genes appears to be markedly less than other complex diseases (26); 2) Its etiology is driven by both rare, highly penetrant variants which explain monogenic-like presentations, and common variants with low effect sizes that contribute for a polygenic disorder (8); and 3) the PRS-IPF accurately identifies individuals at high risk of suffering interstitial lung abnormalities (ILA) and IPF (44). In agreement, we have found that PRS-IPF values are inversely associated with the likelihood that QVs are present in the patient. This supports the idea that PRS could serve as a valuable tool for prioritizing those patients who should undergo a deeper sequence-based analysis.

We acknowledge some limitations. First, we are aware that other genes not considered for the study are also involved in IPF risk. To test the hypothesis, the analyses were restricted to a very limited number of genes showing dominant inheritance and presumably high penetrance. This resulted in the exclusion of well-defined monogenic PF genes, such as ABCA3 which shows recessive inheritance. In addition, there is not co-segregation data, TL measures, or functional evidence to accurately classify most QVs as P or LP. Therefore, we recognize that some variants categorized as QVs may be VUS or benign. To address this limitation, we have provided alternative QV definitions based on different in silico predictors and AF cutoffs which consistently showed the same direction of effect in the survival analysis. For PRS analyses, we relied on simple models based on sentinel variants from existing GWAS. For IPF, this implies that the PRS-IPF results are mainly driven by the effect of MUC5B. However, a recent study proved that a common genetic variant score complements the MUC5B variant in accurately identifying individuals at high risk of suffering ILA and IPF (44). Therefore, the use of whole-genome PRS instead of the sentinel variant-based PRS model might have benefits by offering robust associations across cohorts. Moreover, it is important to highlight that most IPF GWAS conducted to date mainly involve participants of European genetic ancestry. As a result, the PRS-IPF derived in this study might be much less accurate for non-European ancestry participants of the cohorts. Finally, despite the main findings were consistent across PFFPR and PROFILE cohorts, we acknowledge that there were differences in the definition of the primary endpoint or the received antifibrotic treatment between the two studies which could have influenced the results. Additionally, patient recruitment in PROFILE started before the approval of pirfenidone and nintedanib, which might explain the reduced median survival of patients compared to PFFPR. These differences might explain some of the inconsistencies, such as the association with survival found for PARN or TERT QV carriers only in one of the cohorts but not in the other.

CONCLUSIONS

We found a robust association of telomere and non-telomere gene QVs with IPF patient survival. We also show that those QVs were non-additive to common IPF risk variants on survival. This study highlights the potential significance of identifying QVs in telomere and non-telomere genes linked to monogenic forms of PF in clinical practice.

Supplementary Material

Supplement 1

ACKNOWLEDGEMENTS

We thank all patients who participated in the PFF Patient Registry. We also thank investigators and other staff at participating PFF Care Centers for providing clinical data and blood samples, the PFF which established and has maintained the Patient Registry since 2016, and lastly, the many generous donors.

FUNDING

Instituto de Salud Carlos III (PI20/00876, PI23/00980), co-financed by the European Regional Development Funds (ERDF), “A way of making Europe” from the EU; ITER agreements (OA17/008 and OA23/043); Cabildo Insular de Tenerife (CGIEU0000219140 and A0000014697); NIH/NHLBI grants UG3HL145266 and R01HL171918. LVW reports funding from the Medical Research Council (MR/V00235X/1).

DATA AVAILABILITY

Data supporting the findings are available as part of the manuscript or from the supplementary files. Access to the raw whole-genome sequence dataset is restricted to qualified researchers under an agreement with the PFFPR and PROFILE steering committees to protect the privacy of the participants. For further information and to apply for access to the data from the PFFPR prior to public deposit in BioLINCC, please contact the chair of the ancillary committee (Dr. Noth) or any other member of the steering committee (https://www.pulmonaryfibrosis.org/pff-registry/pff-patient-registry). For further information and to apply for access to the PROFILE data, please contact admin.mtwc@imperial.ac.uk. Data access requests must be reviewed before release.

REFERENCES

  • 1.Lederer DJ, Martinez FJ. Idiopathic Pulmonary Fibrosis. N Engl J Med. 2018. May 10;378(19):1811–23. [DOI] [PubMed] [Google Scholar]
  • 2.Pires FS, Damas C, Mota P, Melo N, Costa D, Jesus JM, et al. Slow versus rapid progressors in idiopathic pulmonary fibrosis. European Respiratory Journal [Internet]. 2011. Sep 1 [cited 2024 Mar 14];38(Suppl 55). Available from: https://erj.ersjournals.com/content/38/Suppl_55/p656 [Google Scholar]
  • 3.Allen RJ, Porte J, Braybrooke R, Flores C, Fingerlin TE, Oldham JM, et al. Genetic variants associated with susceptibility to idiopathic pulmonary fibrosis in people of European ancestry: a genome-wide association study. The Lancet Respiratory medicine. 2017. Nov;5(11):869–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Dhindsa RS, Mattsson J, Nag A, Wang Q, Wain LV, Allen R, et al. Identification of a missense variant in SPDL1 associated with idiopathic pulmonary fibrosis. Commun Biol. 2021. Mar 23;4(1):1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Fingerlin TE, Murphy E, Zhang W, Peljto AL, Brown KK, Steele MP, et al. Genome-wide association study identifies multiple susceptibility loci for pulmonary fibrosis. Nature Genetics. 2013;45(6):613–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Partanen JJ, Häppölä P, Zhou W, Lehisto AA, Ainola M, Sutinen E, et al. Leveraging global multi-ancestry meta-analysis in the study of idiopathic pulmonary fibrosis genetics. Cell Genomics. 2022;2(10):100181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Zhang D, Newton CA, Wang B, Povysil G, Noth I, Martinez FJ, et al. Utility of whole genome sequencing in assessing risk and clinically relevant outcomes for pulmonary fibrosis. European Respiratory Journal [Internet]. 2022. Dec 1 [cited 2023 Oct 12];60(6). Available from: https://erj.ersjournals.com/content/60/6/2200577 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Peljto AL, Blumhagen RZ, Walts AD, Cardwell J, Powers J, Corte TJ, et al. Idiopathic Pulmonary Fibrosis Is Associated with Common Genetic Variants and Limited Rare Variants. Am J Respir Crit Care Med. 2023. May 1;207(9):1194–202. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Raghu G, Remy-Jardin M, Richeldi L, Thomson CC, Inoue Y, Johkoh T, et al. Idiopathic Pulmonary Fibrosis (an Update) and Progressive Pulmonary Fibrosis in Adults: An Official ATS/ERS/JRS/ALAT Clinical Practice Guideline. Am J Respir Crit Care Med. 2022. May 1;205(9):e18–47. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Peljto AL, Zhang Y, Fingerlin TE, Ma SF, Garcia JGN, Richards TJ, et al. Association between the MUC5B promoter polymorphism and survival in patients with idiopathic pulmonary fibrosis. JAMA. 2013. Jun;309(21):2232–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Cai S, Allen RJ, Wain LV, Dudbridge F. Reassessing the association of MUC5B with survival in idiopathic pulmonary fibrosis. Ann Hum Genet. 2023. Sep;87(5):248–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Allen RJ, Oldham JM, Jenkins DA, Leavy OC, Guillen-Guio B, Melbourne CA, et al. Longitudinal lung function and gas transfer in individuals with idiopathic pulmonary fibrosis: a genome-wide association study. The Lancet Respiratory Medicine [Internet]. 2022. Nov; Available from: 10.1016/S2213-2600(22)00251-X [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Oldham JM, Allen RJ, Lorenzo-Salazar JM, Molyneaux PL, Ma SF, Joseph C, et al. PCSK6 and Survival in Idiopathic Pulmonary Fibrosis. Am J Respir Crit Care Med. 2023. Jun 1;207(11):1515–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Ley B, Torgerson DG, Oldham JM, Adegunsoye A, Liu S, Li J, et al. Rare Protein-Altering Telomere-related Gene Variants in Patients with Chronic Hypersensitivity Pneumonitis. Am J Respir Crit Care Med. 2019. Nov 1;200(9):1154–63. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Newton CA, Batra K, Torrealba J, Kozlitina J, Glazer CS, Aravena C, et al. Telomere-related lung fibrosis is diagnostically heterogeneous but uniformly progressive. European Respiratory Journal. 2016;48(6):1710–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Borie R, Kannengiesser C, Hirschi S, Pavec JL, Mal H, Bergot E, et al. Severe hematologic complications after lung transplantation in patients with telomerase complex mutations. The Journal of heart and lung transplantation : the official publication of the International Society for Heart Transplantation. 2015. Apr;34(4):538–46. [DOI] [PubMed] [Google Scholar]
  • 17.Stuart BD, Lee JS, Kozlitina J, Noth I, Devine MS, Glazer CS, et al. Effect of telomere length on survival in patients with idiopathic pulmonary fibrosis: An observational cohort study with independent validation. The Lancet Respiratory Medicine. 2014;2(7):557–65. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Dressen A, Abbas AR, Cabanski C, Reeder J, Ramalingam TR, Neighbors M, et al. Analysis of protein-altering variants in telomerase genes and their association with MUC5B common variant status in patients with idiopathic pulmonary fibrosis: a candidate gene sequencing study. Lancet Respir Med. 2018. Aug;6(8):603–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Wang BR, Edwards R, Freiheit EA, Ma Y, Burg C, de Andrade J, et al. The Pulmonary Fibrosis Foundation Patient Registry. Rationale, Design, and Methods. Ann Am Thorac Soc. 2020. Dec;17(12):1620–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Maher TM. PROFILEing idiopathic pulmonary fibrosis: rethinking biomarker discovery. European Respiratory Review. 2013. Jun 1;22(128):148–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Maher TM, Oballa E, Simpson JK, Porte J, Habgood A, Fahy WA, et al. An epithelial biomarker signature for idiopathic pulmonary fibrosis: an analysis from the multicentre PROFILE cohort study. The Lancet Respiratory Medicine. 2017. Dec 1;5(12):946–55. [DOI] [PubMed] [Google Scholar]
  • 22.Zhang D, Povysil G, Kobeissy PH, Li Q, Wang B, Amelotte M, et al. Rare and Common Variants in KIF15 Contribute to Genetic Risk of Idiopathic Pulmonary Fibrosis. Am J Respir Crit Care Med. 2022. Jul 1;206(1):56–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Hollmén M, Laaka A, Partanen JJ, Koskela J, Sutinen E, Kaarteenaho R, et al. KIF15 missense variant is associated with the early onset of idiopathic pulmonary fibrosis. Respir Res. 2023. Sep 30;24(1):240. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Harrison PW, Amode MR, Austine-Orimoloye O, Azov AG, Barba M, Barnes I, et al. Ensembl 2024. Nucleic Acids Research. 2024. Jan 5;52(D1):D891–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med. 2015. May;17(5):405–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Allen RJ, Stockwell A, Oldham JM, Guillen- B, Schwartz DA, Maher TM, et al. wide association study across five cohorts identifies five novel loci associated with idiopathic pulmonary fibrosis. 2022;1–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Li C, Stoma S, Lotta LA, Warner S, Albrecht E, Allione A, et al. Genome-wide Association Analysis in Humans Links Nucleotide Metabolism to Leukocyte Telomere Length. Am J Hum Genet. 2020. Mar 5;106(3):389–404. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Schwarzer G, Carpenter J, Rücker G. Meta-Analysis with R. 2015.
  • 29.Cutting CC, Bowman WS, Dao N, Pugashetti JV, Garcia CK, Oldham JM, et al. Family History of Pulmonary Fibrosis Predicts Worse Survival in Patients With Interstitial Lung Disease. CHEST. 2021. May 1;159(5):1913–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Petrovski S, Todd JL, Durheim MT, Wang Q, Chien JW, Kelly FL, et al. An Exome Sequencing Study to Assess the Role of Rare Genetic Variation in Pulmonary Fibrosis. American journal of respiratory and critical care medicine. 2017. Jul;196(1):82–93. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Liu Q, Zhou Y, Cogan JD, Mitchell DB, Sheng Q, Zhao S, et al. The Genetic Landscape of Familial Pulmonary Fibrosis. Am J Respir Crit Care Med. 2023. May 15;207(10):1345–57. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Justet A, Klay D, Porcher R, Cottin V, Ahmad K, Molina MM, et al. Safety and efficacy of pirfenidone and nintedanib in patients with idiopathic pulmonary fibrosis and carrying a telomere-related gene mutation. European Respiratory Journal. 2021;57(2):4–7. [DOI] [PubMed] [Google Scholar]
  • 33.Richeldi L, du Bois RM, Raghu G, Azuma A, Brown KK, Costabel U, et al. Efficacy and safety of nintedanib in idiopathic pulmonary fibrosis. N Engl J Med. 2014. May 29;370(22):2071–82. [DOI] [PubMed] [Google Scholar]
  • 34.King TE, Bradford WZ, Castro-Bernardini S, Fagan EA, Glaspole I, Glassberg MK, et al. A phase 3 trial of pirfenidone in patients with idiopathic pulmonary fibrosis. N Engl J Med. 2014. May 29;370(22):2083–92. [DOI] [PubMed] [Google Scholar]
  • 35.Bowman WS, Newton CA, Linderholm AL, Neely ML, Pugashetti JV, Kaul B, et al. Proteomic biomarkers of progressive fibrosing interstitial lung disease: a multicentre cohort analysis. The Lancet Respiratory Medicine. 2022. Jun 1;10(6):593–602. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Armanios MY, Chen JJL, Cogan JD, Alder JK, Ingersoll RG, Markin C, et al. Telomerase Mutations in Families with Idiopathic Pulmonary Fibrosis. New England Journal of Medicine. 2007;356(13):1317–26. [DOI] [PubMed] [Google Scholar]
  • 37.Stuart BD, Choi J, Zaidi S, Xing C, Holohan B, Chen R, et al. Exome sequencing links mutations in PARN and RTEL1 with familial pulmonary fibrosis and telomere shortening. Nature Genetics. 2015;47(5):512–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Leavy OC, Ma SF, Molyneaux PL, Maher TM, Oldham JM, Flores C, et al. Proportion of Idiopathic Pulmonary Fibrosis Risk Explained by Known Common Genetic Loci in European Populations. American Journal of Respiratory and Critical Care Medicine. 2020. Nov;203(6):775–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Pulmonary fibrosis in non-mutation carriers of families with short telomere syndrome gene mutations - PubMed [Internet]. [cited 2024 Jun 19]. Available from: https://pubmed.ncbi.nlm.nih.gov/34580961/ [DOI] [PubMed]
  • 40.Borie R, Kannengiesser C, Antoniou K, Bonella F, Crestani B, Fabre A, et al. European Respiratory Society statement on familial pulmonary fibrosis. European Respiratory Journal [Internet]. 2023. Mar 1 [cited 2023 Oct 12];61(3). Available from: https://erj.ersjournals.com/content/61/3/2201383 [DOI] [PubMed] [Google Scholar]
  • 41.Zhang D, Eckhardt CM, McGroder C, Benesh S, Porcelli J, Depender C, et al. Clinical Impact of Telomere Length Testing for Interstitial Lung Disease. CHEST [Internet]. 2024. Jun 28 [cited 2024 Jul 2];0(0). Available from: https://journal.chestnet.org/article/S0012-3692(24)00808-0/abstract [DOI] [PubMed] [Google Scholar]
  • 42.Lu T, Zhou S, Wu H, Forgetta V, Greenwood CMT, Richards JB. Individuals with common diseases but with a low polygenic risk score could be prioritized for rare variant screening. Genet Med. 2021. Mar;23(3):508–15. [DOI] [PubMed] [Google Scholar]
  • 43.Darst BF, Sheng X, Eeles RA, Kote-Jarai Z, Conti DV, Haiman CA. Combined Effect of a Polygenic Risk Score and Rare Genetic Variants on Prostate Cancer Risk. Eur Urol. 2021. Aug;80(2):134–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Moll M, Peljto AL, Kim JS, Xu H, Debban CL, Chen X, et al. A Polygenic Risk Score for Idiopathic Pulmonary Fibrosis and Interstitial Lung Abnormalities. Am J Respir Crit Care Med. 2023. Oct;208(7):791–801. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1

Data Availability Statement

Data supporting the findings are available as part of the manuscript or from the supplementary files. Access to the raw whole-genome sequence dataset is restricted to qualified researchers under an agreement with the PFFPR and PROFILE steering committees to protect the privacy of the participants. For further information and to apply for access to the data from the PFFPR prior to public deposit in BioLINCC, please contact the chair of the ancillary committee (Dr. Noth) or any other member of the steering committee (https://www.pulmonaryfibrosis.org/pff-registry/pff-patient-registry). For further information and to apply for access to the PROFILE data, please contact admin.mtwc@imperial.ac.uk. Data access requests must be reviewed before release.


Articles from medRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES