Abstract
Background
Improving lung cancer risk assessment is required because current early-detection screening criteria miss most cases. We therefore examined the utility for lung cancer risk assessment of a DNA Repair score obtained from OGG1, MPG, and APE1 blood tests. In addition, we examined the relationship between the level of DNA repair and global gene expression.
Methods
We conducted a blinded case-control study with 150 non–small cell lung cancer case patients and 143 control individuals. DNA Repair activity was measured in peripheral blood mononuclear cells, and the transcriptome of nasal and bronchial cells was determined by RNA sequencing. A combined DNA Repair score was formed using logistic regression, and its correlation with disease was assessed using cross-validation; correlation of expression to DNA Repair was analyzed using Gene Ontology enrichment.
Results
DNA Repair score was lower in case patients than in control individuals, regardless of the case’s disease stage. Individuals at the lowest tertile of DNA Repair score had an increased risk of lung cancer compared to individuals at the highest tertile, with an odds ratio (OR) of 7.2 (95% confidence interval [CI] = 3.0 to 17.5; P < .001), and independent of smoking. Receiver operating characteristic analysis yielded an area under the curve of 0.89 (95% CI = 0.82 to 0.93). Remarkably, low DNA Repair score correlated with a broad upregulation of gene expression of immune pathways in patients but not in control individuals.
Conclusions
The DNA Repair score, previously shown to be a lung cancer risk factor in the Israeli population, was validated in this independent study as a mechanism-based cancer risk biomarker and can substantially improve current lung cancer risk prediction, assisting prevention and early detection by computed tomography scanning.
Lung cancer is the leading cause of cancer deaths, being responsible for about 156 000 deaths per year in the United States (1,2), and 388 000 deaths per year in Europe (3). Detection of lung cancer at an early stage dramatically increases the 5-year survival rate from less than 5% at stage IV to 54%–73% at stage I, highlighting the effectiveness of early detection in reducing lung cancer mortality (4). Identification of people at high risk for lung cancer is important for empowering effective early detection, and currently the main lung cancer risk factors are age and smoking history (5–9). Indeed, recommendations for computed tomography (CT) screening for early lung cancer detection in the United States are based on age and heavy smoking, which were used in the National Lung Screening Trial (NLST) (10). This trial showed that CT screening followed by treatment led to a 20% reduction in lung cancer mortality (10), confirmed by the NELSON trial, which recently showed a 26% reduction in lung cancer mortality in men after CT screening and treatment (11), and consistent with the earlier single-arm I-ELCAP study (12). If the NLST selection criteria were to be applied to all eligible individuals in the United States, estimations show that only 27% of lung cancer case patients would be detected (13). Although this is a substantial proportion, it does suggest that additional risk factors await discovery and employment.
Because DNA repair is crucial for removing DNA damage and avoiding mutations (14–16), suboptimal DNA repair is likely to cause cancer risk. This is clearly illustrated in the extreme cases of germline mutations in DNA Repair genes, which cause repair deficiency and high predisposition to cancer (17–19). Similarly, suboptimal DNA Repair activity was reported to be associated with cancer risk for a variety of DNA Repair mechanisms and cancer types (20–25).
Here we examined, in a blinded, case-control study, whether a low DNA Repair score, calculated from the DNA Repair enzymatic activities of OGG1 (8-oxoguanine DNA glycosylase), MPG (methylpurine DNA glycosylase), and APE1 (apurinic/apyrimidinic endonuclease 1) (also known as OMA score), all of which act primarily on oxidative DNA damage, is a risk biomarker for lung cancer in the United Kingdom as it was in the Israeli population (23,25–27). In addition, we examined whether variation in the DNA Repair score measured in peripheral blood mononuclear cells (PBMC) correlates with systematic effects on gene expression in airway cells of lung cancer patients and control individuals.
Methods
Study Participants
Patients at least age 18 years referred to Royal Papworth Hospital, Cambridge, Thoracic Oncology service for investigation of suspected lung cancer were recruited between October 2013 and November 2015. Those with a history of previous lower or upper airway cancer were excluded, as were patients with symptoms who were initially recruited to the study but later were found to have no evidence of cancer. Control individuals were healthy volunteers recruited from the Cambridge BioResource, a panel of approximately 17 600 volunteers, both with and without health conditions, who are willing to be approached to participate in research studies (https://www.cambridgebioresource.group.cam.ac.uk). They were selected in an attempt to form a sex and age composition comparable to the cases group. Disease stage was reported according to the seventh edition of the tumor, lymph node, metastasis (TNM) classification of malignant tumors (28). Research ethics approvals for sample collection from participants in this study were given by East of England Cambridge Central REC 13/EE/0012 and the National Research Ethics Service Committee South East Coast – Surrey 13/LO/0889. Table 1 summarizes the characteristics of case patients and control individuals.
Table 1.
Characteristics | Control individuals (N = 143) | Case patients (N = 150) | P * |
---|---|---|---|
Sex, No. (%) | .291 | ||
Male | 79 (55.2) | 93 (62.0) | |
Female | 64 (44.8) | 57 (38.0) | |
Age, mean (SD) | 59.7 (10.0) | 68.6±9.6 | <.001 |
Smoking status, No. (%) | <.001 | ||
Current | 48 (33.6) | 62 (41.3) | |
Ex-smoker | 50 (35.0) | 87 (58.0) | |
Never | 45 (31.5) | 1 (0.7) | |
Pack-years, mean (SD) | 16.8±20.3 | 43.2±27.6 | <.001 |
Histology, No. (%) | |||
Adenocarcinoma | — | 83 (55.3) | |
Squamous | — | 56 (37.3) | |
NSCLC-NOS | — | 11 (7.3) | |
COPD, No. (%) | <.001 | ||
None | 112 (83.6) | 56 (41.8) | |
Mild | 9 (6.7) | 26 (19.4) | |
Moderate | 10 (7.5) | 31 (23.1) | |
Severe | 3 (2.2) | 16 (11.9) | |
Severity unknown | 0 (0.0) | 5 (3.7) |
P value comparing the distribution of case patients with control individuals: For categorial values the chi-square test was used; for continuous variables the unpaired t test was used. COPD = chronic obstructive pulmonary disease; NSCLC-NOS = non–small cell lung cancer–not otherwise specified; — = not applicable; Missing values: pack-years: three control individuals; COPD: nine control individuals and 16 case patients.
Specimens, Protein Extracts, DNA Repair Assays, and RNA Sequencing
Each participant donated 17 mL blood (two 8.5 mL ACD vacutainers) from which, within 2–4 hours, PBMC were isolated by Ficoll fractionation and frozen as previously described (23,29). Protein extracts, DNA substrates, and DNA Repair assays were performed as previously described (24,25,30,31) and are briefly described in the Supplementary Methods (available online). The outline of the DNA Repair assays is described in Figure 1. Bronchial and nasal sample collection, along with RNA extraction, sequencing, and analysis, are described in the Supplementary Methods (available online).
Statistical Analysis
Personal characteristics of case patients and control individuals were compared using unpaired t tests for continuous variables and χ2 tests for categorical variables. The smoking status distribution differed between case patients and control individuals, with approximately 30% of the control individuals being never smokers compared to only one case patient. In addition, whereas case patients and control subjects were quite well-matched by sex, the control individuals were approximately 9 years younger than the case patients on average. Therefore, all comparisons of case patients and control individuals were adjusted for smoking and age, as well as sex. The adjustment was performed by using regression models in which these factors were included as covariates. Because of the large imbalance in the smoking variable, these regression analyses essentially compare the 98 ever-smoked control individuals with the 149 ever-smoked case patients, and little information is gained from the never-smokers. Indeed, in additional analyses where we excluded the never-smokers, the regression results were virtually unchanged from those reported in Results. Chronic obstructive pulmonary disease (COPD) was reported more commonly among the lung cancer case patients than among the control individuals; no adjustment for COPD was made in our analyses because it is likely a mediating factor rather than a confounder in the association of DNA Repair enzymes with lung cancer, and adjusting for mediating factors is not recommended (32).
The association of enzymatic DNA repair activity levels with lung cancer risk was evaluated using an unconditional logistic regression model in which the presence or absence of lung cancer was the binary dependent variable, and the covariates were the DNA Repair enzyme levels (OGG1, MPG, and APE1) together with age (continuous), sex, and smoking status (smoker, past smoker, never smoker) as adjusting variables. From this model, the DNA Repair scores were derived using the estimated regression coefficients for the OGG1, MPG, and APE1 variables. The resulting formula of the DNA Repair score for each study participant was DNA Repair score = 0.00621 × APE1 – 0.047 × OGG1 – 0.0223 × MPG. Examination of the scores for case patients and control individuals showed them to be approximately normally distributed within each group. This score was then entered as a covariate in a new logistic regression model, either as a continuous variable or in tertile categories (according to the score distribution in the control individuals), with the same adjusting variables, to obtain odds ratio estimates and receiver-operating characteristic (ROC) curves to describe the strength of the DNA Repair score–lung cancer association. The DNA Repair score formula optimized for this cohort of participants was different from the previous study (25), most likely because of the revised APE1 assay in this study and the fact that different cohorts of modest size were used in the two studies. To overcome the bias arising from use of the same data to develop the DNA Repair score and evaluate its strength of association, the odds ratios and ROC curves were reestimated using “leave-one-out” cross-validation (33). Confidence limits (bias-correction and acceleration method) for the area under the curve (AUC) of the ROC and the comparison test of two AUCs were computed using the nonparametric bootstrap. The continuous scores were also compared with case-control status, age group (≤55, 56–69, ≥70 years), sex, smoking status, and histology and disease stage (among case patients) using analysis of covariance, in each case adjusting for the other factors (Table 2). A P value equal to or less than .05 was considered statistically significant and all tests were two-sided.
Table 2.
Control individuals (n = 140) |
Case patients (n = 149) |
||||
---|---|---|---|---|---|
Characteristics | No. | DNA Repair score mean (95% CI) | P | No. | DNA Repair score mean (95% CI) |
All† | 140 | 4.00 (3.84 to 4.16) | 149 | 2.67 (2.50 to 2.84) | |
P Case vs control‡ < .001 | |||||
Histology | |||||
SQCC | – | – | 56 | 2.69 (2.45 to 2.93) | |
Adenocarcinoma | – | – | 82 | 2.62 (2.38 to 2.86) | |
Unknown | – | – | 11 | 2.92 (1.87 to 3.98) | |
P SQCC vs adenocarcinoma§ = .65 | |||||
Age, y | |||||
≤55 | 43 | 4.28 (3.96 to 4.59) | 16 | 2.36 (1.82 to 2.89) | |
56–69 | 75 | 4.00 (3.81 to 4.20) | 64 | 2.86 (2.55 to 3.16) | |
≥70 | 22 | 3.45 (2.95 to 3.94) | 69 | 2.56 (2.35 to 2.78) | |
P Age trend‖ = .075 | |||||
Sex | |||||
Male | 77 | 3.82 (3.59 to 4.05) | 93 | 2.71 (2.49 to 2.93) | |
Female | 63 | 4.21 (3.99 to 4.44) | 56 | 2.60 (2.32 to 2.88) | |
P M vs F‖ = .47 | |||||
Smoking status | |||||
Never smoked | 45 | 4.07 (3.79 to 4.36) | 1 | 4.85 | |
Past smoker | 49 | 3.98 (3.72 to 4.24) | 87 | 2.60 (2.40 to 2.81) | |
Current smoker | 46 | 3.95 (3.62 to 4.27) | 61 | 2.72 (2.43 to 3.02) | |
P Ever-smoked vs never smoked‖ = .32; | |||||
P Current smoker vs others‖ = .65 | |||||
COPD | |||||
No | 109 | 4.02 (3.85 to 4.19) | 56 | 2.86 (2.55 to 3.18) | |
Yes | 22 | 3.82 (3.30 to 4.34) | 77 | 2.57 (2.35 to 2.79) | |
P Yes vs no¶ = .13 |
The DNA Repair score was defined as: 0.00621 × APE1 − 0.047 × OGG1 − 0.0223 × MPG, where APE1, OGG1, and MPG each represent the measured enzyme activity of an individual. The weights for each component used in the DNA Repair score were calculated from the logistic regression and chosen to optimize strength of association of the score with lung cancer for the observed data set. APE1 = apurinic/apyrimidinic endonuclease 1; CI = confidence interval; COPD = chronic obstructive pulmonary disease; MPG = methylpurine DNA glycosylase; OGG1 = 8-oxoguanine DNA glycosylase; SQCC = squamous cell carcinoma.
Values were missing for three control individuals (two current and one past smoker) and one case patient (current smoker).
Two-sided P value comparing the distribution of case patients with control individuals: analysis of covariance was used adjusting the comparison for continuous age, sex, and smoking status.
Two-sided P value for comparing SQCC with adenocarcinoma: analysis of covariance was used adjusting for case-control-status, continuous age, sex, and smoking status.
Two-sided P values obtained from analysis of covariance for factor of interest, adjusted for case-control status and other factors from among age, sex, and smoking status.
Two-sided P value comparing COPD vs no COPD; using analysis of covariance adjusting for continuous age and sex.
Results
DNA Repair Score in Lung Cancer Patients Compared to Control Individuals in the United Kingdom
To examine whether a low DNA Repair score is associated with lung cancer risk in the United Kingdom, we conducted a case-control study with 150 non–small cell lung cancer case patients and 143 control individuals for whom we tested the enzymatic activities for OGG1, MPG, and APE1 in PBMC. Participant accrual, blood collection, and PBMC isolation were performed in the United Kingdom, after which the PBMC samples were coded and sent unidentified to Israel to determine their DNA Repair activities. After completion of DNA Repair testing, the study was unblinded, and the clinical and DNA Repair data were used for the statistical analysis. From the logistic regression relating levels of OGG1, MPG, and APE1 to lung cancer risk (adjusting for age, sex, and smoking status), the DNA Repair score (defined in the footnote to Table 2) was derived. Results of the analysis of this DNA Repair score are presented in Table 2 and Figure 2. The mean score in patients was 2.67 (95% confidence interval [CI] = 2.50 to 2.84) and was lower than in the control individuals: 4.00 (95% CI = 3.84 to 4.16; P < .001; Table 2). Consistently, the distribution of DNA Repair score values among patients was shifted to lower values compared to control individuals (Figure 2). There were no statistically significant differences between the mean scores of men and women; individuals with or without COPD; never, past, and current smokers; and patients with adenocarcinoma or squamous cell carcinoma (Table 2). There appeared to be a decrease in DNA Repair score with age among control individuals, but it did not reach statistical significance (Table 2).
Association Between Low DNA Repair Score and Lung Cancer
When the DNA Repair score was assessed as a continuous variable in relation to lung cancer risk, a 1-SD decrease in the DNA Repair score was associated with an increased adjusted odds ratio of 2.7 (95% CI = 1.9 to 3.7; P < .001), and 2.5 (95% CI = 1.8 to 3.4; P < .001) after cross-validation (Table 3). When the score was categorized into tertiles, using the tertile with the highest score as reference, the adjusted odds ratio for lung cancer in the lowest tertile was 11.5 (95% CI = 4.3 to 31.2; P < .001), and 7.2 (95% CI = 3.0 to 17.5; P < .001) after cross-validation (Table 3). These results suggest that a low DNA Repair score is strongly associated with lung cancer risk.
Table 3.
Variable | Score value | No. of control individuals (%) | No. of case patients (%) | Adjusted OR† (95% CI) | P | Cross-validation Adjusted OR‡ (95% CI) | P |
---|---|---|---|---|---|---|---|
Score per 1 SD decrease§ | 0.98 U | 140 (100.0) | 149 (100.0) | 2.7 (1.9 to 3.7) | <.001 | 2.5 (1.8 to 3.4) | <.001 |
Score by tertiles‖ | |||||||
Highest tertile | ≥4.52 | 47 (33.6) | 6 (4.0) | 1.0 (Referent) | — | 1.0 (Referent) | — |
Middle tertile | 3.68–4.51 | 46 (32.9) | 22 (14.8) | 3.0 (1.0 to 8.7) | .048 | 1.6 (0.6 to 4.2) | — |
Lowest tertile | ≤3.68 | 47 (33.6) | 121 (81.2) | 11.5 (4.3 to 31.2) | <.001 | 7.2 (3.0 to 17.5) | — |
DNA Repair score definition: 0.00621 × APE1 – 0.047 × OGG1 – 0.0223 × MPG. APE1 = apurinic/apyrimidinic endonuclease 1; CI = confidence interval; MPG = methylpurine DNA glycosylase; OGG1 = 8-oxoguanine DNA glycosylase; OR = odds ratio.
Unconditional logistic regression adjusted for age, sex, and smoking status (smoker, past smoker, never smoker).
Cross-validated odds ratios calculated according to the SDs and tertiles of the cross-validated DNA Repair scores.
Score was fitted in the unconditional logistic regression model as a continuous variable. 0.98 U is the 1 SD in the control group for the DNA Repair scores. For each model, the estimated odds ratios for smoking were slightly different and the range was as follows: current vs never = 57.1–59.4; current v ex = 1.27–1.29.
Tertiles of the control individuals’ values. The upper tertile was chosen as the referent.
Patients' DNA Repair Score by Disease Stage Compared to Healthy Individuals
The DNA Repair score for case patients was examined within each disease stage and each TNM category. The mean score in patients with stage IA lung cancer was 2.95 (95% CI = 2.56 to 3.35), statistically significantly lower than the mean of control individuals (4.00, 95% CI = 3.84 to 4.16; P = .0009), adjusted for age, sex, and smoking status (Figure 3 and Supplementary Table 1, available online). Within TNM categories, the mean DNA Repair scores in lung cancer patients with T1A, N0, and M0 were 2.80 (95% CI = 2.35 to 3.25), 2.90 (95% CI = 2.64 to 3.17), and 2.76 (95% CI = 2.58 to 2.95), respectively, each statistically significantly lower than in control subjects (DNA Repair score = 4.00), with P values of .011 for the first, and less than .0001 for the latter two comparisons. These findings are consistent with low DNA Repair score being a risk factor for lung cancer, possibly acting from the early stages of lung carcinogenesis. The tests performed show a further small yet statistically significant decline in the DNA Repair score for more advanced disease stage, N category and M category, and there is also a statistically nonsignificant decline over T categories (Supplementary Table 1, available online).
Sensitivity, Specificity, and ROC of DNA Repair Score Testing
Plotting the sensitivity and specificity of the DNA Repair score combined with age and smoking status in ROC curves yielded an AUC of 0.89 (95% CI = 0.82 to 0.93) and 0.88 after cross-validation (Figure 4B). When DNA Repair score was assessed separately from age and smoking, the AUC was 0.81 for the DNA Repair score (0.80 after cross-validation; Figure 4A). The AUC for age and smoking status without the DNA repair score was 0.83 (95% CI = 0.75 to 0.89) (Figure 4B). The added value of the DNA Repair score in estimating lung cancer risk on top of the risk predicted based on age and smoking was statistically significant (P = .0002) and is illustrated by the following sensitivity values: at a specificity of 95%, the sensitivity of smoking and age as risk factors is 34%, with the DNA Repair score exhibiting a similar independent sensitivity of 36%. When the two are combined, the sensitivity is increased to 54%. Similarly, at a specificity of 90%, the combined sensitivity is 64%, higher than smoking and age (43%) or DNA Repair score (50%) risk factors alone.
Correlation Between DNA Repair Score and Expression of Biological Pathways in Lung Cancer Patients
Having the DNA Repair scores of a group of people, we sought to examine whether interpersonal variations in DNA repair correlate with systemic effects in the human body. To that end, we examined the relationship between the DNA Repair score and whole transcriptome RNAseq analysis in airway epithelial cells. Analysis included 213 participants (92 healthy control individuals and 121 lung cancer patients) selected after performing quality control procedures and data cleaning on a larger RNAseq dataset (see Supplementary Methods, available online). The correlation between RNA expression levels and DNA Repair score was tested separately for case patients and control subjects, by regressing the RNA expression levels of each gene in the dataset on the DNA Repair score values using the DESeq2 tool (34). Sex, age, smoking status, and experimental batch were incorporated as adjusting variables. Few genes were statistically significantly correlated with scores at a false discovery rate threshold of 0.01 (Supplementary Methods, available online).
Hypothesizing that there is a signal in the data that is distributed over many genes, with small effect size in each gene, we examined whether there are pathways that are statistically significantly enriched with varying score values, using Gene Set Enrichment Analysis (GSEA) (35). For each group (case patients and control subjects), we ranked approximately 15 000 genes based on the statistics of DESeq2 and used this ranking as input to GSEA with default parameters. We tested for overrepresentation of genes belonging to specific pathways as defined by gene ontology terms (Supplementary Tables 2 and 3, available online). Pathways identified by GSEA were manually curated and divided into three groups: immune system-related pathways, cell cycle pathways, and other pathways (see keywords in legend to Supplementary Table 3, available online). The analysis revealed 185 immune response pathways that were statistically significantly enriched, spanned by 3154 unique genes negatively correlated with the DNA Repair score. Strikingly, this correlation was specific for the lung cancer patients and was not observed in the control individuals’ samples (Figure 5, Supplementary Table 3, available online). The dramatic enrichment in the immune response pathways was robust against subsampling, indicating that it is not a sampling bias (Supplementary Methods and Supplementary Figure 3, available online).
Another set of pathways that exhibited negative correlation with the DNA Repair score represents cell cycle pathways (Supplementary Table 3, available online). This correlation was found both in lung cancer patients and control individuals, with a larger number of pathways enriched in the control participants. RNAseq analysis of 37 bronchial samples, which were obtained only from case patients, also showed an enrichment of immune system–related pathways (Figure 5). In conclusion, the expression data suggest that low DNA Repair score correlates with broad upregulation of immune response pathways in lung cancer patients, but not in control individuals.
Discussion
The results presented here indicate that a personal low baseline DNA Repair score, composed of the enzymatic activities of OGG1, MPG, and APE1, is a strong risk factor for lung cancer. These results validate a previous study performed in an Israeli population (25) and exhibit comparable estimated relative risk values (Supplementary Figure 4, available online), suggesting that low DNA Repair score is a general lung cancer risk factor in different countries and populations.
Although the studies were retrospective, several lines of evidence suggest that a low DNA Repair score may play a causative role in lung cancer: DNA Repair score is lower in patients at any disease staging (overall disease stage and each TNM staging separately) than in control individuals, including stage I disease, consistent with low personal baseline DNA repair contributing to the carcinogenesis process; We have previously compared DNA Repair activity in blood samples before and after surgery of non–small cell lung cancer patients and found no correlation between DNA Repair activity and the presence of the tumor (27); In a study on the role of low DNA Repair in the risk of head and neck cancer, we found that DNA Repair activity at diagnosis, and 3–4 years after treatment and cure, was similar within the 95% confidence interval (26). This suggests that baseline DNA repair is a personal feature characteristic of an individual and is not affected by the disease; and the general well-established role of DNA repair is eliminating DNA damage and avoiding mutations and cancer (36).
Nevertheless, to definitively prove the causative role of the DNA Repair biomarker in lung cancer, a prospective study is needed.
Screening for early detection of lung cancer based on the NLST criteria (age, heavy smoking) is expected to apply to about 7.5% of all smokers and ex-smokers in the United States and detect about 27% of lung cancers (13). This leaves 92.5% of smokers and former smokers in the United States and 73% of lung cancer case patients with no proper risk assessment for early detection of lung cancer. Our results suggest that adding the DNA Repair score to the age and smoking considerations is likely to improve risk assessment of individuals. For example, using the Liverpool Lung Project model (see Data Supplements for the calculation, available online), a male aged 65 years, having smoked for 30 years with no other risk factors, has a projected 1.6% 5-year risk of developing lung cancer (37). However, assuming the person also has a low DNA Repair score (≤5th percentile), his risk increases approximately fourfold to 6.9% (Figure 4C;Supplementary Table 4, available online). Conversely, if the person had smoked for 50 years, then his projected risk would be 4.5%, but knowing that the person had a relatively high DNA Repair score (≥75th percentile), his risk would drop to 1.1% (Supplementary Table 4, available online). These results illustrate how using the DNA Repair score could greatly improve lung cancer risk assessment, aiding selection of patients for lung cancer early detection and paving the way to broadening primary and secondary prevention of lung cancer beyond the NLST criteria.
Aside from the association of low DNA Repair score with lung cancer risk, we observed a striking correlation between low DNA Repair score and a broad activation of immune-related pathways in lung cancer patients, measured by RNAseq gene expression (Supplementary Tables 2 and 3, available online). The correlation withstood two types of robustness tests, indicating that the effect did not occur by chance. Why was such an effect not observed in control individuals? It is possible that activation of the immune system requires the presence of cancer cells or cells presenting neoantigens, and in addition, the level of DNA repair was lower in case patients than in control individuals. A low DNA repair might cause a higher mutational burden and an increase in neoantigens, contributing to the activation of the immune system (38,39), similar to the situation in tumors with a mismatch repair deficiency (40–42). In addition, the tested DNA Repair enzymes affect the immune system via other mechanisms (43–47). Because only about 1% of lung cancer case patients exhibit a mismatch repair deficiency (48,49), a low DNA Repair score might serve as a biomarker for response to immunotherapy of these patients, but of course this needs to be directly tested.
The data obtained thus far indicate that DNA Repair score testing may provide a highly useful risk biomarker that when combined with smoking history and age can provide effective risk estimates for lung cancer, beyond the classical smoking history and age risk factors (Figure 4C). Such risk assessment may have applications in referring low-dose CT screening to high-risk individuals who are ineligible under current NLST criteria and aiding management of indeterminate pulmonary nodules, assisting prevention, early detection, and therapy of lung cancer.
Funding
This work was funded by grants from NIH/NCI/EDRN (#1 U01 CA111219), the Flight Attendant Medical Research Institute, Florida, the Mike Rosenbloom Foundation and Weizmann Institute of Science to ZL and TPE; and by grants from Cancer Research UK to BP and to the Cancer Research UK Cambridge Centre; and by a UK National Institute for Health Research Senior Fellowship to BP; and by the Cambridge Biomedical Research Centre and the Cancer Research UK Cambridge Centre to RCR. Volunteer participant recruitment through the Cambridge Bioresource was funded by the Cambridge Biomedical Research Centre.
Notes
Affiliations of authors: Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel (TPE, YLD, YE, ZL); Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK (KBM, FMG, MOR, HK, BP); Department of Oncology, University of Cambridge, Cambridge, UK (KBM, FMG, MOR, HK, RCR, BP); Bioinformatics Unit, Grand Israel National Center for Personalized Medicine, Weizmann Institute of Science, Rehovot, Israel (BM); Biostatistics Unit, Gertner Institute for Epidemiology and Public Health Policy Sheba Medical Center, Tel Hashomer, Israel (RF, LSF); Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy (FMG); Department of Thoracic Oncology, Royal Papworth Hospital, Cambridge, UK (RCR).
We gratefully acknowledge the participation of all NIHR Cambridge BioResource volunteers, and thank the NIHR Cambridge BioResource Centre and staff for their contribution. We thank the National Institute for Health Research and NHS Blood and Transplant.
We thank Dr Brendan Dougherty, Dr David Meek, Dr Nick Carroll, Dr Nicky Simler, Amanda Stone, Anne Joy, Tania Pettit, Ann Thomson, Victoria Senior, Theresa Green, Amy Gladwell for subject recruitment, and data and sample collection; Tim Young, Julia Knight, and Natalie Allen for excellent technical assistance in sample processing; the study participants, for enabling us to perform this study; and Prof Eytan Domany (Department of Physics of Complex Systems, Weizmann Institute of Science) for his advice on the bioinformatics analysis.
Conflict of interest statement. The corresponding authors (ZL and TPE) have an issued patent on the OGG1 risk factor for lung cancer and patent applications issued and pending for the OGG1, MPG, and APE1 panel of DNA Repair biomarkers. The other authors declare no conflict of interest.
The funders had no role in the design of the study; the collection, analysis, and interpretation of the data; the writing of the manuscript; and the decision to submit the manuscript for publication.
Supplementary Material
References
- 1. Torre LA, Siegel RL, Jemal A.. Lung cancer statistics In: Ahmad A, Gadgeel S, eds. Lung Cancer and Personalized Medicine: Current Knowledge and Therapies. Switzerland: Springer Nature; 2016:1–19. [Google Scholar]
- 2. National Cancer Institute Surveillance, Epidemiology, and End Results Program SEER Cancer Statistics Review, 1975–2014, Lung and bronchus. https://seer.cancer.gov/archive/csr/1975_2014/results_merged/sect_15_lung_bronchus.pdf.
- 3. Ferlay J, Colombet M, Soerjomataram I, et al. Cancer incidence and mortality patterns in Europe: estimates for 40 countries and 25 major cancers in 2018. Eur J Cancer. 2018;103(11):356–387. [DOI] [PubMed] [Google Scholar]
- 4. National Cancer Institute NIH. SEER Cancer Statistics Review, 1975–2016. https://seer.cancer.gov/csr/1975_2016/results_merged/sect_15_lung_bronchus.pdf.
- 5. D’Amelio AM Jr, Cassidy A, Asomaning K, et al. Comparison of discriminatory power and accuracy of three lung cancer risk models. Br J Cancer. 2010;103(3):423–429. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Schmidt-Hansen M, Berendse S, Hamilton W, Baldwin DR.. Lung cancer in symptomatic patients presenting in primary care: a systematic review of risk prediction tools. Br J Gen Pract. 2017;67(659):e396–e404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Marcus MW, Raji OY, Field JK.. Lung cancer screening: identifying the high risk cohort. J Thorac Dis . 2015;7(suppl 2):S156–S162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Li K, Husing A, Sookthai D, et al. Selecting high-risk individuals for lung cancer screening: a prospective evaluation of existing risk models and eligibility criteria in the German EPIC Cohort. Cancer Prev Res (Phila). 2015;8(9):777–785. [DOI] [PubMed] [Google Scholar]
- 9. Katki HA, Kovalchik SA, Petito LC, et al. Implications of nine risk prediction models for selecting ever-smokers for computed tomography lung cancer screening. Ann Intern Med . 2018;169(1):10–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Aberle DR, Adams AM, Berg CD, et al. Reduced lung-cancer mortality with low-dose computed tomographic screening. N Engl J Med. 2011;365(5):395–409. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. de Koning H, Van Der Aalst C, Haaf T, Oudkerk M.. Effect of volume CT lung cancer screening: mortality results of the NELSON randomised-controlled population-based trial. In: IASLC 19th World Conference on Lung Cancer. J Thoracic Oncol. 2018;13(10). [Google Scholar]
- 12. International Early Lung Cancer Action Program Investigators, Henschke CI, Yankelevitz DFet al. Survival of patients with stage I lung cancer detected on CT scanning. N Engl J Med. 2006;355(17):1763–1771. [DOI] [PubMed] [Google Scholar]
- 13. Pinsky PF, Berg CD.. Applying the National Lung Screening Trial eligibility criteria to the US population: what percent of the population and of incident lung cancers would be covered? J Med Screen. 2012;19(3):154–156. [DOI] [PubMed] [Google Scholar]
- 14. Vogelstein B, Kinzler KW.. Cancer genes and the pathways they control. Nat Med. 2004;10(8):789–799. [DOI] [PubMed] [Google Scholar]
- 15. Hanahan D, Weinberg RA.. Hallmarks of cancer: the next generation. Cell. 2011;144(5):646–674. [DOI] [PubMed] [Google Scholar]
- 16. Friedberg EC, Walker GC, Siede W, Wood RD, Schultz RA, Ellenberger T.. DNA Repair and Mutagenesis. 2nd ed. Washington, DC: ASM Press; 2006. [Google Scholar]
- 17. Modrich P. Mismatch repair, genetic stability, and cancer. Science. 1994;266(5193):1959–1960. [DOI] [PubMed] [Google Scholar]
- 18. Marteijn JA, Lans H, Vermeulen W, Hoeijmakers JH.. Understanding nucleotide excision repair and its roles in cancer and ageing. Nat Rev Mol Cell Biol. 2014;15(7):465–481. [DOI] [PubMed] [Google Scholar]
- 19. Welcsh PL, King MC.. BRCA1 and BRCA2 and the genetics of breast and ovarian cancer. Hum Mol Genet. 2001;10(7):705–713. [DOI] [PubMed] [Google Scholar]
- 20. Wei Q, Cheng L, Amos CI, et al. Repair of tobacco carcinogen-induced DNA adducts and lung cancer risk: a molecular epidemiologic study. J Natl Cancer Inst. 2000;92(21):1764–1772. [DOI] [PubMed] [Google Scholar]
- 21. Sakiyama T, Kohno T, Mimaki S, et al. Association of amino acid substitution polymorphism in DNA repair genes TP53, POLI, REV1 and LIG4 with lung cancer risk. Int J Cancer. 2005;114(5):730–737. [DOI] [PubMed] [Google Scholar]
- 22. Bau DT, Mau YC, Ding SL, Wu PE, Shen CY.. DNA double-strand break repair capacity and risk of breast cancer. Carcinogenesis. 2007;28(8):1726–1730. [DOI] [PubMed] [Google Scholar]
- 23. Paz-Elizur T, Krupsky M, Blumenstein S, Elinger D, Schechtman E, Livneh Z.. Reduced DNA repair activity for oxidative damage and the risk of lung cancer. J Natl Cancer Inst. 2003;95(17):1312–1319. [DOI] [PubMed] [Google Scholar]
- 24. Leitner-Dagan Y, Sevilya Z, Pinchev M, et al. Enzymatic MPG DNA repair assays for two different oxidative DNA lesions reveal associations with increased lung cancer risk. Carcinogenesis. 2014;35(12):2763–2770. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Sevilya Z, Leitner-Dagan Y, Pinchev M, et al. Low integrated DNA repair score and lung cancer risk. Cancer Prev Res. 2014;7(4):398–406. [DOI] [PubMed] [Google Scholar]
- 26. Paz-Elizur T, Ben-Yosef R, Elinger D, et al. Reduced repair of the oxidative 8-oxoguanine DNA damage and the risk of head and neck cancer. Cancer Res. 2006;66(24):11683–11689. [DOI] [PubMed] [Google Scholar]
- 27. Paz-Elizur T, Elinger D, Blumenstein S, Krupsky M, Schechtman E, Livneh Z.. Novel molecular targets for risk identification: DNA repair enzyme activities. Cancer Biomark. 2007;3(3):129–133. [DOI] [PubMed] [Google Scholar]
- 28. Goldstraw P, Crowley J, Chansky K, et al. International Association for the Study of Lung Cancer International Staging C, Participating I. The IASLC Lung Cancer Staging Project: proposals for the revision of the TNM stage groupings in the forthcoming (seventh) edition of the TNM classification of malignant tumours. J Thorac Oncol. 2007;2(8):706–714. [DOI] [PubMed] [Google Scholar]
- 29. Paz-Elizur T, Elinger D, Leitner-Dagan Y, et al. Development of an enzymatic DNA repair assay for molecular epidemiology studies: distribution of OGG activity in healthy individuals. DNA Repair. 2007;6(1):45–60. [DOI] [PubMed] [Google Scholar]
- 30. Sevilya Z, Leitner-Dagan Y, Pinchev M, et al. Development of APE1 enzymatic DNA repair assays: low APE1 activity is associated with increase lung cancer risk. Carcinogenesis. 2015;36(9):982–991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Leitner-Dagan Y, Sevilya Z, Pinchev M, et al. N-methylpurine DNA glycosylase and OGG1 DNA repair activities: opposite associations with lung cancer risk. J Natl Cancer Inst. 2012;104(22):1765–1769. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Wang T, Li H, Su P, et al. Sensitivity analysis for mistakenly adjusting for mediators in estimating total effect in observational studies. BMJ Open. 2017;7(11):e015640. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Geisser S. Predictive Inference. New York, NY: Chapman and Hall; 1993. [Google Scholar]
- 34. Love MI, Huber W, Anders S.. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15(12):550.. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Subramanian A, Tamayo P, Mootha VK, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005;102(43):15545–15550. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Paz-Elizur T, Sevilya Z, Leitner-Dagan Y, Elinger D, Roisman LC, Livneh Z.. DNA repair of oxidative damage in human carcinogenesis: potential application for cancer risk assessment and prevention. Cancer Lett. 2008;266(1):60–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Cassidy A, Myles JP, van Tongeren M, et al. The LLP risk model: an individual risk prediction model for lung cancer. Br J Cancer. 2008;98(2):270–276. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Rizvi NA, Hellmann MD, Snyder A, et al. Cancer immunology. Mutational landscape determines sensitivity to PD-1 blockade in non-small cell lung cancer. Science. 2015;348(6230):124–128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Gubin MM, Zhang X, Schuster H, et al. Checkpoint blockade cancer immunotherapy targets tumour-specific mutant antigens. Nature. 2014;515(7528):577–581. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Le DT, Durham JN, Smith KN, et al. Mismatch repair deficiency predicts response of solid tumors to PD-1 blockade. Science. 2017;357(6349):409–413. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Germano G, Lamba S, Rospo G, et al. Inactivation of DNA repair triggers neoantigen generation and impairs tumour growth. Nature. 2017;552(7683):116–120. [DOI] [PubMed] [Google Scholar]
- 42.Administration USFD. FDA grants accelerated approval to pembrolizumab for first tissue/site agnostic indication. 2017. https://www.fda.gov/drugs/resources-information-approved-drugs/fda-grants-accelerated-approval-pembrolizumab-first-tissuesite-agnostic-indication. Accessed September 14, 2019.
- 43. Guikema JE, Linehan EK, Tsuchimoto D, et al. APE1- and APE2-dependent DNA breaks in immunoglobulin class switch recombination. J Exp Med. 2007;204(12):3017–3026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Stavnezer J, Linehan EK, Thompson MR, et al. Differential expression of APE1 and APE2 in germinal centers promotes error-prone repair and A: T mutations during somatic hypermutation. Proc Natl Acad Sci USA. 2014;111(25):9217–9222. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Xu J, Husain A, Hu W, Honjo T, Kobayashi M.. APE1 is dispensable for S-region cleavage but required for its repair in class switch recombination. Proc Natl Acad Sci USA. 2014;111(48):17242–17247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Visnes T, Cazares-Korner A, Hao W, et al. Small-molecule inhibitor of OGG1 suppresses proinflammatory gene expression and inflammation. Science. 2018;362(6416):834–839. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Vlahopoulos S, Adamaki M, Khoury N, Zoumpourlis V, Boldogh I.. Roles of DNA repair enzyme OGG1 in innate immunity and its significance for lung cancer. Pharmacol Ther. 2018;194(2):59–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Imielinski M, Berger AH, Hammerman PS, et al. Mapping the hallmarks of lung adenocarcinoma with massively parallel sequencing. Cell. 2012;150(6):1107–1120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Govindan R, Ding L, Griffith M, et al. Genomic landscape of non-small cell lung cancer in smokers and never-smokers. Cell. 2012;150(6):1121–1134. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.