Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2012 Apr 15.
Published in final edited form as: Cancer Res. 2011 Apr 12;71(8):3009–3017. doi: 10.1158/0008-5472.CAN-10-2510

Lung cancer diagnosis from proteomic analysis of preinvasive lesions

SM Jamshedur Rahman 1, Adriana L Gonzalez 1, Ming Li 1, Erin H Seeley 1, Lisa J Zimmerman 1, Xueqiong J Zhang 1, M Lisa Manier 1, Sandy Olson 1, Ronak N Shah 1, Alison Miller 1, Joe B Putnam 1, York E Miller 1, Wilbur A Franklin 1, William J Blot 1, David P Carbone 1, Yu Shyr 1, Richard M Caprioli 1, Pierre P Massion 1
PMCID: PMC3110721  NIHMSID: NIHMS279327  PMID: 21487035

Abstract

Early detection may help improve survival from lung cancer. In this study our goal was to derive and validate a signature from the proteomic analysis of bronchial lesions that could predict the diagnosis of lung cancer. Using previously published studies of bronchial tissues we selected a signature of 9 matrix-assisted laser desorption ionization mass spectrometry (MALDI MS) mass to charge ratio features to build a prediction model diagnostic of lung cancer. The model was based on MALDI MS signal intensity (MALDI score) from bronchial tissue specimens from our 2005 published cohort of 51 patients. The performance of the prediction model in identifying lung cancer was tested in an independent cohort of bronchial specimens from 60 patients. The probability of having lung cancer based on the proteomic analysis of the bronchial specimens was characterized by an area under the receiver operating characteristic curve of 0.77 (95% CI 0.66 to 0.88) in this validation cohort. Eight of the 9 features were identified and validated by Western blotting and immunohistochemistry. These results demonstrate that proteomic analysis of endobronchial lesions may facilitate the diagnosis of lung cancer and the monitoring of high risk individuals for lung cancer in surveillance and chemoprevention trials.

Keywords: Lung cancer, Proteomic signature, Early detection, Biomarkers, Tumorigenesis

Introduction

Informative biomarkers for early detection would be of great utility in improving the outcome of patients with lung cancer, the leading cause of cancer deaths in the United States and worldwide (1). Recent attempts at early detection strategies using low dose chest CT are showing promise in reducing lung cancer related mortality (2). Yet, continued research is needed in assessing individuals for their risk in developing and for having lung cancer (3,4).

Although only a minority of preinvasive lung lesions progress to cancer, they are associated with an increased risk of having lung cancer (510). These lesions are likely to harbor molecular alterations indicative of lung cancer risk in the entire affected epithelium, including tumors of different histological subtypes (11, 12). However, this histology-based assessment of these lesions is limited in many ways. First, grading is semi-quantitative at best and the agreement between two observers for the reporting of the histopathology according to the World Health Organization (WHO) criteria has not been studied extensively (13, 14). Distinguishing mild from moderate dysplasia and severe dysplasia from carcinoma in situ is difficult, for example. Consequently, most studies evaluate bronchial preinvasive lesions that are either low or high grade. Secondly, the effect of the structural integrity of the biopsy on the interpretability of the histology features remains a significant obstacle to this area of research. Thus, the development of a quantitative measure of probability of having lung cancer based on the molecular analysis of the bronchial epithelium, and independent of the histological grade, may have important implications. Our study was therefore designed to measure the probability of having a lung cancer based on the proteomic analysis of endobronchial lesions.

We recently identified patterns of protein expression in fresh frozen bronchial lesions obtained by autofluorescence bronchoscopy and resected lung tissue samples at different stages of tumor progression using matrix-assisted laser desorption ionization mass spectrometry (MALDI MS) (15). Our proteomic analysis generated signatures discriminating subtypes of lesions in a continuum from normal lung to invasive lung tumor.

Here we took our original observation further and hypothesized that the bronchial epithelium expresses a subset of proteins that once increased confer an increased probability of having lung cancer. To assess the probability of having a lung cancer based on the analysis of bronchial lesions, we are moving away from a pathological scale based on the grade of the lesions to a quantitative scale based on the proteomic analysis of these bronchial specimens. We selected 9 MALDI MS features (mass to charge ratio [m/z]) as a signature from two previous studies (15, 16). The novel proteomic signature of 9 features gives rise to a score based on MALDI MS signal intensity that is used as a continuous variable to predict the diagnosis of lung cancer and circumvents many of the problems associated with a histology-based assessment of the lesions. We identified 8 of the 9 features and tested their expression by immunohistochemistry (IHC) in formalin fixed paraffin-embedded (FFPE) tissues and Western blotting.

Materials and Methods

Study design

The study design is summarized in Figure 1. To derive a bronchial epithelium-based signature predictive of lung cancer, results from two independent studies published from our laboratory were examined for top ranking discriminatory features (m/z) identified by MALDI MS. The first study led to features distinguishing normal lung from invasive lung tumors (n=93) (16), and the second study led to features distinguishing normal bronchial epithelium, preinvasive bronchial lesions and invasive lung tumors (n=51) (15). Nine features (m/z) were selected from these 144 patients as candidates of a novel signature from the intercept of best classifiers from these two previous studies based on their statistical significance with false discovery rate-adjusted p values <0.01 and based on the expert visual confirmation of the characteristics of the peak. The m/z values of the selected features were 4749, 4965, 6175, 8451, 8565, 9955, 11048, 12275, and 12345. The features (m/z) were identified and their expression was verified by biochemical methods. To evaluate the proteomic signature of bronchial lesions as a biomarker diagnostic of lung cancer, we built a prediction model using the Rahman et al. 2005 cohort (51 patients, 41 with and 10 without lung cancer) (15) and validated it in a new dataset (60 patients, 40 with and 20 without lung cancer). The intensity of the signature (9 m/z values) was transformed to a MALDI MS score to predict the diagnosis of having lung cancer (see statistical analysis).

Figure 1.

Figure 1

Overview of the study design. Proteomic data of two independent studies were examined for top ranking discriminatory features identified by MALDI MS. A signature of 9 features was selected from the intercept of best classifiers from these two previous studies based on their statistical significance with false discovery rate-adjusted p values <0.01 and based on the expert visual confirmation of the characteristics of the peak. To assess the association of the signature with cancer status we first build a model using Rahman et al. 2005 cohort (n=51). The intensity of the signature (9 m/z values) was transformed to MALDI MS score to predict the risk of having lung cancer. The model was then tested in the validation cohort (n=60)

The project was approved by the local Institutional Review Board and informed consent was obtained on all individuals at all institutions.

Patients and tissue samples

A total of 69 bronchial biopsies and surgical lung tissue specimens were collected from the 60 patients in the validation cohort (Table 1 and supplementary Table 1) who participated between 2003 and 2008 at Vanderbilt University Medical Center (VUMC), the Nashville Veteran Affairs Medical Center (VAMC) or the University of Colorado Health Science Center (UCHSC). Multiple bronchial specimens were obtained at the time of bronchoscopy or surgical resection. Patients undergoing bronchoscopy for clinical suspicion of lung cancer were consented for this research study and agreed to undergo autofluorescence bronchoscopy and to provide bronchial specimens at predetermined normal sites (with normal fluorescence ratio) and at suspicious sites (with increased fluorescence ratio (17) in addition to the samples taken for clinical diagnosis. Patients with lung cancer in both the validation cohort and the Rahman et al. 2005 cohort included different subtypes and all stages of lung cancer (Supplementary Tables 1 and 2). As shown in these supplementary tables some patients in both cohorts provided more than one tissue specimen. Specimens collected for research were snap frozen and later cut, stained and graded by the pathologists (WAF and ALG) according to the WHO nomenclature (14). Only bronchial epithelium specimens were considered in the analysis so that no alveolar tissue sample was used in either the training or test set. Normal bronchial epithelium specimens were obtained from surgical specimens at least 2 cm away from the tumor. Two pathologists (ALG and WAF) reviewed the hematoxylin and eosin (H&E) stained sections of each lesion.

Table 1.

Characteristics of the patients in the validation cohort according to cancer status.

Patients No cancer
(n=20)
Cancer
(n=40)
Age, years (IQR) * 69 (62–74) 62 (54–71)
Gender
   Male, n (%) 18 (90) 27 (68)
   Female, n (%) 2 (10) 13 (32)
Smoking status
   Never smoker, n (%) 2 (10) 2 (5)
   Ex-smoker, n (%) 8 (40) 24 (60)
   Current, n (%) 10 (50) 14 (35)
   PKY, (IQR) 50 (38–68) 50 (40–80)

Abbreviation: PKY, pack year (pack per day × number of years smoked), IQR, interquartile range.

*

IQR (1st–3rd) values are the median.

Acquisition of MALDI-time of flight (TOF) MS data

Bronchial biopsy specimens were frozen in liquid nitrogen immediately after collection, embedded in Tissue-Tek Optimal Cutting Temperature compound (IMEB, USA) and stored at −80°C. For MALDI MS profiling, a 12 µm section was cut and thaw-mounted directly onto a gold-coated MALDI plate followed by fixation in graded ethanol washes (70, 90, and 95% for 30 sec each). An adjacent 7 µm section was also cut and stained with H&E following standard protocols. Photomicrographs of H&E stained tissue sections were obtained with a Mirax Scan digital microscope slide scanner (Mirax, Budapest, Hungary) at a pixel resolution of 0.23 µm. The serial 12 µm histology images were annotated using Mirax viewer software by a pathologist (ALG) using a small circular shape (200 µm) to mark areas of interest for matrix spotting (Supplementary Figure 1). Color coding was used to designate areas of different histology grade within the tissue section. Annotated histology images were then overlaid with the MALDI plate images using Adobe Photoshop (San Jose, CA). Fiducial points were placed into at least four distinctive areas that are visible in the MALDI plate image and served as landmarks for registering the MALDI plate to the robotic spotter. Using these fiducial points, coordinates of the histology annotations are transformed into a system recognized by the spotter. An acoustic robotic microspotter (LabCyte, Sunnyvale, CA) was used to place crystalline matrix (25 mg/ml sinapinic acid in 1:1 acetonitrile: 0.2% trifluoroacetic acid) spots (~120 pL) that were typically 180–220 µm in diameter (18). Subsequent to spotting, spot placement was evaluated for accuracy compared to the location designated by the pathologist. Spots that deviated significantly from their desired locations were eliminated from further analysis. Tissue profile spectra were acquired from the spotted areas using an Autoflex II (Bruker Daltonics, Billerica, MA) MALDI-TOF MS equipped with a SmartBeamTM Laser (Nd:YAG, 355 nanometer) Bruker Autoflex II (Billerica, MA). An accelerating voltage 20 kV was used with an extraction voltage of 18.65 kV and an Einzel lens voltage of 6.00 kV. Delayed extraction (350 ns) was optimized for resolution at m/z 12000. Data were collected over the range of 2–40 kDa. A total of 400 laser shots were collected for each profile spectrum. Spectra were converted to ASCII files for preprocessing and statistical analysis.

Validation of the identified protein expression by Western immunoblotting

Validation of the identified proteins was performed by Western immunoblotting using surgically resected specimens of squamous cell carcinomas of the lung, as well as adjacent histologically normal tissue obtained from the Vanderbilt lung Specialized Programs of Research Excellence (SPORE) tissue repository. Samples were snap-frozen and stored at −80°C until protein lysates were prepared. Protein concentration of the supernatant was determined by the bicinchoninic acid (BCA) protein assay (Thermo Scientific, Rockford, IL) and 40 µg of total protein from each sample was separated on 15% SDS-polyacrylamide gels (ubiquitin, macrophage migration inhibitory factor [MIF], thymosin β4, cytochrome c) or 4–12% Bis-Tris gels (Invitrogen, Carlsbad, CA) (acyl-coA binding protein [ACBP], and cystatin A [CSTA]). Proteins were transferred onto a polyvinylidene diflouride (PVDF) membrane and treated with blocking solution before treatment with primary antibodies for thymosin β4 (1:1000, Santa Cruz Biotechnology, Santa Cruz, CA), ubiquitin (1:1000, R&D Systems, Minneapolis, MN), ACBP (1:250, Abcam, Cambridge, MA), CSTA (1:2500, R&D Systems, Minneapolis, MN), cytochrome c (1:1000, Cell Signaling, Beverly, MA), and MIF (1:500, R&D Systems, Minneapolis, MN). The blots were treated with horseradish peroxidase (HRP)-conjugated secondary antibodies and developed with the enhanced chemiluminescence system (Perkin-Elmer, Waltham, MA). Immunoblotting of β-actin (Sigma, Saint Louis, MO) was used as a loading control.

Validation of the identified protein expression by immunohistochemistry (IHC)

Five µ tissue sections were cut from FFPE lung tissue blocks, deparaffinized by placing in 3 xylene baths, 10 minutes each, hydrated through a series of ethanol baths of decreasing concentration, and placed in a buffer bath of TBS. Endogenous peroxide was quenched using Dako Peroxidase Blocking Reagent and non-specific staining blocked using Dako Serum-Free Protein Block according to manufacturer’s instructions. Optimal staining for each antibody was obtained by using a citrate antigen retrieval and the following primary antibody dilutions: thymosin β4 (1:500, BioDesign, Saco, ME), ubiquitin (1:400, Dako, Carpinteria, CA), ACBP (1:80, Abcam, Cambridge, MA), CSTA (1:8000, Sigma, Saint Louis, MO), ProteinTech Group, Chicago, IL), cytochrome c (1:1000, Cell Signaling, Beverly, MA), and MIF (1:200, Santa Cruz Biotechnology, Santa Cruz, CA). Primary antibodies were applied at room temperature for 1 hour. All further steps and chromogen development were carried out according to manufacturer’s instructions (Dako EnVision+ System, HRP). Slides were counterstained using a Gills #1 hematoxylin, rinsed appropriately and dehydrated through a series of ethanol baths of increasing concentration into xylene. A non-aqueous coverslipping material was used (Cytoseal or Permount). Results of the IHC staining were analyzed by the pathologist (ALG).

Tissue microarrays (TMA) of lung cancer subtypesfrom 59 patients were prepared from paraffin blocks. There were 21 squamous carcinomas, 20 adenocarcinomas, 4 bronchioalveolar carcinomas, 4 large cell carcinomas, 2 non-small cell lung cancer (NSCLC), 5 carcinoids, and one each of adenosquamous, sarcoma, and small cell lung cancer (SCLC) represented on the TMA. Archived tissue blocks from 1989 to 2002 were retrieved from the files of VUMC and the Nashville VAMC pathology departments. For all tissue blocks, H&E-stained sections were reviewed by our pathologist (ALG). Areas to be punched for array production were carefully marked. Tissues from 26 patients had spots in triplicates, 13 of which also had single spot of corresponding normal tissue obtained at least 2 centimeter away from the tumor. Tissues from 33 patients had spots in duplicates. Cores 0.6 mm in diameter were taken from the selected area of each specimen and inserted into a recipient paraffin block. Five micron sections were cut from the arrays and mounted onto charged slides. Every fifteenth section was stained by H&E to confirm the presence of the histological feature of interest (tumor or normal histology). Immunohistochemical staining of the TMA for thymosin β4, ACBP, CSTA and MIF were was performed according the protocol described above. IHC staining was scored 0 to 3 based on intensity by ALG. Scoring for ubiquitin was not done because of poor staining quality.

Statistical analysis

Prior to statistical analysis, the MALDI MS data were preprocessed for calibration, baseline correction, denoising, normalization, peak detection and quantification by a preprocessing package: Wavespec software (19) (The raw data can be found at http://www.vicc.org/biostatistics/supp.php). We selected 9 features from the intercept of best classifiers to distinguish between normal or low grade preinvasive lesions and high grade or invasive cancer from two previously described datasets (15, 16). The selection was based on the statistical significance of the discriminatory features (m/z) with false discovery rate-adjusted p values <0.01 and based on the expert visual confirmation of the characteristics of the peak. With these pre-determined 9 candidate features we carried out the following analysis to access their ability to predict whether these bronchial samples were associated with a patient with a lung cancer. First, we generated a weighted compound score (MALDI score) based on the weighted flexible compound covariate method (WFCCM) (16, 20): the preselected 9 features were reduced to one “summarized” feature by building MALDI score for each patient (see supplementary materials). For easier interpretation and comparisons, the MALDI scores were re-scaled to a range between 0 and 5. Secondly, a logistic regression model was applied to test the hypothesis that the summarized 9 features profile (MALDI score) and histology group are associated, thus, this MALDI score has predictive ability in differentiating cancer patients from normal controls. The model was derived and internally validated using the Rahman et al. 2005 cohort (15). A bias-corrected estimate of predictive accuracy was obtained by bootstrap method (21). The class prediction performance was assessed by a widely used measurement of diagnostic discrimination: the area under receiver operating characteristic (ROC) curve and it follows the standards for reporting of diagnosis accuracy (22). Thirdly, we then applied this internally validated model to the new independent validation cohort described above by carrying the estimated coefficients of this model to the independent validation set. The prediction ability in terms of area under ROC curve and the predicted probability of having lung cancer with the change of MALDI score were provided for validation set. To complete our analysis, we applied a permutation analysis for an alternative interpretation on the same hypothesis we were testing, and the permutation test reconfirmed such association. From patients who provided more than one bronchial biopsy, including biopsies of histological normal and invasive cancer tissues, we used the average of these signal intensities to represent the patient in the analysis (see supplementary materials).

Results

Patient characteristics

Characteristics of the 60 patients in the validation cohort are summarized in Table 1. Characteristics of individual patients of the validation cohort and the Rahman et al. 2005 cohort (15) are given in supplementary Tables 1 and 2. Representative images of H&E stained tissue sections of each histological type and images with spots for automatic targeted MALDI matrix deposition are shown in Supplementary Figure 1.

Performance of the signature and assessment of the probability of having lung cancer

The proteomic signature of 9 features (m/z) was first used to build a prediction model using Rahman et al. 2005 cohort to assess the probability of having lung cancer. The resultant MALDI score obtained as a weighted sum (see statistical section and supplementary materials) was used as a continuous variable in the validation cohort. The model showed that the MALDI score in this independent data set is statistically significantly associated with the cancer status. Based on the model the estimated probability of having lung cancer can be calculated for any patient in the cohort with a MALDI score value. The estimated probabilities for some values of MALDI score are summarized in Table 2. In order to understand the characteristics of MALDI score better two graphical summaries are provided in Figure 2. The distribution of MALDI scores across patients with and without cancer in the validation cohort (n=60) is presented in Figure 2A and MALDI score prediction performance through the prediction model, is presented by an area under the ROC curve in Figure 2B. By bootstrap method, the bias-corrected area under the ROC curve is 0.90 (with 95% CI [0.82, 0.98]) in the Rahman et al 2005 cohort and 0.77 (95% CI [0.66, 0.88]) in the validation cohort.

Table 2.

Probability of predicting lung cancer based on MALDI MS score.

Score Probability 95% CI
0 0.10 (0.00, 0.3)
1 0.25 (0.00, 0.5)
2 0.48 (0.29, 0.67)
3 0.72 (0.60, 0.85)
4 0.88 (0.75, 1)
5 0.95 (0.87, 1)

Abbreviation: MALDI MS, matrix assisted laser desorption ionization mass spectrometry, CI, confidence interval.

Figure 2.

Figure 2

Performance of the signature. A.) Boxplots representing the distribution of MALDI scores from patients of the validation cohort (n=60). Patients with (n=40) and without (n=20) lung cancer were analyzed. Weighted compound score was generated based on the weighted flexible compound covariate method (WFCCM). The 9 candidate features (m/z) were reduced to one summarized feature to build MALDI score for each patient. B.) Receiver-operator characteristic curves demonstrating performance of the signature to classify patients with and without cancer in the published cohort (solid line) and in the validation cohort (dotted line). AUC, area under the curve.

Identification of the MALDI MS features of the signature

Out of 9 features we have identified 8 corresponding to 6 different proteins listed in Table 3. Two of the 6 proteins were newly identified in the current study. One of those features at m/z 9955 was identified ACBP following in-gel digestion and LC-MS/MS analysis (see Supplementary Methods and Supplementary Figure 2). Based upon the features identified, the m/z value was consistent with the loss of the initial methionine residue and an acetylation modification to the N-terminal serine residue. Examination of the 42 Da mass shift in the b-ion series in the MS/MS spectra for the acetylated peptide, SQAEFEK and its miscleavage product SQAEFEKAAEEVR, supports such a modification as shown in Supplementary Figure 3A. Another protein identified was CSTA. Analysis of the gel band corresponding to m/z 11048 was found to contain an acetylated lysine residue in the peptide sequence NKDDELTGF. The modification was confirmed by examination of the MS/MS spectra containing the 42 Da mass shift in the b-ion series as shown in Supplementary Figure 3B. The unmodified NKDDELTGF peptide was also identified, consistent with the m/z 11006 observed in the MALDI MS spectrum. All peptides identified for both ACBP and CSTA are listed in Supplementary Table 3. The other 4 proteins identified by our group are thymosin β4 (23), ubiquitin (16), cytochrome c (24) and MIF (25).

Table 3.

Identity of the candidate features in the signature.

Selected candidate
features (m/z)
Identification
4749 Not determined
4965 Thymosin β4
6175 Macrophage migration inhibitory factor++
8451 Des-ubiquitin (Ubiquitin-Gly-Gly)
8565 Ubiquitin
9955 Acyl-coA binding protien
11048 Cystatin A
12275 Cytochrome c
12345 Macrophage migration inhibitory factor

Abbreviation: m/z, mass to charge ratio.

Proteomic validation of the MALDI MS features components of the signature

Overexpression of each of the 6 identified proteins was then confirmed by Western blotting in lung cancers as shown in Figure 3. Representative data of 3 surgical specimens of squamous cell carcinomas of the lung and adjacent histologically normal lung tissue are presented. Protein expression was further validated in FFPE squamous cell carcinoma of the lung and normal tissues by IHC as shown in Figure 4. Squamous cell carcinomas showed moderate to strong cytoplasmic and nuclear expression of all these proteins. Thymosin β4 and cytochrome c were predominantly expressed in the cytoplasm. In normal bronchial epithelium weak to moderate staining of MIF was observed, while staining was negative to weak forthymosin β4, ubiquitin, ACBP, CSTA and cytochrome c. We also validated the expression of thymosin β4, ACBP, CSTA and MIF in a TMA representing lung cancer tissues from 59 patients by IHC. The immunostaining signals of these proteins were localizing to the cytoplasm. The summary of the results (intensity score 0 to 3) are presented in Supplementary Table 4. Of interest, CSTA staining intensity was associated with prolonged survival (p=0.03). We found no significant association between staining intensity scores and gender, smoking status, histology or pathological stage.

Figure 3.

Figure 3

Validation of the expression of candidate biomarker proteins by Western immunoblotting. Tissue lysates were prepared from matched pairs of fresh-frozen normal (N) and squamous lung tumor (T) tissues. Three representative pairs of tissues were used for each immunoblotting. Amount of protein loaded in each lane was 40 microgram. One of the lanes was added with molecular weight marker in each gel. Immunoblotting of β-actin was performed as a loading control.

Figure 4.

Figure 4

Validation of the expression of identified proteins by IHC. Photomicrographs (40x) of IHC staining of normal bronchial epithelium (left panel) and invasive squamous carcinoma of the lung (middle panel) are shown for 6 identified proteins. Scale bar: 50 µm. Peaks (m/z) corresponding to 6 identified proteins from the average spectra of normal bronchial epithelium (blue arrow) and invasive tumor (red arrow) of the normalized MALDI MS data are represented in the right panel.

Discussion

In this study we report a proteomic signature derived from the bronchial mucosa that confers a quantifiable probability of having lung cancer. We identified all but one of the 9 features of this signature (Table 3). Overexpression of the identified proteins was confirmed by Western blotting (Figure 3) and immunohistochemical staining (Figure 4 and Supplementary Table 4) using normal lung and NSCLC. The bronchial biopsy proteomic signature provides a quantitative measure of the probability of having lung cancer that goes beyond the histological evaluation of preinvasive lesions.

Preinvasive lesions are prevalent in the airways of patients at increased risk for and with lung cancer (810, (26, 27). Our ultimate goal is to predict the probability of developing lung cancer and to identify the molecular determinants of lung cancer development. In this study, we made an important step towards this goal by providing a quantitative measure of the probability of having lung cancer in individuals presenting with endobronchial lesions (Figure 2A). Importantly, our diagnostic biomarker signature goes beyond histologic interpretation. It is quantifiable and used as a continuous variable. It was formally validated in an independent dataset of specimens from 3 institutions (Supplementary Table 1). The differential expression of these candidate biomarker proteins revealed by proteomic technologies also allows us to detect changes in protein expression during lung tumor progression which may otherwise remain undetected by histology alone. Moreover, the development of a proteomic diagnostic signature for lung cancer provides an opportunity to further our understanding about the role of these proteins in various cellular interactions and in pathways of lung tumorigenesis. In Figure 2A we showed the difference in the MALDI score between patients with and without lung cancer giving the existing overlap. Although such overlap between box plots is common in the biomarker field, logistical regression analysis indicated that the difference is statistically significant. Yet, these results do not necessarily imply a clinically useful classifier. Further prospective validation of the biomarker is therefore needed to address clinical utility. Finally, this signature may lead to an improved understanding of the progression of preinvasive lesions to an invasive phenotype, potentially enabling earlier therapeutic intervention and allowing us to further select patients to enroll in surveillance studies.

Scarcity and the small size of preinvasive lesions of the airways remain the major challenges for studies of bronchial mucosa in lung tumor development. To accomplish our goal, we obtained specimens from three institutions (VUMC and VAMC in Nashville, TN, and UCHSC in Denver, CO). We also took advantage of significant improvements in proteomic technologies, including automated spotting and related software, MS instrumentation, database resources and statistical analysis tools (19). Histology directed spotting of matrix using Mirax viewer software, for example, allowed us to limit data acquisition specifically from areas containing greater than 70% of the epithelial cells (18, 28).

Technological advances have also allowed us to confidently identify the proteins represented by our signature. Out of the 6 identified proteins contained in the signature, CSTA and ACBP deserve special attention. Although the role of ACBP in lung tumorigenesis is unknown, ACBP plays a crucial role in lipid metabolism and acts as acyl-CoA transporter and intracellular pool former. In spite of the fact that ACBP is ubiquitously expressed (29), its differential expression in normal epithelium and invasive tissues was clearly demonstrated by MALDI MS. To our knowledge, this is the first report of overexpression of ACBP in lung tumorigenesis.

Cystatins are members of a protein family with endogenous inhibitors of cysteine proteases such as cathepsins B, H and L. Studies demonstrated an inverse relationship between increased cathepsin B expression and decreased CSTA in a variety of human tumors (30). The importance of CSTA in the development of malignant tumors (31) and its overexpression in lung cancer have been reported (32). Lung tumor cells are suggested to produce both cysteine proteases and cystatins in vitro which are regulated differently in histologic subtypes of lung cancers; however, their diagnostic and prognostic role in large pathological materials has not been studied extensively. One study involving preinvasive bronchial lesions and various lung carcinomas suggests that CSTA expression is associated with malignant transformation and predicts a risk of tumor recurrence as well as poor survival (32). Our studies suggest that CSTA is one of the promising candidate biomarkers for lung cancer and deserves further investigation in lung tumorigenesis. Of interest also is that cathepsin B which is a cysteine protease cleaves two terminal glycine residues of ubiquitin generating des-ubiquitin (33). Both monomeric ubiquitin and des-ubiquitin corresponding to m/z values 8565 and 8451 are among the 9 candidates of the signature described in this study, and in our previous study in established tumors (16). Overexpression of cathepsin B inhibitor CSTA, and des-ubiquitin, a product of cathepsin B proteolytic activity on ubiquitin, suggests a delicate balance between cathepsin B and CSTA may somehow perturb cellular homeostasis contributing to the overexpression of des-ubiquitin.

MIF is expressed and secreted in response to mitogens and integrin-dependent cell adhesion. Extracellular MIF is required in the steady-state activation of Rho GTPase family members, leading to cell growth and migratory phenotypes. Previous reports demonstrated that MIF expression is increased in premalignant, malignant and metastatic tumors and a potential therapeutic target in NSCLC (34, 35). By use of a small interfering RNA (siRNA) knockdown of MIF, the migratory and invasive potential of human lung adenocarcinoma cells was abolished (36).

Thymosin β4 sequesters cytoplasmic monomeric actin (37) and is a potent regulator of actin polymerization. Overexpression of thymosin β4 in tumors has been suggested to stimulate lung tumor metastasis by activating cell migration and angiogenesis (38). The gene expression of thymosin β4 was also significantly associated with metastasis in NSCLC patients and it was suggested as a prognostic parameter for NSCLC (39). A combined proteomic indicator including thymosin β4, thymosin β10, and calmodulin was shown to be able to classify NSCLC patients according to good or poor prognosis (40).

Our study has limitations. The patient sample sizes, especially for non-cancer cases, in the training and validation sets were not large. Although the demographics of our study population reflect well individuals who are at high risk for lung cancer, the results may not generalize to those with lower epidemiological risk. Our proteomic signature represents a direct measure of alterations of the airway epithelium in the field of carcinogenesis, but the signature has not yet been prospectively validated as a predictor of lung cancer development. Also, we could not evaluate the possibility that the presence of such a signature represents a true susceptibility factor rather than a tumor marker. Our study also does not address the diagnostic value of the proteomic analysis of alveolar atypical hyperplasia thought to be possible precursors of adenocarcinoma in situ (formerly called bronchioalveolar carcinoma or BAC). Among other study limitations, although the methodology used to assess the probability of having lung cancer may be found useful as an MS based analysis, the signature may require further development in the form of an immune-based multiplex assay to reach broad clinical utility.

Looking ahead we see this novel signature as a significant advance in the search for early detection strategies. As opposed to histologic evaluation which provides a few discrete subtypes with potential significant inter-observer variability (41, 42) the quantitative nature of our signature, after further validation, may be able to better assess the diagnosis of lung cancer from endobronchial biopsies. We will need to further validate the diagnostic potential of the signature prospectively in a cohort of high risk individuals. Determining the probability of having lung cancer based on a novel proteomic signature of bronchial lesions may have important clinical implications. This signature may facilitate the diagnosis of lung cancer and the monitoring of high risk individuals for lung cancer in surveillance and chemoprevention trials.

Supplementary Material

1
2
3
4
5
6
7
8

Acknowledgments

We thank all patients who volunteered to participate in this study and provided tissue samples. We thank Candace Murphy and Anna Ostrander for assistance with sample collection, Harriet Davis and Gabe Garcia for clinical data management, Jamie Allen for assistance with sample processing for MALDI MS data acquisition and organization, and Salisha Hill and Kristen Cheek for their assistance with LC-MS/MS. This work was supported by the National Institute of Health (CA102353 to P.P.M., the Lung SPORE P50 CA 90949 to D.P.C. and the Lung SPORE P50 CA58187 to Y.E.M and W.A.F.), Vanderbilt-Ingram Cancer Center Core Support Grant (P30 CA68485 to P.P.M. and R.M.C.), Department of Defense (W81XWH-05-1-0179 to R.M.C.) and NIH/NIGMS (5R01-GM58008 to R.M.C.).

References

  • 1.Jemal A, Bray F, Center MM, Ferlay J, Ward E, Forman D. Global cancer statistics. CA Cancer J Clin. 2001 doi: 10.3322/caac.20107. [DOI] [PubMed] [Google Scholar]
  • 2.Marshall E. Cancer screening. The promise and pitfalls of a cancer breakthrough. Science. 2010;330:900–901. doi: 10.1126/science.330.6006.900-b. [DOI] [PubMed] [Google Scholar]
  • 3.Franklin WA, Gazdar AF, Haney J, Wistuba II, La Rosa FG, Kennedy T, et al. Widely dispersed p53 mutation in respiratory epithelium. A novel mechanism for field carcinogenesis. J Clin Invest. 1997;100:2133–2137. doi: 10.1172/JCI119748. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Spira A, Beane JE, Shah V, Steiling K, Liu G, Schembri F, et al. Airway epithelial gene expression in the diagnostic evaluation of smokers with suspect lung cancer. Nat Med. 2007;13:361–366. doi: 10.1038/nm1556. [DOI] [PubMed] [Google Scholar]
  • 5.Kennedy TC, McWilliams A, Edell E, Sutedja T, Downie G, Yung R, et al. Bronchial intraepithelial neoplasia/early central airways lung cancer: ACCP evidence-based clinical practice guidelines (2nd edition) Chest. 2007;132:221S–233S. doi: 10.1378/chest.07-1377. [DOI] [PubMed] [Google Scholar]
  • 6.Bota S, Auliac JB, Paris C, Metayer J, Sesboue R, Nouvet G, et al. Follow-up of bronchial precancerous lesions and carcinoma in situ using fluorescence endoscopy. Am J Respir Crit Care Med. 2001;164:1688–1693. doi: 10.1164/ajrccm.164.9.2012147. [DOI] [PubMed] [Google Scholar]
  • 7.Miller YE, Blatchford P, Hyun DS, Keith RL, Kennedy TC, Wolf H, et al. Bronchial epithelial Ki-67 index is related to histology, smoking, and gender, but not lung cancer or chronic obstructive pulmonary disease. Cancer Epidemiol Biomarkers Prev. 2007;16:2425–2431. doi: 10.1158/1055-9965.EPI-07-0220. [DOI] [PubMed] [Google Scholar]
  • 8.Paris C, Benichou J, Bota S, Sagnier S, Metayer J, Eloy S, et al. Occupational and nonoccupational factors associated with high grade bronchial pre-invasive lesions. Eur Respir J. 2003;21:332–341. doi: 10.1183/09031936.03.00001503. [DOI] [PubMed] [Google Scholar]
  • 9.Keith RL, Miller YE, Gemmill RM, Drabkin HA, Dempsey EC, Kennedy TC, et al. Angiogenic squamous dysplasia in bronchi of individuals at high risk for lung cancer. Clin Cancer Res. 2000;6:1616–1625. [PubMed] [Google Scholar]
  • 10.Banerjee AK. Preinvasive lesions of the bronchus. J Thorac Oncol. 2009;4:545–551. doi: 10.1097/JTO.0b013e31819667bd. [DOI] [PubMed] [Google Scholar]
  • 11.Loewen G, Natarajan N, Tan D, Nava E, Klippenstein D, Mahoney M, et al. Autofluorescence bronchoscopy for lung cancer surveillance based on risk assessment. Thorax. 2007;62:335–340. doi: 10.1136/thx.2006.068999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.McWilliams A, Mayo J, MacDonald S, leRiche JC, Palcic B, Szabo E, et al. Lung cancer screening: a different paradigm. Am J Respir Crit Care Med. 2003;168:1167–1173. doi: 10.1164/rccm.200301-144OC. [DOI] [PubMed] [Google Scholar]
  • 13.Franklin WA. Diagnosis of lung cancer: pathology of invasive and preinvasive neoplasia. Chest. 2000;117:80S–89S. doi: 10.1378/chest.117.4_suppl_1.80s. [DOI] [PubMed] [Google Scholar]
  • 14.Brambilla E, Travis WD, Colby TV, Corrin B, Shimosato Y. The new World Health Organization classification of lung tumours. Eur Respir J. 2001;18:1059–1068. doi: 10.1183/09031936.01.00275301. [DOI] [PubMed] [Google Scholar]
  • 15.Rahman SM, Shyr Y, Yildiz PB, Gonzalez AL, Li H, Zhang X, et al. Proteomic patterns of preinvasive bronchial lesions. Am J Respir Crit Care Med. 2005;172:1556–1562. doi: 10.1164/rccm.200502-274OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Yanagisawa K, Shyr Y, Xu BJ, Massion PP, Larsen PH, White BC, et al. Proteomic patterns of tumour subsets in non-small-cell lung cancer. Lancet. 2003;362:433–439. doi: 10.1016/S0140-6736(03)14068-8. [DOI] [PubMed] [Google Scholar]
  • 17.Hirsch FR, Prindiville SA, Miller YE, Franklin WA, Dempsey EC, Murphy JR, et al. Fluorescence versus white-light bronchoscopy for detection of preneoplastic lesions: a randomized study. J Natl Cancer Inst. 2001;93:1385–1391. doi: 10.1093/jnci/93.18.1385. [DOI] [PubMed] [Google Scholar]
  • 18.Aerni HR, Cornett DS, Caprioli RM. Automated acoustic matrix deposition for MALDI sample preparation. Anal Chem. 2006;78:827–834. doi: 10.1021/ac051534r. [DOI] [PubMed] [Google Scholar]
  • 19.Chen S, Li M, Hong D, Billheimer D, Li H, Xu BJ, et al. A novel comprehensive wave-form MS data processing method. Bioinformatics. 2009;25:808–814. doi: 10.1093/bioinformatics/btp060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Shyr Y, Kim K. Weighted flexible compound covariate method for classifying microarray data. In: Berrar D, editor. A Practical Approach to Microarray Data Analysis. Kluwer Academic, NY: 2003. pp. 186–200. [Google Scholar]
  • 21.Efron BaT R. An introduction to the Bootstrap. New York: Chapman and Hall; 1993. [Google Scholar]
  • 22.Bossuyt PM, Reitsma JB, Bruns DE, Gatsonis CA, Glasziou PP, Irwig LM, et al. The STARD statement for reporting studies of diagnostic accuracy: explanation and elaboration. Clin Chem. 2003;49:7–18. doi: 10.1373/49.1.7. [DOI] [PubMed] [Google Scholar]
  • 23.Stoeckli M, Chaurand P, Hallahan DE, Caprioli RM. Imaging mass spectrometry: a new technology for the analysis of protein expression in mammalian tissues. Nat Med. 2001;7:493–496. doi: 10.1038/86573. [DOI] [PubMed] [Google Scholar]
  • 24.Hardesty WM. Proteomic analysis and classification of metastatic melanoma by MLADI imaging mass spectrometry. Nashville, TN: Vanderbilt University; 2010. [Google Scholar]
  • 25.Chaurand P, Fouchecourt S, DaGue BB, Xu BJ, Reyzer ML, Orgebin-Crist MC, et al. Profiling and imaging proteins in the mouse epididymis by imaging mass spectrometry. Proteomics. 2003;3:2221–2239. doi: 10.1002/pmic.200300474. [DOI] [PubMed] [Google Scholar]
  • 26.Haussinger K, Becker H, Stanzel F, Kreuzer A, Schmidt B, Strausz J, et al. Autofluorescence bronchoscopy with white light bronchoscopy compared with white light bronchoscopy alone for the detection of precancerous lesions: a European randomised controlled multicentre trial. Thorax. 2005;60:496–503. doi: 10.1136/thx.2005.041475. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Auerbach O, Gere JB, Forman JB, et al. Changes in the bronchial epithelium in relation to smoking and cancer of the lung: report of progress. N Engl J Med. 1957;256:97–104. doi: 10.1056/NEJM195701172560301. [DOI] [PubMed] [Google Scholar]
  • 28.Cornett DS, Reyzer ML, Chaurand P, Caprioli RM. MALDI imaging mass spectrometry: molecular snapshots of biochemical systems. Nat Methods. 2007;4:828–833. doi: 10.1038/nmeth1094. [DOI] [PubMed] [Google Scholar]
  • 29.Neess D, Kiilerich P, Sandberg MB, Helledie T, Nielsen R, Mandrup S. ACBP--a PPAR and SREBP modulated housekeeping gene. Mol Cell Biochem. 2006;284:149–157. doi: 10.1007/s11010-005-9039-9. [DOI] [PubMed] [Google Scholar]
  • 30.Li W, Ding F, Zhang L, Liu Z, Wu Y, Luo A, et al. Overexpression of stefin A in human esophageal squamous cell carcinoma cells inhibits tumor cell growth, angiogenesis, invasion, and metastasis. Clin Cancer Res. 2005;11:8753–8762. doi: 10.1158/1078-0432.CCR-05-0597. [DOI] [PubMed] [Google Scholar]
  • 31.Strojan P, Budihna M, Smid L, Svetic B, Vrhovec I, Kos J, et al. Prognostic significance of cysteine proteinases cathepsins B and L and their endogenous inhibitors stefins A and B in patients with squamous cell carcinoma of the head and neck. Clin Cancer Res. 2000;6:1052–1062. [PubMed] [Google Scholar]
  • 32.Leinonen T, Pirinen R, Bohm J, Johansson R, Rinne A, Weber E, et al. Biological and prognostic role of acid cysteine proteinase inhibitor (ACPI, cystatin A) in non-small-cell lung cancer. J Clin Pathol. 2007;60:515–519. doi: 10.1136/jcp.2006.038711. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Herring K. Identificatin of protein markers of drug-induced nephrotoxicity by MALDI MS: in vivo discovery of ubiquitin-T. Nashville: Vanderbilt University; 2009. [Google Scholar]
  • 34.McClelland M, Zhao L, Carskadon S, Arenberg D. Expression of CD74, the receptor for macrophage migration inhibitory factor, in non-small cell lung cancer. Am J Pathol. 2009;174:638–646. doi: 10.2353/ajpath.2009.080463. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Campa MJ, Wang MZ, Howard B, Fitzgerald MC, Patz EF., Jr Protein expression profiling identifies macrophage migration inhibitory factor and cyclophilin a as potential molecular targets in non-small cell lung cancer. Cancer Res. 2003;63:1652–1656. [PubMed] [Google Scholar]
  • 36.Rendon BE, Roger T, Teneng I, Zhao M, Al-Abed Y, Calandra T, et al. Regulation of human lung adenocarcinoma cell migration and invasion by macrophage migration inhibitory factor. J Biol Chem. 2007;282:29910–29918. doi: 10.1074/jbc.M704898200. [DOI] [PubMed] [Google Scholar]
  • 37.Kobayashi T, Okada F, Fujii N, Tomita N, Ito S, Tazawa H, et al. Thymosin-beta4 regulates motility and metastasis of malignant mouse fibrosarcoma cells. Am J Pathol. 2002;160:869–882. doi: 10.1016/s0002-9440(10)64910-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Cha HJ, Jeong MJ, Kleinman HK. Role of thymosin beta4 in tumor metastasis and angiogenesis. J Natl Cancer Inst. 2003;95:1674–1680. doi: 10.1093/jnci/djg100. [DOI] [PubMed] [Google Scholar]
  • 39.Ji P, Diederichs S, Wang W, Boing S, Metzger R, Schneider PM, et al. MALAT-1, a novel noncoding RNA, and thymosin beta4 predict metastasis and survival in early-stage non-small cell lung cancer. Oncogene. 2003;22:8031–8041. doi: 10.1038/sj.onc.1206928. [DOI] [PubMed] [Google Scholar]
  • 40.Xu BJ, Gonzalez AL, Kikuchi T, Yanagisawa K, Massion PP, Wu H, et al. MALDI-MS derived prognostic protein markers for resected non-small cell lung cancer. Proteomics ClinAppl. 2008;2:1508–1517. doi: 10.1002/prca.200800094. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Nicholson AG, Perry LJ, Cury PM, Jackson P, McCormick CM, Corrin B, et al. Reproducibility of the WHO/IASLC grading system for pre-invasive squamous lesions of the bronchus: a study of inter-observer and intra-observer variation. Histopathology. 2001;38:202–208. doi: 10.1046/j.1365-2559.2001.01078.x. [DOI] [PubMed] [Google Scholar]
  • 42.Kujan O, Khattab A, Oliver RJ, Roberts SA, Thakker N, Sloan P. Why oral histopathology suffers inter-observer variability on grading oral epithelial dysplasia: an attempt to understand the sources of variation. Oral Oncol. 2007;43:224–231. doi: 10.1016/j.oraloncology.2006.03.009. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1
2
3
4
5
6
7
8

RESOURCES