Signal stratification of autoantibody levels in serum samples and its application to the early detection of lung cancer

Graham F Healey; Stephen Lam; Peter Boyle; Geoffrey Hamilton-Fairley; Laura J Peek; John F R Robertson

doi:10.3978/j.issn.2072-1439.2013.08.65

. 2013 Oct;5(5):618–625. doi: 10.3978/j.issn.2072-1439.2013.08.65

Signal stratification of autoantibody levels in serum samples and its application to the early detection of lung cancer

Graham F Healey ¹, Stephen Lam ², Peter Boyle ³, Geoffrey Hamilton-Fairley ¹, Laura J Peek ⁴, John F R Robertson ^5,^✉

PMCID: PMC3815735 PMID: 24255775

Abstract

Background

Further signal stratification for the EarlyCDT®-Lung test should facilitate interpretation of the test, leading to more precise interventions for particular patients.

Methods

Samples were measured for the presence of autoantibodies to seven tumor-associated antigens (TAAs) (p53, NY-ESO-1, CAGE, GBU4-5, SOX2, MAGE A4, and HuD). In addition to the current test cut-offs (determined using a previously reported Validation case-control sample set, set A; n=501), new high and low cut-offs were set in order to maximize the test’s positive and negative predictive values (PPV and NPV, respectively). All three sets of cut-offs were applied to two confirmatory datasets: (I) the case-control set B (n=751), and (II) Population-derived set C (n=883), and all three datasets combined (n=2,135).

Results

For the Validation dataset, cancer/non-cancer positivity for current cut-offs was 41%/9% (PPV =0.109, 1 in 9). The high positive stratum improved this to 25%/2% (PPV =0.274, 1 in 4). The low negative stratum improved this to 8%/23% (NPV =0.990, 1 in 105). This provides a 25-fold difference in lung cancer probability between the highest and lowest groups.

The test performs equally well in subjects who fulfilled the entry risk criteria for the National Lung Screening Trial (NLST) and subjects who did not meet the NLST criteria.

Conclusions

The EarlyCDT®-Lung test has been converted to a four-stratum test by the addition of high and low sets of cut-offs: patients are thus stratified into four risk categories. This stratification will enable personalization of subsequent screening and treatment programs for high risk individuals or patients with lung nodules.

KEYWORDS : Lung cancer, autoantibody (AAb), tumor-associated antigen, risk stratification

Introduction

The technical and clinical validation of an autoantibody (AAb) assay for the early detection of lung cancer (EarlyCDT®-Lung) has recently been described (1-3). In a series of case-control studies, where the cases were newly diagnosed lung cancer patients, circulating AAbs to a panel of tumor-associated antigens (TAAs) were measured in serum samples. Validation of the 7 AAb panel showed that EarlyCDT®-Lung can, with a specificity of 93%, detect elevated levels of AAbs in peripheral blood samples for up to 41% of all primary lung cancers (3). In combination with imaging techniques, the test is now commercially available to assist clinicians in the early detection of lung cancer in a high-risk population.

Currently a single test threshold (“cut-off”) for each AAb measured in the panel classifies the samples into two strata, i.e., positive or negative for AAbs associated with lung cancer. This two-stratum test yields a useful binary classification, but given the range of intervention options available to the clinician, refinement of the result is desirable. A four-stratum test is therefore now proposed with additional sets of low and high cut-offs to classify the results into high positive, positive, negative and low negative strata indicating relatively very high, high, low and very low levels of AAbs, respectively; the level of AAbs measured relates to the probability of lung cancer (i.e., risk of having the disease) (Table 1). This allows more refined intervention for different sub-groups of patients.

Table 1. Result categories for the four-stratum EarlyCDT®-Lung test.

Block	Rule	Result	Risk^a
1	At least one AAb > H	High positive	Very high
2	All AAbs < H, but at least one > C	Positive	High
3	All AAbs < C, but at least one > L	Negative	Low
4	All AAbs < L	Low negative	Very low

Open in a new tab

AAb, autoantibody; L, low cut-off; C, current cut-off; H, high cut-off; ^a, Risk (i.e., probability) of having a lung cancer at the time of the test.

Materials and methods

Assay procedure

AAbs to seven TAAs (p53, NY-ESO-1, CAGE, GBU4-5, SOX2, MAGE A4, and HuD) were measured using EarlyCDT®-Lung (Oncimmune USA LLC, De Soto, KS, USA), a commercially available blood test based on indirect enzyme-linked immunosorbent assay (ELISA) methods, that uses microtiter plates coated with semi-log serial dilutions of recombinant antigens (1). AAb levels were measured as optical density units, background-corrected and then converted to calibrated reference units (RU). Each patient serum sample was assayed in duplicate on each plate and a titration curve obtained for each antigen. A sample was declared positive if there was a clear titration curve, and if the RU at either of the two highest points on the titration curve was above its respective cut-off for at least one antigen. Quality control samples were interspersed in the sample order.

Patient samples

Three separate sets of serum samples were used in this work, two case-control sets described previously, and a new population-based set. All patients provided written informed consent for their samples to be used in this study.

Sample set A (Validation case-control)

This set comprised 235 patients with lung cancer from UK, US, Ukraine, and Russia (obtained at or just after diagnosis) representing 87% of the cancers in a previously published dataset (Group 3, n=269) (2) for which enough volume was available to complete the panel of seven AAbs (3). There were 179 non-small-cell lung cancers (NSCLCs, 76%), 53 small-cell lung cancers (SCLCs, 23%), and three others (1%). The controls, all recruited in the US from the general population, came from the same sample set and comprised 266 healthy volunteers with no history of cancer, 235 of whom were matched to the cases by age, gender, and smoking history (2).

Sample set B (Post-validation case-control)

Four groups of patients (Groups 1-4) with newly diagnosed lung cancer, but prior to treatment, plus controls matched by gender, age (±4 years) and smoking history (as far as possible), were combined into a single dataset, as previously reported (4). Group 1 comprised 32 cases with SCLC from a single UK center and Group 2 comprised 161 cases from multiple European centers. Controls (±4 years) came from a prospective collection of cancer-free smokers in the Midlands of England and the Midwest of America. Group 3 comprised 120 cases from a single center in Vancouver, Canada, matched to 113 high-risk lung-cancer-free controls. Group 4 comprised 23 cases matched to 109 controls. The total sample set comprised 336 lung cancer cases, including 301 NSCLC (90%) and 35 SCLC (10%), and 415 normal control sera. The incomplete matching in Groups 2 to 4 was mainly due to controls being excluded if they had been used for another group or if sample volume was insufficient.

Sample set C (Population)

This set comprised 847 commercially-derived samples collected consecutively between November 2010 and February 2012 from individuals deemed by their clinicians as being at high risk of developing lung cancer. Clinical follow-up information available through a prospective audit is known for all these individuals of whom 36/847 (4.3%) were diagnosed [using computed tomography (CT) and/or biopsy] with lung cancer within 6 months after taking the test. Ethnicity was known for 823 (97%) of patients.

Derivation of cut-offs

The current test cut-offs divide the samples into two strata, positive or negative, corresponding to high and low lung cancer risk respectively, so as to maximize the sensitivity for a specificity of about 90% (2). As previously reported (2,3), the specificity was also adjusted for the presence of an estimated small number of undiagnosed cancers in the control group. In the Population dataset, individuals were defined as ‘cancer-free’ if a lung cancer diagnosis was not obtained within six months after testing (manuscript in preparation).

Using sample set A, a new set of high cut-offs, splitting the two-stratum positives, was derived by adding a multiple of the standard deviation of the distribution of controls to the current cut-off for each autoantibody to optimize specificity and sensitivity to yield a high positive predictive value (PPV). Similarly, a set of low cut-offs, splitting the two-stratum negatives, was derived by subtracting multiples of standard deviations to yield a high negative predictive value (NPV). The main calculations were performed assuming a cancer prevalence of 2.7% (2), but tabulation for 4%, being the typical five-year lung cancer risk for an average smoker, was also carried out. All analyses were carried out using SAS® (Version 9.1.3, Cary, NC, USA).

Statistical analysis

The new cut-offs were applied to all datasets, separately and combined, thus sorting patients into four strata on the basis of their EarlyCDT®-Lung AAb levels (Table 1). To check the consistency of the classification, the percentages of samples within the new strata were compared across datasets for cases and controls separately using Fisher Exact tests (5).

Using the specificity and sensitivity for each stratum, the PPV and NPV were then derived using the number of samples in the stratum versus their complement, the samples not in the stratum. A continuous estimate of five-year demographic risk based on gender, age and, where available, smoking history was also derived using a modified version of the Spitz model (6). The demographic factors and staging on the stratification were investigated using multinomial modelling (SAS®, Proc GENMOD and Proc FREQ).

A further analysis compared subjects who could or could not be classified according to the main National Lung Screening Trial (NLST) trial inclusion criteria, i.e., age (55-74 years old) and smoking history (≥30 pack years and quit <15 years ago) (7). In the combined dataset (n=1,802), 531 subjects met the NLST criteria (29%) 116 of whom (22%) were lung cancers, while 1,271 did not (non-NLST) (71%) 328 of whom (26%) were lung cancers (Tables 2,3). There were more under-age subjects than over-age for sets A & B: 21% under-age, 68% NLST and 13% over-age. For sample set C (population set), the figures were 31%, 58% and 11%, respectively. Also, Non-NLST subjects had all smoked less. Two associations were tested (χ² tests): (I) between cancer status and NLST eligibility for each separate test stratum, and (II) between EarlyCDT-Lung positivity and NLST eligibility for cancers and non-cancer subjects separately.

Table 2. Incidence, stratified by NLST Inclusion, all datasets pooled.

Dataset	Cut-offs	Two-stratum		Four-stratum
		Positivity (C/N) (PPV, 1/PPV)	Negativity (C/N) [NPV, 1/(1-NPV)]	Positivity (C/N) (PPV, 1/PPV)		Negativity (C/N) [NPV, 1/(1-NPV)]
		Current positive	Current negative	High positive	Positive	Negative	Low negative
NLST^a (116C/415N)	Incidence Unadjusted 2.7%	43/36 37%/9% (0.106, 1 in 9)	73/379 63%/91% (0.981, 1 in 53)	31/9 27%/2% (0.255, 1 in 4)	12/27 10%/7% (0.042, 1 in 24)	56/270 48%/65% (0.980, 1 in 50)	17/109 15%/26% (0.985, 1 in 66)
Non-NLST^b (328C/943N)	Incidence Unadjusted 2.7%	103/89 31%/9% (0.085, 1 in 12)	225/854 69%/91% (0.979, 1 in 49)	61/23 19%/2% (0.175, 1 in 6)	42/66 13%/7% (0.048, 1 in 21)	198/595 60%/63% (0.974, 1 in 39)	27/259 8%/28% (0.992, 1 in 121)
^cP-value		0.91	0.03	0.56	0.37	0.005	0.22
^dP-value		C 0.26/N 0.65		C 0.03/N 0.91

Open in a new tab

^aIndividuals who met the NLST criteria for lung cancer screening; ^bIndividuals who did not meet the NLST criteria for lung cancer screening; ^cAssociation of Cancer status and NLST eligibility using χ² test; ^dAssociation of EarlyCDT-Lung positivity and NLST eligibility using χ² test. C, cancers; N, normals (cancer-free controls).

Table 3. Incidence, stratified by NLST Inclusion: case-control and population set separate.

Dataset	Dataset	Cut-offs	Two-stratum		Four-stratum
			Positivity (C/N) (PPV, 1/PPV)	Negativity (C/N) [NPV, 1/(1-NPV)]	Positivity (C/N) (PPV, 1/PPV)		Negativity (C/N) [NPV, 1/(1-NPV)]
			Current positive	Current negative	High positive	Positive	Negative	Low negative
Case-control (409C/617N)	NLST^a (98C/167N)	Incidence Unadjusted 2.7%	38/17 39%/10% (0.096, 1 in 10)	60/15 61%/90% (0.981, 1 in 54)	29/5 30%/3% (0.215, 1 in 5)	9/12 9%/7% (0.034, 1 in 29)	47/117 48%/70% (0.981, 1 in 54)	13/33 13%/20% (0.982, 1 in 55)
	Non-NLST^b (311C/450N)	Incidence Unadjusted 2.7%	95/41 31%/9% (0.085, 1 in 12)	216/409 69%/91% (0.979, 1 in 48)	56/14 18%/3% (0.138, 1 in 7)	39/27 13%/6% (0.055, 1 in 18)	191/304 61%/68% (0.975, 1 in 41)	25/105 8%/23% (0.991, 1 in 106)
^aP-value			0.92	0.11	0.51	0.19	0.02	0.20
^bP-value			C 0.13/N 0.69		C 0.02/N 0.78
Population (35C/741N)	NLST^a (18C/248N)	Incidence Unadjusted 2.7%	5/19 28%/8% (0.091, 1 in 11)	13/229 72%/92% (0.979, 1 in 47)	2/4 11%/2% (0.160, 1 in 6)	3/15 17%/6% (0.071, 1 in 14)	9/153 50%/62% (0.978, 1 in 45)	4/76 22%/31% (0.980, 1 in 51)
	Non-NLST^b (17C/493N)	Incidence Unadjusted 2.7%	8/48 47%/10% (0.118, 1 in 8)	9/445 53%/90% (0.984, 1 in 62)	5/9 29%/2% (0.309, 1 in 3)	3/39 18%/8% (0.058, 1 in 17)	7/291 41%/59% (0.981, 1 in 53)	2/154 12%/31% (0.990, 1 in 97)
^cP-value			0.47	0.01	0.92	0.26	0.07	0.09
^dP-value			C 0.24/N 0.35		C 0.54/N 0.79

Open in a new tab

^aIndividuals who met the NLST criteria for lung cancer screening; ^bIndividuals who did not meet the NLST criteria for lung cancer screening; ^cAssociation of Cancer status and NLST eligibility using χ² test; ^dAssociation of EarlyCDT-Lung positivity and NLST eligibility using χ² test. C, cancers; N, normals (cancer-free controls).

Results

Patient samples

The patient demographics were summarized for the sample sets A, B & C separately (Table 4, full details in Table 5). Demographics for sample sets A & B were representative of patients with lung cancer, with more males than females, age ranging from 23 to 90 years and more than half of patients being at least 60 years old. In sample set C (the Population dataset), however, there was a higher percentage of females, suggesting that females are more likely to be proactive about their health, and with a median age of the cases about 10 years older than for the controls. The pattern of smoking was similar over all three datasets, although with a tendency for the cases to be current smokers and controls to be ex-smokers. Mean demographic risk in sample set C was higher for cases than for controls, reflecting the differences noted above.

Table 4. Brief summary of demographics.

Dataset	Controls/cases
Dataset	Males	Median age	Smoker	Ex-smoker
Set A	70%/73%	65/65	35%/46%	54%/29%
Set B	64%/65%	62/67	19%/52%	57%/33%
Set C	36%/42%	60/70	45%/50%	41%/44%

Open in a new tab

Table 5. Summary of demographics by dataset.

	Sample set A dataset		Sample set B dataset		Sample set C (Population) dataset
	Controls N=266	Cases N=235	Controls N=415	Cases N=336	Controls (Cancer-free) N=811	Cases N=36
Tumor type, n (%)
NSCLC	n/a	179 (76%)	n/a	301 (89%)	n/a	32 (88%)
Stage I		79 (34%)		170 (51%)		16 (44%)
Stage II		48 (20%)		45 (13%)		5 (13%)
Stage III		14 (6%)		44 (13%)		8 (22%)
Stage IV		3 (1%)		21 (6%)		2 (6%)
Stage unknown		35 (15%)		21 (6%)		1 (3%)
SCLC	n/a	53 (23%)	n/a	35 (11%)	n/a	2 (6%)
Limited SCLC		23 (10%)		6 (2%)		1 (3%)
Extensive SCLC		7 (3%)		26 (8%)		1 (3%)
Stage unknown		23 (10%)		3 (1%)		0 (0%)
Type unknown	n/a	3 (1%)	n/a	0 (0%)	n/a	2 (6%)
Gender, n (%)
Male	185 (70%)	171 (73%)	265 (64%)^a	218 (65%)	290 (36%)	15 (42%)
Female	81 (30%)	64 (27%)	148 (36%)	118 (35%)	521 (64%)	21 (58%)
Age
Age, median [min-max]	65 [38-86]	65 [42-85]	62 [23-87]	67 [23-90]	60 [35-85]	70 [49-85]
Age, mean +/- sem	64 +/- 0.6	65 +/- 0.6	62 +/- 0.5	65 +/- 0.6	61 +/- 0.4	70 +/- 1.4
Race, n (% of known)
Caucasian	n/r	n/r	n/r	n/r	721 (91%, n=789)	29 (85%, n=34)
African-American	n/r	n/r	n/r	n/r	37 (5%, n=789)	4 (12%, n=34)
Smoker
Yes, n (%) Pk-yrs, mean +/- sem Risk (Modified Spitz)	93 (35%) 36 +/- 2 (n=92) 3.0 (0.3)	108 (46%) 31 +/- 2 (n=86) 3.3 (0.3)	78 (19%) 31 +/- 3 (n=69) 3.1 (0.3)	175 (52%) 32 +/- 2 (n=147) 3.4 (0.2)	361 (45%) 41 +/- 1 (n=346) 2.4 (0.1)	18 (50%) 45 +/- 6 (n=18) 5.9 (0.6)
Ex, n (%)	144 (54%) 32 +/- 3 (n=105) 3.8 (0.2)	67 (29%) 38 +/- 4 (n=37) 3.7 (0.3)	237 (57%) 31 +/- 2 (n=223) 3.4 (0.1)	112 (33%) 39 +/- 2 (n=72) 5.1 (0.3)	331 (41%) 40 +/- 2 (n=315) 3.7 (0.1)	16 (44%) 52 +/- 9 (n=16) 5.8 (0.4)
Pk-yrs, mean +/- sem Risk (Modified Spitz)
No, n (%)	29 (11%)	24 (10%)	99 (24%)	43 (13%)	117 (14%)	2 (6%)
Risk (Modified Spitz)	0.2 (0.02)	0.1 (0.02)	0.1 (0.01)	0.2 (0.01)	0.1 (0.01)	0.1 (0.02)
Unknown	0 (0%)	36 (15%)	1 (0%)	6 (2%)	2 (0%)	0 (0%)

Open in a new tab

^aGender unknown for two subjects. n/a, not applicable; n/r, information not recorded; NSCLC, non-small-cell lung carcinoma; SCLC, small-cell lung carcinoma; sem, standard error of the mean; Pk-yrs, Pack-years. Rounding applied to percentages to ensure 100% totals.

Analysis using the current and newly defined test cut-offs

For both the current two-stratum test and the new four-stratum test, using the cut-offs derived from sample set A, the sensitivity, specificity, PPV and NPV were calculated for each dataset and for all datasets combined (Table 6, full details in Table 7). For convenience the NPV is also presented in its reciprocal form (1 in X), i.e., the probability of cancer given a negative result.

Table 6. Summary of PPV and NPV for two-stratum and four-stratum test.

Dataset	Two-stratum test		Four-stratum test
Dataset	Positive PPV	Negative NPV	High positive PPV	Positive PPV	Negative NPV	Low negative NPV
Set A	10.9% (1 in 9)	1.8% (1 in 57)	27.4% (1 in 4)	5.5% (1 in 18)	2.0% (1 in 49)	1.0% (1 in 105)
Set B	7.6% (1 in 13)	2.1% (1 in 47)	11.3% (1 in 9)	5.4% (1 in 19)	2.3% (1 in 43)	1.5% (1 in 65)
Set C	10.3% (1 in 10)	1.9% (1 in 53)	22.6% (1 in 4)	6.3% (1 in 16)	2.1% (1 in 47)	1.5% (1 in 67)
Combined	9.4% (1 in 11)	2.0% (1 in 51)	19.3% (1 in 5)	5.4% (1 in 18)	2.3% (1 in 43)	1.1% (1 in 90)

Open in a new tab

PPV, positive predictive value; NPV, negative predictive value, in its reciprocal form, i.e., 1-NPV. Based on a population lung cancer prevalence of 2.7%.

Table 7. Incidence with PPV and NPV for two-stratum and four-stratum formats.

Dataset	Cut-offs	Two-stratum		Four-stratum
		Positivity (C/N) (PPV, 1/PPV)	Negativity (C/N) [NPV, 1/(1-NPV)]	Positivity (C/N) (PPV, 1/PPV)		Negativity (C/N) [NPV, 1/(1-NPV)]
		Current positive^a	Current negative^b	High positive^c	Positive^d	Negative^e	Low negative^f
Sample set A (235C/266N)	Incidence Unadjusted 2.7% 4.0%	97/25 41%/9% (0.109, 1 in 9) (0.155, 1 in 6)	138/241 59%/91%, (0.982, 1 in 57) (0.974, 1 in 38)	60/5 25%/2% (0.274, 1 in 4), (0.361, 1 in 3)	37/20 16%/7% (0.055, 1 in 18) (0.080, 1 in 12)	119/179 51%/68% (0.980, 1 in 49) (0.970, 1 in 33)	19/62 8%/23% (0.990, 1 in 105) (0.986, 1 in 70)
Sample set B (336C/415N)	Incidence Unadjusted 2.7% 4.0%	99/41 30%/10% (0.076, 1 in 13) (0.111, 1 in 9)	237/374 70%/90% (0.979, 1 in 47) (0.968, 1 in 32)	56/15 17%/4% (0.113, 1 in 9) (0.161, 1 in 6)	43/26 13%/6% (0.054, 1 in 19) (0.078, 1 in 13)	196/284 58%/68% (0.977, 1 in 43) (0.966, 1 in 29)	41/90 12%/22% (0.985, 1 in 65) (0.977, 1 in 44)
Sample set C (36C/811N)	Incidence Unadjusted 2.7% 4.0%	13/71 36%/9% (0.103, 1 in 10) (0.147, 1 in 7)	23/740 64%/91% (0.981, 1 in 53) (0.972, 1 in 35)	7/15 19%/2% (0.226, 1 in 4) (0.305, 1 in 3)	6/56 17%/7% (0.063, 1 in 16) (0.091, 1 in 11)	17/492 47%/60% (0.979, 1 in 47) (0.969, 1 in 32)	6/248 17%/31% (0.985, 1 in 67) (0.978, 1 in 45)
Combined (607C/1492N)	Incidence Unadjusted 2.7% 4.0%	209/137 34%/9% (0.094, 1 in 11) (0.135, 1 in 7)	398/1,355 66%/91% (0.980, 1 in 51) (0.971, 1 in 34)	123/35 20%/2% (0.193, 1 in 5) (0.265, 1 in 4)	86/102 14%/7% (0.054, 1 in 18) (0.079, 1 in 13)	332/955 55%/64% (0.977, 1 in 43) (0.966, 1 in 29)	66/400 11%/27% (0.989, 1 in 90) (0.983, 1 in 60)
Sample set A (235C/266N)	Adjusted 2.7% 4.0%	41%/7% (0.145, 1 in 7) (0.203, 1 in 5)	59%/93% (0.983, 1 in 58) (0.974, 1 in 39)	25%/0.04% (0.947, 1 in 1.06) (0.964, 1 in 1.04)	16%/7% (0.061, 1 in 16) (0.089, 1 in 11)	51%/69% (0.980, 1 in 50) (0.970, 1 in 34)	8%/24% (0.991, 1 in 109) (0.986, 1 in 73)
P-value^g				0.03/0.16	0.49/0.80	0.12/0.013	0.13/0.001

Open in a new tab

^aIncludes all samples above current cut-off; ^bIncludes all samples below current cut-off; ^cIncludes all samples above high cut-off; ^dIncludes only samples between current and high cut-off; ^eIncludes only samples between current and low cut-off; ^fIncludes all samples below low cut-off; ^gP-value for comparison of percentages across Sample set A, B & C datasets (cancers/normals separately) using χ² test. C, cancers; N, normals (cancer-free controls); NPV, negative predictive value; PPV, positive predictive value. PPV and NPV calculated for a population with a prevalence of 2.7% or 4.0% using unrounded specificity/sensitivity values.

Sample set A

The two-stratum test gave a cancer/normal positivity of 41%/9% (PPV =0.109, 1 in 9), and the four-stratum test high positive stratum improved this to 25%/2% (PPV =0.274, 1 in 4) (Tables 6,7). The two-stratum test also gave a cancer/normal negativity of 59%/91% (NPV =0.982, 1 in 57), and the four-stratum test low negative stratum improved this to 8%/23% (NPV =0.990, 1 in 105). For the demographic split, no difference (5% level) between strata was seen for gender (P=0.99), age category (P=0.053) or smoking status (P=0.37), similarly for staging profile (P=0.16).

Sample set B

The two-stratum test gave a cancer/normal positivity of 30%/10% (PPV =0.076, 1 in 13), and the high positive stratum improved this to 17%/4% (PPV =0.113, 1 in 9). The two-stratum test also gave a cancer/normal negativity of 70%/90% (NPV =0.979, 1 in 47), and the low negative stratum improved this to 12%/22% (NPV =0.985, 1 in 65) (Table 6). No difference between strata was seen for gender (P=0.88), age category (P=0.62) or smoking status (P=0.57), similarly for staging profile (P=0.21).

Sample set C

The two-stratum test gave a cancer/normal positivity of 36%/9% (PPV =0.103, 1 in 10), and the high positive stratum improved this to 19%/2% (PPV =0.226, 1 in 4). The two-stratum test also gave a cancer/normal negativity of 64%/91% (NPV =0.981, 1 in 53), and the low negative stratum improved this slightly to 17%/31% (NPV =0.985, 1 in 67) (Table 6). Again, no difference between strata was seen for gender (P=0.20), age category (P=0.07) or smoking status (P=0.51), similarly for the proportion of Caucasians (P=0.22). There were too few cancers to investigate staging.

Combined set

The three datasets were pooled into a single Combined dataset, with 607 cases and 1,492 controls. The two-stratum test gave a cancer/normal positivity of 34%/9% (PPV =0.094, 1 in 11) and the high positive stratum improved this to 20%/2% (PPV =0.193, 1 in 5). The two-stratum cut-offs also gave a cancer/normal negativity of 66%/91% (NPV =0.980, 1 in 51) and the low negative stratum improved this to 11%/27% (NPV =0.989, 1 in 90) (Table 6). Further analysis showed clear consistency of the EarlyCDT-Lung risk profile across age decades (Tables 8,9).

Table 8. Age by EarlyCDT-Lung risk stratum, combined dataset (Cancers).

Stratum	Age group				Total
Stratum	20-49	50-59	60-69	70-90	Total
Very low risk	2 (5%)	16 (12%)	28 (14%)	20 (9%)	66 (11%)
Low risk	34 (78%)	74 (53%)	94 (47%)	130 (57%)	332 (55%)
High risk	2 (5%)	18 (13%)	29 (15%)	37 (16%)	86 (14%)
Very high risk	5 (12%)	31 (22%)	47 (24%)	40 (18%)	123 (20%)
Total	43	139	198	227	607

Open in a new tab

Table 9. Age by EarlyCDT-Lung risk stratum, combined dataset (Controls).

Stratum	Age group				Total
Stratum	20-49	50-59	60-69	70-90	Total
Very low risk	58 (33%)	139 (30%)	128 (26%)	75 (21%)	400 (27%)
Low risk	98 (57%)	281 (60%)	324 (66%)	252 (71%)	955 (64%)
High risk	14 (8%)	38 (8%)	27 (5%)	23 (6%)	102 (7%)
Very high risk	4 (2%)	8 (2%)	15 (3%)	8 (2%)	35 (2%)
Total	174	466	494	358	1,492

Open in a new tab

Percentages calculated within column. Test for an association between stratum profile within age.

Comparison across datasets

The positivity percentages were consistent across the three datasets for both cases and controls (Table 6). For the low negative stratum, the percentage of negatives in sample set C ‘controls’ (31%) was higher than for the other two datasets (23% and 22% respectively, P=0.001), which could reflect the higher number of younger cancer-free individuals in the population sample set C. Even despite the age difference, some dataset-to-dataset variation is to be expected, and the difference was not great. This consistency confirmed that the new sample set A cut-offs were directly applicable to sample sets B & C.

Risk analysis

In standard demographic models (e.g., Spitz) (6), risk increases with age and degree of smoking. To assess the independence of demographic risk and EarlyCDT-Lung result, a single threshold was applied to demographic risk to classify samples into low and high risk. This allowed 2×2×2 tables of positivity (demographic risk, EarlyCDT-Lung result, cancer status) to be compiled, bearing in mind the matching in the case-control sets. No evidence was found for a departure from independence (proportionality) of demographic risk and EarlyCDT-Lung.

The modification of the personalized continuous demographic risk by the four-stratum test is also under investigation. Based on DLR (diagnostic likelihood ratio) calculations (8) for typical cases (e.g., middle-aged moderate smokers), going from a positive result in the two-stratum test to a high positive result in the four-stratum test changed the risk increase from 4.3- to 12.7-fold. Similarly, going from a two-stratum negative result to a four-stratum low negative result changed the risk decrease from 1.5- to 2.9-fold.

EarlyCDT-Lung and NLST criteria

There was no evidence that the cancer rate differed between NLST and Non-NLST cohorts at the positive end of the test (Table 2). At the negative end, there were some small differences, but these were not consistent across sample sets A & B (more cancers in the Non-NLST cohort) and sample set C (Population dataset) (more cancers in the NLST cohort). Statistical significance was generally only seen when the table frequencies were high, and in fact, the differences were not large in the NPV estimates.

There was also little evidence for a difference between NLST and Non-NLST cohorts in their positivity profile. The only comparison significant at 5% was for the sample sets A & B cancers in the four-stratum test where there were more high positives in the NLST cohort (30%) than in the Non-NLST (18%) (Table 3), but this finding was not repeated in sample set C (Population dataset).

Discussion

Improvements in diagnostic test sensitivity and specificity, and hence PPV and NPV, facilitate clinical intervention decisions. This report confirms that the addition of high and low cut-offs to EarlyCDT®-Lung enables stratification of patients into very high risk for lung cancer, with improved PPV, or very low risk, with improved NPV.

Three lung cancer case-control sets were assessed. The case demographics were representative of patients with lung cancer: a predominance of males, more than half of patients >60 years of age, and over half the patients with early-stage lung cancer (i.e., NSCLC stages 1 or 2 or limited SCLC).

For the high positive stratum the specificity was set at 98%. In sample set A, this lowered the sensitivity from 41% for the positive stratum to 25%, but overall the PPV was greatly increased from 10.9% (1 in 9) to 27.4% (1 in 4). Similarly, for the low negative stratum the NPV increased from 98.2% (1 in 57) to 99.0% (1 in 105). The cost for this improvement is reduced performance for the two intermediate strata; for the positive stratum the PPV fell to 5.5% (1 in 18), whilst for the negative stratum the NPV fell to 98.0% (1 in 49).

Importantly, the consistency of performance when applied to sample sets B and C was found to be excellent (Tables 6,7). These data suggest that the EarlyCDT®-Lung measurements may provide a continuous variable in terms of lung cancer risk. We term this the Occurrence Score^TM and it is under development.

There was no evidence for an association between demographic factors and EarlyCDT-Lung strata. The analysis clearly suggested that EarlyCDT®-Lung is adding to demographic risk independently.

The varied origin of the sample sets supports the general applicability of the results. Nodule data was not available for the case-control datasets. In the Population dataset, however, a positive EarlyCDT®-Lung result did add to the risk of a lung nodule being cancer (manuscript in preparation). The described AAb technology and CT imaging are potentially additive rather than competitive since the presence of AAbs provides an opportunity for early detection of lung cancer, even in early-stage disease, and may therefore be useful in the management of high-risk individuals. Thus, for example, combining a low negative EarlyCDT®-Lung result with a negative CT scan would lead to a very high NPV (manuscript in preparation).

Finally, this study compared the EarlyCDT-Lung strata with whether or not patients met the entry criteria for the NLST study. Only 65% of participants in the Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial (PLCO) who developed lung cancers met the NLST criteria (9), and in another recent US study of early stage lung cancer patients (n=267) fewer than half met the criteria and would not be covered under current screening paradigms (10). In our analysis of EarlyCDT-Lung, little evidence was found that the cancer rate differed between NLST and Non-NLST cohorts, indicating that EarlyCDT-Lung provides similar risk stratification for these cohorts. Thus we can now identify individuals initially deemed at a risk lower than the NLST criteria whose risk after EarlyCDT-Lung is equivalent to the entry criteria for the NLST. This provides a rationale for identification and CT screening of individuals who fall outside the NLST criteria.

Conclusions

EarlyCDT-Lung is recommended as a tool for physicians to assess a patient’s probability of lung cancer thereby facilitating the early detection of lung cancer. By applying two additional cut-offs, we have converted the test to a four-stratum version to allow further stratification of patients into different risk categories. This enhanced stratification can be used on a population that fulfills the NLST criteria to identify super high risk sub-groups. In addition, we have shown that EarlyCDT-Lung can increase the risk estimates for certain Non-NLST patients, and bring them into the NLST range, thus facilitating more appropriate intervention for such patients.

Acknowledgements

Funding: This work was supported by the University of Nottingham and Oncimmune Ltd.

The manuscript was drafted by the authors. However, the authors would like to acknowledge and thank Varinia Munoz, from Complete Medical Communications, who provided editorial assistance funded by Oncimmune Ltd. Thanks also to Jared Allen, Oncimmune Ltd., for data-processing and IT support.

Disclosure: JFRR is Chief Scientific Officer and a shareholder in Oncimmune Ltd, a University of Nottingham spinout company; G. Hamilton-Fairley is a shareholder and Executive Chairman of Oncimmune Ltd; G. Healey is an employee of Oncimmune Ltd.; L. J. Peek is an employee of Oncimmune LLC. All remaining authors have declared no conflicts of interest.

References

1.Murray A, Chapman CJ, Healey G, et al. Technical validation of an autoantibody test for lung cancer. Ann Oncol 2010;21:1687-93 [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Boyle P, Chapman CJ, Holdenrieder S, et al. Clinical validation of an autoantibody test for lung cancer. Ann Oncol 2011;22:383-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Chapman CJ, Healey GF, Murray A, et al. EarlyCDT®-Lung test: improved clinical utility through additional autoantibody assays. Tumour Biol 2012;33:1319-26 [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Lam S, Boyle P, Healey GF, et al. EarlyCDT-Lung: an immunobiomarker test as an aid to early detection of lung cancer. Cancer Prev Res (Phila) 2011;4:1126-34 [DOI] [PubMed] [Google Scholar]
5.Armitage P, Berry G, Matthews JNS. Analysing means and proportions. In: Statistical methods in medical research, 4th edition. Oxford: John Wiley and Sons, 2001:83-146. [Google Scholar]
6.Spitz MR, Hong WK, Amos CI, et al. A risk model for prediction of lung cancer. J Natl Cancer Inst 2007;99:715-26 [DOI] [PubMed] [Google Scholar]
7.National Lung Screening Trial Research Team , Aberle DR, Adams AM, et al. Reduced lung-cancer mortality with low-dose computed tomographic screening. N Engl J Med 2011;365:395-409 [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Pepe MS. eds. The statistical evaluation of medical tests for classification and prediction. Oxford: Oxford University Press, 2003. [Google Scholar]
9.Tammemagi CM, Pinsky PF, Caporaso NE, et al. Lung cancer risk prediction: Prostate, Lung, Colorectal And Ovarian Cancer Screening Trial models and validation. J Natl Cancer Inst 2011;103:1058-68 [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Taiwo EO, Yorio JT, Yan J, et al. How have we diagnosed early-stage lung cancer without radiographic screening? A contemporary single-center experience. PLoS One 2012;7:e52313. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r1] 1.Murray A, Chapman CJ, Healey G, et al. Technical validation of an autoantibody test for lung cancer. Ann Oncol 2010;21:1687-93 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r2] 2.Boyle P, Chapman CJ, Holdenrieder S, et al. Clinical validation of an autoantibody test for lung cancer. Ann Oncol 2011;22:383-9 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r3] 3.Chapman CJ, Healey GF, Murray A, et al. EarlyCDT®-Lung test: improved clinical utility through additional autoantibody assays. Tumour Biol 2012;33:1319-26 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r4] 4.Lam S, Boyle P, Healey GF, et al. EarlyCDT-Lung: an immunobiomarker test as an aid to early detection of lung cancer. Cancer Prev Res (Phila) 2011;4:1126-34 [DOI] [PubMed] [Google Scholar]

[r5] 5.Armitage P, Berry G, Matthews JNS. Analysing means and proportions. In: Statistical methods in medical research, 4th edition. Oxford: John Wiley and Sons, 2001:83-146. [Google Scholar]

[r6] 6.Spitz MR, Hong WK, Amos CI, et al. A risk model for prediction of lung cancer. J Natl Cancer Inst 2007;99:715-26 [DOI] [PubMed] [Google Scholar]

[r7] 7.National Lung Screening Trial Research Team , Aberle DR, Adams AM, et al. Reduced lung-cancer mortality with low-dose computed tomographic screening. N Engl J Med 2011;365:395-409 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r8] 8.Pepe MS. eds. The statistical evaluation of medical tests for classification and prediction. Oxford: Oxford University Press, 2003. [Google Scholar]

[r9] 9.Tammemagi CM, Pinsky PF, Caporaso NE, et al. Lung cancer risk prediction: Prostate, Lung, Colorectal And Ovarian Cancer Screening Trial models and validation. J Natl Cancer Inst 2011;103:1058-68 [DOI] [PMC free article] [PubMed] [Google Scholar]

[r10] 10.Taiwo EO, Yorio JT, Yan J, et al. How have we diagnosed early-stage lung cancer without radiographic screening? A contemporary single-center experience. PLoS One 2012;7:e52313. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Signal stratification of autoantibody levels in serum samples and its application to the early detection of lung cancer

Graham F Healey

Stephen Lam

Peter Boyle

Geoffrey Hamilton-Fairley

Laura J Peek

John F R Robertson

Abstract

Background

Methods

Results

Conclusions

Introduction

Table 1. Result categories for the four-stratum EarlyCDT®-Lung test.

Materials and methods

Assay procedure

Patient samples

Sample set A (Validation case-control)

Sample set B (Post-validation case-control)

Sample set C (Population)

Derivation of cut-offs

Statistical analysis

Table 2. Incidence, stratified by NLST Inclusion, all datasets pooled.

Table 3. Incidence, stratified by NLST Inclusion: case-control and population set separate.

Results

Patient samples

Table 4. Brief summary of demographics.

Table 5. Summary of demographics by dataset.

Analysis using the current and newly defined test cut-offs

Table 6. Summary of PPV and NPV for two-stratum and four-stratum test.

Table 7. Incidence with PPV and NPV for two-stratum and four-stratum formats.

Sample set A

Sample set B

Sample set C

Combined set

Table 8. Age by EarlyCDT-Lung risk stratum, combined dataset (Cancers).

Table 9. Age by EarlyCDT-Lung risk stratum, combined dataset (Controls).

Comparison across datasets

Risk analysis

EarlyCDT-Lung and NLST criteria

Discussion

Conclusions

Acknowledgements

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases