Combining simple blood tests to identify primary care patients with unexpected weight loss for cancer investigation: Clinical risk score development, internal validation, and net benefit analysis

Brian D Nicholson; Paul Aveyard; Constantinos Koshiaris; Rafael Perera; Willie Hamilton; Jason Oke; F D Richard Hobbs

doi:10.1371/journal.pmed.1003728

. 2021 Aug 31;18(8):e1003728. doi: 10.1371/journal.pmed.1003728

Combining simple blood tests to identify primary care patients with unexpected weight loss for cancer investigation: Clinical risk score development, internal validation, and net benefit analysis

Brian D Nicholson ^1,^*, Paul Aveyard ¹, Constantinos Koshiaris ¹, Rafael Perera ¹, Willie Hamilton ², Jason Oke ¹, F D Richard Hobbs ¹

PMCID: PMC8407560 PMID: 34464384

Abstract

Background

Unexpected weight loss (UWL) is a presenting feature of cancer in primary care. Existing research proposes simple combinations of clinical features (risk factors, symptoms, signs, and blood test data) that, when present, warrant cancer investigation. More complex combinations may modify cancer risk to sufficiently rule-out the need for investigation. We aimed to identify which clinical features can be used together to stratify patients with UWL based on their risk of cancer.

Methods and findings

We used data from 63,973 adults (age: mean 59 years, standard deviation 21 years; 42% male) to predict cancer in patients with UWL recorded in a large representative United Kingdom primary care electronic health record between January 1, 2000 and December 31, 2012. We derived 3 clinical prediction models using logistic regression and backwards stepwise covariate selection: Sm, symptoms-only model; STm, symptoms and tests model; Tm, tests-only model. Fifty imputations replaced missing data. Estimates of discrimination and calibration were derived using 10-fold internal cross-validation. Simple clinical risk scores are presented for models with the greatest clinical utility in decision curve analysis. The STm and Tm showed improved discrimination (area under the curve ≥ 0.91), calibration, and greater clinical utility than the Sm. The Tm was simplest including age-group, sex, albumin, alkaline phosphatase, liver enzymes, C-reactive protein, haemoglobin, platelets, and total white cell count. A Tm score of 5 balanced ruling-in (sensitivity 84.0%, positive likelihood ratio 5.36) and ruling-out (specificity 84.3%, negative likelihood ratio 0.19) further cancer investigation. A Tm score of 1 prioritised ruling-out (sensitivity 97.5%). At this threshold, 35 people presenting with UWL in primary care would be referred for investigation for each person with cancer referred, and 1,730 people would be spared referral for each person with cancer not referred. Study limitations include using a retrospective routinely collected dataset, a reliance on coding to identify UWL, and missing data for some predictors.

Conclusions

Our findings suggest that combinations of simple blood test abnormalities could be used to identify patients with UWL who warrant referral for investigation, while people with combinations of normal results could be exempted from referral.

Dr. Brian D Nicholson and colleagues investigate whether combinations of routine blood tests could be used to stratify patients in UK with unexpected weight loss based on their risk of cancer.

Author summary

Why was this study done?

The risk of an early and late stage cancer diagnosis is increased during the 3 to 6 months following the first record of unexpected weight loss (UWL) in primary care. UWL presents a diagnostic challenge as it is associated with a wide range of other benign and serious conditions.
Diagnostic strategies that avoid the harms of unnecessary invasive and costly cancer investigation are required for patients with UWL. Our research has shown that the absence of individual or pairs of co-occurring clinical features does not reduce the likelihood of cancer enough to sufficiently rule-out patients from further cancer investigation. It has also identified that primary care clinicians commonly request multiple blood tests when patients present with UWL.
We aimed to identify whether the presence or absence of risk factors, symptoms, signs, and blood test results could be used together to rule-out more accurately the need for cancer investigation in patients with UWL.

What did the researchers do and find?

We analysed the electronic health records of 63,693 adults with UWL recorded between January 1, 2000 and December 31, 2012 to derive 3 clinical scores including symptoms, symptoms and test results, and test results, to predict the risk of cancer within 6 months.
The scores including test results were discriminative between patients with and without cancer, were well calibrated at the levels of risk that decisions to investigate are made in primary care, and showed superior clinical utility compared to the symptoms-only model.

What do these findings mean?

Simple scores including age-group, sex, and 7 simple primary care blood tests (albumin, alkaline phosphatase, C-reactive protein, haemoglobin, liver enzymes, platelets, and total white cell count) could be used to select patients with UWL who do not warrant further cancer investigation in addition to those that do.
Further research is required to validate these scores in external datasets from settings, populations, and subgroups of interest, to understand how to maximise uptake in primary care, and to assess whether the use of this approach to cancer investigation might impact cancer outcomes.

Introduction

Unexpected weight loss (UWL) is a presenting feature of cancer for which there remains no consensus on the most appropriate investigation strategy in primary care [1]. Patients with UWL recorded by their primary care clinician are more likely to be diagnosed with the following cancers within 3 months: pancreatic, cancer of unknown primary, gastro-oesophageal, lymphoma, hepatobiliary, lung, bowel, and renal tract [2]. This association is greatest in males once aged 60 years or older and in females 80 years or older [2,3]. Current investigation guidelines focus on selecting patients for single-site cancer investigation based on simple combinations of clinical features (individual risk factors, signs, symptoms, and blood test abnormalities) [3–5].

As most patients presenting to primary care with UWL will not have cancer, diagnostic strategies that avoid the harms of unnecessary invasive and costly investigation are also required for patients at a low risk of cancer [1]. Our previous work has shown that the presence of individual co-occurring clinical features increases the likelihood of cancer sufficiently to rule-in cancer investigation [5]. However, the absence of individual co-occurring clinical features, including pairs of normal inflammatory markers, do not reduce the likelihood of cancer sufficiently enough to rule-out patients from further cancer investigations [5,6]. Primary care clinicians commonly request multiple blood tests when patients present with UWL [5,7]. There is little guidance on how clinicians should interpret these blood tests in combination or which are most relevant for use in clinical practice [1,6]. When baseline investigations are normal, a watchful waiting approach may be preferable to invasive testing [8].

Prediction models have been developed to identify the most helpful combinations of clinical features for use in clinical practice [9,10]. However, these studies were based on small cohorts from secondary care; they recommend conflicting approaches and include some investigations uncommon in primary care. Research using data from primary care is therefore needed to investigate whether the absence of risk factors and co-occurring clinical findings in the context of normal test results could reduce the risk of cancer to sufficiently rule-out patients with UWL from invasive cancer investigation.

We aimed to derive and internally validate prediction models using co-occurring risk factors, symptoms, signs, and blood test data to identify those clinical features that could be used together to stratify cancer risk in patients attending primary care with UWL.

Methods

The protocol was approved by the Independent Scientific Advisory Committee (ISAC) of the MHRA (protocol number 16_164A2A) [11]. Ethics approval for observational research using the CPRD with approval from ISAC was granted by a National Research Ethics Service committee (Trent Multiresearch Ethics Committee, REC reference number 05/MRE04/87). We followed the TRIPOD (S1 TRIPOD Checklist) reporting guidelines [12]. Stata (version 15) was used for all analyses.

Cohort design and population

We selected a cohort of patients with UWL indicated by the presence of a code for UWL previously been shown to be linked to measured weight loss [13,14]. Patients were selected for the derivation cohort if UWL was first coded between January 1, 2000 and December 31, 2012 in the Clinical Practice Research Datalink (CPRD). The CPRD is an anonymised database of primary care records database covering a representative 6.9% of the United Kingdom population [15]. Patients were included if they were ≥18 years of age, registered with a CPRD general practice, eligible for linkage to NCRAS and Office for National Statistics (ONS) data, and at least 12 months of data before their first UWL code (the “index date”). These UWL Read codes equated to a mean weight loss of ≥5% within a 6-month period in our previous internal validation study of weight-related coding in CPRD [13]. UWL may be coded following a range of clinical scenarios, including UWL reported as the patient’s presenting complaint, after targeted history taking, following weight measurement as part of the clinical examination, or as part of a routine health check or chronic disease review [5]. Patients were excluded if they had a prescription of weight-reducing medication (orlistat) or a code for bariatric surgery in the previous 6 months, or if they had been previously diagnosed with cancer.

Outcome definition

The outcome was any cancer diagnosed within 6 months of the index date identified in the CPRD or NCRAS, using an existing library of codes [2]. Patients were followed up until the date of the first cancer diagnosis or for 6 months, whichever occurred first. Six months was chosen as previous research has shown that this is the period associated with an increased risk of cancer diagnosis following a presentation of UWL to primary care [2]. Cancers classified as nonmelanoma skin cancer, in situ, benign, ill-defined, or uncertain were excluded.

Predictor variables

Sociodemographic features, recorded on or before the index date, were extracted for each patient (Table 1). Preexisting comorbidities were identified using a previously described approach [13]. Clinical features shown to be associated with cancer when recorded in the 3 months before to 1 month after the UWL date were identified within that time period [5]. Continuous results of blood tests commonly requested within this time period were also identified using entity codes in CPRD, and outliers and erroneous results were dropped [5] (Table 2).

Table 1. Baseline characteristics of the derivation cohort and imputation dataset.

	Derivation cohort		Imputation dataset
	n (%)	mean (SD)	%/mean (SD)
Sex
Male	26,758 (41.8)		41.8
Female	37,215 (58.2)		58.2
Age-group (years)
18–39	14,290 (22.3)		22.3
40–49	8,016 (12.5)		12.5
50–59	8,511 (13.3)		13.3
60–69	9,017 (14.1)		14.1
70–79	11,565 (18.1)		18.1
80+	12,574 (19.7)		19.7
Smoking status
Current	9,629 (15.1)		31.0^*
Ex-smoker	7,164 (11.1)		22.9^*
Nonsmoker	14,457 (22.6)		46.0^*
Missing	32,723 (51.2)		-
Alcohol consumption
Current	18,435 (28.8)		66.9^*
Nondrinker	8,095 (12.7)		29.1^*
Past drinker	1,087 (1.7)		4.0^*
Missing	36,356 (56.8)		-
Body mass index
Underweight	6,691 (10.9)		12.4^*
Normal	33,846 (52.9)		60.0^*
Overweight	10,790 (16.9)		18.8^*
Obese	5,141 (8.0)		8.8^*
Missing	7,235 (11.3)		-
Comorbidities
0	8,870 (13.9)		13.9
1	12,762 (20.0)		20.0
2	12,641 (19.8)		19.8
3	10,378 (16.2)		16.2
4	7,638 (11.9)		11.9
5+	11,684 (18.3)		18.3
Co-occurring clinical features
Appetite loss	1,592 (2.5)		2.5
Abdominal mass	233 (0.4)		0.4
Abdominal pain	3,762 (5.9)		5.9
Back pain	3,277 (5.1)		5.1
Change in bowel habit	641 (1.0)		1.0
Chest pain (noncardiac)	1,866 (2.9)		2.9
Chest signs	90 (0.1)		0.1
Dyspepsia	1,776 (2.8)		2.8
Dysphagia	717 (1.1)		1.1
Haemoptysis	181 (0.3)		0.3
Iron deficiency anaemia	780 (1.2)		1.2
Jaundice	126 (0.2)		0.2
Lymphadenopathy	209 (0.3)		0.3
Venous thromboembolism	201 (0.3)		0.3
Liver function tests
Albumin (g/L)	41,622 (65.1)	41.1 (4.8)	41.2 (4.8)^*
Alkaline phosphatase (iu/L)	41,278 (64.5)	90.8 (57.5)	89.9 (56.8)^*
AST/ALT (iu/L)	10,911 (17.1)	25.9 (20.7)	25.4 (18.9)^*
Bilirubin (umol/L)	41,895 (65.5)	11.1 (6.9)	11.1 (6.8)^*
Full blood count
Haemoglobin (g/L)	46,129 (72.1)	13.5 (1.6)	13.5 (1.6)^*
Lymphocytes (×10⁹/L)	42,502 (66.5)	1.9 (0.7)	1.9 (0.8)^*
Mean cell volume (fL)	45,010 (70.4)	91.1 (6.2)	91.1 (6.2)^*
Monocytes (×10⁹/L)	42,607 (66.6)	0.6 (0.4)	0.6 (0.4)^*
Neutrophils (×10⁹/L)	40,592 (63.5)	4.6 (2.1)	4.6 (2.1)^*
Platelets (×10⁹/L)	45,239 (70.7)	272.5 (88.5)	272.2 (88.3)^*
Total white cell count (×10⁹/L)	44,714 (69.9)	7.3 (2.5)	7.3 (2.5)^*
Inflammatory markers
C-reactive protein (mg/L)	14,703 (23.0)	13.3 (30.1)	11.7 (27.3)^*
Erythrocyte sedimentation rate (mm/h)	22,931 (35.9)	17.4 (21.4)	17.6 (21.0)^*
Cancer diagnosis within 6 months
Yes	908 (1.4)		1.4
No	63,065 (98.6)		98.6
Total	63,973		50×63973

Open in a new tab

*Imputed data.

ALT, alanine aminotransferase; AST, aspartate aminotransferase; SD, standard deviation.

Table 2. Sm—Symptoms-only model.

Covariate		Coefficient (95% CI)	Odds Ratio (95% CI)	p-value
Demographics
Age-group (ref = 60–69 years)	18–39	−3.85 (−4.75–2.96)	0.02 (0.01–0.05)	<0.001
	40–49	−1.93 (−2.40–1.46)	0.14 (0.09–0.23)	<0.001
	50–59	−1.11 (−1.43–0.78)	0.33 (0.24–0.46)	<0.001
	70–79	0.63 (0.43–0.82)	1.87 (1.53–2.28)	<0.001
	80+	0.72 (0.52–0.93)	2.06 (1.69–2.52)	<0.001
Sex (ref = female)	male	0.77 (0.63–0.91)	2.16 (1.87–2.48)	<0.001
Risk factors
Smoking status (ref = non-smoker)	current	0.61 (0.37–0.84)	1.84 (1.45–2.32)	<0.001
	ex-smoker	0.21 (−0.02–0.43)	1.23 (0.98–1.54)	0.068
Symptoms
Abdominal pain	Yes	0.77 (0.56–0.98)	2.17 (1.76–2.67)	<0.001
Appetite loss	Yes	0.61 (0.33–0.89)	1.84 (1.39–2.43)	<0.001
Dyspepsia	Yes	0.46 (0.16–0.75)	1.58 (1.18–2.12)	<0.001
Sign
Abdominal mass	Yes	1.09 (0.59–1.59)	2.98 (1.81–4.93)	<0.001
Chest signs	Yes	1.32 (0.60–2.04)	3.73 (1.81–7.68)	<0.001
Iron deficiency anaemia	Yes	1.15 (0.82–1.48)	3.16 (2.27–4.41)	<0.001
Jaundice	Yes	1.89 (1.33–2.45)	6.62 (3.77–11.6)	<0.001
Lymphadenopathy	Yes	1.54 (0.73–2.35)	4.67 (2.08–10.5)	<0.001
Venous thromboembolism	Yes	1.10 (0.47–1.73)	3.00 (1.60–5.64)	<0.001
Constant		−4.97 (−5.2–4.75)		<0.001

Open in a new tab

Multiple imputation

Multiple imputation was used to replace missing values for smoking status, alcohol intake, body mass index, and blood tests using the mi suite of commands in Stata [16,17]. Fifty imputed datasets were created. Multinomial logistic regression was used to impute categorical variables, and predictive mean matching (PMM) with 5 donors was used to impute continuous variables [18]. The imputation model included the outcome, all candidate variables to be included in the final predictions models, and auxiliary variables to increase the likelihood that the missing at random assumption was satisfied. These were a combination of variables found to predict missingness, personal characteristics, comorbidities, risk factors, other markers of inflammation, or full blood count components (S1 Text). For the primary analysis, continuous test results were dichotomised as abnormal/normal in each imputed dataset using standard laboratory ranges (S1 Table). Rubin’s rules were used to combine results across the imputed datasets [16].

Model derivation

Three prediction models were derived in the complete dataset: a symptoms-only model (Sm), a symptoms and tests model (STm), and a simple tests-only model (Tm). The mim Stata command was used to select variables for each model in the imputed data using backwards stepwise logistic regression, using a p-value of <0.01 for inclusion. Candidate variables for Sm included age-group, sex, smoking status, and clinical features found to be associated with a cancer diagnosis within 6 months in males and females (Table 2) [5]. Candidate variables for the STm also included the blood tests most commonly requested by GPs in patients with UWL and tests used in prognostic scores for patients with cancer (Table 3) [5,19,20]. For the Tm, candidate variables included age-group, sex, smoking status, and the blood tests, and as we intended to derive a parsimonious model, we chose the quantum over component tests; for example, the total white cell count was included rather than the white cell subtypes (Table 4). The most complex model (STm) had at least 15 events per variable [21].

Table 3. STm—Symptoms and test model.

Covariate		Coefficient (95% CI)	Odds ratio (95% CI)	p-value	Risk score*
Demographics
Age-group (ref = 60–69 yrs)	18–39 yrs	−3.34 (−4.24–2.44)	0.04 (0.01–0.09)	<0.001	−10
	40–49 yrs	−1.61 (−2.09–1.13)	0.20 (0.12–0.32)	<0.001	−5
	50–59 yrs	−0.96 (−1.3–0.63)	0.38 (0.27–0.53)	<0.001	−3
	70–79 yrs	0.36 (0.16–0.57)	1.44 (1.17–1.77)	0.001	1
	80+ yrs	0.33 (0.11–0.54)	1.38 (1.12–1.72)	0.003	1
Sex (ref = female)	Male	0.53 (0.38–0.69)	1.70 (1.46–1.99)	<0.001	2
Smoking status (ref = never)	Current	0.60 (0.35–0.85)	1.82 (1.41–2.34)	<0.001	2
	Ex-smoker	0.18 (−0.06–0.42)	1.20 (0.94–1.53)	0.144	1
Symptoms
Abdominal pain	Yes	0.64 (0.42–0.86)	1.90 (1.52–2.37)	<0.001	2
Change in bowel habit	Yes	0.71 (0.23–1.18)	2.03 (1.26–3.26)	0.004	2
Dyspepsia	Yes	0.47 (0.16–0.77)	1.59 (1.17–2.17)	0.003	1
Signs
Iron Deficiency Anaemia	Yes	0.51 (0.15–0.88)	1.67 (1.16–2.41)	0.006	2
Jaundice	Yes	0.85 (0.24–1.45)	2.33 (1.27–4.28)	0.006	3
Blood tests
Low albumin	Yes	0.36 (0.17–0.55)	1.43 (1.18–1.73)	<0.001	1
Raised alkaline phosphatase	Yes	0.67 (0.48–0.86)	1.95 (1.61–2.36)	<0.001	2
Raised AST/ALT	Yes	0.69 (0.39–0.99)	1.99 (1.47–2.70)	<0.001	2
Raised CRP	Yes	1.41 (1.18–1.64)	4.09 (3.24–5.16)	<0.001	4
Low haemoglobin	Yes	0.37 (0.19–0.55)	1.45 (1.21–1.74)	<0.001	1
Low mcv	Yes	−0.47 (−0.68–0.25)	0.63 (0.50–0.78)	<0.001	−1
Raised platelets	Yes	0.50 (0.31–0.70)	1.65 (1.36–2.02)	<0.001	2
Raised monocytes	Yes	0.68 (0.50–0.86)	1.97 (1.65–2.36)	<0.001	2
Low lymphocytes	Yes	0.36 (0.18–0.53)	1.43 (1.20–1.70)	<0.001	1
Constant		−6.09 (−6.36–5.81)		<0.001

Open in a new tab

Candidate covariates included in the backwards stepwise selection procedure using p < 0.01 to retain covariates included the following: sex, age-group, smoking status, abdominal mass, abdominal pain, appetite loss, back pain, chest pain (noncardiac), chest signs, change in bowel habit, dyspepsia, dysphagia, iron deficiency anaemia, jaundice, lymphadenopathy, venous thromboembolism, albumin, alkaline phosphatase, bilirubin, C-reactive protein, erythrocyte sedimentation rate, haemoglobin, liver enzymes (AST/ALT), lymphocytes, mean cell volume, monocytes, neutrophils, and platelets.

*Conversion factor = 0.32. The conversion factor is used to translate coefficients into the risk score, rounded to the nearest integer.

ALT, alanine aminotransferase; AST, aspartate aminotransferase; CRP, C-reactive protein; mcv, mean cell volume.

Table 4. Tm—Tests-only model.

Covariate		Coefficient (95% CI)	Odds ratio (95% CI)	p-value	Risk score*
Demographics
Age-group (ref = 60–69 years)	18–39	−3.31 (−4.20–2.41)	0.04 (0.01–0.09)	<0.001	−10
	40–49	−1.61 (−2.08–1.14)	0.20 (0.12–0.32)	<0.001	−5
	50–59	−0.94 (−1.28–0.61)	0.39 (0.28–0.54)	<0.001	−3
	70–79	0.36 (0.15–0.56)	1.43 (1.17–1.75)	0.001	1
	80+	0.26 (0.06–0.47)	1.30 (1.06–1.60)	0.012	1
Sex (ref = female)	male	0.61 (0.46–0.76)	1.84 (1.59–2.14)	<0.001	2
Blood tests
Low albumin	Yes	0.38 (0.19–0.57)	1.47 (1.21–1.78)	<0.001	1
Raised alkaline phosphatase	Yes	0.70 (0.51–0.89)	2.01 (1.66–2.43)	<0.001	2
Raised AST/ALT	Yes	0.70 (0.41–1.00)	2.02 (1.51–2.71)	<0.001	2
Raised CRP (>10)	Yes	1.50 (1.28–1.73)	4.50 (3.60–5.64)	<0.001	5
Raised total white cell count	Yes	0.60 (0.40–0.79)	1.82 (1.50–2.20)	<0.001	2
Raised platelets	Yes	0.52 (0.32–0.72)	1.68 (1.38–2.04)	<0.001	2
Low haemoglobin	Yes	0.44 (0.26–0.61)	1.55 (1.30–1.85)	<0.001	1
Constant		−5.69 (−5.93–5.46)		<0.001

Open in a new tab

Candidate covariates included in the backwards stepwise selection procedure using p < 0.01 to retain covariates included the following: sex, age-group, smoking status, albumin, alkaline phosphatase, bilirubin, CRP, erythrocyte sedimentation rate, haemoglobin, liver enzymes (AST/ALT), platelets, and total white cell count.

*Conversion factor = 0.32. The conversion factor is used to translate coefficients into the risk score, rounded to the nearest integer.

ALT, alanine aminotransferase; AST, aspartate aminotransferase; CRP, C-reactive protein.

Internal cross-validation

Ten-fold internal cross-validation was used to assess overall model performance using the mean predicted probability for each patient across all 50 imputation datasets with the cvauroc command in Stata [22]. Model performance was assessed using discrimination and calibration statistics. Discrimination was quantified using the area under the curve (AUC) and 95% confidence intervals calculated using bootstrap resampling. Calibration plots were generated using Stata’s pmcalplot command to assess how the predicted probabilities derived by each model correspond to the observed proportion of patients diagnosed with cancer [23].

Decision curve analysis

We then used decision curve analysis (DCA) to compare the standardised net benefit (SNB) and proportion of investigations avoided by the Sm, STm, and Tm with scenarios where no prediction model was used (i.e., treat everyone or treat nobody) across a range of risk thresholds (threshold probabilities) using the dca command in Stata [24]. In general, the strategy with the highest net benefit (the highest plotted curve) is considered to have the greatest clinical utility at any given risk threshold [25]. Net benefit represents the proportion of the studied population with true positive results minus the proportion with false positives multiplied by the odds of cancer at each risk threshold. To ease interpretation, we calculated the SNB to give the proportion of the maximum achievable utility attained by each model (SNB = NB / prevalence of cancer) [26]. An alternative presentation of DCA is the proportion of patients who would avoid further investigation without missing a cancer diagnosis at each risk threshold [27].

Clinical risk scores

Finally, to demonstrate how these models could be used in clinical practice, we followed established methods to develop 2 simple clinical risk scores for the STm and Tm [28]. The risk score associated with each variable was derived by multiplying each coefficient by the same conversion factor and rounding the result to the nearest whole number. We calculated the mean point score for each patient across the imputation datasets and constructed a 2 × 2 table using each total score as the cutoff. We calculated the sensitivity, specificity, positive likelihood ratio (PLR), negative likelihood ratio (NLR), positive predictive value (PPV), and negative predictive values (NPVs) for each score.

Sensitivity analysis

We refitted the models using the missing indicator method to assess our approach to multiple imputation. We refitted the models using continuous blood test results to explore the impact on discrimination and calibration statistics. We used Stata’s mfpmi command to select the most appropriate functional form for each continuous covariate in relation to the outcome [29].

Results

Study participants

In the derivation cohort of 63,973 adults aged ≥18 years with UWL recorded, 908 (1.4%) were diagnosed with cancer within 6 months of the index date, of whom 902 (99.3%) were aged ≥40 years. Table 1 summarises the baseline characteristics of the study population. Patients with UWL were more commonly females (58.2%), aged ≥60 years (51.8%), and of normal body mass index (52.9%). The most commonly recorded clinical features were abdominal pain (5.9%), back pain (5.1%), noncardiac chest pain (2.9%), and dyspepsia (2.8%) (Table 1). The most commonly recorded tests were haemoglobin (72.1%), platelets (70.7%), and total white cell count (69.9%).

Missing data

A total of 32,723 (51.15%) patients had missing data on smoking status, 36,356 (56.8%) on alcohol consumption, and 7,235 (11.3%) had no body mass index recorded (Table 1). The most commonly missing blood tests were liver enzymes (53,062, 82.9%), C-reactive protein (49,270, 77.0%), and erythrocyte sedimentation rate (41,042, 64.1%). Imputation diagnostics were deemed satisfactory for all imputed variables (S1 Fig). The direction of the estimates was the same, and the confidence intervals overlapping when comparing variables included in the final imputed and missing indicator models.

Model development

In the final Sm, 12 of 17 candidate variables were associated with cancer (Table 2), of which concurrent jaundice (adjusted odds ratio 6.62 (95% CI 3.77 to 11.63)) and lymphadenopathy (4.67 (2.08 to 10.47)) were most predictive. Out of 29 candidate variables, 17 were retained in the final STm (Table 3), of which a raised C-reactive protein (4.09 (3.24 to 5.16) and concurrent jaundice (2.33 (1.27 to 4.28)) were most predictive. Out of 12 candidate predictor variables, 9 were retained in the final Tm (Table 4), of which raised C-reactive protein (4.50 (3.60 to 5.64)) and raised liver enzymes (2.02 (1.51 to 2.71)) were most predictive.

Internal validation

The AUC for both the STm (0.92 (0.91 to 0.93)) and the Tm (0.91 (0.90 to 0.92)) showed discrimination, which was superior to the Sm (0.79 (0.78 to 0.81)) (Table 5). However, the calibration statistics showed that the Sm was better calibrated compared to the STm and Tm. The calibration plots showed that the difference in calibration statistics was mainly due to underprediction in the highest decile of risk for the STm and Tm that was not seen for the Sm (Fig 1). Refitting the STm and Tm with continuous blood test results instead of dichotomised blood test results made negligible difference to model performance (Table 5, S2 Fig, S2 Table).

Table 5. Prediction model discrimination and calibration statistics.

Models	Discrimination	Calibration
Models	AUC (95% CI)	E:O	CITL	Slope
Sm	0.79 (0.78–0.81)	0.995	0.005	0.993
STm (dichotomous primary analysis)	0.92 (0.91–0.93)	0.825	0.210	1.197
STm (continuous sensitivity analysis)	0.92 (0.91–0.93)	0.820	0.220	1.199
Tm (dichotomous primary analysis)	0.91 (0.90–0.92)	0.824	0.209	1.213
Tm (continuous sensitivity analysis)	0.92 (0.91–0.93)	0.826	0.211	1.199

Open in a new tab

AUC, area under the curve; CITL, calibration in the large; E:O, ratio of expected (predicted) probability vs observed frequency of the outcome; Sm, symptoms-only model; STm, symptoms and tests model; Tm, tests-only model.

An AUC of 0.5 represents chance, and 1 represents perfect ability to discriminate between patients who will and patients who will not be diagnosed with cancer [47]. Perfect calibration has a calibration slope of 1, a CITL of 0, and an O:E ratio of 1 [41].

Fig 1 — **Calibration plots for the Sm (1a and 1b), the STm (1c and 1d), and the Tm (1e and 1f).** Green points are deciles of predicted probability with error bars. The right hand panels (1b, 1d, and 1e) are zoomed in to show in detail the first 0.1% of predicted probability and observed frequency. AUC, area under the curve; CITL, calibration in the large; E:O, ratio of expected (predicted) probability vs observed frequency of the outcome; Sm, symptoms-only model; STm, symptoms and tests model; Tm, tests-only model.

Decision curve analysis

The STm had greatest clinical utility (Fig 2A). The STm had higher SNB than the Sm for the risk thresholds of 0.4% to 18%, and the Tm had greater net benefit to the Sm for 0.5% to 15% (Fig 2A, S3 Table). At a cancer risk threshold of 1%, these differences translate into a 55% reduction in further cancer testing if using the STm compared to investigating all patients, a 19% reduction compared to using the Sm, or a 2% reduction compared to the Tm (Fig 2B, S3 Table).

Examples of applying the clinical risk scores

Figs 3 and 4 show diagnostic accuracy statistics corresponding to each possible point score for the STm and the Tm, respectively. S4 and S5 Tables show how these statistics apply to 100,000 patients with UWL for the STm and the Tm risk score thresholds, respectively. Box 1 gives examples of how the STm score might be used in clinical practice, for example, by choosing the optimal threshold to sufficiently rule-out further cancer investigation.

Fig 3 — FN, false negative; FP, false negative; N/E, not estimable; NLR, negative likelihood ratio; NPV, negative predictive value; PLR, positive likelihood ratio; PPV, positive predictive value; Sn, sensitivity; Sp, specificity; STm, symptoms and tests model; TN, true negative; TP, true positive.

Fig 4 — FN, xxxx; FP, xxxx; N/E, xxxx; NLR, negative likelihood ratio; NPV, negative predictive value; PLR, positive likelihood ratio; PPV, positive predictive value; Sn, xxxx; Sp, xxxx; Tm, tests-only model; TN, xxxx; TP, xxxx.

Box 1. Consequences of applying the STm score in clinical practice (Tables 3 and S4)

Clinical example

A 52-year-old woman with UWL, no other clinical features, a low albumin, high alkaline phosphatase, and a raised C-reactive protein corresponds to an STm score of 4: sensitivity 93.9%, specificity 68.4%. At this threshold, 23 people would be referred for each person with cancer referred, and 784 people would be spared investigation for each person with cancer not referred. Per 100,000 people with UWL, 1,333 with cancer would be referred, 31,152 would be referred unnecessarily, 67,429 correctly spared referral, and 86 people with cancer would not be referred.

Example of maximising sensitivity to rule-out cancer

An STm score of 1 prioritises ruling-out cancer by maximising sensitivity: sensitivity 98.6%, specificity 43.4%. At this threshold, 40 people with UWL would be referred for each person with cancer referred, and 2,139 people with UWL spared referral for each person with cancer not referred. Per 100,000 people with UWL, 1,399 people with cancer would be referred, 55,797 people would be unnecessarily referred, 42,784 correctly spared referral, and 20 people with cancer not referred.

Example of a threshold to balance ruling-in and ruling-out cancer

An STm score of 6 would balance ruling-in (PLR 5.09) and ruling-out (NLR 0.16) the need for referral: sensitivity 86.3%, specificity 83.0%. At this threshold, there would be 14 people referred for investigation for each person with cancer referred, and 422 would be spared investigation for each person with cancer not referred. Per 100,000 people with UWL, 1,225 people with cancer would be referred, 16,759 patients would be unnecessarily referred, 81,822 correctly spared referral, and 194 people with cancer not referred.

Example of a threshold close to the NICE PPV threshold of 3%

An STm score of 2 is the closest to a PPV to 3%, the threshold chosen by NICE to warrant further investigation: sensitivity 97.9%, specificity 50.9%. At this threshold, 35 people would be referred for each person with cancer referred, and 1,730 patients would be spared investigation for each person with cancer not referred. Per 100,000 patients with UWL, 1,390 people with cancer would be referred, 48,403 patients would be unnecessarily referred, 50,178 correctly spared referral, and 29 people with cancer not referred.

Discussion

Summary of findings

Combinations of multiple simple test results were discriminative between patients with and without cancer, were well calibrated at the levels of risk that decisions to investigate are made in primary care, and showed superior clinical utility when compared to symptoms and signs. We present stand-alone risk scores that could be used by GPs to guide test selection and interpretation. The simplest includes age-group, sex, and 7 primary care blood tests (alkaline phosphatase, liver enzymes, albumin, C-reactive protein, haemoglobin, platelets, and total white cell count). They could be used to select patients with UWL who do not warrant further cancer investigation in addition to those that do.

Strengths and limitations

Our study design aimed to minimise bias. We excluded patients with objective evidence of intentional weight loss, restricted co-occurring clinical features to the time of the UWL presentation [5], and included only the first UWL code [2,30]. We were reliant on electronic health record (EHR) codes to define UWL as weight is not recorded frequently enough [13]. It is unclear how recording bias relates to coding for UWL, which occurs when GPs preferentially code clinical features that they associate with cancer and can lead to inflated estimates of association for these features [31]. We excluded patients with a past history of cancer and only included cancers coded within 6 months of the UWL date to ensure that we investigated a first diagnosis of a cancer associated with UWL.

By utilising multiple imputation to replace missing risk factor and continuous test result data, we could produce precise estimates for combinations of multiple covariates. Previous studies have not done this and have had to focus on single or pairs of blood test abnormalities [3]. We included auxiliary variables to increase the likelihood that missing values could be accurately predicted by the observed data (that they are missing at random) [16]. However, there is no established method to formally evaluate whether this was successful. Imputation allowed us to combine multiple blood test results and to show that once blood tests are modelled with sex and age, there appears to be no need to include additional risk factors and clinical features.

We dichotomised each blood test for the primary analysis to derive a simple risk score for use in clinical practice using the upper or lower boundary of the normal reference range. This can have limitations. Firstly, by dichotomising a continuous variable, information is lost by grouping slightly and extremely abnormal results together. Secondly, choosing raised values to define abnormal might be unhelpful for cancer sites associated with low values (and vice versa). Thirdly, we chose the upper limit of the normal range to define abnormal for blood tests where there is no consensus on how to define abnormal. Refitting the models to include continuous linear and fractional polynomial terms made no meaningful difference to model performance.

We required a testing strategy appropriate for a composite of all cancer types. The literature reports that the direction of blood test abnormalities are similar for most cancers and that a pro-inflammatory state underpins many cancers and cancer cachexia [19,20,32–39]. While this supports our approach, it remains likely the composite cancer outcome is partly responsible for the underprediction observed at the highest decile of risk. It is unlikely that this would have adverse clinical consequences because GPs’ decision to refer for invasive testing is likely to be triggered at lower thresholds than this.

We used 10-fold internal cross-validation to derive estimates of predictive performance [40]. However, internal validation may produce overoptimistic estimates and so external validation remains necessary to assess the generalisability of our findings in settings, populations, and subgroups of interest [41,42]. Primary care data will be identified from alternative clinical systems for the same time period or from the same clinical systems for a later time period to account for variation in UK practice, international settings with alternative approached to weight measurement and weight loss recording and where a different spectrum of patients consults with primary care, and in systems where similar blood tests are used with differing degrees of missingness.

Findings in context

One previous study developed a risk score to predict cancer in a cohort of 256 patients referred for the investigation of UWL (AUC 0.90 (95% CI 0.88 to 0.92)) including the following: age ≥80 years, white blood cells, albumin, alkaline phosphatase, and lactate dehydrogenase [10]. The AUC was notably lower when externally validated (0.70 (0.61 to 0.78) in a cohort of 290 consecutive patients referred to hospital with UWL [9]. This study also developed a simpler 3-variable score that included age, alkaline phosphatase, and albumin and gave an AUC of 0.74 (0.66 to 0.81) [9]. The models we developed here produced higher AUCs than these models and, more importantly, include a cohort of all patients presenting to primary care with UWL, not those referred to hospital for further investigation of UWL.

Two existing primary care prediction models incorporate multiple symptoms and risk factors to estimate cancer risk over a 2-year period for 1.24 million females (AUC 0.85 (95% CI 0.84 to 0.85)) and 1.26 million males (0.87 (0.88 to 0.89)) aged 25 to 89 years [43,44]. They also demonstrate good calibration at the lower deciles of risk and miscalibration at the highest decile of risk. The relative timing of symptoms was not reported. Blood tests were not included, except haemoglobin results in the 12 months before to 2 months after study entry were used to define anaemia as a baseline risk factor. Consequently, the design and reporting of these models make it impossible to understand the diagnostic value of symptoms and blood tests co-occurring with UWL.

Implications for research and clinical practice

DCA allowed us to demonstrate the importance of simple tests in comparison to symptoms and signs. It was used to define how risk thresholds related to the probable “yield” of referrals in diagnosing cancer through hospital-based invasive investigation [25] However, DCA cannot itself define the acceptable balance between the number of people referred to each person with cancer referred or between the number of people spared referral to each person with cancer not referred These trade-offs are for patients, clinicians, and society at large to decide and could be evaluated further in health economic analysis. The examples shown in Box 1 illustrates 4 examples of how a risk score could inform these decisions. Moreover, these examples could help GPs develop watchful waiting strategies for patients with intermediate risks of having cancer, perhaps scheduling periodic review and blood test reevaluation to examine whether the risk has changed.

There is a dearth of research on how risk scores for cancer are best adopted into clinical practice. We intended to derive an intuitive risk score that mirrors clinical practice by focussing on simple combinations of clinical features including commonly used and available blood tests. Each score could be completed by GPs by hand, used as an online calculator, or integrated as into the EHR. However, the limited literature shows that risk scores are underused, and GPs find predictions difficult to interpret or are distrustful of them, especially when they conflict with intuitive clinical judgement [45,46]. Further research is therefore required to understand their uptake and to assess whether their use impacts on cancer outcomes.

Conclusions

Disclaimer

The lead author affirms that this manuscript is an honest, accurate, and transparent account of the study being reported; that no important aspects of the study have been omitted; and that any discrepancies from the study as planned (and, if relevant, registered) have been explained.

Supporting information

S1 TRIPOD Checklist

(DOCX)

Click here for additional data file.^{(96.3KB, docx)}

S1 Fig. Multiple imputation diagnostics for the first 20 imputations for continuous blood test variables.

(DOCX)

Click here for additional data file.^{(1.3MB, docx)}

S2 Fig. Cross-validation plots for continuous STm model and the Tm-only model.

Green points are deciles of predicted probability with error bars. The right hand panels are zoomed in to show in detail the first 0.1% of predicted probability and observed frequency. AUC, area under the curve; CITL, calibration in the large; E:O, ratio of expected (predicted) probability vs observed frequency of the outcome; STm, symptoms and tests model; Tm, tests-only model.

(DOCX)

Click here for additional data file.^{(161.4KB, docx)}

S1 Table. Common laboratory test ranges in general practice.

(DOCX)

Click here for additional data file.^{(32.9KB, docx)}

S2 Table. Full specification of the continuous STm and the Tm. STm, symptoms and test model; Tm, tests-only model.

(DOCX)

Click here for additional data file.^{(26.1KB, docx)}

S3 Table. SNB and reduction in further investigation comparing models and investigating all patients or none.

SNB = net benefit / prevalence. SNB, standardised net benefit; STm, symptoms and test model; Sm, symptoms-only model; Tm, tests-only model.

(DOCX)

Click here for additional data file.^{(25.8KB, docx)}

S4 Table. STm risk score thresholds expressed per 100,000 patients with UWL investigated. STm, symptoms and test model; UWL, unexpected weight loss.

(DOCX)

Click here for additional data file.^{(23.1KB, docx)}

S5 Table. Tm risk score thresholds expressed per 100,000 patients with UWL investigated. Tm, tests-only model; UWL, unexpected weight loss.

(DOCX)

Click here for additional data file.^{(22.7KB, docx)}

S1 Text. Auxiliary variables used in the multiple imputation model.

(DOCX)

Click here for additional data file.^{(18.5KB, docx)}

Abbreviations

AUC: area under the curve
CPRD: Clinical Practice Research Datalink
DCA: decision curve analysis
EHR: electronic health record
ISAC: Independent Scientific Advisory Committee
NLR: negative likelihood ratio
NPV: negative predictive value
ONS: Office for National Statistics
PLR: positive likelihood ratio
PMM: predictive mean matching
PPV: positive predictive value
Sm: symptoms-only model
SNB: standardised net benefit
STm: symptoms and tests model
Tm: tests-only model
UWL: unexpected weight loss

Data Availability

This study is based on CPRD data and is subject to a full licence agreement which does not permit data sharing outside of the research team.

Funding Statement

BDN was supported by National Institute for Health Research (NIHR) Doctoral Research Fellowship number (DRF-2015-08-18). FDRH and RP acknowledges part-funding from the NIHR Oxford Medtech and In-Vitro Diagnostics Co-operative (MIC). FDRH, RP, and PA acknowledge part-funding from the NIHR Oxford and Thames Valley Applied Research Collaboration (ARC). FDRH, JO, RP, and PA acknowledge part-funding from the NIHR Oxford Biomedical Research Centre (BRC). PA is an NIHR senior investigator. RP acknowledges part-funding from the National Institute for Health Research (NIHR Programme Grant for Applied Research) and the Oxford Martin School. WH is co- Principal Investigator of the multi-institutional CanTest Research Collaborative funded by a Cancer Research UK Population Research Catalyst award (C8640/A23385). The views expressed are those of the authors and not necessarily those of the NHS, the NIHR, or the Department of Health. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1.Nicholson BD, Aveyard P, Hamilton W, Hobbs FDR. When should unexpected weight loss warrant further investigation to exclude cancer? BMJ. 2019;366:l5271. doi: 10.1136/bmj.l5271 [DOI] [PubMed] [Google Scholar]
2.Nicholson BD, Hamilton W, Koshiaris C, Oke JL, Hobbs FDR, Aveyard P. The association between unexpected weight loss and cancer diagnosis in primary care: a matched cohort analysis of 65,000 presentations. Br J Cancer. 2020;122(12):1848–56. doi: 10.1038/s41416-020-0829-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Nicholson BD, Hamilton W, O’Sullivan J, Aveyard P, Hobbs FR. Weight loss as a predictor of cancer in primary care: a systematic review and meta-analysis. Br J Gen Pract. 2018;68(670):e311–e22. doi: 10.3399/bjgp18X695801 [DOI] [PMC free article] [PubMed] [Google Scholar]
4.NICE. Suspected cancer: recognition and referral (NG12). National Institute for Health and Care Excellence; 2015 [cited 2021 Jun 14]. Available from: https://www.nice.org.uk/guidance/ng12. [PubMed]
5.Nicholson BD, Aveyard P, Price SJ, Hobbs FR, Koshiaris C, Hamilton W. Prioritising primary care patients with unexpected weight loss for cancer investigation: diagnostic accuracy study. BMJ. 2020;370:m2651. doi: 10.1136/bmj.m2651 [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Nicholson BD, Oke JL, Aveyard P, Hamilton WT, Hobbs FDR. Individual inflammatory marker abnormalities or inflammatory marker scores to identify primary care patients with unexpected weight loss for cancer investigation? Br J Cancer. 2021. doi: 10.1038/s41416-021-01282-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Watson J, de Salis I, Hamilton W, Salisbury C. I’m fishing really’—inflammatory marker testing in primary care: a qualitative study. Br J Gen Pract. 2016;66(644):e200–6. doi: 10.3399/bjgp16X683857 [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Nicholson BD, Perera R, Thompson MJ. The elusive diagnosis of cancer: testing times. Br J Gen Pract. 2018;68(676):510–1. doi: 10.3399/bjgp18X699461 [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Baicus C, Rimbas M, Baicus A, Caraiola S. Cancer and involuntary weight loss: failure to validate a prediction score. PLoS ONE. 2014;9(4):e95286. doi: 10.1371/journal.pone.0095286 [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Hernandez JL, Matorras P, Riancho JA, Gonzalez-Macias J. Involuntary weight loss without specific symptoms: a clinical prediction score for malignant neoplasm. QJM. 2003;96(9):649–55. doi: 10.1093/qjmed/hcg107 [DOI] [PubMed] [Google Scholar]
11.Nicholson BD, Aveyard P, Hobbs FDR, Smith M, Fuller A, Perera R, et al. Weight loss as a predictor of cancer and serious disease in primary care: an ISAC-approved CPRD protocol for a retrospective cohort study using routinely collected primary care data from the UK. Diagn Progn Res. 2018;2:1. doi: 10.1186/s41512-017-0019-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Benchimol EI, Smeeth L, Guttmann A, Harron K, Moher D, Petersen I, et al. The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) statement. PLoS Med. 2015;12(10):e1001885. doi: 10.1371/journal.pmed.1001885 [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Nicholson BD, Aveyar P, Bankhea CR, Hamilto W, Hobb FDR, Lay-Flurrie S. Determinants and extent of weight recording in UK primary care: an analysis of 5 million adults’ electronic health records from 2000 to 2017. BMC Med. 2019;17(1):222. doi: 10.1186/s12916-019-1446-y [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Nicholson BD, Aveyard P, Hamilton W, Bankhead CR, Koshiaris C, Stevens S, et al. The internal validation of weight and weight change coding using weight measurement data within the UK primary care Electronic Health Record. Clin Epidemiol. 2019;11:145–55. doi: 10.2147/CLEP.S189989 [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Herrett E, Gallagher AM, Bhaskaran K, Forbes H, Mathur R, van Staa T, et al. Data Resource Profile: Clinical Practice Research Datalink (CPRD). Int J Epidemiol. 2015;44(3):827–36. doi: 10.1093/ije/dyv098 [DOI] [PMC free article] [PubMed] [Google Scholar]
16.van Burren S. Flexible Imputation of Missing Data. 2nd ed. Chapman & Hall; 2018. doi: 10.1186/s12942-018-0127-y [DOI] [Google Scholar]
17.Royston P. Multiple Imputation of Missing Values. Stata J. 2004;4(3):227–41. [Google Scholar]
18.Morris TP, White IR, Royston P. Tuning multiple imputation by predictive mean matching and local residual draws. BMC Med Res Methodol. 2014;14:75. doi: 10.1186/1471-2288-14-75 [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Baracos VE, Martin L, Korc M, Guttridge DC, Fearon KCH. Cancer-associated cachexia. Nat Rev Dis Primers. 2018;4:17105. doi: 10.1038/nrdp.2017.105 [DOI] [PubMed] [Google Scholar]
20.Fearon K, Strasser F, Anker SD, Bosaeus I, Bruera E, Fainsinger RL, et al. Definition and classification of cancer cachexia: an international consensus. Lancet Oncol. 2011;12(5):489–95. doi: 10.1016/S1470-2045(10)70218-7 [DOI] [PubMed] [Google Scholar]
21.van Smeden M, Moons KG, de Groot JA, Collins GS, Altman DG, Eijkemans MJ, et al. Sample size for binary logistic prediction models: Beyond events per variable criteria. Stat Methods Med Res. 2019;28(8):2455–74. doi: 10.1177/0962280218784726 [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Luque-Fernandez MA, Redondo-Sánchez D. Maringe C. cvauroc: Command to compute cross-validated area under the curve for ROC analysis after predictive modeling for binary outcomes. Stata J. 2019;19(3):615–25. [Google Scholar]
23.Ensor J, Snell K, Martin E. PMCALPLOT: Stata module to produce calibration plot of prediction model performance. Statistical Software Components S458486. Boston College Department of Economics; 2018. [Google Scholar]
24.Van Calster B, Wynants L, Verbeek JFM, Verbakel JY, Christodoulou E, Vickers AJ, et al. Reporting and Interpreting Decision Curve Analysis: A Guide for Investigators. Eur Urol. 2018;74(6):796–804. doi: 10.1016/j.eururo.2018.08.038 [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Vickers AJ, Van Calster B, Steyerberg EW. Net benefit approaches to the evaluation of prediction models, molecular markers, and diagnostic tests. BMJ. 2016;352:i6. doi: 10.1136/bmj.i6 [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Kerr KF, Brown MD, Zhu K, Janes H. Assessing the Clinical Impact of Risk Prediction Models With Decision Curves: Guidance for Correct Interpretation and Appropriate Use. J Clin Oncol. 2016;34(21):2534–40. doi: 10.1200/JCO.2015.65.5654 [DOI] [PMC free article] [PubMed] [Google Scholar]
27.D’Andrea D, Soria F, Zehetmayer S, Gust KM, Korn S, Witjes JA, et al. Diagnostic accuracy, clinical utility and influence on decision-making of a methylation urine biomarker test in the surveillance of non-muscle-invasive bladder cancer. BJU Int. 2019;123(6):959–67. doi: 10.1111/bju.14673 [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Bonnett LJ, Snell KIE, Collins GS, Riley RD. Guide to presenting clinical prediction models for use in clinical settings. BMJ. 2019;365:l737. doi: 10.1136/bmj.l737 [DOI] [PubMed] [Google Scholar]
29.Morris TP, White IR, Carpenter JR, Stanworth SJ, Royston P. Combining fractional polynomial model building with multiple imputation. Stat Med. 2015;34(25):3298–317. doi: 10.1002/sim.6553 [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Whiting PF, Rutjes AW, Westwood ME, Mallett S, Deeks JJ, Reitsma JB, et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. 2011;155(8):529–36. doi: 10.7326/0003-4819-155-8-201110180-00009 [DOI] [PubMed] [Google Scholar]
31.Price SJ, Stapley SA, Shephard E, Barraclough K, Hamilton WT. Is omission of free text records a possible source of data loss and bias in Clinical Practice Research Datalink studies? A case-control study. BMJ Open. 2016;6(5):e011664. doi: 10.1136/bmjopen-2016-011664 [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Bailey SE, Ukoumunne OC, Shephard EA, Hamilton W. Clinical relevance of thrombocytosis in primary care: a prospective cohort study of cancer incidence using English electronic medical records and cancer registry data. Br J Gen Pract. 2017;67(659):e405–e13. doi: 10.3399/bjgp17X691109 [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Dolan RD, Lim J, McSorley ST, Horgan PG, McMillan DC. The role of the systemic inflammatory response in predicting outcomes in patients with operable cancer: Systematic review and meta-analysis. Sci Rep. 2017;7(1):16717. doi: 10.1038/s41598-017-16955-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Dolan RD, McSorley ST, Horgan PG, Laird B, McMillan DC. The role of the systemic inflammatory response in predicting outcomes in patients with advanced inoperable cancer: Systematic review and meta-analysis. Crit Rev Oncol Hematol. 2017;116:134–46. doi: 10.1016/j.critrevonc.2017.06.002 [DOI] [PubMed] [Google Scholar]
35.Hamilton W, Lancashire R, Sharp D, Peters TJ, Cheng KK, Marshall T. The importance of anaemia in diagnosing colorectal cancer: a case-control study using electronic primary care records. Br J Cancer. 2008;98(2):323–7. doi: 10.1038/sj.bjc.6604165 [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Merriel SW, Carroll R, Hamilton F, Hamilton W. Association between unexplained hypoalbuminaemia and new cancer diagnoses in UK primary care patients. Fam Pract. 2016;33(5):449–52. doi: 10.1093/fampra/cmw051 [DOI] [PubMed] [Google Scholar]
37.Newsome PN, Cramb R, Davison SM, Dillon JF, Foulerton M, Godfrey EM, et al. Guidelines on the management of abnormal liver blood tests. Gut. 2018;67(1):6–19. doi: 10.1136/gutjnl-2017-314924 [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Smellie WSA, Forth J, Ryder S, Galloway MJ, Wood AC, Watson ID. Best practice in primary care pathology: review 5. J Clin Pathol. 2006;59(12):1229–37. doi: 10.1136/jcp.2006.037754 [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Watson J, Salisbury C, Banks J, Whiting P, Hamilton W. Predictive value of inflammatory markers for cancer diagnosis in primary care: a prospective cohort study using electronic health records. Br J Cancer. 2019;120(11):1045–51. doi: 10.1038/s41416-019-0458-x [DOI] [PMC free article] [PubMed] [Google Scholar]
40.LeDell E, Petersen M, van der Laan M. Computationally efficient confidence intervals for cross-validated area under the ROC curve estimates. Electron J Stat. 2015;9(1):1583–607. doi: 10.1214/15-EJS1035 [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Riley R, van der Windt D, Croft P, Moons KG. Prognosis Research in Healthcare: Concepts, Methods, and Impact. Oxford Univerity Press; 2019. [Google Scholar]
42.Riley RD, Ensor J, Snell KI, Debray TP, Altman DG, Moons KG, et al. External validation of clinical prediction models using big datasets from e-health records or IPD meta-analysis: opportunities and challenges. BMJ. 2016;i3140:353. doi: 10.1136/bmj.i3140 [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Hippisley-Cox J, Coupland C. Symptoms and risk factors to identify women with suspected cancer in primary care: derivation and validation of an algorithm. Br J Gen Pract. 2013;63(606):e11–21. doi: 10.3399/bjgp13X660733 [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Hippisley-Cox J, Coupland C. Symptoms and risk factors to identify men with suspected cancer in primary care: derivation and validation of an algorithm. Br J Gen Pract. 2013;63(606):e1–10. doi: 10.3399/bjgp13X660724 [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Chiang PP, Glance D, Walker J, Walter FM, Emery JD. Implementing a QCancer risk tool into general practice consultations: an exploratory study using simulated consultations with Australian general practitioners. Br J Cancer. 2015;112(Suppl 1):S77–83. doi: 10.1038/bjc.2015.46 [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Price S, Spencer A, Medina-Lara A, Hamilton W. Availability and use of cancer decision-support tools: a cross-sectional survey of UK primary care. Br J Gen Pract. 2019;69(684):e437–e43. doi: 10.3399/bjgp19X703745 [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Moons KGM, Altman DG, Reitsma JB, Ioannidis JPA, Macaskill P, Steyerberg EW, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): Explanation and Elaboration. Ann Intern Med. 2015;162(1):W1–W73. doi: 10.7326/M14-0698 [DOI] [PubMed] [Google Scholar]

PLoS Med. doi: 10.1371/journal.pmed.1003728.r001

Decision Letter 0

Richard Turner

19 Mar 2021

Dear Dr Nicholson,

Thank you for submitting your manuscript entitled "Combinations of simple blood tests to identify primary care patients with unexpected weight loss for cancer investigation." for consideration by PLOS Medicine for our upcoming Special Issue.

Your manuscript has now been evaluated by the PLOS Medicine editorial staff and by the Guest Editors, and I am writing to let you know that we would like to send your submission out for external assessment.

However, before we can send your manuscript to reviewers, we need you to complete your submission by providing the metadata that is required for full assessment. To this end, please login to Editorial Manager where you will find the paper in the 'Submissions Needing Revisions' folder on your homepage. Please click 'Revise Submission' from the Action Links and complete all additional questions in the submission questionnaire.

Please re-submit your manuscript within two working days, i.e. by .

Once your full submission is complete, your paper will undergo a series of checks in preparation for external assessment.

Feel free to email us at plosmedicine@plos.org if you have any queries relating to your submission.

Kind regards,

Richard Turner, PhD

Senior Editor, PLOS Medicine

rturner@plos.org

PLoS Med. doi: 10.1371/journal.pmed.1003728.r002

Decision Letter 1

Richard Turner

21 Apr 2021

Dear Dr. Nicholson,

Thank you very much for submitting your manuscript "Combinations of simple blood tests to identify primary care patients with unexpected weight loss for cancer investigation." (PMEDICINE-D-21-01229R1) for consideration at PLOS Medicine.

Your paper was discussed among the editors and evaluated by the Guest Editors for the Special issue. Comments from independent reviewers, including a statistical reviewer, are appended at the bottom of this email and any accompanying reviewer attachments can be seen via the link below:

[LINK]

In light of these reviews, we will not be able to accept the manuscript for publication in the journal in its current form, but we would like to invite you to submit a revised version that addresses the reviewers' and editors' comments fully. You will appreciate that we cannot make a decision about publication until we have seen the revised manuscript and your response, and we expect to seek re-review by one or more of the reviewers.

In revising the manuscript for further consideration, your revisions should address the specific points made by each reviewer and the editors. Please also check the guidelines for revised papers at http://journals.plos.org/plosmedicine/s/revising-your-manuscript for any that apply to your paper. In your rebuttal letter you should indicate your response to the reviewers' and editors' comments, the changes you have made in the manuscript, and include either an excerpt of the revised text or the location (eg: page and line number) where each change can be found. Please submit a clean version of the paper as the main article file; a version with changes marked should be uploaded as a marked up manuscript.

In addition, we request that you upload any figures associated with your paper as individual TIF or EPS files with 300dpi resolution at resubmission; please read our figure guidelines for more information on our requirements: http://journals.plos.org/plosmedicine/s/figures. While revising your submission, please upload your figure files to the PACE digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at PLOSMedicine@plos.org.

We hope to receive your revised manuscript by May 12 2021 11:59PM. Please email us (plosmedicine@plos.org) if you have any questions or concerns.

***Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.***

We ask every co-author listed on the manuscript to fill in a contributing author statement, making sure to declare all competing interests. If any of the co-authors have not filled in the statement, we will remind them to do so when the paper is revised. If all statements are not completed in a timely fashion this could hold up the re-review process. If new competing interests are declared later in the revision process, this may also hold up the submission. Should there be a problem getting one of your co-authors to fill in a statement we will be in contact. YOU MUST NOT ADD OR REMOVE AUTHORS UNLESS YOU HAVE ALERTED THE EDITOR HANDLING THE MANUSCRIPT TO THE CHANGE AND THEY SPECIFICALLY HAVE AGREED TO IT. You can see our competing interests policy here: http://journals.plos.org/plosmedicine/s/competing-interests.

Please use the following link to submit the revised manuscript:

https://www.editorialmanager.com/pmedicine/

Your article can be found in the "Submissions Needing Revision" folder.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Please ensure that the paper adheres to the PLOS Data Availability Policy (see http://journals.plos.org/plosmedicine/s/data-availability), which requires that all data underlying the study's findings be provided in a repository or as Supporting Information. For data residing with a third party, authors are required to provide instructions with contact information for obtaining the data. PLOS journals do not allow statements supported by "data not shown" or "unpublished results." For such statements, authors must provide supporting data or cite public sources that include it.

Please let me know if you have any questions, and we look forward to receiving your revised manuscript.

Sincerely,

Richard Turner, PhD

Senior Editor, PLOS Medicine

rturner@plos.org

-----------------------------------------------------------

Requests from the editors:

We believe that the case for publication in PLOS Medicine would be enhanced if you were able to include an element of external validation, as suggested by one referee.

To your data statement, please add contact information for those wishing to access data from CPRD.

To your title, please add a study descriptor following a colon, e.g., "...: A diagnostic accuracy study".

In the "Methods and findings" subsection of your abstract, please quote summary demographic details for study participants, and mention the recruitment period.

Please add a new final sentence to the "Methods and findings" subsection of your abstract, which should begin "Study limitations include ..." or similar and should quote 2-3 of the study's main limitations.

We ask you to revisit the wording of the "Conclusions" subsection of your abstract, and similar statements elsewhere in the paper, e.g., at the end of the main text, which seem to us somewhat too strong. We suggest "Our findings suggest that combinations ... could be used ..." or similar.

After the abstract, please add a new and accessible "Author summary" section in non-identical prose. You may find it helpful to consult one or two recent research papers published in PLOS Medicine to get a sense of the preferred style.

Please avoid the phrase "discriminated excellently".

Please substitute "sex" for "gender" where appropriate, e.g., in table 1.

Throughout the text, please format reference call-outs as follows: "... clinical practice [8,9]." (noting the absence of spaces within the square brackets).

Please remove the information on funding and competing interests from the end of the main text. In the event of publication, this will appear in the article metadata, via entries in the submission form.

Please move the information on ethics approval from the end of the main text to the Methods section.

In the reference list, please ensure that journal names are abbreviated correctly and consistently (e.g., "PLoS ONE", "PLoS Med.").

Noting references 6 & 27, please ensure that all references have full access details.

Please include the study protocol or prespecified analysis plan as a supplementary file, referred to in your Methods section. Please highlight analyses that were not prespecified.

Please include a completed checklist for the most appropriate reporting guideline, e.g., TRIPOD, labelled "S1_TRIPOD_Checklist" or similar and referred to as such in the Methods section of your main text.

In the checklist, please refer to individual items by section (e.g., "Methods") and paragraph number rather than by line or page numbers, as the latter generally change in the event of publication.

Comments from the reviewers:

*** Reviewer #1:

This study aims to derive and internally validate prediction models using co-occurring risk factors, symptoms, signs and blood test data to identify clinical features that could be used together to stratify cancer risk in patients attending primary care with UWL.

Comments:

"The protocol has been published" and "We followed the TRIPOD and RECORD reporting guidelines".

Can the authors please supply the protocol and reporting checklists in the supplementary material?

"Multiple imputation was used to replace missing values for smoking status, alcohol intake, body mass index, and blood tests using the mi suite of commands in Stata (15, 16). Fifty imputed datasets were created. "

Did the authors undertake any sensitivity analyses for their missing value approach?

"Three prediction models were derived in the complete dataset: a symptoms only model (Sm), a symptoms and tests model (STm), and a simple tests only model (Tm). The mim Stata command was used to select variables for each model in the imputed data using backwards stepwise logistic regression, using a p-value of <0.01 for inclusion."

A technically appropriate statistical approach has been applied.

Internal cross-validation and conducting sensitivity analyses refitting models using continuous data, are rigorous and robust methodologies.

Overall, this is a clearly laid out and well written study. The authors have suitably acknowledged the main limitations, and present a thorough range of analyses in the supplementary material.

*** Reviewer #2:

It is important to develop predictive tools that can help clinicians select patients with unexpected weight loss for further testing. I applaud the authors for conducting this important work. I however have some comments and concerns:

1. How did you define UWL? In the first paragraph of the Methods section you write: "…12 months of data before their first UWL code (the "index date").". What is a "UWL code"? Some ICD-code? I suggest you specify.

2. For the outcome, you only consider cancer diagnosed within the first 6 months after the UWL diagnosis. This appears short to me. I see many cancer patients in my clinic for whom the diagnostic workup has been longer than 6 months. I know you found the mean time between UWL and cancer diagnosis in your previous study to be 181 days (reference 2 in your paper). I nevertheless suggest you consider to use, at least in a sensitivity analysis, 12 months or longer follow-up.

3. You used multiple imputation with 50 imputed datasets. For some variables, you had a very high proportion of missing (eg, 77% for CRP). Is 50 imputed datasets really enough? I suggest you motivate in your paper why you only used 50 imputed datasets.

4. You use 10-fold internal cross-validation to assess model performance. To properly internally validate the model, shouldn't you validate the whole model development process including variable selection? That is, if you use 10-fold cross validation, you should in each of the 10 runs re-select the variables, fit the model and test it in the 10th fold. And then finally average the performance. You could also consider a split-sample approach.

5. Why not test the performance of the previous models (reference 8, 9, 41, 42) in your data set? If you do that, you would have a chance to i) externally validate their models, and ii) benchmark your model against previous models.

6. I think it is a rather major concern that your dataset/CPRD represents only 6.9% of the UK population, you have a very high proportion of missing (ie, 70-80%) for key predictors, and there are obvious concerns with over-fitting. The value and impact of your study would increase substantially with external validation. I thus strongly suggest you try to find a dataset for external validation. If this is not possible, I suggest you add a paragraph discussing these limitations and clearly state that additional research is needed before your tool can be used in clinical practice.

*** Reviewer #3:

No comments basically

Very well written; very important analyses, and especially the explanation box1 quite helpful for many (lay) readers

Should authors define UWL somewhere

Typing error? Age 0+ in discussion

Should results lead to clear GP guidelines ? (It is not advisable to have individual GPS make their own cut-offs

***

Any attachments provided with reviews can be seen via the following link:

[LINK]

PLoS Med. 2021 Aug 31;18(8):e1003728. doi: 10.1371/journal.pmed.1003728.r003

Author response to Decision Letter 1

12 May 2021

Attachment

Submitted filename: Response to reviews.docx

Click here for additional data file.^{(124.3KB, docx)}

PLoS Med. doi: 10.1371/journal.pmed.1003728.r004

Decision Letter 2

Beryne Odeny

14 Jun 2021

Dear Dr. Nicholson,

Thank you very much for re-submitting your manuscript "Combining simple blood tests to identify primary care patients with unexpected weight loss for cancer investigation: clinical risk score development, internal validation, and net benefit analysis." (PMEDICINE-D-21-01229R2) for review by PLOS Medicine.

I have discussed the paper with my colleagues and the academic editor and it was also seen again by two reviewers. I am pleased to say that provided the remaining editorial and production issues are dealt with we are planning to accept the paper for publication in the journal.

The remaining issues that need to be addressed are listed at the end of this email. Any accompanying reviewer attachments can be seen via the link below. Please take these into account before resubmitting your manuscript:

[LINK]

In revising the manuscript for further consideration here, please ensure you address the specific points made by each reviewer and the editors. In your rebuttal letter you should indicate your response to the reviewers' and editors' comments and the changes you have made in the manuscript. Please submit a clean version of the paper as the main article file. A version with changes marked must also be uploaded as a marked up manuscript file.

Please also check the guidelines for revised papers at http://journals.plos.org/plosmedicine/s/revising-your-manuscript for any that apply to your paper. If you haven't already, we ask that you provide a short, non-technical Author Summary of your research to make findings accessible to a wide audience that includes both scientists and non-scientists. The Author Summary should immediately follow the Abstract in your revised manuscript. This text is subject to editorial change and should be distinct from the scientific abstract.

We expect to receive your revised manuscript within 1 week. Please email us (plosmedicine@plos.org) if you have any questions or concerns.

We ask every co-author listed on the manuscript to fill in a contributing author statement. If any of the co-authors have not filled in the statement, we will remind them to do so when the paper is revised. If all statements are not completed in a timely fashion this could hold up the re-review process. Should there be a problem getting one of your co-authors to fill in a statement we will be in contact. YOU MUST NOT ADD OR REMOVE AUTHORS UNLESS YOU HAVE ALERTED THE EDITOR HANDLING THE MANUSCRIPT TO THE CHANGE AND THEY SPECIFICALLY HAVE AGREED TO IT.

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript.

Please note, when your manuscript is accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you've already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosmedicine@plos.org.

If you have any questions in the meantime, please contact me or the journal staff on plosmedicine@plos.org.

We look forward to receiving the revised manuscript by Jun 17 2021 11:59PM.

Sincerely,

Beryne Odeny,

Associate Editor

PLOS Medicine

plosmedicine.org

------------------------------------------------------------

Requests from Editors:

Thank you for your responses. Before we proceed, please address the following:

1. Given the lack of external validation, please temper the phrases “excellent discrimination” and “highly discriminative”, by removing the words “excellent” and “highly.” For example, use objective terms such as “reasonably discriminative”, “discriminative” or similar.

2. Please do not report P<0.01; report as P < 0.001.

3. Please define the abbreviations in Table 4: AST, ALT, CRP

4. Please use the "Vancouver" style for reference formatting and see our website for other reference guidelines.

5. Please include a weblink and access date for ref # 4

6. Please include line numbers in your next draft

Comments from Reviewers:

Reviewer #1: The authors have satisfactorily responded to each comment in turn, amending the manuscript accordingly.

Reviewer #2: I thank the authors for their responses to my comments and questions. The responses are scientifically satisfactory to me. That said, the major limitation of this study (ie, no external validation) remains.

Any attachments provided with reviews can be seen via the following link:

[LINK]

PLoS Med. 2021 Aug 31;18(8):e1003728. doi: 10.1371/journal.pmed.1003728.r005

Author response to Decision Letter 2

14 Jun 2021

Attachment

Submitted filename: Response to reviews 2.docx

Click here for additional data file.^{(20.9KB, docx)}

PLoS Med. doi: 10.1371/journal.pmed.1003728.r006

Decision Letter 3

Beryne Odeny

12 Jul 2021

Dear Dr Nicholson,

On behalf of my colleagues and the Academic Editors, Drs. Sara-Jane Dawson, Charles Swanton, and Chris Abbosch, am pleased to inform you that we have agreed to publish your manuscript "Combining simple blood tests to identify primary care patients with unexpected weight loss for cancer investigation: clinical risk score development, internal validation, and net benefit analysis." (PMEDICINE-D-21-01229R3) in PLOS Medicine.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Once you have received these formatting requests, please note that your manuscript will not be scheduled for publication until you have made the required changes.

In the meantime, please log into Editorial Manager at http://www.editorialmanager.com/pmedicine/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production process.

PRESS

We frequently collaborate with press offices. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximise its impact. If the press office is planning to promote your findings, we would be grateful if they could coordinate with medicinepress@plos.org. If you have not yet opted out of the early version process, we ask that you notify us immediately of any press plans so that we may do so on your behalf.

We also ask that you take this opportunity to read our Embargo Policy regarding the discussion, promotion and media coverage of work that is yet to be published by PLOS. As your manuscript is not yet published, it is bound by the conditions of our Embargo Policy. Please be aware that this policy is in place both to ensure that any press coverage of your article is fully substantiated and to provide a direct link between such coverage and the published work. For full details of our Embargo Policy, please visit http://www.plos.org/about/media-inquiries/embargo-policy/.

Thank you again for submitting to PLOS Medicine. We look forward to publishing your paper.

Sincerely,

Beryne Odeny

Associate Editor

PLOS Medicine

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 TRIPOD Checklist

(DOCX)

Click here for additional data file.^{(96.3KB, docx)}

S1 Fig. Multiple imputation diagnostics for the first 20 imputations for continuous blood test variables.

(DOCX)

Click here for additional data file.^{(1.3MB, docx)}

S2 Fig. Cross-validation plots for continuous STm model and the Tm-only model.

(DOCX)

Click here for additional data file.^{(161.4KB, docx)}

S1 Table. Common laboratory test ranges in general practice.

(DOCX)

Click here for additional data file.^{(32.9KB, docx)}

S2 Table. Full specification of the continuous STm and the Tm. STm, symptoms and test model; Tm, tests-only model.

(DOCX)

Click here for additional data file.^{(26.1KB, docx)}

S3 Table. SNB and reduction in further investigation comparing models and investigating all patients or none.

SNB = net benefit / prevalence. SNB, standardised net benefit; STm, symptoms and test model; Sm, symptoms-only model; Tm, tests-only model.

(DOCX)

Click here for additional data file.^{(25.8KB, docx)}

S4 Table. STm risk score thresholds expressed per 100,000 patients with UWL investigated. STm, symptoms and test model; UWL, unexpected weight loss.

(DOCX)

Click here for additional data file.^{(23.1KB, docx)}

S5 Table. Tm risk score thresholds expressed per 100,000 patients with UWL investigated. Tm, tests-only model; UWL, unexpected weight loss.

(DOCX)

Click here for additional data file.^{(22.7KB, docx)}

S1 Text. Auxiliary variables used in the multiple imputation model.

(DOCX)

Click here for additional data file.^{(18.5KB, docx)}

Attachment

Submitted filename: Response to reviews.docx

Click here for additional data file.^{(124.3KB, docx)}

Attachment

Submitted filename: Response to reviews 2.docx

Click here for additional data file.^{(20.9KB, docx)}

Data Availability Statement

This study is based on CPRD data and is subject to a full licence agreement which does not permit data sharing outside of the research team.

[pmed.1003728.ref001] 1.Nicholson BD, Aveyard P, Hamilton W, Hobbs FDR. When should unexpected weight loss warrant further investigation to exclude cancer? BMJ. 2019;366:l5271. doi: 10.1136/bmj.l5271 [DOI] [PubMed] [Google Scholar]

[pmed.1003728.ref002] 2.Nicholson BD, Hamilton W, Koshiaris C, Oke JL, Hobbs FDR, Aveyard P. The association between unexpected weight loss and cancer diagnosis in primary care: a matched cohort analysis of 65,000 presentations. Br J Cancer. 2020;122(12):1848–56. doi: 10.1038/s41416-020-0829-3 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref003] 3.Nicholson BD, Hamilton W, O’Sullivan J, Aveyard P, Hobbs FR. Weight loss as a predictor of cancer in primary care: a systematic review and meta-analysis. Br J Gen Pract. 2018;68(670):e311–e22. doi: 10.3399/bjgp18X695801 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref004] 4.NICE. Suspected cancer: recognition and referral (NG12). National Institute for Health and Care Excellence; 2015 [cited 2021 Jun 14]. Available from: https://www.nice.org.uk/guidance/ng12. [PubMed]

[pmed.1003728.ref005] 5.Nicholson BD, Aveyard P, Price SJ, Hobbs FR, Koshiaris C, Hamilton W. Prioritising primary care patients with unexpected weight loss for cancer investigation: diagnostic accuracy study. BMJ. 2020;370:m2651. doi: 10.1136/bmj.m2651 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref006] 6.Nicholson BD, Oke JL, Aveyard P, Hamilton WT, Hobbs FDR. Individual inflammatory marker abnormalities or inflammatory marker scores to identify primary care patients with unexpected weight loss for cancer investigation? Br J Cancer. 2021. doi: 10.1038/s41416-021-01282-4 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref007] 7.Watson J, de Salis I, Hamilton W, Salisbury C. I’m fishing really’—inflammatory marker testing in primary care: a qualitative study. Br J Gen Pract. 2016;66(644):e200–6. doi: 10.3399/bjgp16X683857 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref008] 8.Nicholson BD, Perera R, Thompson MJ. The elusive diagnosis of cancer: testing times. Br J Gen Pract. 2018;68(676):510–1. doi: 10.3399/bjgp18X699461 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref009] 9.Baicus C, Rimbas M, Baicus A, Caraiola S. Cancer and involuntary weight loss: failure to validate a prediction score. PLoS ONE. 2014;9(4):e95286. doi: 10.1371/journal.pone.0095286 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref010] 10.Hernandez JL, Matorras P, Riancho JA, Gonzalez-Macias J. Involuntary weight loss without specific symptoms: a clinical prediction score for malignant neoplasm. QJM. 2003;96(9):649–55. doi: 10.1093/qjmed/hcg107 [DOI] [PubMed] [Google Scholar]

[pmed.1003728.ref011] 11.Nicholson BD, Aveyard P, Hobbs FDR, Smith M, Fuller A, Perera R, et al. Weight loss as a predictor of cancer and serious disease in primary care: an ISAC-approved CPRD protocol for a retrospective cohort study using routinely collected primary care data from the UK. Diagn Progn Res. 2018;2:1. doi: 10.1186/s41512-017-0019-9 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref012] 12.Benchimol EI, Smeeth L, Guttmann A, Harron K, Moher D, Petersen I, et al. The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) statement. PLoS Med. 2015;12(10):e1001885. doi: 10.1371/journal.pmed.1001885 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref013] 13.Nicholson BD, Aveyar P, Bankhea CR, Hamilto W, Hobb FDR, Lay-Flurrie S. Determinants and extent of weight recording in UK primary care: an analysis of 5 million adults’ electronic health records from 2000 to 2017. BMC Med. 2019;17(1):222. doi: 10.1186/s12916-019-1446-y [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref014] 14.Nicholson BD, Aveyard P, Hamilton W, Bankhead CR, Koshiaris C, Stevens S, et al. The internal validation of weight and weight change coding using weight measurement data within the UK primary care Electronic Health Record. Clin Epidemiol. 2019;11:145–55. doi: 10.2147/CLEP.S189989 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref015] 15.Herrett E, Gallagher AM, Bhaskaran K, Forbes H, Mathur R, van Staa T, et al. Data Resource Profile: Clinical Practice Research Datalink (CPRD). Int J Epidemiol. 2015;44(3):827–36. doi: 10.1093/ije/dyv098 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref016] 16.van Burren S. Flexible Imputation of Missing Data. 2nd ed. Chapman & Hall; 2018. doi: 10.1186/s12942-018-0127-y [DOI] [Google Scholar]

[pmed.1003728.ref017] 17.Royston P. Multiple Imputation of Missing Values. Stata J. 2004;4(3):227–41. [Google Scholar]

[pmed.1003728.ref018] 18.Morris TP, White IR, Royston P. Tuning multiple imputation by predictive mean matching and local residual draws. BMC Med Res Methodol. 2014;14:75. doi: 10.1186/1471-2288-14-75 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref019] 19.Baracos VE, Martin L, Korc M, Guttridge DC, Fearon KCH. Cancer-associated cachexia. Nat Rev Dis Primers. 2018;4:17105. doi: 10.1038/nrdp.2017.105 [DOI] [PubMed] [Google Scholar]

[pmed.1003728.ref020] 20.Fearon K, Strasser F, Anker SD, Bosaeus I, Bruera E, Fainsinger RL, et al. Definition and classification of cancer cachexia: an international consensus. Lancet Oncol. 2011;12(5):489–95. doi: 10.1016/S1470-2045(10)70218-7 [DOI] [PubMed] [Google Scholar]

[pmed.1003728.ref021] 21.van Smeden M, Moons KG, de Groot JA, Collins GS, Altman DG, Eijkemans MJ, et al. Sample size for binary logistic prediction models: Beyond events per variable criteria. Stat Methods Med Res. 2019;28(8):2455–74. doi: 10.1177/0962280218784726 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref022] 22.Luque-Fernandez MA, Redondo-Sánchez D. Maringe C. cvauroc: Command to compute cross-validated area under the curve for ROC analysis after predictive modeling for binary outcomes. Stata J. 2019;19(3):615–25. [Google Scholar]

[pmed.1003728.ref023] 23.Ensor J, Snell K, Martin E. PMCALPLOT: Stata module to produce calibration plot of prediction model performance. Statistical Software Components S458486. Boston College Department of Economics; 2018. [Google Scholar]

[pmed.1003728.ref024] 24.Van Calster B, Wynants L, Verbeek JFM, Verbakel JY, Christodoulou E, Vickers AJ, et al. Reporting and Interpreting Decision Curve Analysis: A Guide for Investigators. Eur Urol. 2018;74(6):796–804. doi: 10.1016/j.eururo.2018.08.038 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref025] 25.Vickers AJ, Van Calster B, Steyerberg EW. Net benefit approaches to the evaluation of prediction models, molecular markers, and diagnostic tests. BMJ. 2016;352:i6. doi: 10.1136/bmj.i6 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref026] 26.Kerr KF, Brown MD, Zhu K, Janes H. Assessing the Clinical Impact of Risk Prediction Models With Decision Curves: Guidance for Correct Interpretation and Appropriate Use. J Clin Oncol. 2016;34(21):2534–40. doi: 10.1200/JCO.2015.65.5654 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref027] 27.D’Andrea D, Soria F, Zehetmayer S, Gust KM, Korn S, Witjes JA, et al. Diagnostic accuracy, clinical utility and influence on decision-making of a methylation urine biomarker test in the surveillance of non-muscle-invasive bladder cancer. BJU Int. 2019;123(6):959–67. doi: 10.1111/bju.14673 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref028] 28.Bonnett LJ, Snell KIE, Collins GS, Riley RD. Guide to presenting clinical prediction models for use in clinical settings. BMJ. 2019;365:l737. doi: 10.1136/bmj.l737 [DOI] [PubMed] [Google Scholar]

[pmed.1003728.ref029] 29.Morris TP, White IR, Carpenter JR, Stanworth SJ, Royston P. Combining fractional polynomial model building with multiple imputation. Stat Med. 2015;34(25):3298–317. doi: 10.1002/sim.6553 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref030] 30.Whiting PF, Rutjes AW, Westwood ME, Mallett S, Deeks JJ, Reitsma JB, et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. 2011;155(8):529–36. doi: 10.7326/0003-4819-155-8-201110180-00009 [DOI] [PubMed] [Google Scholar]

[pmed.1003728.ref031] 31.Price SJ, Stapley SA, Shephard E, Barraclough K, Hamilton WT. Is omission of free text records a possible source of data loss and bias in Clinical Practice Research Datalink studies? A case-control study. BMJ Open. 2016;6(5):e011664. doi: 10.1136/bmjopen-2016-011664 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref032] 32.Bailey SE, Ukoumunne OC, Shephard EA, Hamilton W. Clinical relevance of thrombocytosis in primary care: a prospective cohort study of cancer incidence using English electronic medical records and cancer registry data. Br J Gen Pract. 2017;67(659):e405–e13. doi: 10.3399/bjgp17X691109 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref033] 33.Dolan RD, Lim J, McSorley ST, Horgan PG, McMillan DC. The role of the systemic inflammatory response in predicting outcomes in patients with operable cancer: Systematic review and meta-analysis. Sci Rep. 2017;7(1):16717. doi: 10.1038/s41598-017-16955-5 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref034] 34.Dolan RD, McSorley ST, Horgan PG, Laird B, McMillan DC. The role of the systemic inflammatory response in predicting outcomes in patients with advanced inoperable cancer: Systematic review and meta-analysis. Crit Rev Oncol Hematol. 2017;116:134–46. doi: 10.1016/j.critrevonc.2017.06.002 [DOI] [PubMed] [Google Scholar]

[pmed.1003728.ref035] 35.Hamilton W, Lancashire R, Sharp D, Peters TJ, Cheng KK, Marshall T. The importance of anaemia in diagnosing colorectal cancer: a case-control study using electronic primary care records. Br J Cancer. 2008;98(2):323–7. doi: 10.1038/sj.bjc.6604165 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref036] 36.Merriel SW, Carroll R, Hamilton F, Hamilton W. Association between unexplained hypoalbuminaemia and new cancer diagnoses in UK primary care patients. Fam Pract. 2016;33(5):449–52. doi: 10.1093/fampra/cmw051 [DOI] [PubMed] [Google Scholar]

[pmed.1003728.ref037] 37.Newsome PN, Cramb R, Davison SM, Dillon JF, Foulerton M, Godfrey EM, et al. Guidelines on the management of abnormal liver blood tests. Gut. 2018;67(1):6–19. doi: 10.1136/gutjnl-2017-314924 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref038] 38.Smellie WSA, Forth J, Ryder S, Galloway MJ, Wood AC, Watson ID. Best practice in primary care pathology: review 5. J Clin Pathol. 2006;59(12):1229–37. doi: 10.1136/jcp.2006.037754 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref039] 39.Watson J, Salisbury C, Banks J, Whiting P, Hamilton W. Predictive value of inflammatory markers for cancer diagnosis in primary care: a prospective cohort study using electronic health records. Br J Cancer. 2019;120(11):1045–51. doi: 10.1038/s41416-019-0458-x [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref040] 40.LeDell E, Petersen M, van der Laan M. Computationally efficient confidence intervals for cross-validated area under the ROC curve estimates. Electron J Stat. 2015;9(1):1583–607. doi: 10.1214/15-EJS1035 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref041] 41.Riley R, van der Windt D, Croft P, Moons KG. Prognosis Research in Healthcare: Concepts, Methods, and Impact. Oxford Univerity Press; 2019. [Google Scholar]

[pmed.1003728.ref042] 42.Riley RD, Ensor J, Snell KI, Debray TP, Altman DG, Moons KG, et al. External validation of clinical prediction models using big datasets from e-health records or IPD meta-analysis: opportunities and challenges. BMJ. 2016;i3140:353. doi: 10.1136/bmj.i3140 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref043] 43.Hippisley-Cox J, Coupland C. Symptoms and risk factors to identify women with suspected cancer in primary care: derivation and validation of an algorithm. Br J Gen Pract. 2013;63(606):e11–21. doi: 10.3399/bjgp13X660733 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref044] 44.Hippisley-Cox J, Coupland C. Symptoms and risk factors to identify men with suspected cancer in primary care: derivation and validation of an algorithm. Br J Gen Pract. 2013;63(606):e1–10. doi: 10.3399/bjgp13X660724 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref045] 45.Chiang PP, Glance D, Walker J, Walter FM, Emery JD. Implementing a QCancer risk tool into general practice consultations: an exploratory study using simulated consultations with Australian general practitioners. Br J Cancer. 2015;112(Suppl 1):S77–83. doi: 10.1038/bjc.2015.46 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref046] 46.Price S, Spencer A, Medina-Lara A, Hamilton W. Availability and use of cancer decision-support tools: a cross-sectional survey of UK primary care. Br J Gen Pract. 2019;69(684):e437–e43. doi: 10.3399/bjgp19X703745 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pmed.1003728.ref047] 47.Moons KGM, Altman DG, Reitsma JB, Ioannidis JPA, Macaskill P, Steyerberg EW, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): Explanation and Elaboration. Ann Intern Med. 2015;162(1):W1–W73. doi: 10.7326/M14-0698 [DOI] [PubMed] [Google Scholar]

PERMALINK

Combining simple blood tests to identify primary care patients with unexpected weight loss for cancer investigation: Clinical risk score development, internal validation, and net benefit analysis

Brian D Nicholson

Paul Aveyard

Constantinos Koshiaris

Rafael Perera

Willie Hamilton

Jason Oke

F D Richard Hobbs

Roles

Abstract

Background

Methods and findings

Conclusions

Author summary

Why was this study done?

What did the researchers do and find?

What do these findings mean?

Introduction

Methods

Cohort design and population

Outcome definition

Predictor variables

Table 1. Baseline characteristics of the derivation cohort and imputation dataset.

Table 2. Sm—Symptoms-only model.

Multiple imputation

Model derivation

Table 3. STm—Symptoms and test model.

Table 4. Tm—Tests-only model.

Internal cross-validation

Decision curve analysis

Clinical risk scores

Sensitivity analysis

Results

Study participants

Missing data

Model development

Internal validation

Table 5. Prediction model discrimination and calibration statistics.

Fig 1.

Decision curve analysis

Fig 2.

Examples of applying the clinical risk scores

Fig 3. Full breakdown of STm model performance.

Fig 4. Full breakdown of Tm model performance.

Box 1. Consequences of applying the STm score in clinical practice (Tables 3 and S4)

Clinical example

Example of maximising sensitivity to rule-out cancer

Example of a threshold to balance ruling-in and ruling-out cancer

Example of a threshold close to the NICE PPV threshold of 3%

Discussion

Summary of findings

Strengths and limitations

Findings in context

Implications for research and clinical practice

Conclusions

Disclaimer

Supporting information

Abbreviations

Data Availability

Funding Statement

References

Decision Letter 0

Richard Turner

Roles

Decision Letter 1

Richard Turner

Roles

Author response to Decision Letter 1

Decision Letter 2

Beryne Odeny

Roles

Author response to Decision Letter 2

Decision Letter 3

Beryne Odeny

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS