Skip to main content
JNCI Journal of the National Cancer Institute logoLink to JNCI Journal of the National Cancer Institute
. 2023 Jan 27;115(5):514–522. doi: 10.1093/jnci/djad017

Agreement between individual and neighborhood income measures in patients with colorectal cancer in Canada

Laura E Davis 1,, Alyson L Mahar 2, Erin C Strumpf 3,4
PMCID: PMC10165486  PMID: 36708004

Abstract

Introduction

With increasing interest in income-related differences in cancer outcomes, accurate measurement of income is imperative. Misclassification of income can result in wrong conclusions as to the presence of income inequalities. We determined misclassification between individual- and neighborhood-level income and their association with overall survival among colorectal cancer (CRC) patients.

Methods

The Canadian Census Health and Environment Cohorts were used to identify CRC patients diagnosed from 1992 to 2017. We used neighborhood income quintiles from Statistics Canada and created individual income quintiles from the same data sources to be as similar as possible. Agreement between individual and neighborhood income quintiles was measured using cross-tabulations and weighted kappa statistics. Cox proportional hazards and Lin semiparametric hazards models were used to determine the effects of individual and neighborhood income independently and jointly on survival. Analyses were also stratified by rural residence.

Results

A total of 103 530 CRC patients were included in the cohort. There was poor agreement between individual and neighborhood income with only 17% of respondents assigned to the same quintile (weighted kappa = 0.18). Individual income had a greater effect on relative and additive survival than neighborhood income when modeled separately. The interaction between individual and neighborhood income demonstrated that the most at risk for poor survival were those in the lowest individual and neighborhood income quintiles. Misclassification was more likely to occur for patients residing in rural areas.

Conclusion

Cancer researchers should avoid using neighborhood income as a proxy for individual income, especially among patients with cancers with demonstrated inequalities by income.


Colorectal cancer (CRC) is the third most commonly diagnosed cancer in men and women and the second leading cause of cancer death worldwide (1,2). For patients with CRC, income effects every step along the cancer continuum. Individuals experiencing low income are more likely to participate in health behaviors, such as smoking and inactivity, have increased incidence of CRC, lower screening rates, later stage at diagnosis, and poorer survival (3-5).

Accurate measurement of income is required to understand cancer inequalities and target interventions to reduce inequities. However, the choice of income measure is often determined by data availability instead of appropriateness. For example, individual-level income measurements are rarely available in population-based datasets; therefore, researchers often use area-based measures (6). This is especially common in cancer studies; for example, a recent systematic review of socioeconomic status and cancer survival found that only 15 of 66 studies examining socioeconomic status (excluding education) measured individual socioeconomic status (7). Using area-level income measurements as a proxy for individual-level income is potentially problematic for several reasons. If the area is large or heterogeneous, the association observed at the area level will not reflect the association at the individual level (8). In addition, the mechanistic pathways through which individual- and area-level socioeconomic factors effect outcomes may differ, resulting in different interpretations and interventions (9-11). For example, individual income might affect cancer outcomes through material and social resources, such as the ability to pay for peripheral costs of cancer care like parking and childcare (9,12). On the other hand, neighborhood income might influence cancer outcomes through features of the physical environment, such as proximity to cancer centers or neighborhood social supports (9,12).

Several studies have demonstrated poor agreement between individual and neighborhood income in the public health literature, but none have examined this in cancer specifically, where patients might be at a higher risk of experiencing low income (10,13-17). These studies often fail to compare the same measure of income at the individual and neighborhood level resulting in inaccurate comparisons (10,12,15). Moreover, few studies consider geography as a confounding characteristic. This is especially important because heterogenous geographical areas might increase the chance of ecologic fallacy (10,13,15,18).

In this study we aimed to 1) estimate misclassification between income measured at the individual and neighborhood level across rural and nonrural residences and 2) measure the association of individual and neighborhood income independently and jointly on overall survival in a cohort of CRC patients.

Methods

Data and study population

This was a retrospective cohort study of a subset of participants diagnosed with CRC from the 1991-2011 Canadian Census Health and Environment Cohorts (CanCHEC). The CanCHEC is a national population-based cohort derived by Statistics Canada that links census, vital statistics, and cancer registry databases to follow individuals who respond to the long form census for mortality and cancer outcomes (19). The Canadian Cancer Registry collects cancer data on all Canadians with cancer, except Quebec after 2010 (20). The census is performed every 5 years in Canada, and approximately 1 in every 5 households completes the long form census. The current study used census data from 1991, 1996, 2001, and 2006 and the National Household Survey (NHS) in 2011. The NHS was a voluntary survey performed in 2011 instead of the census. Individuals are eligible to complete the long form census or NHS if they were residents of Canada on census day and were not living in an institution, such as nursing homes, penitentiaries, or group homes. The postal code conversation file (PCCF+) was linked to the CanCHEC using the participants’ postal code at census to obtain neighborhood income quintiles and rural residence.

We included Canadians aged 35 years and older with a new diagnosis of CRC (International Classification of Diseases–O-3 codes: C180, C182-C189, C199, C209) between January 1, 1992, and December 31, 2017. Individuals had to have completed at least one long form census in the 10 years prior to diagnosis (census cycles 1991-2011). A 10-year time frame was chosen to maximize sample size and minimize any changes in income or residence that may have occurred between the census and diagnosis. In cases where participants completed more than 1 census, the closest questionnaire to the cancer diagnosis was used. Individuals with a missing date of diagnosis, missing postal code at census, postal code that could not be linked to the PCCF+, and those with a death date before their diagnosis date were excluded.

Exposure

Income was measured at the individual and neighborhood level. We created the individual-level income measure to be as consistent as possible with neighborhood-level income, which was obtained from the PCCF+ (summarized in Table 1).

Table 1.

Individual and neighborhood income definitionsa

Income definition concepts Individual income Neighborhood income
Data source Census (self-reported in 1991, 1996, and 2001 and linked to tax records in 2006 and 2011) PCCF+ (uses Statistics Census Profile Data)
Date of measurement Date of census Census year using postal code at time of census
Income Total income for all household members Total income in each EA and DA (calculated by multiplying the median income in each EA and DA by the number of households)
Household size adjustment Statistics Canada single person equivalents Sum of the single person equivalents of the EA and DA (calculated by multiplying the number of households by the single person equivalence separately for each household size of 1 to 5 or more family members and summing)
Before or after tax Before tax Before tax
Quintiles Created by authors in each weighted census population by CMA, CA, other region Created by Statistics Canada in each weighted census population by CMA, CA, other region
a

CA = census agglomeration; CMA = census metropolitan area; DA = dissemination area; EA = enumeration area; PCCF+ = postal code conversation file.

Individual income was defined as adjusted before-tax household income using the long form census. All sources of income from the previous calendar year were summed for each household and adjusted for number of household members using the single person equivalence scales from Statistics Canada’s low-income cutoffs (21). Before-tax income was used for consistency across all census years as after-tax income was only available in 2011. Income was self-reported in years 1991, 1996, and 2001 with the option to consent to income tax linkage in 2006 and 2011. All individuals included in the cohort consented to the use of income tax linkage. Continuous individual income was adjusted to 2011 Canadian dollars using the Statistics Canada consumer price index (22). Individual income quintiles were created by ordering individual household incomes within each census metropolitan area (CMA), census agglomeration (CA), and other region by province and census year within the full-weighted census cohort, as opposed to within the CRC cohort. We then divided each category into 5 equal groups to create quintiles specific for each CMA, CA, and other region. CMAs are large urban areas of at least 100 000 people, CAs are smaller areas of <10 000 population, and other regions incorporate urban fringe and rural areas (23). CMA- and CA-specific quintiles take into account differences in cost of living across regions.

Neighborhood income quintiles were created by Statistics Canada for the PCCF+. The PCCF+ neighborhood income is the most widely used area-level income measure in Canada (24). The PCCF+ uses Statistics Canada Census Profile Data at the dissemination area (DA) or enumeration area (EA) level to calculate area-level adjusted household income (25). EAs and DAs are Statistics Canada’s smallest geographical area representing approximately 400-700 individuals per area (26). Total income for each DA and EA is calculated by multiplying the median income of that area by the total number of households. This number is then adjusted for household size by dividing by the sum of the single-person equivalents of the DA and EA to obtain median household income per single person equivalent for each DA and EA. CMA and CA quintiles are then constructed by ranking DAs and EAs within each CMA, CA, and other region by province from lowest to highest then dividing into fifths (27). Statistics Canada did not create a neighborhood income quintile variable in 2011 because of the use of the NHS instead of the census, therefore we assigned 2006 neighborhood income quintiles to individuals who responded to the 2011 census.

Outcome

Death from any cause was defined according to the vital statistics database. Follow-up time was defined as the number of days from the date of CRC diagnosis in the CCR to the date of death from any cause, end of study (December 31, 2019), or loss to follow-up. Loss to follow-up was defined as those without a death date who had at least 4 years of consecutively missing postal codes without a returning postal code.

Covariates

Individual characteristics obtained from the census were age at census and sex. Tumor location (colon or rectum) and stage at diagnosis were obtained from the CCR (28). Stage was only presented for individuals diagnosed in 2010 or after, as this was when the CCR started prioritizing the routine collection of stage data for lung, colorectal, breast, and prostate cancers (29). Geographic characteristics were measured at the time of census and included province or territory of residence, whether the person had moved in the 5 years previous to the census, and residence in a rural area, defined according to the PCCF+ as residing in a census subdivision with a population of less than 1000 and a population density of less than 400 persons per square kilometer.

Statistical analysis

We examined misclassification between individual and neighborhood income quintiles using cross tabulations overall and stratified by rural residence. Weighted kappa statistics were calculated to determine the degree of nonrandom concordance between the 2 income measures (30). Continuous individual income was described within individual and neighborhood income quintiles. The cohort was described using means, medians, and interquartile ranges (IQRs) for continuous variables and proportions for categorical variables, overall, by census year and by individual and neighborhood income quintiles. In keeping with data confidentiality guidelines from Statistics Canada, number of observations is rounded to the nearest 5.

Cox proportional hazards regression and Lin semiparametric additive risk models were used to determine the relative and additive associations between exposures and overall survival. Four models were specified to estimate the association of individual income, neighborhood income, individual and neighborhood income, and the interaction between individual income and neighborhood income with survival. The interaction models included terms for individual income and neighborhood income, and the interaction term between the two. Models were stratified by rural and nonrural residence. The reference for all models was the highest income quintile 5. Multivariable models were adjusted for age, sex, tumor location, census year, and province or territory at census. We reported adjusted and unadjusted hazard ratios (HRs) or risk differences (RD) for additional deaths per 1000 person years and their 95% confidence intervals (CIs), and statistical significance was considered at P > .05. Proportional hazards were evaluated for all models through graphical diagnostics of the weighted Schoenfeld residuals for the Cox models and the Kolmogorov-Smirnov and Cramer von Mises tests for the additive risk models. A sensitivity analysis was performed stratifying the cox proportional hazards models by stage at diagnosis to see if associations differed by stage.

Ethics approval was obtained from the McGill University research ethics board, and Statistics Canada. SAS (version 9.4) was used for all analyses except the additive risk regression models, which were analyzed using the timereg package in R (31).

Results

Study cohort

There were 122 040 adults aged 35 years or older with a first CRC diagnosis between 1992 and 2017 and who responded to the long form census in the 10 years before their cancer diagnosis. Exclusions included 14 145 with a missing postal code on the date of census, 4345 with a postal code that could not be linked in the PCCF+ and therefore did not have information on neighborhood income, and 20 with a death date before their diagnosis date, resulting in a final cohort of 103 530. The average time from measurement of income to diagnosis was 4.9 (SD = 3) years and was similar across individual and neighborhood income quintiles. There were some differences in individual and tumor characteristics across census years (Supplementary Table 1, available online).

Misclassification of individual and neighborhood income

Overall agreement between individual and neighborhood income quintiles at the time of census was poor (weighted kappa = 0.18). Of the patients, 17% were assigned to the same individual and neighborhood income quintile, and 54% were assigned either to the same quintile or one above or below. Individuals residing in rural areas were more likely to be misclassified than those in urban residences, with a weighted kappa of 0.11 compared with 0.20 in urban areas (Table 2; Supplementary Table 2, available online). The range of individual income in 2011 Canadian dollars across individual income quintiles was wider, and the variation within quintiles was smaller compared with neighborhood income quintiles (Figure 1). Median individual income within individual income quintiles varied from $18 187 (IQR = $14 051-$21 591) in quintile 1 (Q1) to $93 902 (IQR= $78 975-$117 947) in quintile 5 (Q5), whereas neighborhood quintiles varied from $30 451 (IQR = $19 960-$47 626) in Q1 to $54 882 (IQR = $33 608-$83 200) in Q5.

Table 2.

Agreement between individual and neighborhood income quintilesa

Individual income quintile Neighborhood income quintile, No. (%)
Total
Q1 Q2 Q3 Q4 Q5
Q1 7480 (32.3) 5380 (23.3) 4210 (18.2) 3430 (14.8) 2640 (11.4) 23 140
Q2 5375 (22.9) 5520 (23.5) 4965 (21.1) 4255 (18.1) 3395 (14.4) 23 505
Q3 3520 (17.8) 4335 (21.9) 4200 (21.2) 4170 (21.1) 3565 (18.0) 19 790
Q4 2545 (13.8) 3540 (19.3) 3905 (21.3) 4215 (22.9) 4170 (22.7) 18 375
Q5 1860 (9.9) 2635 (14.1) 3455 (18.5) 4330 (23.1) 6435 (34.4) 18 715
Total 20 780 21 410 20 735 20 400 20 205 10 3530
a

Q = quintiles.

Figure 1.

Figure 1.

Individual income distribution by individual and neighborhood income quintiles (whiskers represent 10th and 90th percentiles as opposed to minimum and maximum values because of data confidentiality). Q = quintiles; CAD = Canadian Dollars.

Characteristics by individual and neighborhood income

A greater number of individuals with CRC were categorized in the lowest individual income quintile (Q1 = 23 145 vs Q5 = 18 715), whereas individuals were evenly spread across neighborhood income quintiles (Q1 = 20 780 vs Q5 = 20 205) (Table 3). Patients in the lowest individual income quintile were older and more likely to be female compared with those in the highest income quintile (median age = 68 years in Q1 vs 61 years in Q5; Q1 = 54% female vs Q5 = 37% female), whereas age and sex distributions were similar across neighborhood income quintiles. Patients in Q1 for both individual and neighborhood income were more likely to die during the follow-up period compared with Q5, with a greater difference by individual income (Q1 = 71% vs Q5 = 51% for individual income and Q1 = 66% vs Q5 = 57% for neighborhood income).

Table 3.

Individual characteristics by individual and neighborhood income quintiles

Characteristics Individual income quintiles
Neighborhood income quintiles
Q1 (lowest quintile, n = 23 145) Q2 (n = 23 505) Q3 (n = 19 790) Q4 (n = 19 275) Q5 (highest quintile, n = 18 715) Q1 (lowest quintile, n = 20 780) Q2 (n = 21 410) Q3 (n = 20 735) Q4 (n = 20 400) Q5 (highest quintile, n = 20 205)
Variable No. (%) No. (%) No. (%) No. (%) No. (%) No. (%) No. (%) No. (%) No. (%) No. (%)
Age at census, mean (SD), y 68 (12) 67 (12) 64 (12) 62 (12) 61 (11) 65 (12) 65 (12) 65 (12) 64 (12) 64 (12)
 Age at census
  35-44 y 1035 (4.7) 1095 (4.7) 1210 (6.1) 1165 (6.3) 1050 (5.6) 1065 (5.1) 1025 (4.8) 1140 (5.5) 1185 (5.8) 1140 (5.7)
  45-54 y 2220 (9.6) 2175 (9.3) 2800 (14.1) 3585 (19.5) 4465 (23.9) 2730 (13.1) 2865 (13.4) 3105 (15.0) 3245 (15.9) 3295 (16.3)
  55-64 y 4900 (21.2) 4410 (18.8) 4980 (25.2) 5400 (29.4) 6245 (33.4) 4945 (23.8) 5160 (24.1) 5205 (25.1) 5340 (26.2) 5280 (26.1)
  65-74 y 7470 (32.3) 8840 (37.6) 6510 (32.9) 5100 (27.8) 4280 (22.9) 6630 (31.9) 6925 (32.4) 6355 (30.7) 6200 (30.4) 6085 (30.1)
  75-84 y 5960 (25.7) 5810 (24.7) 3570 (18.0) 2580 (14.0) 2195 (11.7) 4350 (20.9) 4460 (20.8) 4050 (19.5) 3625 (17.8) 3630 (18.0)
  85 y or older 1560 (6.7) 1180 (5.0) 725 (3.7) 550 (3.0) 485 (2.6) 1065 (5.1) 975 (4.6) 880 (4.2) 810 (4.0) 770 (3.8)
Sex
 Male 10 680 (41.2) 13 230 (56.3) 11 730 (59.3) 11 040 (60.1) 11 840 (63.3) 11 255 (54.2) 11 930 (55.7) 11 740 (56.6) 11 810 (57.9) 11 785 (58.3)
 Female 12 460 (53.8) 10 275 (43.7) 8060 (40.7) 7340 (39.9) 6875 (36.7) 9525 (45.8) 9475 (44.3) 9000 (43.4) 8590 (42.1) 8420 (41.7)
Province or territory at census
Newfoundland and Labrador 545 (2.4) 670 (2.8) 520 (2.6) 495 (2.7) 445 (2.4) 535 (2.6) 555 (2.6) 535 (2.6) 575 (2.8) 480 (2.4)
 Prince Edward Island 155 (0.7) 105 (0.5) 100 (0.5) 125 (0.7) 120 (0.6) 125 (0.6) 105 (0.5) 125 (0.6) 140 (0.7) 110 (0.6)
 Nova Scotia 960 (4.2) 990 (4.2) 850 (4.3) 745 (4.1) 810 (4.3) 900 (4.3) 855 (4.0) 880 (4.2) 860 (4.2) 865 (4.3)
 New Brunswick 655 (2.8) 735 (3.1) 585 (3.0) 480 (2.6) 525 (2.8) 665 (3.2) 580 (2.7) 610 (3.0) 570 (2.8) 555 (2.8)
 Quebec 5035 (21.8) 4810 (20.5) 3730 (18.8) 3320 (18.1) 3455 (18.5) 4525 (21.8) 4425 (20.7) 3995 (12.3) 3725 (18.3) 3675 (18.2)
 Ontario 8935 (38.6) 9185 (39.1) 7905 (39.9) 7575 (41.2) 7625 (40.7) 7615 (36.7) 8375 (39.1) 8330 (40.2) 8390 (41.1) 8510 (42.1)
 Manitoba 970 (4.2) 1010 (4.3) 940 (4.7) 845 (4.6) 880 (4.7) 890 (4.3) 1045 (4.9) 965 (4.7) 915 (4.5) 815 (4.0)
 Saskatchewan 815 (3.5) 915 (3.9) 735 (3.7) 690 (3.8) 705 (3.8) 800 (3.8) 810 (3.8) 785 (3.8) 760 (3.7) 710 (3.5)
 Alberta 2080 (9.0) 1850 (7.9) 1630 (8.2) 1465 (8.0) 1535 (8.2) 1745 (8.4) 1760 (8.2) 1760 (8.5) 1655 (8.1) 1640 (8.1)
 British Columbia 2910 (12.6) 3160 (13.4) 2730 (13.8) 2555 (13.9) 2540 (13.6) 2910 (14.0) 2830 (13.2) 2645 (12.8) 2740 (13.4) 2780 (13.8)
 Territories combined 75 (0.3) 80 (0.3) 70 (0.4) 75 (0.4) 80 (0.4) 70 (0.3) 70 (0.3) 105 (0.5) 75 (0.4) 65 (0.3)
Rural (pop < 1000)
 Urban 17 505 (75.6) 17 560 (74.7) 15 170 (76.7) 14 270 (77.7) 14 495 (77.4) 15 935 (76.7) 16 470 (76.9) 15 610 (75.3) 15 410 (75.5) 15 580 (77.1)
 Rural 5640 (24.4) 5945 (25.3) 4620 (23.3) 4105 (22.3) 4220 (22.6) 4845 (23.3) 4940 (23.1) 5130 (24.7) 4995 (24.5) 4625 (22.9)
Tumor location
 Rectal 7525 (32.5) 7665 (32.6) 6820 (34.4) 6275 (34.1) 6530 (34.9) 7005 (33.7) 7195 (33.6) 6960 (33.6) 6915 (33.9) 6740 (33.4)
 Colon 15 615 (67.5) 15 840 (67.4) 12 975 (65.6) 12 105 (65.9) 12 180 (65.1) 13 775 (66.3) 14 215 (66.4) 13 780 (66.4) 13 485 (66.1) 13 465 (66.7)
Status
 Alive at end of follow-up 6750 (29.2) 8035 (34.2) 8160 (41.21) 8485 (46.2) 9205 (59.2) 7090 (34.1) 8015 (37.4) 8240 (39.7) 8580 (42.1) 8705 (43.1)
 Died 16 395 (70.8) 15 475 (65.8) 11 635 (58.8) 9890 (53.8) 9510 (50.8) 13 690 (65.9) 13 395 (62.6) 12 495 (60.3) 11 820 (57.9) 11 500 (56.9)
Stage at diagnosisa
 Stage 0-I 930 (18.8) 1285 (20.5) 1235 (21.9) 1190 (22.2) 1275 (23.8) 1030 (20.0) 1135 (20.5) 1185 (21.1) 1295 (22.7) 1270 (22.8)
 Stage II 1170 (23.7) 1515 (24.2) 1320 (23.4) 1220 (22.8) 1130 (21.0) 1250 (24.2) 1300 (23.5) 1275 (22.7) 1310 (23.0) 1225 (22.0)
 Stage III 1210 (24.5) 1555 (24.8) 1505 (26.7) 1450 (27.1) 1460 (27.3) 1315 (25.5) 1450 (26.2) 1475 (26.3) 1495 (26.2) 1450 (26.0)
 Stage IV 905 (18.4) 1130 (18.0) 965 (17.1) 875 (16.3) 920 (14.2) 910 (17.7) 960 (17.4) 1000 (17.8) 945 (16.5) 980 (17.5)
 Unknown 300 (6.0) 295 (4.7) 220 (3.9) 185 (3.5) 185 (3.4) 245 (4.8) 245 (4.4) 220 (3.9) 240 (4.2) 235 (4.2)
 Missing 430 (8.7) 490 (7.8) 405 (7.2) 430 (8.1) 395 (7.4) 405 (7.9) 445 (8.0) 460 (8.2) 420 (7.4) 420 (7.6)
a

Stage at diagnosis only for individuals diagnosed in 2010 or later. Q = quintile; Pop = population.

Relative and additive survival by individual and neighborhood income

Unadjusted individual income had a greater association with survival than neighborhood income when modeled separately (Q1 vs Q5 individual: HR = 1.69, RD = 50.74; neighborhood: HR = 1.20, RD = 19) (Figure 2, Table 4). After adjusting for individual covariates, the association of individual income on survival was attenuated, whereas neighborhood income remained similar (Q1 vs Q5 individual: HR = 1.36, RD = 32.23; neighborhood: HR = 1.20, RD = 19). This could indicate that confounders are not appropriately controlled for at the neighborhood level because of only controlling for individual-level covariates. To facilitate comparability between the 2 measures, unadjusted effects are reported moving forward. When both measures were included in the same model, the estimates for neighborhood income were attenuated, whereas individual income remained similar to its unadjusted estimate. This suggests that some, but not all, of the effect of neighborhood income on survival is accounted for by individual income.

Figure 2.

Figure 2.

Five-year Kaplan–Meier survival by individual and neighborhood income quintiles.

Table 4.

Cox proportional hazards regression and Lin semiparametric hazards regression for the association between income and survival

Model Cox regression model
Lin’s regression model
Unadjusted Adjusted Unadjusted Adjusted
HR (95% CI) HR (95% CI) RD (95% CI) per 1000 py RD (95% CI) per 1000 py
Individual (referent = Q5)
 Q1 1.69 (1.65 to 1.73) 1.36 (1.32 to 1.39) 50.74 (31.35 to 70.12) 32.23 (15.49 to 48.97)
 Q2 1.52 (1.48 to 1.56) 1.23 (1.20 to 1.26) 37.60 (19.42 to 55.77) 19.86 (4.19 to 35.52)
 Q3 1.26 (1.22 to 1.29) 1.12 (1.09 to 1.16) 17.99 (0.68 to 35.31) 9.45 (-5.36 to 24.26)
 Q4 1.10 (1.07 to 1.13) 1.06 (1.03 to 1.09) 6.86 (-9.74 to 23.46) 4.23 (-10.15 to 18.61)
Neighborhood (referent = Q5)
 Q1 1.20 (1.17 to 1.23) 1.20 (1.18 to 1.23) 23.83 (4.73 to 42.94) 19.00 (2.75 to 35.25)
 Q2 1.11 (1.08 to 1.13) 1.11 (1.08 to 1.14) 13.69 (-4.63 to 32.00) 9.45 (-6.15 to 25.05)
 Q3 1.09 (1.06 to 1.12) 1.09 (1.06 to 1.12) 8.98 (-9.34 to 27.29) 7.81 (-7.79 to 23.41)
 Q4 1.04 (1.02 to 1.07) 1.04 (1.02 to 1.07) 3.20 (-14.33 to 20.72) 3.60 (-11.28 to 18.48)
Individual and neighborhood
Individual (referent = Q5)
 Q1 1.64 (1.60 to 1.69) 1.32 (1.23 to 1.36) 47.82 (28.43 to 67.20) 29.50 (12.76 to 46.24)
 Q2 1.49 (1.45 to 1.53) 1.20 (1.17 to 1.24) 35.77 (17.60 to 53.94) 18.00 (2.34 to 33.66)
 Q3 1.24 (1.21 to 1.27) 1.11 (1.08 to 1.14) 16.83 (-0.49 to 34.14) 8.18 (-6.70 to 23.06)
 Q4 1.09 (1.06 to 1.12) 1.05 (1.02 to 1.08) 6.17 (-10.36 to 22.69) 3.47 (-10.92 to 17.86)
Neighborhood (referent = Q5)
 Q1 1.11 (1.08 to 1.14) 1.12 (1.10 to 1.15) 10.29 (-8.74 to 29.32) 11.50 (-4.89 to 27.89)
 Q2 1.05 (1.02 to 1.08) 1.06 (1.03 to 1.08) 4.38 (-13.72 to 22.48) 4.45 (-11.21 to 20.11)
 Q3 1.03 (1.00 to 1.06) 1.05 (1.03 to 1.08) 2.42 (-15.61 to 20.44) 4.27 (-11.39 to 19.93)
 Q4 1.00 (0.97 to 1.02) 1.02 (1.00 to 1.05) −0.61 (-17.85 to 16.63) 1.53 (-13.35 to 16.41)
Individuala neighborhood (referent = IQ5 and NQ5)
 IQ1 and NQ1 1.87 (1.79 to 1.95) 1.53 (1.47 to 1.60) 60.48 (56.20 to 64.77) 44.03 (34.65 to 53.40)
 IQ2 and NQ2 1.62 (1.55 to 1.70) 1.32 (1.25 to 1.38) 42.56 (38.17 to 46.94) 24.65 (14.75 to 34.56)
 IQ3 and NQ3 1.30 (1.23 to 1.37) 1.30 (1.23 to 1.38) 19.71 (15.42 to 23.99) 12.41 (2.61 to 22.21)
 IQ4 and NQ4 1.12 (1.06 to 1.18) 1.09 (1.04 to 1.15) 7.79 (3.84 to 11.75) 5.64 (-3.77 to 15.05)
a

Adjusted models control for age, sex, tumor location, census year, province or territory at census. CI = confidence interval; HR = hazard ratio; IQ = individual quintile; NQ = neighborhood quintile; py = person years; Q = quintile; RD = risk difference.

The overall P value for the interaction between individual and neighborhood income was not statistically significant (cox P = .55); however, the individual estimates suggest some multiplicative and additive effects of individual and neighborhood income with survival. Compared with those in individual and neighborhood Q5, the most at risk for poor survival were those in the lowest individual and neighborhood income quintiles (HR for individual Q1 and neighborhood Q1 = 1.87, RD = 60.48). The presence of multiplicative and additive interaction for patients categorized with the same individual and neighborhood income quintiles suggests that these 2 indicators are measuring different concepts resulting in a joint effect on survival.

After stratifying by rural residence, individual and neighborhood income had a smaller association with survival in rural areas compared with urban areas, and additive effects for neighborhood income were not statistically significant (Supplementary Table 3, available online). Similar patterns were observed after stratifying by stage at diagnosis (Supplementary Table 4, available online).

Discussion

We found weak agreement between individual- and neighborhood-level income in Canadian CRC patients, with even weaker agreement for those residing in rural areas. Although both individual and neighborhood income were associated with survival, individual income had a stronger effect on survival, with the estimate for neighborhood income crossing the null when looking at additive effects and in rural areas. Furthermore, the presence of joint effects of individual and neighborhood income on survival suggests that these 2 measures are acting independently and jointly on outcomes.

The results from our study are in line with other studies demonstrating low agreement between individual- and neighborhood-level income measures (10-14,32). Other studies examining misclassification of the PCCF+ neighborhood income found slightly stronger agreement (10,13,14). For example, 29% and 27% perfect agreement was found by Buajitti et al. (13) and Pichora et al. (10), respectively, compared with 17% found in our study. These studies compared individual income from the Canadian Community Health Survey, which is entirely self-reported, with neighborhood income from the PCCF+, which uses a combination of self-reported and tax-reported data from the long form census (10,13). Our study used the same measures from the same populations by using the census data for both individual and neighborhood income quintiles, which could explain the difference in agreement that we observed. Moreover, our study could indicate that agreement between individual and neighborhood income is worse in the CRC population compared with the general population.

We also demonstrated lower agreement and a slightly weaker effect of individual and neighborhood income on survival in rural areas compared with nonrural areas. This is likely because of the larger geographical areas assigned to rural regions, which result in more heterogenous populations compared with smaller urban neighborhoods. This strong heterogeneity likely results in an attenuation of the effect on survival. Our study also presents relative and additive effects of individual and neighborhood income on survival, whereas other studies present only relative effects. Reporting additive effects is especially important when examining inequalities by income because they provide information about the magnitude of inequalities in a population and are more relevant to policy makers (33,34).

Presently, cancer organizations worldwide are acknowledging the pervasive inequalities in cancer outcomes by income and are calling for decreased barriers and equitable access to cancer care (35-37). However, to create successful, evidence-based interventions to reduce health inequalities by income, it is imperative to have a strong understanding of the relationship between income and cancer outcomes. Using neighborhood income as a proxy for individual income could result in an underestimation or incorrect understanding of the pathways through which income effects outcomes. This can in turn result in ineffective interventions, or even no interventions if no effect is concluded with neighborhood income alone. Interpreting neighborhood income as a measure of the individuals’ environment is also challenging. The areas used in these measures are created from administrative units, ranging from half-block radiuses in large downtown centers to hundreds of kilometers in rural areas and do not reflect human mobility and living patterns (38,39). In cases in which researchers are interested in the effect of the physical environment on cancer outcomes, more specific exposures can be used such as distance to cancer centers or greenspace exposure (40,41).

Because neighborhood income has a small effect on survival outcomes, often nearing the null, researchers should be cautious about concluding an absence of inequalities by income when individual income is not also measured. We recommend that whenever possible, individual income measures be used in cancer studies. Neighborhood income should not be used as a proxy for individual income and should instead be interpreted as an area-level measure with the caveat that it represents different sized areas depending on rural residence. A call to data custodians to make individual measures available is required, especially for widely used cancer databases such as the Surveillance, Epidemiology, and End Results and the National Cancer Database (42,43). Data confidentiality is often used as an argument for why individual measures are not provided; however, some research centers are already providing individual measures of income. For example, Statistics Canada now provides access to T1 tax files for individuals responding to the Canadian Community Health Survey (44). For researchers who do not have access to individual measures, quantitative bias analysis using validation studies in similar populations is a relatively simple method that can be used to demonstrate the changes in effect that would occur if individual income had been available (45).

There are several limitations of this study. First, income was categorized into quintiles instead of measured as a continuous variable. Modeling continuous income would avoid arbitrary cut points and increase accuracy and efficiency of the analysis. However, because most research using administrative data categorizes income as quintiles, this study demonstrates the real-world use of income. Second, individual and neighborhood income were measured on average 5 years prior to cancer diagnosis. This should have no effect on the analysis for agreement because both indicators are measured at the same time but might introduce misclassification bias for the effect of income on survival. For example, individual income might increase or decrease over those 5 years, and neighborhood income might change if participants moved to a different neighborhood. Third, we did not have a large enough sample size to detect statistically significant effects for the interaction between individual and neighborhood income; however, because specific effect sizes were statistically significant, our conclusions do not change. Finally, these results may not be generalizable outside of a Canadian setting where neighborhoods might be defined in different ways; however, the conclusion of accurately defining and interpreting income exposures remains the same.

Understanding differences between individual and neighborhood income is becoming increasingly important as we shift from understanding inequalities to implementing interventions to address inequity. This study found very poor agreement between individual and neighborhood income quintiles and much stronger associations of individual income with survival compared with neighborhood income. Cancer researchers should avoid using neighborhood income as a proxy for individual income, especially in CRC where patients might be at a higher risk of experiencing inequalities by income. In the absence of individual income measurements, researchers can use quantitative bias analysis to demonstrate the change in effect that might have occurred if individual income was available. Future work should use validation and quantitative bias analyses to demonstrate how neighborhood income can be adjusted to reflect individual income.

Supplementary Material

djad017_Supplementary_Data

Contributor Information

Laura E Davis, Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, Canada.

Alyson L Mahar, Faculty of Health Sciences, School of Nursing, Queens University, Kingston, Canada.

Erin C Strumpf, Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, Canada; Department of Economics, McGill University, Montreal, Canada.

Funding

This study is support by grants from the Canadian Institutes of Health Research (CIHR), the Canadian Cancer Society (MEGAN-CAN), the Canadian Centre for Applied Research in Cancer Control (ARCC), and Fonds de recherche du Québec—Santé (FRQS).

Notes

Role of the funder: The funder did not play a role in the design of the study; the collection, analysis, and interpretation of the data; the writing of the manuscript; and the decision to submit the manuscript for publication.

Disclosures: The authors have no conflicts of interest.

Author contributions: Erin C. Strumpf (Conceptualization; Methodology; Supervision; Writing—review & editing), Alyson L. Mahar (Conceptualization; Methodology; Supervision; Writing—review & editing), Laura E. Davis (Conceptualization; Data curation; Formal analysis; Funding acquisition; Methodology; Visualization; Writing—original draft; Writing—review & editing).

Acknowledgements: This research was supported by funds to the Canadian Research Data Centre Network (CRDCN) from the Social Sciences and Humanities Research Council (SSHRC), the Canadian Institute for Health Research (CIHR), the Canadian Foundation for Innovation (CFI), and Statistics Canada. Although the research and analysis are based on data from Statistics Canada, the opinions expressed do not represent the views of Statistics Canada.

Data availability

The CanCHEC database is protected by Statistics Canada confidentiality policies and cannot be made publicly available. Access may be granted to those who meet prespecified criteria for confidential access available at https://www.statcan.gc.ca/en/microdata/data-centres/access.

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

djad017_Supplementary_Data

Data Availability Statement

The CanCHEC database is protected by Statistics Canada confidentiality policies and cannot be made publicly available. Access may be granted to those who meet prespecified criteria for confidential access available at https://www.statcan.gc.ca/en/microdata/data-centres/access.


Articles from JNCI Journal of the National Cancer Institute are provided here courtesy of Oxford University Press

RESOURCES