Markers of poverty or of low social class are associated with many diseases and potential causes of disease, but medical studies often fail to record sufficient information on socioeconomic status.1 Postcodes of individuals are, however, often available in Britain, and commercial software exists that estimates household income from the postcode alone. We assessed how informative postcode income estimates are, either about reported household income or about other characteristics related to social class in a large, population based survey of British residents.
Methods and results
The family resources survey involves personal interviews with members of private households in England, Scotland, and Wales selected by stratified clustered probability sample.2 During 1995-6, 26 445 (70%) of 37 712 eligible households gave answers to questions on socioeconomic characteristics. Reported weekly household income was taken as the sum of all sources of pretaxation income (excluding housing benefit) reported by household members. During 1985 to 1993 members of 11 million households, or about half of all households in Britain, provided information to a marketing company about annual income and gave a complete address that included a full postcode—that is, 6 or 7 characters.
This information was used to produce commercial software that estimates household incomes from postcodes. After adjustments for regional variation and for inflation in reported income levels, the pretaxation incomes of at least six households were used to calculate a weighted average income for that postcode. When there were fewer than six responses, the income information was combined with the data for respondents with neighbouring postcodes until a reliable estimate could be made. Parts of this database are updated annually. We compared household income estimates obtained by FIND (a commercially available software program) with information reported in the family resources survey. Matching of the data was carried out at the Office for National Statistics. The investigators in this study were provided with columns of numerical data without any personal identifiers.
The overall correlation coefficient between postcode estimates and reported values of weekly household income for 26 282 individuals was moderate (0.40, 99% confidence interval 0.39 to 0.42; 2P<0.0001). When households were ranked in three equal sized groups on the basis of postcode income estimates, there were substantial and highly significant differences in reported weekly income, duration of education, home ownership, membership of higher social classes, and access to various consumer goods (2P<0.0001 for each) (see table).
Comment
Postcode income estimates are easily available in Britain and can be useful markers of social class. As UK postcodes are usually shared by only 15 to 20 households,3 these estimates should more accurately predict the social class of individuals than can more aggregated data, such as information derived from enumeration districts or electoral wards,4 or, in the United States, from zip codes,1 which are usually based on several hundred households.
Consequently, postcode estimates may serve various epidemiological purposes, particularly when it is not feasible to collect detailed information on social class in large studies. Firstly, they can be used to help standardise for the effects of social class, which may be important when the occurrence of a disease and its possible risk factors are both related to poverty. Looking at the strength of an association before and after such statistical adjustment can suggest how much full adjustment for social class, if that were possible, would have modified the association. This comparison might then lead investigators to make more detailed measurements of social class in a subset of participants. Secondly, income estimates derived from postcodes might help to assess bias when a proportion of eligible people refuse to participate in a study. Most epidemiological reports show that people who fail to return questionnaires or to give blood samples are of lower social class than volunteers.5 The effects of such biases are difficult to establish, especially if surveying the non-respondents again is impracticable, but postcode income estimates can help with this and other aspects of medical research in Britain.
Table.
Characteristic | Thirds of income estimated by postcode
|
|||
---|---|---|---|---|
Low income (n=8760) | Middle income (n=8760) | High income (n=8762) | P value | |
Reported mean (SD) weekly income (£) | 229 (2) | 337 (3) | 533 (5) | <0.0001 |
Social class I-II by occupation | 690 (8) | 1773 (20) | 3553 (41) | <0.0001 |
Homeowner | 3586 (41) | 6306 (72) | 7342 (84) | <0.0001 |
Educated after age 16 | 1084 (13) | 2098 (24) | 3883 (45) | <0.0001 |
Access to: | ||||
Dishwasher | 504 (6) | 1385 (16) | 3074 (35) | <0.0001 |
Tumble dryer | 3634 (42) | 4294 (49) | 5141 (59) | <0.0001 |
Microwave oven | 5532 (63) | 6138 (70) | 6566 (75) | <0.0001 |
Washing machine | 7396 (84) | 7851 (90) | 8183 (93) | <0.0001 |
Freezer | 2488 (28) | 3436 (39) | 4296 (49) | <0.0001 |
Video | 6186 (71) | 6830 (78) | 7315 (84) | <0.0001 |
Colour television | 8329 (95) | 8464 (97) | 8552 (98) | <0.0001 |
Black and white television | 1709 (20) | 1800 (21) | 1857 (21) | <0.01 |
Telephone | 7402 (85) | 8154 (93) | 8537 (98) | <0.0001 |
Cooker | 8676 (99) | 8682 (99) | 8700 (99) | NS |
Acknowledgments
Emily Banks, Robert Clarke, Rory Collins, Jeyanthi John, and Martin Vessey commented helpfully. Charles Lound of the Office for National Statistics helped with data management, Rom Rahman of QAS Systems provided postcode estimates, and Ian Liddicoat of Market Information Consultancy provided details about “FIND” methodology.
Footnotes
Funding: JD was supported by Merton College and a Frohlich award.
Competing interests: None declared.
References
- 1.Davey Smith G, Neaton JD, Wentworth D, Stamler R, Stamler J.for the MRFIT Research Group. Mortality differences between black and white men in the USA: contribution of income and other risk factors among men screened for the MRFIT trial Lancet 1998351934–939. [DOI] [PubMed] [Google Scholar]
- 2.Semmence J, Easto V, Gault S, Hussain M, Fincham P, Hall P, et al. Family resources survey: Great Britain 1995-96. London: Stationery Office; 1997. [Google Scholar]
- 3.Gatrell AC. On the spatial representation and accuracy of address-based data in the United Kingdom. Int J Geographic Inform Syst. 1989;3:335–348. [Google Scholar]
- 4.Ben-Shlomo Y, White IR, Marmot M. Does the variation in the socioeconomic characteristics of an area affect mortality? BMJ. 1996;312:1013–1014. doi: 10.1136/bmj.312.7037.1013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Austin H, Hill HA, Flanders D, Greenburg R. Limitations in the application of the case-control methodology. Epidemiol Rev. 1994;16:65–76. doi: 10.1093/oxfordjournals.epirev.a036146. [DOI] [PubMed] [Google Scholar]