The United Kingdom prospective diabetes study (UKPDS) is seriously flawed, and its results do not justify a policy of aggressive treatment of type 2 diabetes.1,2 The study's flaws arise from changes that were made in the trial as it progressed.
Summary points
During the course of the UK prospective diabetes study the length of follow up was changed, and changes seem to have been made to the end points and the groups under analysis
These changes are not in keeping with accepted scientific principles and make the results of the trial suspect
An independent review of the trial's design and analysis is needed
The results of the study do not justify aggressive treatment of type 2 diabetes
Methods
I searched Medline (Ovid's CD version) from 1976 to March 2000, using various combinations of the keywords “UKPDS” and “prospective diabetes study” and the names of the principal authors of the study's reports. I reviewed the resulting articles for any description of the design of the UKPDS. I reviewed reference lists for other references.
Changes in the end points
The authors have presented various end points over the life of the study (box). The UKPDS grew originally out of the authors' interest in the use of basal rather than postprandial glucose in monitoring diabetes. They concluded, “A prospective controlled trial of different ways of obtaining basal normoglycaemia is needed to determine whether the improved control of mild diabetes is beneficial.”3 In the first report of the study, published in 1983, the authors set out their rationale: “If, after dietary therapy, the fasting plasma glucose continues to be raised, there is little information available to determine whether one should continue with diet alone, or add a sulphonylurea, biguanide or insulin.”4 Specific end points were not laid out in this original description but were set out the following year in a letter discussing the paper.4,5 The authors later argued (1999) that this “brief letter” had only “minor differences in wording.”6 It seems, however, that the letter was written specifically to clarify the vagueness and ambiguities of the 1983 paper: the authors wrote, “In the initial [1983] report, it was not feasible to mention all details of the study.”5
Changes in the end points and stopping points in the UK prospective diabetes study
| 19834 | 199510 |
| •No specific end points reported: “morbidity and excess mortality of the disease,” “complications including macrovascular events or retinopathy, nephropathy or peripheral neuropathy” | Main comparison and interdrug comparison |
| 19845 | • “Diabetes-related mortality—death from: heart attacks, sudden death, stroke, complications of peripheral vascular disease or amputations, renal failure, or hyperglycemic or hypoglycemic coma” |
| Main comparison and interdrug comparison | • “Diabetes-related mortality and major clinical endpoints including non-fatal myocardial infarct, clinical angina, . . . heart failure, . . . major stroke, . . . amputation, retinal photocoagulation, vitreous hemorrhage, blindness, . . . and renal failure” |
| •“Deaths from vascular events, sudden death or renal failure” | • “Total mortality” |
| •“Complication-free interval, including avoidance of death from any cause, heart attack, angina, renal failure, blindness, major stroke or amputation” | 19969 |
| 19917 (interpretation using the stopping criteria set out in tables 4 and 5 of the 1991 paper) | Main comparison and interdrug comparison |
| Main comparison and interdrug comparison | • “Diabetes-related mortality—death from: heart attacks, sudden death, stroke, complications of peripheral vascular disease or amputations, renal failure, hyperglycemic or hypoglycemic coma” |
| •“Diabetes-related deaths, ie vascular, renal, hyper- or hypoglycemia or sudden death” | • “Diabetes-related mortality and major clinical endpoints”: |
| •“Diabetes-related death and major morbitity”: myocardial infarction, angina and ischaemic heart disease, major stroke, major limb complications requiring amputation, blindness in one eye, renal failure (“death from any cause” removed) | • Macrovascular: fatal and non-fatal myocardial infarction, fatal and non-fatal stroke, ischaemic heart disease, heart failure |
| 19917 (interpretation using the aggregates set out in table 5 of the 1991 paper) | • Microvascular: fatal and non-fatal renal disease, ophthalmic (blindness, retinal photocoagulation, vitreous haemorrhage), peripheral neuropathy (amputation) |
| Main comparison and interdrug comparison | • Cataract |
| •Non-fatal end points: myocardial infarction, angina and ischaemic heart disease, major stroke, major limb complications requiring amputation of a digit or limb, blindness in one eye, renal failure (“death from any cause” removed) | • “Total mortality” |
| •“Clinical events not included in stopping criteria” (cataract extraction, vitreous haemorrhage, heart failure, photocoagulation) | 19981 |
| •Total mortality | Main comparison |
| 19938 (translated from the French) | • “Diabetes-related death (death from myocardial infarction, stroke, peripheral vascular disease, renal disease, hyperglycaemia or hypoglycaemia,sudden death)” |
| Main comparison and interdrug comparison | • “Any diabetes-related endpoint (sudden death, death from hyperglycaemia or hypoglycaemia, fatal or non-fatal myocardial infarction, angina, heart failure, stroke, renal failure, amputation, . . . vitreous hemorrhage, photocoagulation, blindness in one eye, or cataract extraction)” |
| •Diabetes related deaths: cardiovascular events, sudden death, hypoglycaemia, hyperglycaemia, renal failure | • “All-cause mortality” |
| •Diabetes related major morbidity: non-fatal cardiovascular accidents (heart attacks, strokes, and lower limb amputations), renal insufficiency, blindness | 19981 |
| •(Total mortality removed) | Interdrug comparison |
| • Main comparison end points (as above) and: | |
| • “Myocardial infarction (fatal and non-fatal) and sudden death” | |
| • “Stroke (fatal and non-fatal)” | |
| • “Amputation or death due to peripheral vascular disease” | |
| • “Microvascular complications (retinopathy requiring photocoagulation, vitreous hemorrhage, and fatal and non-fatal renal failure)” |
In 1991, after the study had been going 14 years and after numerous interim analyses,1,5 the authors restated their end points.7 They presented two possible interpretations of the end points, but neither matches the end points given in the final publications.1,2 Nevertheless, the authors repeatedly refer to the 1991 paper as setting out the final end points.1,8,9 The end points set out in 1993 are almost identical to those published in 1984.5,8 It is not until 1995 that the end points take on a form similar to those given in the final publication in 1998.1,10 The 1995 paper is also the first time that total mortality is set out as an end point and the first time that a clear distinction is made between microvascular and macrovascular end points (although this distinction is mentioned briefly in earlier reports). By 1996 the end points are, apart from differences in wording, identical to those seen in the final report—the only change is that “diabetes-related mortality and major clinical endpoints” is renamed “any diabetes-related endpoint.”1,9
The final report defines, for the first time, four additional end points to be used when comparing intensive treatments (see box).1 These end points do not, as far as I could discover, appear in any other report. Every previous publication implies or states that the end points of the main comparison were to be used in the secondary comparison among the different agents—this was stated explicitly in 1984 and 1993.5,8 In the main body of the final report the authors state that these secondary end points are to be used only when comparing different intensive regimens, not when comparing intensive and conventional regimens (although such a comparison is made in figure 4 of the report).1 Thus the authors' claim that the “intensive treatment group had a substantial, 25% reduction in the risk of microvascular endpoints” (the only one of the four additional end points that was significantly different in the retrospective comparison of conventional and intensive treatment) is not supported by their study design, and the result must be viewed simply as hypothesis generating.1
Summary of the changes
The authors made substantial changes to the end points as the study progressed. In particular, cataract extraction, vitreous haemorrhage, heart failure, and retinal photocoagulation were not included in 1984, were mentioned ambiguously (as “events,” not end points) in 1991, were not included in 1993, and then were included from 1995 on.5,7,8 In fact, none of the publications before 1995 specifically set out the end points that were used in the final analysis. It also seems that the decisions about which of the aggregations of end points to include were based on the results of the interim analyses that were available to the authors.1,5
Soft end points
A second point is that the study was not blinded. This is less an issue with the hard end points (death) than with the soft end points (cataract extraction and retinal photocoagulation), where the decision to perform a procedure might have been influenced by the degree of glucose control in a patient. This is important, as the decrease in these soft end points accounts for all of the significant beneficial results claimed in the study.1
Analysis of subgroups
The main purpose of the study was always to compare intensive and conventional (diet) treatment. This aim is consistent throughout the early reports of the study and was stated as the overall objective in the authors' 1998 review.4,7,8,11,12
Over the years, however, a subtle change was made. A question based primarily on outcome (does lowering blood sugar decrease morbidity and mortality?), with a secondary question based on mechanism (does the way in which blood sugar is lowered matter?), was changed to a question based primarily on mechanism, and the question based on outcome was simply ignored. This change is reflected in the treatments that were analysed (initially conventional versus intensive treatment, then later conventional treatment versus sulphonylurea and insulin and conventional treatment versus metformin). The authors' argument, first put forth in 1996, that separate comparisons based on mechanism should be made is interesting and worthy of study, particularly with the development of the thiazolidinedione drugs, but it does not justify the decision not to publish the study as originally designed.9
Length of follow up
The study was originally planned to end in 1992, with a median follow up of seven years.5 In 1987 an interim analysis showed negative results; the study was therefore expanded in size and also in length of follow up.7 In 1990 the study was “due to report in 1995, by which time a total of 5000 patients will have been followed for a median of 8 yr.”13 In 1991 the study was “planned to finish in 1994 with a median follow-up of 9 years.”7 In 1993 it was also planned to finish the study in 1994.8 It was not until 1995 that the authors stated, “The clinical study will end in 1997 when the 4,209 patients will have had a median time since randomization of 11 years.”10
Interim analyses were planned to occur every six months to 1985 and yearly thereafter.1,5 It seems that the authors continued the study until they obtained a result that was significant, without adjusting for repeatedly looking at the data. Although it is acceptable to extend a study for a set period and to have predetermined stopping rules, it is not acceptable to repeat interim analyses and to delay publication until a significant result is found.14
Recommendations for screening
Can the findings of this study be generalised to screening for asymptomatic diabetes? The patients in the study were aged 25-65 years, had type 2 diabetes that was newly diagnosed, and were referred by local general practitioners. No attempt was made to screen patients for diabetes, and at diagnosis 50% of the patients had evidence of diabetic tissue damage.10 The study was not designed to show a benefit from screening and would not have been powerful enough to do so even if it had been. Screening detects cases of diabetes much earlier in the course of the disease, and it is not logical to imply that beneficial treatment (if there is any) given later in the course of the disease would give the same benefit (either absolute or proportional) when given earlier. Furthermore, the ethics of screening are different in several respects from those of routine medical investigation and treatment and require that more attention be paid to issues of harm, consent, and cost.15
Are the results clinically significant?
The authors present details of their power calculations and state that the study had a good chance of detecting a 20% or 15% benefit.1,7,9,10,12 They further state, “This reduction has been accepted as being a clinically significant gain,”7 and, “A protective effect of 15% has been judged to be clinically relevant,”12 implying that lower reductions should not be considered clinically significant.
Not one of the main results of the study even approaches these numbers. The best result is in the “any diabetes-related end point,” a reduction of 12% (95% confidence interval 1% to 21%).1 The risk of diabetes related death—clinically the most important result—was not significantly reduced.1
Conclusions
In 1999 the authors restated their position that after 1981 they made no substantive changes in the design of the study (apart from those discussed in the 1998 paper).1,6,16 The ambiguities and contradictions in the various reports of the study cannot, however, be denied. Whether the authors' conclusions are supported by the data cannot be resolved by debate but only by an independent review of the study's design and analysis. Meanwhile it is not unreasonable to ask for the results to be published as outlined in 1984—that is, as a comparison between the intensive treatment and conventional treatment groups, using the two end points “deaths from vascular events, sudden death or renal failure” and “complication free interval, including avoidance of death from any cause, heart attack, angina, renal failure, blindness, major stroke or amputation.”5 It would not be unreasonable to add total mortality to this list, with the caveat that it was not included in the initial design.
If it is true that substantial changes, derived from ongoing reanalysis of the data, were made, and if reanalysis of the data according to the original design shows no significant benefit, then we must call into question the recommendations for screening and more aggressive treatment that have flowed from the publication of this study.17,18 In the interim we should return to the position that, although management of the symptoms of type 2 diabetes is reasonable (that is, with the intention of keeping blood sugar concentrations below about 11-14 mmol/l (200-250 mg/dl), a recommendation in favour of screening and more aggressive care is not supported by the evidence presented in this study.
Acknowledgments
Magdi Nour helped with French translation. Robert Wesley provided criticism of earlier drafts of the manuscript. The author previously published a shorter and less detailed version of this critique, arguing only about the end points used in the 1984 and 1991 publications and not discussing the other issues (Ewart RM. The UKPDS: what was the question? [letter] Lancet 1999;353:1882).
Footnotes
Funding: No additional funding.
Competing interests: Both RME and his institution will lose clinical income if type 2 diabetes is treated less aggressively.
References
- 1.UK Prospective Diabetes Study (UKPDS) Group. Intensive blood-glucose control with sulphonylureas or insulin compared with conventional treatment and risk of complications in patients with type 2 diabetes (UKPDS 33) Lancet. 1998;352:837–853. [PubMed] [Google Scholar]
- 2.UK Prospective Diabetes Study (UKPDS) Group. Effect of intensive blood-glucose control with metformin on complications in overweight patients with type 2 diabetes (UKPDS 34) Lancet. 1988;352:854–865. . (Erratum appears in Lancet 1998;352:1558.) [PubMed] [Google Scholar]
- 3.Holman RR, Turner RC. Diabetes: the quest for basal normoglycaemia. Lancet. 1977;i:469–474. doi: 10.1016/s0140-6736(77)91954-7. [DOI] [PubMed] [Google Scholar]
- 4.UK prospective study of therapies of maturity-onset diabetes. I. Effect of diet, sulphonylurea, insulin or biguanide therapy on fasting plasma glucose and body weight over one year. Diabetologia. 1983;24:404–411. [PubMed] [Google Scholar]
- 5.Turner RC, Mann JI, Peto R. UK prospective study of therapies of maturity-onset diabetes [letter] Diabetologia. 1984;27:419. doi: 10.1007/BF00304862. [DOI] [PubMed] [Google Scholar]
- 6.Turner R, Holman R, Butterfield J. UKPDS: what was the question? [letter] Lancet. 1999;354:600. doi: 10.1016/s0140-6736(05)77957-5. [DOI] [PubMed] [Google Scholar]
- 7.UK Prospective Diabetes Study Group. UK prospective diabetes study (UKPDS). VIII. Study design, progress and performance. Diabetologia. 1991;34:877–890. [PubMed] [Google Scholar]
- 8.Levy JC, Cull CA, Stratton IM, Holman RR, Turner RC. L'étude UKPDS sur le contrôle de la glycémie et de l'hypertension artérielle dans la diabète de type II: objectifs, structure et résultats préliminaires. Journ Annu Diabetol Hotel Dieu 1993:123-37. [PubMed]
- 9.Turner R, Cull C, Holman R.for the United Kingdom Prospective Diabetes Study Group. United Kingdom prospective diabetes study 17: a 9-year update of a randomized, controlled trial on the effect of improved metabolic control on complications in non-insulin-dependent diabetes mellitus Ann Intern Med 1996124136–145. [DOI] [PubMed] [Google Scholar]
- 10.UK Prospective Diabetes Study Group. UK prospective diabetes study 16. Overview of 6 years' therapy of type II diabetes: a progressive disease. Diabetes. 1995;44:1249–1258. [PubMed] [Google Scholar]
- 11.UK Prospective Diabetes Study Group. United Kingdom prospective diabetes study (UKPDS) 13: relative efficacy of randomly allocated diet, sulphonylurea, insulin, or metformin in patients with newly-diagnosed non-insulin dependent diabetes followed for three years. BMJ. 1995;310:83–88. [PMC free article] [PubMed] [Google Scholar]
- 12.Turner RC. The UK prospective diabetes study: a review. Diabetes Care. 1998;21(suppl 3):C35–C38. doi: 10.2337/diacare.21.3.c35. [DOI] [PubMed] [Google Scholar]
- 13.Turner RC, Holman RR. Insulin use in NIDDM. Rationale based on pathophysiology of disease. Diabetes Care. 1990;13:1011–1020. doi: 10.2337/diacare.13.9.1011. [DOI] [PubMed] [Google Scholar]
- 14.DeMets DL, Pocock SJ, Julian DG. The agonising negative trend in monitoring of clinical trials. Lancet. 1999;354:1983–1988. doi: 10.1016/S0140-6736(99)03464-9. [DOI] [PubMed] [Google Scholar]
- 15.Ewart RM. Primum non nocere and the quality of evidence: rethinking the ethics of screening. J Am Board Fam Pract. 2000;13:188–196. doi: 10.3122/15572625-13-3-188. [DOI] [PubMed] [Google Scholar]
- 16.Turner RC, Holman R, Stratton I. The UK prospective diabetes study [letter] Lancet. 1998;352:1934. [Google Scholar]
- 17.American Diabetes Association. Screening for type 2 diabetes. Diabetes Care. 2000;23(suppl 1):S20–S23. [PubMed] [Google Scholar]
- 18.American Diabetes Association. Implications of the United Kingdom prospective diabetes study. Diabetes Care. 1998;21:2180–2184. doi: 10.2337/diacare.21.12.2180. . [Also published in Clin Diabetes 1999;17:5-10.] [DOI] [PubMed] [Google Scholar]
