Skip to main content
BMJ Open logoLink to BMJ Open
. 2018 Jun 30;8(6):e020926. doi: 10.1136/bmjopen-2017-020926

Using electronic health records to quantify and stratify the severity of type 2 diabetes in primary care in England: rationale and cohort study design

Salwa S Zghebi 1,2, Martin K Rutter 3,4, Darren M Ashcroft 5, Chris Salisbury 6, Christian Mallen 7, Carolyn A Chew-Graham 7, David Reeves 1, Harm van Marwijk 8, Nadeem Qureshi 9, Stephen Weng 9, Niels Peek 10, Claire Planner 1, Magdalena Nowakowska 1,2, Mamas Mamas 11, Evangelos Kontopantelis 1,2
PMCID: PMC6042592  PMID: 29961021

Abstract

Introduction

The increasing prevalence of type 2 diabetes mellitus (T2DM) presents a significant burden on affected individuals and healthcare systems internationally. There is, however, no agreed validated measure to infer diabetes severity from electronic health records (EHRs). We aim to quantify T2DM severity and validate it using clinical adverse outcomes.

Methods and analysis

Primary care data from the Clinical Practice Research Datalink, linked hospitalisation and mortality records between April 2007 and March 2017 for patients with T2DM in England will be used to develop a clinical algorithm to grade T2DM severity. The EHR-based algorithm will incorporate main risk factors (severity domains) for adverse outcomes to stratify T2DM cohorts by baseline and longitudinal severity scores. Provisionally, T2DM severity domains, identified through a systematic review and expert opinion, are: diabetes duration, glycated haemoglobin, microvascular complications, comorbidities and coprescribed treatments. Severity scores will be developed by two approaches: (1) calculating a count score of severity domains; (2) through hierarchical stratification of complications. Regression models estimates will be used to calculate domains weights. Survival analyses for the association between weighted severity scores and future outcomes—cardiovascular events, hospitalisation (diabetes-related, cardiovascular) and mortality (diabetes-related, cardiovascular, all-cause mortality)—will be performed as statistical validation. The proposed EHR-based approach will quantify the T2DM severity for primary care performance management and inform the methodology for measuring severity of other primary care-managed chronic conditions. We anticipate that the developed algorithm will be a practical tool for practitioners, aid clinical management decision-making, inform stratified medicine, support future clinical trials and contribute to more effective service planning and policy-making.

Ethics and dissemination

The study protocol was approved by the Independent Scientific Advisory Committee. Some data were presented at the National Institute for Health Research School for Primary Care Research Showcase, September 2017, Oxford, UK and the Diabetes UK Professional Conference March 2018, London, UK. The study findings will be disseminated in relevant academic conferences and peer-reviewed journals.

Keywords: type 2 diabetes, electronic health records, primary care, hospitalisation records, cardiovascular disease, diabetes severity algorithm


Strengths and limitations of this study.

  • This is the first UK-based study to develop a diabetes severity scoring tool based on real-world electronic healthcare data to grade people with type 2 diabetes by their clinical severity.

  • The study will include a large sample size and use high-quality medical data routinely collected from general practices with access to linked national hospitalisation and cause-specific mortality data sets.

  • The association between the computed severity scores and future adverse outcomes (cardiovascular event, hospitalisation and mortality) will be used to validate the developed severity algorithm.

  • There is a possibility to miss other severity indicators not recorded in the data sources used such as pharmacy dispensing data.

  • Given that the linkage scheme is only available for consented general practices in England, the study cohort will be restricted to eligible patients registered within England (form nearly 58% of total Clinical Practice Research Datalink general practices).

Introduction

The worldwide prevalence of diabetes mellitus is increasing with WHO estimating that over 422 million adults had diabetes in 2016 with a growth in the global prevalence from 4.7% in 1980 to 8.5% in 2014.1 2 In the UK, the prevalence of type 2 diabetes mellitus (T2DM) has nearly doubled between 2004 and 2014.3 4 People with diabetes have a higher risk for morbidity and mortality when compared with individuals without diabetes,3 5 and hence the increasing prevalence presents a significant burden on healthcare resources. Diabetes management expenses are estimated to consume up to 11% of the total healthcare budget in the UK and USA.6 7 Diabetes is a complex metabolic condition, of an increasing severity,8 9 at individually varying levels, and progressive development of vascular complications and end organ damage over time. Diabetes is mainly managed in primary care in the UK,10 overall Europe,11 USA12 and Asia.13 14 In the UK, nearly 75% of the diabetes-associated costs relate to management of diabetes-related complications.6

The ‘severity’ of clinical conditions can be conceptualised as a progression of the underlying disease process. Increasing disease severity and development of associated complications lead to greater treatment complexity and clinical impact. The severity of chronic conditions, such as diabetes, has not been widely considered despite the clinical relevance and its likely impact on healthcare systems, where few studies have assessed disease severity among patients with T2DM or have quantified temporal trends or scores of severity.15 16

Severity scores could be clinically useful, particularly in T2DM; as such, a new summary score could contribute above what existing risk scores offer (eg, moving beyond risk for death) and also be highly relevant for use in six main clinical and research areas: (1) identifying complex patients with a higher need for future care; (2) identifying patients at early stages of disease (benchmarking); (3) providing information that will directly inform clinical care; (4) identifying trajectories in severity over time; (5) supporting research, for example, by serving as an important diabetes-specific covariate to consider in analyses, similarly to the Charlson Comorbidity Index, or as an outcome; and (6) providing data that will inform resource allocation in health systems, such as the National Health Service.

Here, we describe the conceptual development of a T2DM severity algorithm model, using electronic health records (EHRs) on clinical consultations and treatments. Our aim is to use routinely collected clinical and administrative data to develop a scoring tool which would quantify and grade the severity of T2DM. This model could potentially be used by clinicians to stratify patients based on disease severity and we aim to demonstrate potential advantages over similar risk scores. To validate the algorithm, our secondary aim is to examine the association of severity grades with risk of three main adverse outcomes: cardiovascular disease, hospitalisation and mortality. The developed severity stratification algorithm is anticipated to have direct impact on clinical practice and have wider implications on service planning and policy-making.

Methods and analysis

Data source

This study will use data from the Clinical Practice Research Datalink (CPRD). The CPRD is one of the world’s largest EHR with anonymised primary care records from over 650 general practices over the UK.17 Nearly 76% of all CPRD-registered patients are in England.17 CPRD data include clinical diagnoses, prescribed therapies, biochemical test results and referrals to healthcare services. The accuracy and completeness of diagnostic coding and validity of CPRD data, in clinical research, have been reported as excellent with positive predictive values (PPVs) of over 90% for nearly 14 conditions (including diabetes and cerebrovascular disease).18 PPVs were defined as the proportion of CPRD diagnoses that were validated as true cases when compared with a gold standard such as general practitioner (GP) questionnaire or primary care records.

Two metrics for research quality data, recommended to be used by researchers, are available for CPRD records.17 For patient-level data, the flag ‘acceptable’ indicates that a patient’s record has met certain quality standards such as registration status, valid age and gender and record of patient events. For practice-level data, the up-to-standard (UTS) date is used as a data quality measure. The UTS date is calculated for each CPRD general practice as the latest date at which the general practice meets minimum quality criteria based on two central concepts: the continuity of recorded data and the number of recorded deaths, in comparison to an expected national range.17 19

Nearly 75% of general practices in England (approximately 58% of all CPRD practices) have consented to the CPRD Linkage Scheme for access to a number of linked data sets and national disease registries.17 These include hospital data (including outpatient, admissions and accident and emergency (A&E) data), mortality records (held by the Office for National Statistics (ONS)), socioeconomic status and the cancer registry. The linked hospitalisation records, held by the Hospital Episode Statistics (HES), provide ethnicity data, admission and discharge dates, clinical diagnoses and procedures during hospitalisation recorded using the 10th revision of the International Classification of Diseases (ICD-10) and operating procedure codes. This study will only include patients registered with English general practices which provide UTS data and which participate in the CPRD Linkage Scheme.

Study population

The study cohort will include individuals aged 35 years or over with T2DM (with ≥1 diagnostic code as in (online supplementary table S1) between 1 April 2007 and 31 March 2017. Patients with an ever record of type 1 diabetes (see online supplementary table S2) will be excluded unless they have records of non-insulin anti-diabetic therapies. We chose the study period after 2006 as the quality of CPRD data has improved substantially to adhere with the then introduced important changes to the national incentive scheme The Quality and Outcomes Framework (QOF), an incentivisation programme for all GP surgeries in the UK.20 21 QOF exception reporting process allows general practices to exclude patients from indicators or a clinical domain based on discretionary exception codes. However, evidence on the use of exception codes has shown that they are being used appropriately by practices and overall exception rates for diabetes patients are low.22

Supplementary data

bmjopen-2017-020926supp001.pdf (478.7KB, pdf)

For this cohort construction, we will consider implementing previously validated algorithms designed to identify diabetes cases by avoiding potential misclassification in routinely collected data such as CPRD.23 24 Eligible patients will be followed up until censored at the earliest instance of any of the following event dates: patient transferred out of the practice (any cause), last collection date for the practice, the study end on 31 March 2017 or death. The main demographic and clinical characteristics of the defined diabetes cohorts (such as age, gender, geographic region within England, patient-level and general practice-level social deprivation, body mass index (BMI) and baseline glycated haemoglobin (HbA1c)) will be identified. Based on our previous studies, the expected sample size in a year will include 11 000 patients with T2DM registered with English general practices linked to the HES Admitted Patient Care data set, HES Outpatient and HES A&E data sets. A random 80% of the identified diabetes cohort (training dataset) will be used to develop the severity tool, with the remaining 20% of the cohort used as a validation data set, as described below.

Severity domains

A systematic literature search for studies that developed algorithms or models to assess and quantify the severity of diabetes was conducted to identify the domains and subdomains for T2DM severity (to be published separately). Also, expert clinical opinion from members of the research team was used to supplement the search process and identify possible omissions. Clinical members in the team include pharmacists (DMA, SSZ), GPs (CS, CM, CACG, HVm and NQ), a consultant diabetologist (MKR) and a consultant cardiologist (MM), who used their expertise to create a list of relevant clinical domains for T2DM severity. The final domains to be included in the severity model will be decided during the analysis stage. Currently, the identified clinical domains that are relevant to the degree of progression of T2DM include: patient factors (diabetes duration (the period between T2DM diagnosis and the severity score estimation) and BMI), monitoring laboratory tests (HbA1c categories (threshold of 7% [53 mmol/mol]), and blood glucose levels), type of anti-diabetic therapy, other prescribed medications (such as lipid-regulating medications and ACE inhibitors (ACEIs)), comorbidities (including diabetes-related complications, depression), hospitalisation and surgical interventions. Comorbidities will be identified using appropriate code lists and contribute to the severity score according to each score computing method. The identified domains and subdomains are described in table 1. Domains will also be reviewed by a panel of ‘experts by experience’ (people with lived experience of T2DM) to provide patient validation of the severity scoring tool.

Table 1.

The main (sub)domains identified to quantify the severity of type 2 diabetes

Severity domain Severity subdomain
1. Risk factors*
  • Duration of type 2 diabetes40

  • Body mass index (BMI)

  • Hypertension

  • Hyperlipidaemia

  • Personal/Family history of cardiovascular disease

  • Blood glucose levels

    • Glycated haemoglobin (HbA1c)35

    • Fasting blood glucose (FBG) and random blood glucose (RBG)

2. Type/pattern of anti-diabetic treatment, insulin use and other therapies
  • Anti-diabetic therapy ever;40 Therapies with cardiovascular benefits versus other; Changes in drug treatments;43 or the number of prescribed treatments42

  • Insulin use: prescription ever or within 1 year of diagnosis; Insulin initiation:15 time to initiation

  • Other therapies: ACE inhibitors (ACEI) and lipid-regulating therapies

3. Diabetes-related microvascular complications15 34
  • Neuropathy (foot ulcer, Charcot foot, gangrene, amputation)

  • Nephropathy

  • Retinopathy (laser therapy and blindness)

4. Renal disease
  • Microalbuminuria and proteinuria

  • Moderate-severe chronic kidney disease (CKD) stages 3 and 434

  • End-stage renal disease (ESRD): kidney transplant and dialysis

5. Cardiovascular and cerebrovascular disease
  • Atherosclerosis15 34

  • Myocardial infarction (MI)15 34

  • Angina15 34

  • Atrial/ventricular fibrillation (AF)/(VF)34

  • Heart valve disease

  • Heart failure (HF)34

  • Peripheral vascular disease (PVD)34

  • Transient ischaemic attack (TIA)

  • Ischaemic stroke, haemorrhagic stroke15 34

6. Cardiovascular and cerebrovascular interventions
  • Coronary artery bypass graft (CABG)

  • Coronary artery interventions (PCI/PTCA)

  • Endovascular aneurysm repair (EVAR)

  • PVD stenting and bypass procedures

  • Heart valve interventions

  • Use of defibrillator

  • Carotid artery events, stenting and bypass interventions

7. Other comorbidities
  • Anxiety

  • Depression

  • Dementia

  • Cognitive impairment

8. Hospital admissions
  • Any-cause hospital admissions

  • Diabetes-attributable admission

  • Cardiovascular disease-related admission

9. Emergency diabetes-related events
  • Hypoglycaemia

  • Hyperosmolar hyperglycaemic state (HHS)34

  • Diabetic ketoacidosis (DKA) or other coma34

*Other demographic data (such as age, gender and the level of deprivation) are important predictors for adverse outcomes and will be included in the later risk prediction analysis.

The clinical Read codes for the defined severity domains codes and the product codes for drug therapies will be identified using the (pcdsearch) Stata user command.25 The (pcdsearch) command is a search programme developed to extract code lists from typically very long lookup files associated with primary care databases using an input file containing a list of stubs for codes of interest to be searched for. For CPRD, the lookup files Medical data set is searched for all clinical Read codes and the Product lookup file, that includes unique product codes, is searched for all treatments. The corresponding ICD-10 codes will be used to identify clinical domains recorded in the hospitalisation data and ONS mortality records. The relevant CPRD operational identification entities for the laboratory tests will be identified. This is an ongoing process whereby the final lists of domains and codes will be reached by consensus among the clinical members of the team. All code lists will be available on the online clinical code repository (ClinicalCodes.org).26

Study outcomes

The adverse outcomes of interest will be the development of the first event among cardiovascular disease (myocardial infarction, stroke), future hospitalisation (any hospitalisation, diabetes-related and cardiovascular hospitalisation) and death (diabetes-related mortality, cardiovascular mortality and all-cause mortality). Secondary outcome will be future hospitalisation due to hypoglycaemia, a relevant and potentially preventable adverse outcome. The hospitalisation and death outcomes will be identified using the linked HES and ONS mortality data, respectively. Similarly with severity domains, Read codes and ICD-10 codes (available on the ClinicalCodes.org online repository) will be used to identify the outcomes as appropriate.

Statistical analyses

Diabetes severity algorithm

Using annual data bins and grouping diabetes patients in the training data set (include random 80% of the total diabetes cohort) from 1 April to 31 March between 2007 and 2017, the developed diabetes algorithm will grade the severity of T2DM using predefined (sub)domains. Our study period (after 2006) was selected to ensure very high data quality in primary care, while the addition of secondary care data will make our analyses even more robust, in terms of accurately classifying T2DM severity levels.

We will consider two approaches to derive numerical or categorical diabetes severity scores or levels. First, we will use a binary classification (severity indicator: present/absent) within each subdomain and calculate an aggregate score. The second approach, through hierarchical stratification of end organ microvascular and macrovascular complications, will involve increasing weighting within each subdomain, as severity increases where scores of 1, 2 or three on each subdomain will be assigned based on clinical input in terms of severity. Then, regression models, using death (primary outcome) and future hospitalisation (secondary outcome) as dependent variable, will be fitted from which the weights of its estimates will be used to calculate the weights for severity subdomains. The highest possible severity score will be known when the final list of included domains and subdomains is decided. Hierarchical diagrams of the clinical severity of identified diabetes-related complications domains (figure 1), cerebrovascular domains (figure 2) and cardiovascular (figures 3 and 4, online supplementary figures S1 and S2) domains are presented.

Figure 1.

Figure 1

Severity hierarchy of diabetes-related microvascular complications. CKD, chronic kidney disease; ESRD, end-stage renal disease.

Figure 2.

Figure 2

Severity hierarchy of cerebrovascular domains. TIA, transient ischaemic attack.

Figure 3.

Figure 3

Severity hierarchy of coronary and vascular domains. CABG, coronary artery bypass graft; EVAR, endovascular aneurysm repair; HF, heart failure; MI, myocardial infarction; PAD, peripheral arterial disease; PCI, percutaneous coronary intervention; PVD, peripheral vascular disease.

Figure 4.

Figure 4

Severity hierarchy of cardiovascular disease domains (heart rate, rhythm and conduction). VF, ventricular fibrillation; VT, ventricular tachycardia.

Subdomains with many measurements over time, such as HbA1c levels and BMI, will be modelled as time-varying covariates (alternative models will be considered to account for non-linear changes). Within each time window, the average of these measurements will be used, and we will also use multiple imputation approaches to account for missing data. Provisionally, we will consider using data records within look-back periods between 3 and 5 years. Different look-back windows within this period will be tested to obtain the optimal time window. Descriptive statistics of the study population and the estimated scores will be performed.

Severity algorithm validation

When we have developed a first version of the severity score tool, it will be validated statistically by quantifying the association of severity scores to future adverse outcomes. The primary outcome will be developing a cardiovascular event (myocardial infarction, stroke), future hospitalisation and death (diabetes-related mortality, cardiovascular mortality and all-cause mortality). A secondary adverse outcome will be future hospital admission due to hypoglycaemia, a relevant and potentially preventable adverse outcome. Cox proportional hazards (after assessing proportional hazards assumption) and/or competing-risks regressions will be used to perform survival analyses and assess the relationship between the severity score and outcomes controlling for all available patient characteristics (such as age, gender, ethnicity and level of social deprivation). We aim to analyse and assess how severity scores differ across different levels of these variables. As stated earlier, a random 20% of the study cohort (validation data set) will be used to validate the performance of the severity algorithm that was developed in the training data set (80% of the diabetes cohort). Using the calculated subdomains weights (derived by the regression model estimates), Cox regression analysis will be used to validate the developed severity algorithm and we will assess the model performance using relevant measures (such as area under the receiver operating characteristic curve). Using this analysis, we will also test how much the ’sophisticated' full model that includes all relevant severity domains will add in terms of predictive validity of the adverse outcomes over a simpler model that includes demographic data (age, gender and level of social deprivation) alone.

Longitudinal trends of diabetes severity

In patients newly diagnosed with T2DM, during the study period, we will assess the temporal trends of severity scores over time by calculating the time needed to progress along the quintiles of the severity scale starting from the date or year of diagnosis and identify the predictors to this progression across quintiles.

While the inclusion of some domains such as carotid artery stenting, the use of ACEIs or lipid-regulating drugs, kidney transplantation and laser therapy are clinically relevant and represent markers of greater level of disease severity, they also improve the prognosis (lessen the grade of severity), that is, raising a risk of ‘feedback’. While we will further assess their role, we may have to not include these domains in the modelling at the analysis stage. All statistical analyses will be conducted using Stata Statistical Software for Windows: Release 15 (StataCorp LP, College Station, Texas, USA).

Patient and public involvement and engagement

The public interest in the reuse of routinely collected electronic health data for research purposes has expanded over recent years. Our study presents a valuable opportunity to address the challenge of how to develop meaningful and productive patient and public involvement and engagement (PPIE) collaborations in observational studies that make use of secondary data.

We will identify suitable PPIE partners to collaborate on this study and develop creative strategies for meaningful involvement as the project develops. Participants to provide their input that will be particularly valuable for deciding what information would be most useful to patients with diabetes and for providing feedback into the developed diabetes algorithms.

In the latter stages of the study, jointly with our PPIE partners, we will plan and deliver a patients’ and carers’ workshop where we will present emerging findings and seek feedback to inform further work and the dissemination strategy. This will include an exercise around weights of domains within the algorithm. PPIE work will be reported using the GRIPP (Guidance for Reporting Involvement of Patients and Public) checklist.27

Discussion

The global and country-specific prevalence of diabetes has increased substantially.2 28–31 For T2DM, prevalence rates have doubled over the past two decades.4 32 33 Diabetes diagnosis is associated with higher risk for morbidity and mortality in comparison to people without diabetes.5 Despite the natural progression of most chronic diseases, including T2DM, and their important clinical implication on prognosis and medical resources utilisation, only a few studies have attempted to grade the severity of chronic conditions over time. Our proposed study aims to develop a contemporary algorithm to quantify the severity of T2DM, a highly prevalent condition, using routine EHR. The scoring tool will be based on clinically relevant diagnoses and treatments modelled as severity domains and subdomains. The algorithm will enable clinicians to grade patients with T2DM according to the level of their disease severity, at baseline and longitudinally, as driven by the weights of the included clinical domains.

Prior studies

Currently, there are limited data around validated severity measures that can be used in routine clinical practice managing patients with T2DM. Consequently, we are in parallel finalising a conducted systematic review for clinical-based diabetes severity models which will inform the refinement of this work. Here, we discuss three relevant diabetes severity measures that have been previously reported from countries outside the UK.

In the first study, Gini et al categorised the severity of T2DM (n=300) into four levels based on insulin use and the presence of diabetes-related complications (see online supplementary table S3).15 A validation study in a random sample of cases was performed by interviewing their GPs. We, however, aim to include more patients with T2DM and consider more clinical data such as other coexisting conditions, non-insulin therapies, statins and ACEIs that have been shown to be associated with reduced risk of adverse outcomes.

In the second study, a diabetes symptom checklist was created to measure the perceived symptom severity and assess the changes over time in 185 patients with T2DM.16 The checklist contained 34 items categorised by main clinical symptoms (see online supplementary table S3). The patterns of comorbidities and prescribed treatments were reportedly associated with significant differences in the estimated severity scores.16 The sample size was however relatively small and the checklist was based on patients' perception on diabetes symptoms severity.

Third, Young et al calculated a Diabetes Complications Severity Index (DCSI) in 4229 patients with type 1 or type 2 diabetes in one US geographic region to examine its association with adverse outcomes (risk of hospitalisation and mortality).34 In comparison to using a simple numerical count of complications, DCSI found to be a better tool to predict adverse outcomes. The authors used pharmacy (insulin use only) and laboratory data to compute the severity index and included patients from clinics with largest ethnic diversity (see online supplementary table S3). However, the DCSI missed additional domains such as diabetes duration, hyperlipidaemia and a wider range of diabetes-related and end organ damage manifestations that we aim to assess in addition to hospitalisation and mortality.

In the wider literature, although several studies have investigated the possible role of other various factors on the severity of diabetes, none have provided a severity scoring tool that uses data from various clinical domains as planned in our study that can use the wealth of information routinely collected in electronic healthcare databases. In these studies, approaches used to define diabetes severity included: comparing T2DM severity, before and after obesity surgery;35 examining the association between diabetes severity and either haematological and immunological changes,36 levels of urine citrate,37 a biomarker for adverse outcomes;38 grip strength;39 or the use of complementary medicine to manage T2DM.40

Overall, the severity of T2DM has been previously assessed using the following domains: the complexity of anti-diabetic treatment regimens35 or HbA1c levels;37 a summary severity variable that includes vascular complications;41 or a health status composite, number of comorbidities and patterns of treatments.42 43 Other putative and less clinically robust indicators or animal models were reportedly used to assess the severity of T2DM such as the effect of different patient education approaches on diabetes severity;44 evaluating the effect of parental history;45 and the role of genetic,46 47 metabolic48 or inflammatory mediators.49

In comparison to our planned study, none of the previous studies included as many routinely collected clinically relevant variables as in our more inclusive summary severity score algorithm or assessed the association of estimated scores with the various adverse outcomes included in our defined primary and secondary endpoints.

Potential strengths and limitations

Our proposed study has several potential strengths: First, it aims to present a contemporary measure to the available tools and the first UK-based study to develop an EHR-based severity scoring tool to grade patients by their T2DM severity. Second, the study uses high-quality real-world medical data routinely collected from general practices. The use of routinely collected data indicates that severity scores can be generated automatically with minimal effort. Third, the views of PPIE collaborators will be incorporated in the development of the severity tool. Fourth, we will access two linked data sets: the hospitalisation (HES) data (to maximise data capture and reduce condition misclassification), and the cause-specific mortality data to ascertain causes of death. Fifth, the sample size is expected to be large enough to drive the development and evaluation of the severity algorithm. Sixth, a statistical validation of the developed algorithm is planned and described.

As we will use data available in CPRD and HES data sets, one of the limitations we anticipate is the possibility to miss other severity indicators not recorded in used data sets. These include detailed pharmacy data such as ‘actual’ dispensing and adherence data. Also, the use of routinely collected data is associated with missing values, being collected from questionnaires, and issues around the accuracy of coding. However, we plan to use appropriate imputation methods and definite criteria to minimise the effect of coding issues. A possible limitation that should be acknowledged is underestimated and poorly represented T2DM severity levels for patients not regularly attending a general practice, people missing appointments or patients not being reliably captured in the database due to very high mobility status (eg, homelessness). This limitation aligns with QOF exception reporting that allows practices to exclude patients from indicators or a clinical domain based on discretionary exception codes. However, the use of QOF exception coding was considered appropriate and its levels were very low, especially for informed dissent.22 Another limitation is that our study will be restricted to patients registered with general practices in England, as CPRD currently only provides linkage to external national data sets for consented general practices in England. However, English general practices form the majority of all CPRD practices including nearly 76% of total registered patients.17 Finally, due to project time constraints, it is not possible to validate the developed diabetes scores using questionnaires (as reported previously15) or replicating the algorithm in a separate data set. However, we have planned a statistical validation of the developed algorithm.

Importance and the clinical implications of the severity tool

The developed algorithm and severity tool may have significant implications for primary care both in terms of disease management and resource allocation. Ideally, through future work of further validation and assessment of clinical utility, the severity tool will be of practical use in primary care through its implementation in the clinical computing systems used in the UK.50 The clinical significance of the developed severity algorithm to primary care is driven by the inclusion of highly relevant clinical domains, such as diabetes-related complications and comorbidities, mapped to routinely collected data.

Additionally, by assessing the longitudinal patterns of severity, the developed tool may be more clinically relevant than the currently used proxy (HbA1c), and thus it could be a more reliable indicator in informing practices’ remuneration for diabetes care. Categorising individuals based on their diabetes severity will be relevant for risk stratification (which may enable safer delegation of care within the clinical team), help identify individualised patient risks and will help practitioners triage patients in need of a greater clinical input which informs towards stratified medicine to reduce future life-changing diabetes-related complications. Moreover, the weights of the severity scores may inform future clinical trials as the scoring tool considers a broader range of cardiovascular conditions than in most randomised clinical trials. Given the relatively low rates of cardiovascular outcomes in some trials, identifying patients with diabetes who are at higher risk via our severity algorithm would help to power trials with overall longer-term benefit for patients. Finally, the composite severity score may serve as an important confounding factor in future research, as an example, to match diabetes cases and controls in observational studies and clinical trials.

Ethics and dissemination

Some data were presented at the annual National Institute for Health Research School for Primary Care Research (NIHR SPCR) Showcase, September 2017, Oxford, UK and at the Diabetes UK Professional Conference, March 2018, London, UK. The study findings will be disseminated in relevant academic conferences and peer-reviewed journals.

Supplementary Material

Reviewer comments
Author's manuscript

Footnotes

Contributors: EK, SSZ, MM, MKR and HvM developed the study design and data analysis plan. SSZ, MM, MKR and HvM agreed on provisional clinical code lists. SSZ prepared the first draft of the manuscript, and EK, MM, MKR and HvM critically reviewed initial versions. CP contributed to the planned PPIE work. DR, CACG, CM, NP, DMA, CS, NQ, MN and SW reviewed and critically edited the manuscript. All authors approved the final version of the protocol before submission. SSZ is the guarantor.

Funding: This study is funded by the National Institute for Health Research School for Primary Care Research (NIHR SPCR), grant number 331. This report is an independent research by the National Institute for Health Research.

Disclaimer: The views expressed in this publication are those of the authors and not necessarily those of the NHS, the National Institute for Health Research or the Department of Health.

Competing interests: EK, HvM, MM, DR, CACG, CP, CM, NP and MN declare no competing interests. SSZ reports support by the NIHR SPCR during this study. DMA has received grant funding from Abbvie and has served on advisory boards for Pfizer and GSK. MKR has received educational grant support from MSD and Novo Nordisk; has modest stock ownership in GSK; and has consulted for Roche. NQ reports grants from the NIHR SPCR during the conduct of the study. CS reports grants from NIHR SPCR during the conduct of the study; grants from NHS CLAHRC West, grants from Avon Primary Care Research Collaborative, outside the submitted work. SW serves as a member of the Clinical Practice Research Datalink Independent Scientific Committee (ISAC) at the UK Medicines and Health Regulatory Agency.

Patient consent: Not required.

Ethics approval: The CPRD’s Independent Scientific Advisory Committee (ISAC) approved this study protocol.

Provenance and peer review: Not commissioned; externally peer reviewed.

References

  • 1. World Health Organization (WHO) media centre. Diabetes. [Fact sheet] 2016. http://www.who.int/mediacentre/factsheets/fs312/en/ (cited 29 Apr 2016).
  • 2. World Health Organization (WHO). Global Report on Diabetes. 2016. http://apps.who.int/iris/bitstream/10665/204871/1/9789241565257_eng.pdf (cited 11 Feb 2017).
  • 3. Zghebi SS, Steinke DT, Carr MJ, et al. . Examining trends in type 2 diabetes incidence, prevalence and mortality in the UK between 2004 and 2014. Diabetes Obes Metab 2017;19:1537–45. 10.1111/dom.12964 [DOI] [PubMed] [Google Scholar]
  • 4. Sharma M, Nazareth I, Petersen I. Trends in incidence, prevalence and prescribing in type 2 diabetes mellitus between 2000 and 2013 in primary care: a retrospective cohort study. BMJ Open 2016;6:e010210 10.1136/bmjopen-2015-010210 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. American Diabetes Association (ADA). Standards of Medical Care in Diabetes - 2015. Diabetes Care 2015;38:S1–S49. [Google Scholar]
  • 6. Hex N, Bartlett C, Wright D, et al. . Estimating the current and future costs of Type 1 and Type 2 diabetes in the UK, including direct health costs and indirect societal and productivity costs. Diabet Med 2012;29:855–62. 10.1111/j.1464-5491.2012.03698.x [DOI] [PubMed] [Google Scholar]
  • 7. Walker BR, Colledge NR, Ralston SH, et al. . Davidson’s Principles and Practice of Medicine. China: Elsevier, 2014. (cited 30 Jun 2016). [Google Scholar]
  • 8. Walker BR, Colledge NR, Ralston SH, et al. . Davidson’s Principles and Practice of Medicine. China: Elsevier, 2014. (cited 11 May 2017). [Google Scholar]
  • 9. Nefs G, Pop VJ, Denollet J, et al. . The longitudinal association between depressive symptoms and initiation of insulin therapy in people with type 2 diabetes in primary care. PLoS One 2013;8:e78865 10.1371/journal.pone.0078865 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Murrells T, Ball J, Cookson G, et al. . Managing diabetes in primary care: how does the configuration of the workforce affect quality of care? London: National Nursing Research Unit, King’s College, 2013. Report No: Department of Health Policy Research Programme, ref. 016/0058. [Google Scholar]
  • 11. Eygen Luk V, Patricia S, Luc F, et al. . Priorities for diabetes primary care in Europe. Prim Care Diabetes 2008;2:3–8. 10.1016/j.pcd.2007.11.001 [DOI] [PubMed] [Google Scholar]
  • 12. Spann SJ, Nutting PA, Galliher JM, et al. . Management of type 2 diabetes in the primary care setting: a practice-based research network study. Ann Fam Med 2006;4:23–31. 10.1370/afm.420 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Tai TY, Chuang LM, Tsai ST, et al. . Treatment of type 2 diabetes mellitus in a primary care setting in Taiwan: comparison with secondary/tertiary care. J Formos Med Assoc 2006;105:105–17. 10.1016/S0929-6646(09)60331-4 [DOI] [PubMed] [Google Scholar]
  • 14. Mafauzy M. Diabetes control and complications in private primary healthcare in Malaysia. Med J Malaysia 2005;60:212–7. [PubMed] [Google Scholar]
  • 15. Gini R, Schuemie MJ, Mazzaglia G, et al. . Automatic identification of type 2 diabetes, hypertension, ischaemic heart disease, heart failure and their levels of severity from Italian General Practitioners' electronic medical records: a validation study. BMJ Open 2016;6:e012413 10.1136/bmjopen-2016-012413 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Grootenhuis PA, Snoek FJ, Heine RJ, et al. . Development of a type 2 diabetes symptom checklist: a measure of symptom severity. Diabet Med 1994;11:253–61. 10.1111/j.1464-5491.1994.tb00268.x [DOI] [PubMed] [Google Scholar]
  • 17. Herrett E, Gallagher AM, Bhaskaran K, et al. . Data Resource Profile: Clinical Practice Research Datalink (CPRD). Int J Epidemiol 2015;44:827–36. 10.1093/ije/dyv098 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Khan NF, Harrison SE, Rose PW. Validity of diagnostic coding within the General Practice Research Database: a systematic review. Br J Gen Pract 2010;60:128–36. 10.3399/bjgp10X483562 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Williams T, van Staa T, Puri S, et al. . Recent advances in the utility and use of the General Practice Research Database as an example of a UK Primary Care Data resource. Ther Adv Drug Saf 2012;3:89–99. 10.1177/2042098611435911 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Lester H, Campbell S. Developing Quality and Outcomes Framework (QOF) indicators and the concept of ’QOFability'. Qual Prim Care 2010;18:103–9. [PubMed] [Google Scholar]
  • 21. Calvert M, Shankar A, McManus RJ, et al. . Effect of the quality and outcomes framework on diabetes care in the United Kingdom: retrospective cohort study. BMJ 2009;338:b1870 10.1136/bmj.b1870 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Doran T, Kontopantelis E, Fullwood C, et al. . Exempting dissenting patients from pay for performance schemes: retrospective analysis of exception reporting in the UK Quality and Outcomes Framework. BMJ 2012;344:e2405 10.1136/bmj.e2405 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. de Lusignan S, Khunti K, Belsey J, et al. . A method of identifying and correcting miscoding, misclassification and misdiagnosis in diabetes: a pilot and validation study of routinely collected data. Diabet Med 2010;27:203–9. 10.1111/j.1464-5491.2009.02917.x [DOI] [PubMed] [Google Scholar]
  • 24. Wright AK, Kontopantelis E, Emsley R, et al. . Life Expectancy and Cause-Specific Mortality in Type 2 Diabetes: A Population-Based Cohort Study Quantifying Relationships in Ethnic Subgroups. Diabetes Care 2017;40:338–45. 10.2337/dc16-1616 [DOI] [PubMed] [Google Scholar]
  • 25. Olier I, Springate DA, Ashcroft DM, et al. . Modelling Conditions and Health Care Processes in Electronic Health Records: An Application to Severe Mental Illness with the Clinical Practice Research Datalink. PLoS One 2016;11:e0146715 10.1371/journal.pone.0146715 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Springate DA, Kontopantelis E, Ashcroft DM, et al. . ClinicalCodes: an online clinical codes repository to improve the validity and reproducibility of research using electronic medical records. PLoS One 2014;9:e99825 10.1371/journal.pone.0099825 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Staniszewska S, Brett J, Mockford C, et al. . The GRIPP checklist: strengthening the quality of patient and public involvement reporting in research. Int J Technol Assess Health Care 2011;27:391–9. 10.1017/S0266462311000481 [DOI] [PubMed] [Google Scholar]
  • 28. Lipscombe LL, Hux JE. Trends in diabetes prevalence, incidence, and mortality in Ontario, Canada 1995-2005: a population-based study. Lancet 2007;369:750–6. 10.1016/S0140-6736(07)60361-4 [DOI] [PubMed] [Google Scholar]
  • 29. Onat A, Hergenç G, Uyarel H, et al. . Prevalence, incidence, predictors and outcome of type 2 diabetes in Turkey. Anadolu Kardiyol Derg 2006;6:314–21. [PubMed] [Google Scholar]
  • 30. International Diabetes Federation (IDF). IDF Diabetes Atlas. 7th Edn, 2015. (cited 08 Feb 2017). [Google Scholar]
  • 31. Lin CC, Li CI, Hsiao CY, et al. . Time trend analysis of the prevalence and incidence of diagnosed type 2 diabetes among adults in Taiwan from 2000 to 2007: a population-based study. BMC Public Health 2013;13:318 10.1186/1471-2458-13-318 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. González EL, Johansson S, Wallander MA, et al. . Trends in the prevalence and incidence of diabetes in the UK: 1996-2005. J Epidemiol Community Health 2009;63:332–6. 10.1136/jech.2008.080382 [DOI] [PubMed] [Google Scholar]
  • 33. Evans JM, Barnett KN, Ogston SA, et al. . Increasing prevalence of type 2 diabetes in a Scottish population: effect of increasing incidence or decreasing mortality? Diabetologia 2007;50:729–32. 10.1007/s00125-006-0585-9 [DOI] [PubMed] [Google Scholar]
  • 34. Young BA, Lin E, Von Korff M, et al. . Diabetes complications severity index and risk of mortality, hospitalization, and healthcare utilization. Am J Manag Care 2008;14:15–24. [PMC free article] [PubMed] [Google Scholar]
  • 35. Runkel M, Müller S, Brydniak R, et al. . Downgrading of type 2 diabetes mellitus (T2DM) after obesity surgery: duration and severity matter. Obes Surg 2015;25:494–9. 10.1007/s11695-014-1419-3 [DOI] [PubMed] [Google Scholar]
  • 36. Okano K, Araki M, Yamamoto M, et al. . Exploration of hematological and immunological changes associated with the severity of type 2 diabetes mellitus in Japan. Nurs Health Sci 2008;10:65–9. 10.1111/j.1442-2018.2007.00376.x [DOI] [PubMed] [Google Scholar]
  • 37. Fram EB, Moazami S, Stern JM. The Effect of Disease Severity on 24-Hour Urine Parameters in Kidney Stone Patients With Type 2 Diabetes. Urology 2016;87:52–9. 10.1016/j.urology.2015.10.013 [DOI] [PubMed] [Google Scholar]
  • 38. Eldor R, Klieger Y, Sade-Feldman M, et al. . CD247, a novel T cell-derived diagnostic and prognostic biomarker for detecting disease progression and severity in patients with type 2 diabetes. Diabetes Care 2015;38:113–8. 10.2337/dc14-1544 [DOI] [PubMed] [Google Scholar]
  • 39. Loprinzi PD, Loenneke JP. Evidence of a link between grip strength and type 2 diabetes prevalence and severity among a national sample of U.S. Adults. J Phys Act Health 2016;13:558–61. 10.1123/jpah.2015-0241 [DOI] [PubMed] [Google Scholar]
  • 40. Nahin RL, Byrd-Clark D, Stussman BJ, et al. . Disease severity is associated with the use of complementary medicine to treat or manage type-2 diabetes: data from the 2002 and 2007 National Health Interview Survey. BMC Complement Altern Med 2012;12:193 10.1186/1472-6882-12-193 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Linzer M, Pierce C, Lincoln E, et al. . Preliminary validation of a patient-based self-assessment measure of severity of illness in type 2 diabetes: results from the pilot phase of the Veterans Health Study. J Ambul Care Manage 2005;28:167–76. [DOI] [PubMed] [Google Scholar]
  • 42. Gatlin PK, Insel KC. Severity of type 2 diabetes, cognitive function, and self-care. Biol Res Nurs 2015;17:540–8. 10.1177/1099800414557565 [DOI] [PubMed] [Google Scholar]
  • 43. Dunning T, Martin M. Type 2 diabetes: is it serious? Journal of Diabetes Nursing 1998;2:70–6. [Google Scholar]
  • 44. Windrum P, García-Goñi M, Coad H. The Impact of Patient-Centered versus Didactic Education Programs in Chronic Patients by Severity: The Case of Type 2 Diabetes Mellitus. Value Health 2016;19:353–62. 10.1016/j.jval.2016.01.014 [DOI] [PubMed] [Google Scholar]
  • 45. Svensson E, Berencsi K, Sander S, et al. . Association of parental history of type 2 diabetes with age, lifestyle, anthropometric factors, and clinical severity at type 2 diabetes diagnosis: results from the DD2 study. Diabetes Metab Res Rev 2016;32:308–15. 10.1002/dmrr.2721 [DOI] [PubMed] [Google Scholar]
  • 46. Powell DS, Maksoud H, Chargé SB, et al. . Apolipoprotein E genotype, islet amyloid deposition and severity of Type 2 diabetes. Diabetes Res Clin Pract 2003;60:105–10. 10.1016/S0168-8227(02)00281-4 [DOI] [PubMed] [Google Scholar]
  • 47. Jiang Q, Lyu XM, Yuan Y, et al. . Plasma miR-21 expression: an indicator for the severity of Type 2 diabetes with diabetic retinopathy. Biosci Rep 2017;37:BSR20160589 10.1042/BSR20160589 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Campbell-Tofte J, Hansen HS, Mu H, et al. . Increased lipids in non-lipogenic tissues are indicators of the severity of type 2 diabetes in mice. Prostaglandins Leukot Essent Fatty Acids 2007;76:9–18. 10.1016/j.plefa.2006.09.002 [DOI] [PubMed] [Google Scholar]
  • 49. García-Elorriaga G, Padilla-Reyes M, Cruz-Olivo F, et al. . [Pro-inflammatory cytokines related to severity and mortality in type 2 diabetes patients with soft tissue infection]. Rev Med Inst Mex Seguro Soc 2012;50:237–41. Citocinas, diabetes e infeccion de tejidos blandos. Su relacion con la severidad y mortalidad. [PubMed] [Google Scholar]
  • 50. Kontopantelis E, Buchan I, Reeves D, et al. . Relationship between quality of care and choice of clinical computing system: retrospective analysis of family practice performance under the UK’s quality and outcomes framework. BMJ Open 2013;3:e003190 10.1136/bmjopen-2013-003190 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary data

bmjopen-2017-020926supp001.pdf (478.7KB, pdf)

Reviewer comments
Author's manuscript

Articles from BMJ Open are provided here courtesy of BMJ Publishing Group

RESOURCES