Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2010 Apr 6.
Published in final edited form as: Arch Intern Med. 2008 Dec 8;168(22):2466–2473. doi: 10.1001/archinte.168.22.2466

A Simple Algorithm to Predict Incident Kidney Disease

Abhijit V Kshirsagar 1, Heejung Bang 1, Andrew S Bomback 1, Suma Vupputuri 1, David A Shoham 1, Lisa M Kern 1, Philip J Klemmer 1, Madhu Mazumdar 1, Phyllis A August 1
PMCID: PMC2849985  NIHMSID: NIHMS181868  PMID: 19064831

Abstract

Background

Despite the growing burden of chronic kidney disease (CKD), there are no algorithms (to our knowledge) to quantify the effect of concurrent risk factors on the development of incident disease.

Methods

A combined cohort (N = 14 155) of 2 community-based studies, the Atherosclerosis Risk in Communities Study and the Cardiovascular Health Study, was formed among men and women 45 years or older with an estimated glomerular filtration rate (GFR) exceeding 60 mL/min/1.73 m2 at baseline. The primary outcome was the development of a GFR less than 60 mL/min/1.73 m2 during a follow-up period of up to 9 years. Three prediction algorithms derived from the development data set were evaluated in the validation data set.

Results

The 3 prediction algorithms were continuous and categorical best-fitting models with 10 predictors and a simplified categorical model with 8 predictors. All showed discrimination with area under the receiver operating characteristic curve in a range of 0.69 to 0.70. In the simplified model, age, anemia, female sex, hypertension, diabetes mellitus, peripheral vascular disease, and history of congestive heart failure or cardiovascular disease were associated with the development of a GFR less than 60 mL/min/1.73 m2. A numeric score of at least 3 using the simplified algorithm captured approximately 70% of incident cases (sensitivity) and accurately predicted a 17% risk of developing CKD (positive predictive value).

Conclusions

An algorithm containing commonly understood variables helps to stratify middle-aged and older individuals at high risk for future CKD. The model can be used to guide population-level prevention efforts and to initiate discussions between practitioners and patients about risk for kidney disease.


Patients with chronic kidney disease (CKD) lose kidney function and over time are at risk of developing end-stage kidney disease (ESKD). Predicting individuals at risk for CKD is an important first step in modifying the progressive course of CKD. Early identification1,2 of CKD would provide the best opportunity to implement strategies known to decelerate the loss of kidney function.3-5

Epidemiological studies6-16 have identified independent risk factors for CKD, including diabetes mellitus (DM), hypertension, vascular disease, and advanced age. Yet, clinical experience has shown that individuals often have 1 or more concurrent risk factors. For our group's previous studies,17,18 a system was developed and validated to quantify the likelihood of prevalent CKD based on the presence of 1 or more risk factors. The development of an algorithm that predicts future CKD would be important to health care practitioners and to patients. To date, we are unaware of any studies that have quantified the cumulative effect of concurrent risk factors on the development of incident kidney disease in the general population.

Therefore, we derived and validated a simple risk score to predict incident kidney disease in a group of community-dwelling middle-aged and older adults. Incident CKD was defined as a glomerular filtration rate (GFR) less than 60 mL/min/1.73 m2, a level generally considered to reflect abnormal kidney function even in participants with expected age-related decline in GFR. As with the recent prediction rule for prevalent kidney disease,17 we attempted to retain the following 2 important characteristics for the new prediction rule: (1) the use of routinely available and minimally intrusive variables easily understood by lay persons and by health care practitioners and (2) estimation of the cumulative effect of concurrent risk factors on the likelihood of developing renal disease.

Methods

Study Design

This study analyzed subject-level data from 2 community-based, prospective, public-use data sets to ascertain the relationship between baseline characteristics and incident CKD. Risk scoring rules were developed based on data available in the event-free population.

Study Population

Data were combined from 2 nonconcurrent cohort studies, the Atherosclerosis Risk in Communities (ARIC) Study and the Cardiovascular Health Study (CHS) (http://www.nhlbi.nih.gov/resources/deca/datasets_obv.htm). Detailed descriptions of these 2 studies have been published previously.19,20

Briefly, ARIC enrolled 15 732 biracial participants aged 45 to 64 years between 1987 and 1989 (visit 1) from 4 communities and followed them up for a maximum of 4 visits, approximately 3 years apart, for a maximum follow-up of 9 years. The CHS recruited 5201 participants 65 years and older between 1989 and 1990 from 4 communities. Both studies recruited from 2 common communities of Forsyth County, North Carolina, and Washington County, Maryland. The 2 distinct recruiting regions selected by ARIC are suburban Minneapolis, Minnesota, and Jackson, Mississippi, whereas the CHS recruited from Sacramento, California, and Pittsburgh, Pennsylvania. Between 1992 and 1993, the CHS enrolled an additional 687 black subjects to increase minority participation. The CHS participants were followed up annually for up to 10 years.

Measurements

We chose comprehensive demographic and clinical variables related to CKD. These included age, sex, marital status, race/ethnicity, education level, smoking status, body mass index, hemoglobin levels and anemia, DM, hypertension, peripheral vascular disease (PVD), history of cardiovascular disease (CVD), and history of heart failure, as well as triglyceride level and concentrations of low-density lipoprotein (LDL) (calculated using the Friedwald equation) and high-density lipoprotein (HDL) cholesterol. Family history of hypertension and DM was also recorded in ARIC. Definitions of derived variables are available from the corresponding author on request.

In ARIC, serum creatinine level was measured at baseline and at visit 2 (3-year follow-up) and at visit 4 (9-year follow-up) using the modified kinetic Jaffe method. In the CHS, serum creatinine level was measured at baseline and at years 3 and 7 of the follow-up period by means of a colorimetric method (Kodak Ektachem 700 analyzer; Eastman Kodak Corporation, Rochester, New York). The black cohort in the CHS had only 2 serum creatinine measurements, while the original cohort study had 3 serum creatinine measurements.

Kidney function was quantified by estimated GFR from the 4-variable Modification of Diet in Renal Disease (MDRD) study function as follows21: GFR in milliliters per minute per 1.73 m2= 186 × (serum creatinine level in milligrams per deciliter)−1.154 × (age in years)−0.203 × (1.212 if black) × (0.742 if female). Because serum creatinine levels vary across clinical laboratories, creatinine data in our study were calibrated using published adjustment coefficients. In ARIC, serum creatinine levels were calibrated for the MDRD equation using the following constants: −0.24, −0.24, and 0.18 mg/dL for the creatinine levels at visits 1, 2, and 4, respectively.22 (To convert creatinine level to micromoles per liter, multiply by 88.4.) In the CHS, serum creatinine levels were calibrated with constants of −0.11 mg/dL (for the original cohort) and −0.04 mg/dL (for the CHS black cohort) for baseline,23 −0.04 for 3-year follow-up, and −0.11 for 7-year follow-up.16 Incident CKD was considered a GFR less than 60 mL/min/1.73 m2 occurring at any time during the follow-up period. This corresponds to stage 3 or higher CKD based on the Kidney Disease Outcomes and Quality Initiative guidelines.21

Statistical Analysis

The split-sample method was used for risk equation and score development and for validation.24,25 Eligible participants from the data set were randomly allocated to development and validation samples using a 2:1 ratio within each data set. Multiple logistic regression analysis was used to create a prediction model in the development data set.

Derivation of Prediction Models

We used 3 different strategies for multivariate modeling. First, we used continuous variables whenever available (listed in Table 1), aiming for efficient estimation of regression variables and minimal residual confounding with increased statistical power. All covariates were considered main effects. Backward elimination technique was used to reach the final model: a factor with the largest P value was deleted one at a time until all the predictors in the model were significant at P ≤ .05. After reaching the final parsimonious model, the significance of each of the deleted variables was tested to ensure that no covariate was erroneously omitted in this sequential process. Second, all the covariates in the final model were replaced with categorical versions that could be converted to a user-friendly integer risk score. Third, a simplified categorical model was constructed omitting less readily available variables. To assess discrimination, area under the receiver operating characteristic curve (AUC) was computed. Akaike and Bayesian information criteria were evaluated as model fit statistics,26,27 with lower values indicating better model fit. The Hosmer-Lemeshow goodness-of-fit test was also performed.28

Table 1. Baseline Characteristics of Participants by Incident Chronic Kidney Disease (CKD) Statusa.
Variable No CKD
(n = 12550)
CKD
(n = 1605)
Age, mean (SD), y 57 (9) 62 (9)
Female sex, % 56 59
White race/ethnicity, % 78 83
Married, %b 79 74
Education (≤ high school), % 20 25
Body mass index, mean (SD)c 27 (5) 28 (5)
Anemia, % 2 3
Laboratory values, mean (SD), mg/dL
 High-density lipoprotein cholesterol 53 (17) 51 (16)
 Low-density lipoprotein cholesterol 135 (37) 138 (40)
 Triglycerides 128 (79) 149 (94)
 Total cholesterol 215 (40) 220 (13)
Dyslipidemia, % 28 31
Diabetes mellitus, % 9 17
Hypertension, % 36 55
Blood pressure, mean (SD), mm Hg
 Systolic 122 (19) 132 (22)
 Diastolic 73 (11) 73 (12)
Peripheral vascular disease, % 4 9
Ankle-brachial index, mean (SD) 1.1 (0.1) 1.1 (0.2)
History of cardiovascular disease, % 8 17
History of heart failure, % 0.7 2.3
Current smoker, % 10 9.5
Serum creatinine level, mean (SD), mg/dL 0.8 (0.2) 1.0 (0.2)

SI conversion factors: To convert cholesterol level to millimoles per liter, multiply by 0.0259; creatinine level to micromoles per liter, multiply by 88.4; and triglyceride level to millimoles per liter, multiply by 0.0113.

a

Means represent continuous variables, and proportions represent categorical variables. Sample size is reduced for some variables because of missing data.

b

The Atherosclerosis Risk in Communities Study measured marital status only at visit 2 (ie, 3 years after baseline).

c

Calculated as weight in kilograms divided by height in meters squared.

Validation

Once the most parsimonious model was reached, diagnostic properties were tested in the validation data set that was not used for model building. Using the regression coefficients in the risk function, the probability of developing CKD was estimated, which allowed the establishment of a rule to characterize different degrees of risk based on cut points of the probability distribution. Basic scoring was by rounding up the regression coefficients from the multiple regression model to appropriate integers and by capturing the monotonicity of continuous risk (eg, for age).17

The prediction models were evaluated based on the following standard measures: AUC, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and percentage of persons who are identified to be at high risk by a specified selection strategy. The final cut-point selection for populations at high risk was determined using the index by Youden29 (sensitivity + specificity − 1).

For the sensitivity analysis, 2 additional analyses were performed to determine the robustness of the results. First, we performed an analysis with the outcome as a GFR less than 60 mL/min/1.7 m2 and at least a 10 mL/min change in GFR from baseline. Second, we used a Cox proportional hazards regression model for survival analysis to better account for censoring of follow-up data.

All analyses were performed using commercially available statistical software (SAS version 9.1; SAS Institute, Cary, North Carolina). Two-sided hypotheses tests with 5% type I error were adopted for all statistical inferences.

Results

Population Characteristics

At the baseline visit, ARIC had 15 732 participants and the CHS had 5888 participants, for a combined cohort of 21 620 individuals. A total of 1998 individuals were excluded from analysis because of missing data for creatinine level, GFR formula, or baseline renal insufficiency (150 missing data and 464 prevalence cases in ARIC and 172 missing data and 1212 prevalent cases in the CHS). In addition, 3823 individuals from ARIC and 1644 individuals from the CHS were excluded because there were no incident outcome data by the time of the last contact day. Therefore, the final study population consisted of 14155 participants (11 295 in ARIC and 2860 in the CHS). Of these individuals, 9470 participants were randomly selected from the original data set to form the development data set, and the remaining 4624 comprised the validation data set after excluding 61 individuals with missing covariate data.

The characteristics of the study population at baseline by event status are given in Table 1. On average, participants with CKD were almost 5 years older and less likely to have completed high school compared with participants without CKD. A greater proportion of white participants had CKD. Participants with CKD were also more likely to have DM, PVD, hypertension, and a history of CVD.

Prediction Model of Incident CKD

During the study period, 1605 participants developed incident CKD: 1123 of 11 295 (9.9%) from ARIC and 482 of 2860 (16.9%) from the CHS. Using continuous variables, the multiple logistic regression model in the development data set identified the following 10 predictors of incident CKD: age, DM, PVD, anemia, female sex, white race/ethnicity, systolic blood pressure, history of CVD, history of heart failure, and HDL cholesterol concentration (data not shown).

Table 2 gives the results of categorical variables in the full (best fitting) and simplified prediction models for incident CKD. The AUCs for the 2 models were similar, 0.70 vs 0.69, while the more comprehensive model demonstrated improved model fits as reflected in smaller Akaike and Bayesian information criteria values. The Hosmer-Lemeshow goodness-of-fit test showed no lack of fit for the 2 fitted models (P > .2). The 3 prediction equations were fitted to the full data set, and the results are available from the corresponding author on request.

Table 2. Multivariate Models for Chronic Kidney Disease in the Development Data Set Using Categorical Variablesa.

Covariate β Coefficient (SE) Odds Ratio (95% Confidence Interval) P Value Assigned Score
Best-Fitting Categorical Modelb
Age, y
 50-59 0.60 (0.12) 1.8 (1.4-2.3) <.001 1
 60-69 1.31 (0.12) 3.7 (2.9-4.7) <.001 2
 ≥70 1.46 (0.14) 4.3 (3.3-5.6) <.001 3
White race/ethnicityc 0.41 (0.09) 1.5 (1.3-1.8) <.001 1
Female sex 0.23 (0.07) 1.3 (1.1-1.5) .001 1
Anemia 0.58 (0.20) 1.8 (1.2-2.7) .004 1
Hypertension 0.61 (0.07) 1.8 (1.6-2.1) <.001 1
Diabetes mellitus 0.32 (0.10) 1.4 (1.1-1.7) .001 1
History of cardiovascular disease 0.25 (0.10) 1.3 (1.0-1.5) .03 1
History of heart failured 0.51 (0.25) 1.7 (1.0-2.7) .04 1
Low high-density lipoprotein cholesterol levele 0.28 (0.08) 1.3 (1.1-1.6) <.001 1
Peripheral vascular disease (circulation problem in legs) 0.42 (0.13) 1.5 (1.2-2.0) .001 1

Simplified Categorical Modelf
Age, y
 50-59 0.63 (0.12) 1.9 (1.5-2.4) <.001 1
 60-69 1.33 (0.12) 3.8 (3.0-4.8) <.001 2
 ≥70 1.46 (0.14) 4.3 (3.3-5.6) <.001 3
Female sex 0.13 (0.07) 1.1 (1.0-1.3) .05 1
Anemia 0.48 (0.20) 1.6 (1.1-2.4) .02 1
Hypertension 0.55 (0.07) 1.7 (1.5-2.0) <.001 1
Diabetes mellitus 0.33 (0.10) 1.4 (1.2-1.7) <.001 1
History of cardiovascular disease 0.26 (0.10) 1.3 (1.1-1.6) .009 1
History of heart failured 0.50 (0.25) 1.6 (1.0-2.7) .04 1
Peripheral vascular disease (circulation problem in legs) 0.41 (0.13) 1.5 (1.2-1.9) .002 1
a

The total sample comprised 9470 subjects. The development data set was formed with two-thirds randomly selected from the Atherosclerosis Risk in Communities Study and with two-thirds randomly selected from the Cardiovascular Health Study.

b

Area under the receiver operating characteristic curve, 0.70; Akaike information criterion, 6262; and Bayesian information criterion, 6355. Smaller Akaike and Bayesian information criteria indicate a better model fit.

c

In the Cardiovascular Health Study, white subjects (on average) had a longer follow-up period than black subjects.

d

Because status for only the past 2 weeks was ascertained, decreased power is expected.

e

High-density lipoprotein cholesterol level of 40 mg/dL or less (to convert cholesterol level to millimoles per liter, multiply by 0.0259).

f

Area under the receiver operating characteristic curve, 0.69; Akaike information criterion, 6295; and Bayesian information criterion, 6374.

Validation

Diagnostic characteristics of the full (best fitting) and simplified categorical models using validation data are given in Table 3. For both models, the sensitivity and NPV of the model decreased, while its specificity and PPV increased as the event rate estimated by the prediction model increased, corresponding to higher total scores derived. For example, for a score of 6 or higher in the full model, which defined about 10% of participants tested as high risk, the sensitivity was 25% and the specificity 93%, with the actual risk of CKD (ie, the PPV) being 29%. At the other (lower) end, a score of 3 or higher defined almost 74% of participants tested at high risk and yielded high sensitivity (91%) and low specificity (28%), with actual risk (PPV) of 14%. The NPV remained higher than 90% for all thresholds examined. The Youden index identified a score of 5 or higher for the full model and a score of 3 or higher for the simplified model as the cut points that yielded the highest level of test accuracy.

Table 3. Diagnostic Characteristics of the Screening Rules in the Validation Data Seta.

Screening Rule Identified, %b Sensitivity Specificity Positive Predictive Value Negative Predictive Value
Probability of chronic kidney disease categoricalc
 ≥0.17 20 42 83 24 92
 ≥0.13 30 60 70 20 93
 ≥0.12 40 68 62 19 94
 ≥0.09 50 75 53 17 94
Total score (best-fitting) categoricald
 ≥6 10 25 93 29 91
 ≥5 24 49 80 23 92
 ≥4 46 71 57 17 94
 ≥3 74 91 28 14 96
Total score simplified categoricale
 ≥5 8 20 93 28 90
 ≥4 23 45 80 22 92
 ≥3 45 69 58 17 94
 ≥2 73 89 29 14 96
a

The total sample comprised 4624 subjects. The validation data set was formed with the remaining one-third from the Atherosclerosis Risk in Communities Study and with the remaining one-third from the Cardiovascular Health Study.

b

Percentage of participants who met the condition specified in the first column.

c

Estimated probability from logistic regression model. Area under the receiver operating characteristic curve, 0.70; Akaike information criterion, 3002; and Bayesian information criterion, 3015.

e

Area under the receiver operating characteristic curve, 0.70; Akaike information criterion, 3013; and Bayesian information criterion, 3026.

e

Area under the receiver operating characteristic curve, 0.68; Akaike information criterion, 3053; and Bayesian information criterion, 3066.

Figure 1 shows the corresponding receiver operating characteristic curves from 3 different modeling strategies. Figure 2 shows the proportions of participants who progressed to CKD during the study period by total risk scores in the 2 categorical models in the development and validation data sets. The estimated event rates from the combined data set are given in a user-friendly format in Table 4.

Figure 1.

Figure 1

Receiver operating characteristic curves for the 3 prediction functions in the validation data set. Continuous, categorical, and categorical S indicate the best-fitting continuous model, best-fitting categorical model, and simplified categorical models, respectively.

Figure 2.

Figure 2

Event rate by total score in the development and validation data sets. A, Best-fitting categorical model. B, Simplified categorical model. Light grey indicates the development data set and dark grey the validation data set. Some categories were combined because of very small sample size. CKD indicates chronic kidney disease.

Table 4. Risk Prediction Table for Up to 10 Years.

Event Rate, %b

Total Scorea Best-Fitting Categorical Simplified Categorical
≤1 ≤2 ≤5
2 5 8
3 8 13
4 13 20
5 20 25
6 25 30
7 30 35
≥8 ≥40 ≥50
a

Computed from Table 2.

b

Estimated from all samples. Some event rates were rounded to more intuitive values (eg, 26.4 to 25 and 72.9 to 70).

For the sensitivity analysis, the use of a definition of incident disease that included a GFR less than 60 mL/min/1.73 m2 and at least a 10 mL/min decrease in GFR from baseline did not markedly change the approximate 0.70 AUC of the model. Cox proportional hazards regression analysis for censored data also yielded results similar to those of logistic regression analysis, including the identical scoring system (data not shown).

Comment

We developed a simple scoring algorithm that stratifies persons at risk of developing clinically significant CKD. This prediction rule, which we believe to be the first of its kind, translates a parsimonious set of medical and demographic characteristics into a mean likelihood of developing CKD among middle-aged and older adults with 4 to 9 years of follow-up. These characteristics are often present together and cumulatively affect the risk of kidney disease. Most of the characteristics—age, DM, hypertension, and CVD (divided into PVD, heart failure, and coronary artery disease)—are easily identified by health care practitioners and by the general public. Most other variables (eg, serum lipid and hemoglobin levels) are also frequently checked by health care practitioners.

Risk scores have recently been emphasized as practical tools to help stratify individuals at increased risk of having kidney disease.1,2 In previous studies,17,18 a prediction rule was developed and validated that identified individuals likely to have existing CKD. That analysis used cross-sectional data and could not predict future disease. Herein, we report a tool for prediction of kidney disease akin to the Framingham risk score, a widely used cardiovascular scoring system that is used extensively to stratify individuals and guide therapy in diverse settings. Clinicians can use our scoring system to communicate expected risk among patients and to facilitate discussions about possible preventive strategies. This algorithm also has potential public health applications. It can be posted on medical Web sites for the public to access or may be used in community settings to identify individuals who may wish to be referred to health care practitioners. The identification of high-risk individuals using this scoring system can also optimize the benefit and cost-effectiveness of targeted screening and monitoring. At the least, the prediction rule can be used in concert with other public health initiatives to increase the awareness of CKD, which has been traditionally low.30,31

The 2 categorical models, the best fitting and the simplified, were comparable in terms of discrimination (ie, distinguishing cases vs noncases), with the best-fitting model having greater statistical fit. When information on laboratory variables such as HDL cholesterol concentration is unavailable, the use of the simplified model would be appropriate. Based on the diagnostic characteristics, we recommend using a cut point of 5 with the best-fitting model and a cut point of 3 with the simplified model. Because the score predicts future events, the immediate response to a high test score would likely involve increased frequency of screening, intensive management of risk factors (DM and hypertension), and possibly lifestyle modifications, as well as further laboratory testing. In our opinion, the benefits of such interventions far outweigh their minimal risk (false-positive prediction of renal disease) and may simultaneously serve to reduce cardiovascular risk.

In this investigation, we purposefully chose to define CKD for the prediction equation using a GFR less than 60 mL/min/1.73m2 (rather than <90 mL/min/1.73m2) for several reasons. First, we wanted to minimize the detection of individuals with age-related physiologic decline in kidney function. Nevertheless, age adds the greatest predictive capability of all the variables; in fact, the other variables add about 5% additional predictive capacity to the equation. Second, it has become increasingly clear to practicing clinicians that a GFR of 60 mL/min/1.73m2 represents a practical threshold for clinical action. Third, the MDRD estimation formula, derived among individuals with a baseline GFR less than 60 mL/min/1.73m2, is most accurate for individuals with stage 3 or higher CKD.32

We excluded the baseline level of kidney function in the prediction model because of clinical and methodological concerns. Clinically, the algorithm is designed to help health care practitioners and potential patients by focusing on variables that are available without prior serologic testing. Methodologically, the use of an estimated GFR term as a potential exposure variable and as an outcome would cause colinearity, potentially introducing bias.

The finding of increased risk of CKD among white subjects is noteworthy. While there are well-documented ethnic/racial differences in incident and prevalent ESKD33 and in different stages of CKD, some racial/ethnic differences observed in the prevalence of CKD may be related to variation in the rates of progression among black subjects and white subjects. In National Health and Nutrition Examination Survey (NHANES) III and NHANES 1999-2000 data, black subjects had a lower age-adjusted prevalence of CKD than white subjects.31 In the NHANES 1999-2002 data used in this study,31 the prevalence of CKD was similarly higher among white subjects compared with black subjects. Baseline results from the renal Reasons for Geographic and Racial Difference in Stroke cohort support these findings.34 Furthermore, in the present CHS data set, fewer follow-up visits for black subjects may translate into fewer opportunities to capture incident cases of CKD. Speculatively, the competing risk of death may be greater than the risk of CKD among black subjects. Finally, different statistical weights are assigned to the 2 racial/ethnic groups in the MDRD formula.

There are several important limitations to our study. First, no qualitative or quantitative urinary indexes were available at the baseline visit. Unlike a previous study17 that predicted prevalent renal disease, this analysis could not include proteinuria as a variable. Low levels of proteinuria and hematuria are important clues to the presence of underlying kidney disease, particularly glomerulonephritis and diabetic nephropathy and sometimes autosomal dominant polycystic kidney disease. Therefore, it is possible that participants in ARIC or the CHS with preserved renal function but who had proteinuria, hematuria, or both were included in the baseline cohort of participants. Nevertheless, our scoring system would still be useful in identifying those individuals with urinary abnormalities who are at risk for deterioration in GFR (eg, those with DM, hypertension, and vascular disease).

A second limitation of the scoring system is our inability to include family history of kidney disease in the model. Most epidemiological studies (eg, ARIC, CHS, Framingham, and NHANES) do not include questions about family history of CKD. More recent studies such as the Kidney Early Evaluation Program35 have begun surveying family history among targeted populations with a high burden of ESKD. In the future, we anticipate that national or community-based surveys will add family history of kidney disease to better assess its effect on CKD. In studies36,37 of ESKD, family history has been demonstrated to modify the effect of DM and hypertension; therefore, inclusion of family history may alter our prediction model.

Third, we based our diagnosis of CKD on a single estimate of GFR, which we acknowledge tends to overestimate the incidence of kidney disease. Estimated GFR measurements exhibit a high degree of intraindividual variability and ideally require second measurements to accurately represent kidney function.38 The use of successive GFR measurements, had they been available, would likely have reduced the incidence of CKD but should not have affected the association of the predictor variables with the outcome. Furthermore, most studies of CKD, epidemiologic and interventional, use single serum creatinine measurements.

Fourth, this risk prediction model applies primarily to groups defined by a parsimonious set of clinically relevant variables rather than directly to individuals. This is a limitation common to all risk prediction models.39 Indeed, prevention of CKD and ESKD may require population-based interventions that are beyond the control of individual physicians and patients (eg, the risk contribution of CVD).40 We caution that our risk prediction rule serves only as a guideline and should not be taken as an absolute definition of high risk.

Several strengths of the study are worthy of emphasis. This analysis uses 2 well-studied and representative community-based cohorts (ARIC and CHS), albeit with somewhat different follow-up periods. In addition, the complementary age of the participants of the 2 cohorts provides an age range that mirrors the age range of most individuals who are at risk of developing CKD. Finally, to our knowledge, this is the first prediction model for incident CKD.

Reaching ESKD represents a serious health event that may be significantly delayed or even prevented.3-5 The global burden of CKD is growing,41-43 with the incidence of ESKD more than doubling in Europe and the United States during the past 2 decades.33,44 Our study demonstrates that the risk of incident CKD is predictable with good accuracy in a middle-aged or older population bearing major risk factors for CKD. In clinical practice, the use of our scoring system would allow clinicians to easily stratify the renal disease risk of individual patients. Patients may also calculate their risk scores and probabilities of developing this asymptomatic disease during a 10-year period and query their primary care practitioners about their renal function. Furthermore, this scoring system may provide guidance for policymakers by calling attention to the multiple conditions that contribute to rising CKD and ESKD prevalences. Additional studies and validation of this prediction rule in various real-world settings (eg, high vs low risk, clinical vs community settings, and among different ethnic groups) will be necessary to rigorously assess its usefulness.

Acknowledgments

Funding/Support: This study was supported by the University of North Carolina Kidney Center and by Clinical and Translational Science Award UL1-RR024996 to Drs Bang and Mazumdar.

Footnotes

Author Contributions: Study concept and design: Kshirsagar and Bang. Analysis and interpretation of data: Kshirsagar, Bang, Vupputuri, Shoham, Kern, and Mazumdar. Drafting of the manuscript: Kshirsagar, Bang, and Bomback. Critical revision of the manuscript for important intellectual content: Kshirsagar, Bomback, Vupputuri, Shoham, Klemmer, Mazumdar, and August. Statistical analysis: Bang, Vupputuri, Shoham, and Mazumdar. Obtained funding: Bang and Mazumdar. Study supervision: Kshirsagar and Bang.

Additional Contributions: Kylie Braynt, MS, provided CHS data setup, and Sean Coady, MA, assisted with data issues and clarifications. The staff and participants of ARIC and the CHS contributed valuable information for health research.

Financial Disclosure: None reported.

Publisher's Disclaimer: Disclaimer: The ARIC and the CHS are conducted and supported by the National Heart, Lung, and Blood Institute (NHLBI) in collaboration with ARIC and CHS investigators, respectively. The manuscript was prepared using limited access data sets obtained from the NHLBI, and the conclusions do not necessarily reflect the opinions or views of ARIC, the CHS, or the NHLBI.

References

  • 1.Gansevoort RT, Bakker SJL, de Jong PE. Early detection of progressive chronic kidney disease: is it feasible? J Am Soc Nephrol. 2006;17(5):1218–1220. doi: 10.1681/ASN.2006030247. [DOI] [PubMed] [Google Scholar]
  • 2.Taal MW, Brenner BM. Predicting initiation and progression of chronic kidney disease: developing renal risk scores. Kidney Int. 2006;70(10):1694–1705. doi: 10.1038/sj.ki.5001794. [DOI] [PubMed] [Google Scholar]
  • 3.Diabetes Control and Complications Trial Research Group. The effect of intensive treatment of diabetes on the development and progression of long-term complications in insulin-dependent diabetes mellitus. N Engl J Med. 1993;329(14):977–986. doi: 10.1056/NEJM199309303291401. [DOI] [PubMed] [Google Scholar]
  • 4.Writing Team for the Diabetes Control and Complications Trial/Epidemiology of Diabetes Interventions and Complications Research Group. Sustained effect of intensive treatment of type 1 diabetes mellitus on development and progression of diabetic nephropathy: the Epidemiology of Diabetes Interventions and Complications (EDIC) study. JAMA. 2003;290(16):2159–2167. doi: 10.1001/jama.290.16.2159. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Jafar TH, Schmid CH, Landa M, et al. Angiotensin-converting enzyme inhibitors and progression of nondiabetic renal disease: a meta-analysis of patient-level data. Ann Intern Med. 2001;135(2):73–87. doi: 10.7326/0003-4819-135-2-200107170-00007. [DOI] [PubMed] [Google Scholar]
  • 6.Levey AS, Coresh J, Balk E, et al. National Kidney Foundation practice guidelines for chronic kidney disease: evaluation, classification, and stratification [published correction appears in Ann Intern Med. 2003;139(7):605] Ann Intern Med. 2003;139(2):137–147. doi: 10.7326/0003-4819-139-2-200307150-00013. [DOI] [PubMed] [Google Scholar]
  • 7.McClellan WM, Flanders WD. Risk factors for progressive chronic kidney disease. J Am Soc Nephrol. 2003;14(7) 2:S65–S70. doi: 10.1097/01.asn.0000070147.10399.9e. [DOI] [PubMed] [Google Scholar]
  • 8.O'Hare AM, Glidden DV, Fox CS, Hsu CY. High prevalence of peripheral arterial disease in persons with renal insufficiency: results from the National Health and Nutrition Examination Survey 1999-2000. Circulation. 2004;109(3):320–323. doi: 10.1161/01.CIR.0000114519.75433.DD. [DOI] [PubMed] [Google Scholar]
  • 9.Muntner P, Coresh J, Smith JC, Eckfeldt J, Klag MJ. Plasma lipids and risk of developing renal dysfunction: the Atherosclerosis Risk in Communities Study. Kidney Int. 2000;58(1):293–301. doi: 10.1046/j.1523-1755.2000.00165.x. [DOI] [PubMed] [Google Scholar]
  • 10.Schaeffner ES, Kurth T, Curhan GC, et al. Cholesterol and the risk of renal dysfunction in apparently healthy men. J Am Soc Nephrol. 2003;14(8):2084–2091. doi: 10.1681/ASN.V1482084. [DOI] [PubMed] [Google Scholar]
  • 11.Fox CS, Larson MG, Leip EP, Culleton B, Wilson PW, Levy D. Predictors of new-onset kidney disease in a community-based population. JAMA. 2004;291(7):844–850. doi: 10.1001/jama.291.7.844. [DOI] [PubMed] [Google Scholar]
  • 12.Kurella M, Lo JC, Chertow GM. Metabolic syndrome and the risk of chronic kidney disease among nondiabetic adults. J Am Soc Nephrol. 2005;16(7):2134–2140. doi: 10.1681/ASN.2005010106. [DOI] [PubMed] [Google Scholar]
  • 13.O'Hare AM, Rodriguez RA, Bacchetti P. Low ankle-brachial index associated with a rise in creatinine level over time: results from the Atherosclerosis Risk in Communities study. Arch Intern Med. 2005;165(13):1481–1485. doi: 10.1001/archinte.165.13.1481. [DOI] [PubMed] [Google Scholar]
  • 14.Merkin SS, Coresh J, Roux AV, Taylor HA, Powe NR. Area socioeconomic status and progressive CKD: the Atherosclerosis Risk in Communities (ARIC) study. Am J Kidney Dis. 2005;46(2):203–213. doi: 10.1053/j.ajkd.2005.04.033. [DOI] [PubMed] [Google Scholar]
  • 15.Gelber RP, Kurth T, Kausz AT, et al. Association between body mass index and CKD in apparently healthy men. Am J Kidney Dis. 2005;46(5):871–880. doi: 10.1053/j.ajkd.2005.08.015. [DOI] [PubMed] [Google Scholar]
  • 16.Elsayed EF, Tighiouart H, Griffith J, et al. Cardiovascular disease and subsequent kidney disease. Arch Intern Med. 2007;167(11):1130–1136. doi: 10.1001/archinte.167.11.1130. [DOI] [PubMed] [Google Scholar]
  • 17.Bang H, Vupputuri S, Shoham DA, et al. Screening for Occult Renal Disease (SCORED): a simple prediction model for chronic kidney disease. Arch Intern Med. 2007;167(4):374–381. doi: 10.1001/archinte.167.4.374. [DOI] [PubMed] [Google Scholar]
  • 18.Bang H, Mazumdar M, Kern LM, et al. Validation and comparison of a novel screening guideline for kidney disease: KEEPing SCORED. Arch Intern Med. 2008;168(4):432–435. doi: 10.1001/archinternmed.2007.122. [DOI] [PubMed] [Google Scholar]
  • 19.ARIC Investigators. The Atherosclerosis Risk in Communities (ARIC) study: design and objectives. Am J Epidemiol. 1989;129(4):687–702. [PubMed] [Google Scholar]
  • 20.Fried LP, Borhani NO, Enright P, et al. The Cardiovascular Health Study: design and rationale. Ann Epidemiol. 1991;1(3):263–276. doi: 10.1016/1047-2797(91)90005-w. [DOI] [PubMed] [Google Scholar]
  • 21.National Kidney Foundation. K/DOQI clinical practice guidelines for chronic kidney disease: evaluation, classification, and stratification. Am J Kidney Dis. 2002;39(2) 1:S1–S266. [PubMed] [Google Scholar]
  • 22.Coresh J, Astor BC, McQuillan G, et al. Calibration and random variation of the serum creatinine assay as critical elements of using equations to estimate glomerular filtration rate. Am J Kidney Dis. 2002;39(5):920–929. doi: 10.1053/ajkd.2002.32765. [DOI] [PubMed] [Google Scholar]
  • 23.Weiner DE, Tighiouart H, Amin MG, et al. Chronic kidney disease as a risk factor for cardiovascular disease and all-cause mortality: a pooled analysis of community-based studies. J Am Soc Nephrol. 2004;15(5):1307–1315. doi: 10.1097/01.asn.0000123691.46138.e2. [DOI] [PubMed] [Google Scholar]
  • 24.Mazumdar M, Smith A, Bacik J. Methods for categorizing a prognostic variable in a multivariable setting. Stat Med. 2003;22(4):559–571. doi: 10.1002/sim.1333. [DOI] [PubMed] [Google Scholar]
  • 25.Schmidt MI, Duncan BB, Bang H, et al. Identifying individuals at high risk for diabetes: the Atherosclerosis Risk in Communities study. Diabetes Care. 2005;28(8):2013–2018. doi: 10.2337/diacare.28.8.2013. [DOI] [PubMed] [Google Scholar]
  • 26.Akaike H. A new look at statistical model identification. IEEE Trans Automat Contr. 1974;19:716–723. [Google Scholar]
  • 27.Schwarz G. Estimating the dimension of a model. Ann Stat. 1978;6:461–464. [Google Scholar]
  • 28.Hosmer DW, Lemeshow S. Applied Logistic Regression. New York, NY: John Wiley & Sons Inc; 2000. [Google Scholar]
  • 29.Youden WJ. Index for rating diagnostic tests. Cancer. 1950;3(1):32–35. doi: 10.1002/1097-0142(1950)3:1<32::aid-cncr2820030106>3.0.co;2-3. [DOI] [PubMed] [Google Scholar]
  • 30.Nickolas TL, Frisch GD, Opotowsky AR, Arons R, Radhakrishnan J. Awareness of kidney disease in the US population: findings from the National Health and Nutrition Examination Survey (NHANES) 1999 to 2000. Am J Kidney Dis. 2004;44(2):185–197. doi: 10.1053/j.ajkd.2004.04.023. [DOI] [PubMed] [Google Scholar]
  • 31.Coresh J, Byrd-Holt D, Astor BC, et al. Chronic kidney disease awareness, prevalence, and trends among U.S. adults, 1999 to 2000. J Am Soc Nephrol. 2005;16(1):180–188. doi: 10.1681/ASN.2004070539. [DOI] [PubMed] [Google Scholar]
  • 32.Rule AD, Larson TS, Bergstralh EJ, Slezak JM, Jacobsen SJ, Cosio FG. Using serum creatinine to estimate glomerular filtration rate: accuracy in good health and in chronic kidney disease. Ann Intern Med. 2004;141(12):929–937. doi: 10.7326/0003-4819-141-12-200412210-00009. [DOI] [PubMed] [Google Scholar]
  • 33.U.S. Renal Data System. USRDS 2006 Annual Data Report: Atlas of End-Stage Renal Disease in the United States. Bethesda, MD: National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health; 2006. [Google Scholar]
  • 34.McClellan W, Warnock DG, McClure L, et al. Racial differences in the prevalence of chronic kidney disease participants in the Reasons for Geographic and Racial Difference in Stroke (REGARDS) cohort study. J Am Soc Nephrol. 2006;17(6):1710–1715. doi: 10.1681/ASN.2005111200. [DOI] [PubMed] [Google Scholar]
  • 35.National Kidney Foundation. Kidney Early Evaluation Program. Am J Kidney Dis. 2005;45(2) 2:S1–S135. [PubMed] [Google Scholar]
  • 36.Freedman BI, Bowden DW, Rich SS, Appel RG. Genetic initiation of hypertensive and diabetic nephropathy. Am J Hypertens. 1998;11(2):251–257. doi: 10.1016/s0895-7061(97)00481-0. [DOI] [PubMed] [Google Scholar]
  • 37.Seaquist ER, Goetz FC, Rich S, Barbosa J. Familial clustering of diabetic kidney disease: evidence for genetic susceptibility to diabetic nephropathy. N Engl J Med. 1989;320(18):1161–1165. doi: 10.1056/NEJM198905043201801. [DOI] [PubMed] [Google Scholar]
  • 38.Perrone RD, Madias NE, Levey AS. Serum creatinine as an index of renal function: new insights into old concepts. Clin Chem. 1992;38(10):1933–1953. [PubMed] [Google Scholar]
  • 39.Rockhill B. The privatization of risk. Am J Public Health. 2001;91(3):365–368. doi: 10.2105/ajph.91.3.365. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Magnus P, Beaglehole R. The real contribution of the major risk factors to the coronary epidemics: time to end the “only-50%” myth. Arch Intern Med. 2001;161(22):2657–2660. doi: 10.1001/archinte.161.22.2657. [DOI] [PubMed] [Google Scholar]
  • 41.Hamer RA, El Nahas AM. The burden of chronic kidney disease. BMJ. 2006;332(7541):563–564. doi: 10.1136/bmj.332.7541.563. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Jafar TH. The growing burden of chronic kidney disease in Pakistan. N Engl J Med. 2006;354(10):995–997. doi: 10.1056/NEJMp058319. [DOI] [PubMed] [Google Scholar]
  • 43.Barsoum RS. Chronic kidney disease in the developing world. N Engl J Med. 2006;354(10):997–999. doi: 10.1056/NEJMp058318. [DOI] [PubMed] [Google Scholar]
  • 44.van Dijk PC, Jager KJ, de Charro F, et al. ERA-EDTA Registry. Renal replacement therapy in Europe: the results of a collaborative effort by the ERA-EDTA registry and six national or regional registries. Nephrol Dial Transplant. 2001;16(6):1120–1129. doi: 10.1093/ndt/16.6.1120. [DOI] [PubMed] [Google Scholar]

RESOURCES