Skip to main content
PLOS One logoLink to PLOS One
. 2021 Apr 12;16(4):e0249084. doi: 10.1371/journal.pone.0249084

Recommending encounters according to the sociodemographic characteristics of patient strata can reduce risks from type 2 diabetes

Han Ye 1,*, Ujjal Kumar Mukherjee 1, Dilip Chhajed 2, Jason Hirsbrunner 3, Collin Roloff 3
Editor: Antonio Palazón-Bru4
PMCID: PMC8041209  PMID: 33844693

Abstract

Objectives

Physician encounters with patients with type 2 diabetes act as motivation for self-management and lifestyle adjustments that are indispensable for diabetes treatment. We elucidate the sociodemographic sources of variation in encounter usage and the impact of encounter usage on glucose control, which can be used to recommend encounter usage for different sociodemographic strata of patients to reduce risks from Type 2 diabetes.

Data and methods

We analyzed data from a multi-facility clinic in the Midwestern United States on 2124 patients with type 2 diabetes, from 95 ZIP codes. A zero-inflated Poisson model was used to estimate the effects of various ZIP-code level sociodemographic variables on the encounter usage. A multinomial logistic regression model was built to estimate the effects of physical and telephonic encounters on patients’ glucose level transitions. Results from the two models were combined in marginal effect analyses.

Results and conclusions

Conditional on patients’ clinical status, demographics, and insurance status, significant inequality in patient encounters exists across ZIP codes with varying sociodemographic characteristics. One additional physical encounter in a six-month period marginally increases the probability of transition from a diabetic state to a pre-diabetic state by 4.3% and from pre-diabetic to the non-diabetic state by 3.2%. Combined marginal effect analyses illustrate that a ZIP code in the lower quartile of high school graduate percentage among all ZIP codes has 1 fewer physical encounter per six months marginally compared to a ZIP code at the upper quartile, which gives 5.4% average increase in the probability of transitioning from pre-diabetic to diabetic. Our results suggest that policymakers can target particular patient groups who may have inadequate encounters to engage in diabetes care, based on their immediate environmental sociodemographic characteristics, and design programs to increase their encounters to achieve better care outcomes.

Introduction

In the United States, it is estimated that 30.3 million people (9.4% of the U.S. population) had diabetes in 2015 [1]. The incidence rate of diabetes increased with age, and reached 25.2% among those aged 65 or older [1]. The estimated total cost, including direct medical cost and indirect cost caused by loss of productivity, due to diabetes in 2017 was $327 billion [2]. Type 2 diabetes accounts for 90% to 95% of all diabetes cases [1]. The development of type 2 diabetes is correlated with lifestyle factors such as exercise, weight, nutrition, stress, and urbanization. Type 2 diabetes requires long-term continued care, and patients’ engagement in the care process is key to disease management [3,4]. Managing type 2 diabetes requires patients to stay informed from doctors about their medical conditions and treatment practice changes, and get educated about how to control glucose levels and deal with potential complications. Additionally, patients need to routinely self-monitor their glucose levels and may need to take medications in a timely manner. The Chronic Care Model proposed by the Institute for Healthcare Improvement, an independent nonprofit organization, identifies productive encounters between prepared healthcare practice teams and informed and activated patients as the central tenet in managing chronic diabetes and reducing the population-level economic and healthcare burden from diabetes [5].

Both physical and telephonic encounters play an important role in engaging patients in the care process for type 2 diabetes [68]. During physical encounters, patients and physicians can meet and discuss patients’ medical concerns [911]. Since type 2 diabetes may progress over time, physicians can order tests and update patients’ clinical conditions during each physical encounter and adjust treatment plans. Physical encounters are also great opportunities for raising awareness of disease and self-management of disease. Telephonic encounters are helpful for patients and care providers to communicate with each other, and often play the role of follow-ups of physical encounters for checking health status and the effectiveness of treatment plans, and understanding any concerns or complications from diabetes [1214].

The scheduling of physical and telephonic encounters largely depends on patients’ clinical conditions [15]. Patients with severe conditions (high blood glucose levels) need to be monitored and meet with physicians more frequently, while patients with mild illness see doctors less often [16]. The insurance status of patients also plays a critical role when deciding encounter frequencies and clinical tests [17]. Patients who pay less out of their pocket for each encounter may tend to schedule more visits in a fixed time period. Moreover, scheduling an appointment is not the same as the actual appointment since patients’ adherence to schedule affects the actual number of encounters patients receive. The adherence to schedule may vary from patient to patient.

According to American Diabetes Association, blood sugar levels can be measured by Hemoglobin A1c (A1C), Fasting Plasma Glucose (FPG), Oral Glucose Tolerance Test (OGTT), etc. The current medical diagnosis standard classifies a patient’s glucose level into three states: Diabetic (A1C ≥ 6.5%; FPG≥126 mg/dl; OGTT≥200 mg/dl), Pre-diabetic (5.7%≤A1C < 6.5%; 100mg/dl≤FPG<126 mg/dl; 140 mg/dl≤OGTT<200 mg/dl), and Normal (A1C<5.7%; FPG<100 mg/dl; OGTT<140 mg/dl). The glucose level of a patient with type 2 diabetes may fall in any of the three states at a time point, depending on various factors such as self-management and efficacy of treatments. The change in a patient’s glucose value over a time period is often used as a measure of glucose control [1820]. The transition among the three states of glucose over time periods of a patient reflects disease progress and the effectiveness of disease management. For instance, a transition from Diabetic to Normal in a six-month time period indicates effective disease control, while a transition from Normal to Diabetic indicates disease worsening.

To treat type 2 diabetes, a better understanding of the effects of physical and telephonic encounters on glycemic control (measured by transitions of glucose state) can help policymakers to more efficiently allocate limited capacity of encounters across different patient populations [7]. Inequality of patient encounter is defined as the variation in the frequency of encounter across patients under similar health conditions. Additionally, investigating what factors contribute to the inequalities of encounters across different patient populations can help healthcare providers target specific groups with an elevated risk of high glucose levels. Increasing encounters among the targeted groups may improve their quality of care significantly.

In this article, we report findings from a study that examined sources of inequalities in physical and telephonic encounters of patients with type 2 diabetes at a multi-facility clinic in Illinois. We build two statistical models to estimate: (1) the effects of patients’ environmental sociodemographic variables on their encounter utilizations; (2) the effects of telephonic and physical encounters on patients’ glucose transitions. The results from the two models can help policymakers target specific patient groups with insufficient encounters, and better allocate encounter capacity at community health centers, clinics, and hospitals to improve diabetes outcomes. Furthermore, we demonstrate complementarities between physical and telephonic encounters, which can be used to target the specific type of encounters for specific patient groups.

Study data and methods

The data, collected at Christie Clinic in central Illinois, USA, ranges from 02/01/2013 to 12/21/2015 and includes de-identified electronic medical records (EMR) of 10,235 patients with an ICD-10 code related to Diabetes Mellitus. The data set was assessed on 04/25/2016, after the EMR data were fully anonymized. In the data set, there are two metrics that measure patients’ glycemic levels: A1C and FPG. All the patients in the sample had their FPG measure as well as A1C measure. However, FPG measures were taken on patients at regular intervals and the A1C measures, which are more robust, were taken at a less frequent interval. Therefore, we used the FPG measure to model the diabetes status of patients. We used the ICD-10 codes to select the final sample for data analysis. The primary disease codes for all types of diabetes are E08, E09, E10, E11, O24, and E13. However, we removed all patients whose disease codes are E08, E09, or E10, which indicate diabetes due to other underlying conditions (23.3% of patients), drug-induced diabetes (13.1% of patients), and Type I diabetes (4.4% of patients). Furthermore, we removed all ICD codes O24 (pre-natal diabetes) and E13 (miscellaneous condition related diabetes), totaling 8.8% of the patients. Patients with Type II diabetes due to other underlying conditions are not usually chronic, and the diabetes usually subsides after the underlying condition is cured. Furthermore, we are interested in observing the effect of continued and regular encounters, both physical and telephonic, on diabetes outcome. Therefore, we removed all patients who do not have sufficient records in the dataset. For example, many patients joined the system only towards the end of the observation period and do not have sufficient records to warrant analysis. In addition, some patients dropped out in the middle of the observation period due to deaths, mobility, or other personal causes. We retained all patients who had records for more than 80% of the observation timeframe. A total of 29.6% of the data was removed due to the non-availability of data. A total of 2124 patients (20.8% of total initial record) were retained in the final sample of patients. We checked that patients removed from the data due to unavailability of sufficient observations do not have any systematic gender or ZIP-code based variation. These 20.8% of the patients corresponds to 46.3% of the total number of encounters. The time range of the study is divided into 6-month time periods. For each patient at each time period, the data set includes the following variables: glucose (mg/dl), LDL cholesterol (mg/dl), age (years), gender, number of physical encounters, number of telephonic encounters, health insurance policy, ZIP-code of patient home.

It has been shown in extant research that a patient’s sociodemographic characteristics (such as income, education, and race) are correlated with both their engagement as well as healthcare outcome [2128]. Therefore, it is important to incorporate patients’ sociodemographic characteristics in the study, since they can be significant explanatory variables and/or confounders for encounters and healthcare outcomes. However, the clinic does not collect individual patients’ socioeconomic information. Indeed, patients’ socioeconomic context is rarely asked and documented in healthcare systems due to various privacy concerns [29]. Therefore, we collate each patient’s ZIP-code as a proxy, which partly reflects their living environment and socioeconomic status with the ZIP-code level sociodemographic information acquired from the 2015 US Census. The ZIP-code level sociodemographic variables include population, annual income, percentage of high school graduates, percentage of college graduates, and race distribution such as percentage of white and percentage of African American.

We first carry out an exploratory clustering analysis, and then build two statistical models motivated by the clustering results: one identifies factors that result in the inequality of patient encounters, while the other estimates the effect of encounters on patients’ glucose transitions. Combining the two models together, healthcare providers can identify patients who are likely to have insufficient encounters, and predict the health implications of such a lack of encounters. We elaborate on the clustering analysis and the two models below.

Exploratory analysis

We first carry out an exploratory data analysis, in which we use three sets of variables to cluster patients with K-means clustering methods [30]. Clustering of patients or locations have been widely used in clinical and healthcare literature to make broad generalizable observations and analysis [31]. The objective of the clustering analysis is to discover evidence of encounter usage inequality across ZIP codes with varying sociodemographic characteristics, after adjusting for clinical measurements. The clustering process allows easy interpretability of results and understanding broad differences between clusters. In particular, patients are clustered into: (1) two groups based on their ZIP-code level socioeconomic information including income and education, (2) two groups based on their ZIP-code level racial distribution, and (3) four groups based on their individual clinical measurements (glucose and cholesterol). In particular, socioeconomic Cluster-1 has higher income and education level than socioeconomic Cluster-2; Racial Cluster-1 has a higher white percentage and lower African American percentage than Racial Cluster-2. Clinical Cluster-4 has the most severe diabetes condition, clinical Cluster-1 has the mildest diabetes condition, while the other two clinical clusters stay in the middle, with respect to disease severity. The descriptive statistics of the variables included in the study and for these three clustering results are shown in Table 1. Hypothesis tests were performed for the descriptive statistics to initially understand the center and variation of data. Description of the tests and the associated null hypotheses is given in Table 1.

Table 1. Data description and cluster descriptions.

  Mean / Total Std. Dev T-Stat / Chi-sq Stat p-Value
Total Number of Patients 2124      
FPG measure (mg/dl) 175.5 30.6 1.65 (H0: FPG≤125) 0.0495
2.47 (H0: FPG≤100) 0.0068
LDL Cholesterol (mg/dl) 172.1 34.8 2.07 (H0: LDL≤100) 0.0192
Patient Gender        
Male 50.3%   Chi-sq = 0.004 (df = 1) 0.9522
Female 49.7%  
Patient Age (Years) 64.02 11.14    
< = 40 Years 2.01%   Chi-sq = 33.17 (df = 5) 3.46E-06
41–50 Years 8.57%  
51–60 Years 21.75%  
61–70 Years 32.02%  
71–80 Years 19.87%  
>80 Years 15.78%  
Patient Encounters        
Six month encounter (physical) 3.97 6.03    
Six month encounter (telephonic) 3.01 4.82    
Health Insurance        
13 Groups: Patients / Group 163.4 231.1 Chi-sq = 3921.4 (df = 12) < 2.2E-16
Patient Clusters based on Clinical Conditions        
4 Clusters (Between SS/Total SS)     73.9%  
Cluster 1 (Percentage of Observations) 24.8%      
Glucose 113.9 18.6    
Cholesterol 145.4 21.3    
Cluster 2 (Percentage of Observations) 26.8%      
Glucose 169.2 20.9 Hotelling T^2 = 16756.4 < 2.2E-16
Cholesterol 145.1 22.8 (Compared to Cluster 1)  
Cluster 3 (Percentage of Observations) 21.2%      
Glucose 144.7 26.4 Hotelling T^2 = 15641.2 < 2.2E-16
Cholesterol 213.2 30.8 (Compared to Cluster 2)  
Cluster 4 (Percentage of Observations) 27.2%      
Glucose 261.9 46.5 Hotelling T^2 = 9801.2 < 2.2E-16
Cholesterol 190.8 52.7 (Compared to Cluster 3)  
Community (Zip Code) Level Descriptives
Number of ZIP Codes 95      
Number of Patients / ZIP code 22.4 44.7 Chi-sq = 8405.1 (df = 94) < 2.2E-16
Population 6742 9618 Chi-sq = 1289800 (df = 94) < 2.2E-16
Annual Income ($) 55822 12695
Highschool (%) 91.02% 5.62%
College Graduate (%) 22.62% 13.65%
Race—White (%) 43.82% 3.01%
Race—African American (%) 6.26% 4.09%
Zip Code Clusters: Socio-economic Clusters        
2 Clusters (Between SS/Total SS)     64.1%  
Cluster 1 42      
Annual Income ($) 66465 9059    
Highschool (%) 95.1% 2.17%    
College Graduate (%) 28.8% 14.15%    
Cluster 2 53      
Annual Income ($) 47389 7918 Hotelling T^2 = 191.33 < 2.2E-16
Highschool (%) 87.8% 5.37%    
College Graduate (%) 17.7% 11.11%    
Zip Code Clusters: Race Based Cluster        
2 Clusters (Between SS/Total SS)     81.4%  
Cluster 1 25      
White Percentage 48.70% 1.84%    
African American Percentage 0.75% 1.94%    
Cluster 2 70      
White Percentage 42.10% 3.31% Hotelling T^2 = 132.35 < 2.2E-16
African American Percentage 8.24% 4.61%    

T-test for a population mean is performed for FPG and LDL Cholesterol (H0 is reported with the test statistic). Chi-square goodness of fit test (H0: discrete uniform distribution) is performed for Patient Gender, Patient Age, Patient Health Insurance, Zip-code number of patients, and Zip-code population. Hotelling’s t-square test for independent population mean vectors (H0: two population mean vectors are equal) is performed for cluster mean vectors.

We next show how the distribution of patient encounters varies interactively across socioeconomic clusters and clinical clusters in the left panel of Fig 1. In general, patients in socioeconomic Cluster-2 have fewer physical encounters. In other words, after controlling for patients’ clinical status, patients who live in a ZIP-code with lower income and education level exhibit fewer physical encounters for diabetes treatment. The right panel in Fig 1 shows the distribution of physical encounters by different clinical clusters and racial clusters. Patients in the same clinical cluster have fewer physical encounters when they belong to Racial Cluster-2 (with lower white percentage and higher African American percentage). These results clearly indicate potential inequalities in patient encounters across ZIP-codes with varying sociodemographic characteristics, after controlling for patients’ clinical status.

Fig 1. Significant variation exists in patient encounters based on sociodemographic factors.

Fig 1

The clustering was done with K-means clustering. The advantage of K-means clustering is that it does not require distributional assumptions and can be done in a non-parametric setting. However, K-means depends on the initial starting points, and therefore tends to suffer from lack of stability in some cases [31]. Therefore, we performed the K-means with 25 random starts and used the most frequent clustering. For the study data, all random starts provided similar clustering results with less than 1% error on average. In addition, we re-estimated the K-means clustering multiple times to ensure stability. Furthermore, in the Appendix we provide a table comparing the clustering by K-means and by Latent Class Analysis (LCA). We find that the differences between the patient clusters are insignificant.

Model 1: Understanding the inequalities in encounters across ZIP-codes with different sociodemographic characteristics

Motivated by Fig 1, we study how patient encounters vary across ZIP-codes with different sociodemographic characteristics, after controlling for patients’ clinical status. Our objective is to identify patient subgroups (e.g., from a ZIP-code with particular sociodemographic characteristics) that are likely to have inadequate encounters for their diabetes control. We build zero-inflated Poisson models [32] for patients’ physical and telephonic encounters separately. Patients’ six-month count of physical encounters is the response variable in one of the models, while the six-month count of telephonic encounters is the response variable in another model. Poisson regression is often used to model a count response variable. In our data, there is a high frequency of zero counts. Patients may have zero encounter in a six-month period for various reasons such as missed appointments and traveling, which may not be accounted for by the independent variables, and therefore zero counts can be inflated. Hence, we adopt the zero-inflated Poisson model [32] to analyze such zero-inflated data. This model employs two processes to generate count data. One process follows a binomial distribution that generates structural zero counts. The other process follows a Poisson distribution that generates encounter counts given that at least one encounter takes place in a time period. To control for patient clinical status, we include patients’ previous time period’s clinical measurements of glucose and cholesterol as explanatory variables. Besides, patients’ encounters can largely depend on their insurance policy, so we include insurance policy as an explanatory variable. We then fit a zero-inflated Poisson regression model with the Poisson generating process described as follows. The estimated coefficients of the explanatory variables tell their effects on the rate of encounters.

log(λt)=β0+β1logCholesterolt1+β2logGlucoset1+β3logAget1+β4Gender+j=24αjClinicalClusterj,t1+kτkInsuranceTypek+γ1log(population)+γ2White%+γ3AfricanAmerican%+γ4log(Income)+γ5HighSchool%+γ6College%.

In the equation, λt is the Poisson rate for the encounter counts at time period t. The coefficients βi, αj, and τi control for patients’ clinical status, insurance policy, and demographics. The coefficients γi show how ZIP-code level sociodemographic characteristics correlate with patient encounters.

To enhance the robustness of our data analysis, we then fit the above model within each clinical cluster. A different model, namely positive count model, is also fitted for robustness check. The results are essentially similar to that of the zero-inflated Poisson model, and are reported in the appendix.

Model 2: Estimate the effect of encounters on glucose transition

First, diabetic patients’ health status can be majorly reflected by their glucose measurements. Second, encounters may play an important role in controlling diabetic patients’ glycemic levels. On the one hand, more encounters mean patients are under improved oversight of their service providers and better informed about their health issues. On the other hand, fewer encounters than needed can mean less attention to healthcare, poorer information, and less education for disease control, and as a result, lead to worse health status. Finally, the ZIP-code level variables record the average values of sociodemographic characteristics of the population around a patient’s home, and thus they can partially and indirectly describe the patient’s sociodemographic characteristics.

We classify every patient’s glucose measurements in each six-month time period into three health states following the common standard: N (<100 mg/dl), P (100–125 mg/dl), and D (>125 mg/dl). These three states, as mentioned in the introduction, correspond to normal, prediabetes, and diabetes diagnoses, respectively. A patient who is already diagnosed with type 2 diabetes may have their glucose measurement varying among the three states over time. In other words, a patient in any state in time period t may transit to N, P, or D in time period t+1. The transition of glucose states tells if a patient’s clinical status is improving, worsening, or staying the same. For example, a transition from N to P means a patient’s glucose level gets worse, while a transition from D to P means an improvement in glucose level. The chance that a patient in state i (N, P, or D) at time t will transit to state j (N, P, or D) at time t+1 can be quantified by a transition probability, denoted by Pijt. The larger the transition probability Pijt, the higher the chance that the patient will transit from state i to state j, as time goes from t to t+1. Both healthcare providers and patients desire higher probabilities of the transitions that improve the glucose level (e.g., P to N, D to N, and D to P), and lower probabilities of the transitions that worsen the glucose level (e.g., N to P, N to D, and P to D).

The transition probabilities depict how the health risk changes may depend on a patient’s engagement via physical and telephonic encounters, as well as the patient’s sociodemographic characteristics, which are partially characterized by the ZIP-code level variables. We then build a multinomial logistic regression model [33] to estimate the effect of encounters on glucose status transitions, including ZIP-code level sociodemographic variables and their interactions with the encounter variables. Multinomial logistic regression is a well-developed tool for modeling transitions among a finite number of states, and has been widely adopted in various scientific fields [3436]. Using the transition of glycemic state as a dependent variable has the advantage of model flexibility such that the effect of an explanatory variable is allowed to vary by different transitions. In contrast, directly using the change in glucose value as a response variable assumes unvaried effect of an explanatory variable, regardless of a patient’s current glycemic state. The multinomial logistic regression model for glucose state transitions is stated as follows.

LogPijtPDDt=α0,ij+α1,ijPhyEncountert+α2,ijTelEncountert+α3,ijlogAge+α4,ijWhite%+α5,ijAfricanAmerican%+α6,ijlog(Income)+α7,ijHighSchool%+α8,ijCollege%+PhyEncountert*(α9,ijlogAge+α10,ijWhite%+α11,ijAfricanAmerican%+α12,ijlog(Income)+α13,ijHighSchool%+α14,ijCollege%)+TelEncountert*(α15,ijlogAge+α16,ijWhite%+α17,ijAfricanAmerican%+α18,ijlog(Income)+α19,ijHighSchool%+α20,ijCollege%).

In the above model, the left-hand side is the logarithm of the odds ratio of the transition from i to j to the transition from D to D, where i, j can be any of {D, P, N}. The coefficient α1,ij indicates the effect of a physical encounter on the log odds of transition from i to j. For example, the larger the value of α1,ij, the bigger the effect of a physical encounter on the probability of transitioning from i to j.

Study results

Identify ZIP codes with inadequate encounters

The estimates of Model 1 are shown in Table 2. According to the results, all coefficients show statistical significance when the model is fitted on the entire data to explain physical encounters. Sociodemographic characteristics of a ZIP-code significantly correlate to the number of physical encounters of patients from the ZIP code, after adjusting for patients’ clinical measures, gender, age, and insurance policy. Specifically, a smaller population (coef = −0.21, p<0.001), higher percentage of white (coef = 13.71, p<0.001), lower percentage of African Americans (coef = −7.18, p<0.001), higher income (coef = 0.43, p<0.01), higher percentage of high-school graduates (coef = 4.14, p<0.01), and higher percentage of college graduates (coef = 4.59, p<0.001) correlate to a higher number of physical encounters. Besides ZIP-code level variables, other significant factors for predicting physical encounters include dummy variables of clinical clusters 3 and 4 (coef = 0.46, p<0.01; coef = 0.58, p<0.01), log cholesterol in the previous time period (coef = 0.6, p<0.01), log age (coef = 1.96, p<0.001), and male indicator (coef = −0.26, p<0.001).

Table 2. Zero-inflated poisson model estimated for physical and telephonic encounters.

Response: Physical Encounters
All Patients Clinical Cluster 1 Clinical Cluster 2 Clinical Cluster 3 Clinical Cluster 4
Patient Level
Log(Cholesterol(t-1)) 0.6 (0.17, 1.03) ** 2.00 (0.18, 3.82) * 0.9 (0.21, 1.59) ** 0.83 (0.14, 1.52) * 1.43 (0.49, 2.37) **
Log(Glucose(t-1)) 0.34 (-0.03, 0.71) + 1.85 (-0.17, 3.87) + 0.63 (-0.21, 1.47) 1.02 (0.47, 1.57) *** 0.33 (-0.28, 0.94)
Log(PatientAge(t-1)) 1.96 (1.55, 2.37) *** 3.91 (0.87, 6.95) * 1.83 (1.14, 2.52) *** 2.46 (1.68, 3.24) *** 0.97 (0.21, 1.73) *
Patient Gender—Male -0.26 (-0.4, -0.12) *** -0.57 (-1.26, 0.12) -0.21 (-0.43, 0.01) . -0.3 (-0.52, -0.08) ** -0.35 (-0.66, -0.04) *
Clinical Cluster 2 0.35 (-0.02, 0.72) +
Clinical Cluster 3 0.46 (0.13, 0.79) **
Clinical Cluster 4 0.58 (0.17, 0.99) **
Insurance S S S S S
Community Level (Zip Code)
Log(Population) -0.21 (-0.33, -0.09) *** 0.58 (0.11, 1.05) * 0.35 (0.17, 0.53) *** 0.14 (-0.04, 0.32) 0.14 (-0.15, 0.43)
Percentage White 13.71 (7.42, 20) *** 20.96 (-13.91, 55.83) 0.49 (-9.68, 10.66) 22.97 (12.66, 33.28) *** 8.42 (-6.16, 23)
Percentage African American -7.18 (-11.32, -3.04) *** 13.02 (-9.97, 36.01) 2.49 (-4.29, 9.27) -11.54 (-18.42, -4.66) ** 1.04 (-8.68, 10.76)
Log(Income) 0.43 (0.16, 0.7) ** -1.22 (-2.69, 0.25) 0.11 (-0.36, 0.58) 0.45 (-0.02, 0.92) + 0.62 (0.09, 1.15) *
Percentage High School 4.14 (1.08, 7.2) ** 1.45 (-7.66, 10.56) 3.22 (-2.31, 8.75) 4.9 (-1.04, 10.84) 7.9 (1.53, 14.27) *
Percentage Graduate 4.58 (3.23, 5.93) *** 7.46 (0.72, 14.2) * 3.13 (0.93, 5.33) ** 5.32 (3.24, 7.4) *** 2.75 (-0.86, 6.36)
Number of Patients 2124 526 569 451 578
N-Observations 12533 3104 3357 2661 3411
Theta (Zero Inflation Factor) 0.88 0.85 0.94 0.93 0.91
Log-Lik -2.96E+04 -7.28E+03 -9.24E+03 -6.72E+03 -6.92E+03
Response: Telephonic Encounters
All Patients Clinical Cluster 1 Clinical Cluster 2 Clinical Cluster 3 Clinical Cluster 4
Patient Level
Log(Cholesterol(t-1)) -0.02 (-0.31, 0.27) 0.49 (-0.2, 1.18) -0.17 (-0.68, 0.34) 0.56 (0.07, 1.05) * 0.76 (0.07, 1.45) *
Log(Glucose(t-1)) -0.01 (-0.26, 0.24) -0.25 (-1.13, 0.63) 0.23 (-0.44, 0.9) -0.22 (-0.61, 0.17) 0.25 (-0.2, 0.7)
Log(PatientAge(t-1)) 1.15 (0.88, 1.42) *** 1.27 (0.45, 2.09) ** 1.22 (0.71, 1.73) *** 1.44 (0.97, 1.91) *** 0.65 (0.12, 1.18) *
Patient Gender—Male 0.06 (-0.04, 0.16) 0.21 (-0.12, 0.54) 0.07 (-0.11, 0.25) 0.03 (-0.13, 0.19) 0.01 (-0.19, 0.21)
Clinical Cluster 2 -0.29 (-0.53, -0.05) *
Clinical Cluster 3 -0.25 (-0.54, 0.04) +
Clinical Cluster 4 0.23 (-0.02, 0.48) +
Insurance S S S S S
Community Level (Zip Code)
Log(Population) -0.11 (-0.19, -0.03) ** -0.21 (-0.46, 0.04) + -0.28 (-0.42, -0.14) *** -0.07 (-0.21, 0.07) 0.13 (-0.07, 0.33)
Percentage White 1.19 (-3.16, 5.54) -6.62 (-21.99, 8.75) 8.51 (0.63, 16.39) * 13.39 (6.14, 20.64) *** -5.42 (-14.69, 3.85)
Percentage African American 0.2 (-2.8, 3.2) -1.27 (-11.74, 9.2) -3.43 (-8.88, 2.02) -6.24 (-11.14, -1.34) * 6.08 (-0.39, 12.55) +
Log(Income) 0.21 (0.01, 0.41) * 0.49 (-0.29, 1.27) 0.07 (-0.3, 0.44) 0.36 (0.01, 0.71) * 0.29 (-0.12, 0.7)
Percentage High School 3.22 (1.63, 4.81) *** 3.59 (-1.7, 8.88) 3.12 (0.08, 6.16) * 2.1 (-0.21, 4.41) + 6.28 (1.91, 10.65) **
Percentage Graduate 1.85 (1.01, 2.69) *** 0.97 (-1.83, 3.77) 0.95 (-0.6, 2.5) 3.48 (2.15, 4.81) *** 0.08 (-1.94, 2.1)
Number of Patients 2124 526 569 451 578
N-Observations 12533 3104 3357 2661 3411
Theta (Zero Inflation Factor) 1.21 1.17 1.21 2.01 1.24
Log-Likelihood -2.61E+04 -6.53E+03 -7.16E+03 -6.18E+03 -7.24E+03

Significance Code. 0 <’***’ < = 0.001 < ’**’ < = 0.01 < ’*’ < = 0.05 < ’.’ < = 0.1.

S: Significant at at least 0.05 level.

Parameter estimates with 95% confidence intervals of the parameter estimates are provided.

When the model is fitted within each clinical cluster, the coefficient estimates vary but the general insights remain similar for most ZIP code variables. For clinical Cluster-1 (with lower risk from disease condition) and clinical Cluster-2 (with medium risk from disease condition), the percentage of college graduates (coef = 7.46, p<0.05; coef = 3.13, p<0.01) and log population size (coef = 0.58, p<0.05; coef = 0.35, p<0.001) are both significantly positively related to physical encounters. For clinical Cluster-3 (with medium disease condition), percentage of white (coef = 22.97, p<0.001), and percentage of college graduates (coef = 5.32, p<0.001) are both significantly positively related to physical encounters, while the relationship between the percentage of African American (coef = −11.54, p<0.01) and physical encounters is significant and negative. For clinical Cluster-4 (most severe disease condition), log income (coef = 0.62, p<0.05) and percentage of high-school graduates (coef = 7.9, p<0.05) are the most significant ZIP code factors that correlate to physical encounters, and their effects are both positive. The model estimates show that there exists heterogeneity in physical encounters across ZIP codes with varying sociodemographic characteristics, after adjusting for patients’ clinical and demographic factors. The model suggests healthcare providers need to pay special attention to: (i) patients in clinical Cluster-4 (most severe disease condition) from ZIP codes with low-income and low education level, (ii) patients in clinical Cluster-3 from ZIP codes with low education level, low percentage of white, and high percentage of African American, and (iii) patients in clinical Clusters 1 and 2 from ZIP codes with small population and low education level, since these patients are likely to have fewer encounters, which may be inadequate for their disease control.

The results of the model for telephonic encounters are similar to those for physical encounters, showing that the patients from ZIP codes with a larger population, lower-income, and lower education level are likely to have fewer telephonic encounters. Moreover, log age (coef = 1.15, p<0.001) and indicator for clinical Cluster-2 (coef = −0.29, p<0.05) are significantly related to telephonic encounters. The model is also fitted within each clinical cluster. For clinical Cluster-1 (mildest condition), ZIP code variables do not show significant effects on telephonic encounters. For clinical Cluster-2 (medium condition), significant ZIP-code variables include the percentage of white (coef = 8.51, p<0.05), percentage of high school graduates (coef = 3.12, p<0.05), and log population (coef = −0.28, p<0.001). For clinical Cluster-3 (medium condition), the percentage of white (coef = 13.39, p<0.001), log income (coef = 0.36, p<0.05), and percentage of college graduates (coef = 3.48, p<0.001) all have a significant positive effect, while the percentage of African American (coef = −6.24, p<0.05) has a significant negative effect on telephonic encounters. For clinical Cluster-4 (severe condition), percentage of high-school graduates (coef = 6.28, p<0.01) is the only significant ZIP code variable. Therefore, like physical encounters, we observe similar disparity in telephonic encounters across ZIP codes with varying sociodemographic characteristics, after adjusting for patients’ clinical and demographic factors. In general, the results can help identify ZIP codes that may correlate to insufficient telephonic encounters (e.g., with high population, low percentage of white, high percentage of African American, low income, and low education level).

We also compute the marginal effect of the sociodemographic variables on patient encounters. As an illustration, we divide the patient populations based on the percentage of high school graduates. We find that patients from a ZIP code at the lower quartile of high school education have 1 fewer average encounter than those at the upper quartile. A similar analysis for income reveals that there are 1.3 fewer average encounters of patients in a ZIP code at the lower quartile than those at the upper quartile.

Effect of encounters on glucose transition

Estimates of Model 2 are shown in Table 3. Significance level and 95% confidence interval (CI) are also reported for each estimate. The first column denotes the transition of glucose level between two successive time periods. The three states of glucose level N, P, and D are going from mild to severe. Higher probabilities of transitioning to a milder state (e.g., D to P, D to N, and P to N) and lower probabilities of transitioning to a more severe state (e.g., N to P, N to D, and P to D) are desired by both healthcare providers and patients. According to the estimates, more physical encounters predict a higher probability of transitioning from D to P (coef = 0.01, CI: 0.00, 0.02), N to N (coef = 0.02, CI: 0.00, 0.04), and P to N (coef = 0.01, CI: 0.00, 0.02). The results indicate that more physical encounters, after controlling for sociodemographic factors (ZIP-code level), help patients to transit to a milder state from D to P, or P to N, or maintain a healthy glucose level (N to N). Telephonic encounters have a significant negative effect on transition probabilities of N to P (coef = −0.03, CI: −0.05, −0.01) and P to D (coef = −0.02, CI: −0.04, 0.00). It means that more telephonic encounters can help reduce the chance of transitioning to a higher-risk state (N to P and P to D), after controlling for patients’ sociodemographic factors (at ZIP-code level).

Table 3. Multinomial logistic model estimates for glucose state transition of patients.

Main Effects Model
Physical Encounter Telephonic Encounter Log(Patient Age) Percentage White Percentage African American Log(Income) HighSchool Graduate
D→N 0.00 (-0.02, 0.02) 0.02 (-0.02, 0.06) -0.24 (-0.61, 0.13) 1.99 (-1.02, 5.00) -4.68* (-7.14, -2.22) 0.18* (0.04, 0.32) 0.59 (-0.24, 1.42) -0.42 (-0.99, 0.15)
D→P 0.01* (0.00, 0.02) -0.01 (-0.03, 0.01) 0.18 (-0.08, 0.44) 3.62* (1.10, 6.14) -4.16* (-8.16, -0.16) 0.09* (0.05, 0.13) 0.31* (0.09, 0.53) 0.93* (0.52, 1.34)
N→D -0.01 (-0.03, 0.01) 0.03 (-0.29, 0.35) 0.42* (0.01, 0.83) -6.31* (-8.83, -3.79) 1.68 (-0.82, 4.18) 0.02 (-0.01, 0.05) -0.49* (-0.90, -0.08) -0.74* (-1.41, -0.07)
N→N 0.02* (0.00, 0.04) 0.02 (-0.02, 0.06) 0.54 (-0.19, 1.27) 5.78* (1.13, 10.43) 0.07 (-0.11, 0.25) 0.04* (0.01, 0.07) 4.91* (0.91, 8.91) 0.19* (-0.14, 0.52)
N→P 0.02 (-0.02, 0.06) -0.03* (-0.05, -0.01) 0.13 (-0.11, 0.37) -1.41* (-2.67, -0.15) 9.63* (0.98, 18.28) -0.27* (-0.51, -0.03) -2.70* (-4.73, -0.67) -1.56* (-2.47, -0.65)
P→D 0.01* (0.00, 0.02) -0.02* (-0.04, 0.00) 0.67* (0.04, 1.30) -2.58* (-4.96, -0.20) 4.12* (0.02, 8.24) -0.10* (-0.16, -0.04) -0.17 (-0.39, 0.05) -0.79* (-1.54, -0.04)
P→N 0.01* (0.00, 0.02) 0.02 (-0.02, 0.06) -0.13* (-0.23, -0.03) 4.04* (0.08, 8.00) -2.34* (-4.39, -0.29) 0.10 (-0.16, 0.36) 1.55 (-0.38, 3.48) 1.05* (0.58, 1.52)
P→P 0.02 (-0.02, 0.06) -0.04 (-0.10, 0.02) 0.30 (-0.19, 0.79) 3.16 (-2.45, 8.77) 0.03* (0.01, 0.05) -0.21* (-0.37, -0.05) -1.49* (-2.49, -0.49) -0.14 (-0.36, 0.08)
Interaction Effects
Interaction with Physical Encounter
D→N 0.08 (-0.04, 0.20) 1.34 (-0.45, 3.13) 2.02 (-1.27, 5.31) -0.01 (-0.03, 0.01) 0.37* (0.05, 0.69) 0.18 (-0.06, 0.42)
D→P 0.08* (0.02, 0.14) 1.32 (-0.39, 3.03) 2.12* (0.13, 4.11) -0.02 (-0.05, 0.01) 0.31* (0.05, 0.57) 0.17* (0.05, 0.29)
N→D -0.07* (-0.13, -0.01) -0.94 (-2.22, 0.34) -1.61* (-3.01, -0.21) -0.06* (-0.10, -0.02) -0.37* (-0.69, -0.05) 0.02 (-0.01, 0.05)
N→N 0.02* (0.01, 0.03) 1.86* (0.84, 2.88) 1.71* (0.11, 3.31) 0.01* (0.00, 0.02) 0.76* (0.25, 1.27) 0.09* (0.02, 0.16)
N→P -0.17* (-0.29, -0.05) -1.57* (-2.97, -0.17) -2.10* (-4.12, -0.08) -0.13* (-0.22, -0.04) -0.46 (-1.00, 0.08) 0.05 (-0.03, 0.13)
P→D 0.06 (-0.02, 0.14) -1.69* (-2.73, -0.65) -1.66* (-2.86, -0.46) -0.04* (-0.07, -0.01) -0.16 (-0.50, 0.18) -0.12* (-0.22, -0.02)
P→N -0.04 (-0.10, 0.02) 1.99* (0.97, 3.01) 1.86* (0.43, 3.29) 0.00 (-0.02, 0.02) -0.45* (-0.77, -0.13) 0.05 (-0.04, 0.14)
P→P 0.03 (-0.02, 0.08) 2.59 (-1.22, 6.40) 2.49 (-1.50, 6.48) -0.06 (-0.17, 0.05) -0.30 (-1.04, 0.44) 0.16 (-0.12, 0.44)
Interaction with Telephonic Encounters
D→N -0.18 (-0.52, 0.16) -0.21 (-0.71, 0.29) 0.63* (0.00, 1.26) 0.08 (-0.04, 0.20) 0.29 (-0.05, 0.63) 0.03 (-0.02, 0.08)
D→P 0.10* (0.04, 0.16) 1.67* (0.76, 2.58) 1.90* (0.83, 2.97) 0.00 (-0.02, 0.02) 0.21* (0.07, 0.35) 0.29* (0.09, 0.49)
N→D -0.13 (-0.26, 0.00) -0.67 (-2.12, 0.78) -0.70 (-1.95, 0.55) -0.01* (-0.02, 0.00) -0.06 (-0.16, 0.04) -0.08 (-0.22, 0.06)
N→N 0.00 (-0.03, 0.03) 0.59 (-0.13, 1.31) 1.32* (0.27, 2.37) 0.06* (0.02, 0.10) 0.33* (0.05, 0.61) 0.42* (0.04, 0.80)
N→P -0.25 (-0.47, -0.03) -1.03 (-2.85, 0.79) -1.84* (-3.29, -0.39) 0.03 (-0.02, 0.08) -0.42* (-0.80, -0.04) 0.18 (-0.08, 0.44)
P→D -0.10* (-0.19, -0.01) 0.90 (-0.75, 2.55) -1.22* (-2.35, -0.09) 0.02 (-0.01, 0.05) -0.10 (-0.24, 0.04) 0.17 (-0.01, 0.35)
P→N 0.15* (0.01, 0.29) 1.14* (0.19, 2.09) 2.33* (0.16, 4.50) 0.03 (-0.01, 0.07) -0.14 (-0.46, 0.18) 0.32 (-0.06, 0.70)
P→P 0.03 (-0.01, 0.07) -0.39 (-0.93, 0.15) 0.60 (-0.35, 1.55) 0.14* (0.03, 0.25) 0.69* (0.26, 1.12) 0.31* (0.10, 0.52)

Notes. 1. D: high glucose, P: medium glucose, N: low glucose.

2. The class transitions likelihoods are estimated using a Multinomial logistic model.

3. The numbers signify multinomial slope estimates.

4. * indicates significant at 0.05 level.

The interaction coefficient (inter coef) estimates (in Table 3) show how patients’ sociodemographic characteristics at the ZIP-code level interact with encounters to influence glucose state transitions. For example, physical encounters have significantly stronger effects on older patients in terms of improving patients’ state from D to P (inter coef = 0.08, CI: 0.02, 0.14), keeping patients in state N (inter coef = 0.02, CI: 0.01, 0.03), and preventing transitions from N to D (inter coef = −0.07, CI: −0.13, −0.01) and from N to P (inter coef = −0.17, CI: −0.29, −0.05). While we observe significant racial disparity in encounter usage, the interaction effects on transition probabilities are similar across races in terms of sign, magnitude, and significance level. Other ZIP-code level variables such as income and percentage of college graduates also have significant interactions with physical encounters on some of the transition types. In general, the significant interactions between physical encounters and the ZIP-code variables indicate that increasing encounters would have higher benefits for patients who are older, reside in a ZIP-code with a higher percentage of white, a higher percentage of African American, higher income, or higher education level. The only counterintuitive exception is the interaction effect between physical encounters and percentage of high-school graduates on the transition from P to N, which is significantly negative (inter coef = −0.45, CI: −0.77, −0.13), indicating that for patients in ZIP codes with a higher percentage of high school graduates, the effect of physical encounters is smaller in helping the transition from P to N. This is surprising since in general education should be positively associated with lower risk from diabetes. However, we feel that the causal chain is through the nature of patients’ occupations, which may be generally less physical for high-school graduates than for non-high-school graduates, and higher levels of physical activity are associated with lower diabetic risks [37]. The current data does not include occupational information of patients.

Telephonic encounters also interact with patients’ age and ZIP-code level sociodemographic variables to affect glucose transitions. For example, for older patients, the effect of telephonic encounters is significantly stronger on improving patients’ states from D to P (inter coef = 0.1, CI: 0.04, 0.16) and from P to N (inter coef = 0.15, CI: 0.01, 0.29), and preventing state worsening from N to D (inter coef = −0.13, CI: −0.26, 0.00), N to P (inter coef = −0.25, CI: −0.47, −0.03), and P to D (inter coef = −0.1, CI: −0.19, −0.01). For the race variables at the ZIP code level, the interaction between percentage of white and telephonic encounters is significant for two transitions (positive inter coef for D to P and P to N), while the interaction between percentage of African American and telephonic encounters is significant for six transitions (positive inter coef for D to N, D to P, N to N, and P to N; negative inter coef for N to P and P to D). This shows that increased telephonic encounters benefit more in the ZIP codes with higher percentage of either white or African American. In addition, the ZIP codes with high percentage of African American benefits from increased telephonic encounters for more transition types, compared to the ZIP codes with high percentage of white. In summary of the interaction effects, increasing telephonic encounters lead to more benefits among patients who are older, or live in a ZIP code with higher percentage of white, higher percentage of African American, higher income, higher percentage of high-school graduates, or higher percentage of college graduates.

We further interpret the results by computing the average marginal effects of the encounters and sociodemographic variables by computing the marginal effects at each observation in the sample, and averaging the marginal effects for all observations. As an illustration, consider the logistic regression model

pj=eXβj1+j=1J1eXβj,j=1,,J1,

pJ=11+j=1J1eXβj, where j = 1,…,J, denote the transitions, X = (x1,…,xK) are the values of the K independent variables, and βj=(βj1,,βjK) are the coefficients of the K independent variables for transition category j. For a variable xk, its marginal effect for transition category j is given by

[pjxk]X=X=eXβj(βjk+βjkj=1J1eXβj+j=1J1βjkeXβj)(1+j=1J1eXβj)2,j=1,,J1,
[pJxk]X=X=j=1J1βjkeXβj(1+j=1J1eXβj)2.

Then, the average marginal effect of variable xk on transition j is 1Nn=1N[pjxk]X=Xn, where Xn = (xn1,…,xnK) denotes the nth individual in the data. We used the R package margins (https://cran.r-project.org/web/packages/margins/) to compute the average marginal effects of the multinomial transition models. The average marginal effect of physical encounters for all patients for the transition from diabetic to the pre-diabetic stage is 4.3%. This indicates that one additional encounter on average would increase the likelihood of patients’ transition D to P by 4.3%. Similarly, the average marginal effect of one additional physical encounter for the transition P to N is 3.2%. This indicates that encounters with doctors, and nurses increase the likelihood of transition from a higher risk level to a lower risk level. To interpret the effect of encounters for different characteristics of patient populations we compute the average marginal effects for different quartiles of patients. The average marginal effect of one additional physical encounter on the transition from diabetic to pre-diabetic for patient groups falling in the upper quartile of the percentage of high school education is 1.1%, while the corresponding average marginal effect for patient groups falling in the lower quartile of the percentage of high school education is 7.9%. This indicates that the effect of encounters for patients with lower levels of high school education is higher than for patients with higher levels. A similar observation is made for the median income and percentage of college graduates.

These results not only identify significant variation in the effect of physical and telephonic encounters on the health risks from diabetes across patient groups, but also indicate that physical encounters work better for improving disease states (such as increasing chances of the good transitions: D to P, N to N, and P to N), while telephonic encounters work better for preventing disease state worsening (such as reducing chances of the bad transitions: N to P, and P to D). These results, therefore, indicate the prevalence of complementarities between the two types of encounters, which can be used by providers to focus strategically on specific encounter types.

We also combine the results of the two models to interpret the effects of sociodemographic factors on encounters, and the effect of the encounters on the final diabetes outcome. As an illustrative example, we observe that for ZIP-codes which are in the lower quartile of the distribution on the variable percentage of high school graduates, there is a marginal Poisson rate of 1 fewer encounter (from zero-inflated Poisson models) compared to the ZIP-codes at the upper quartile of percentage of high school graduates, which when plugged into the multinomial regression model gives an 5.4% average increase in the probability of transitioning from pre-diabetic to diabetic. The corresponding estimates for median zip code income are 1.3 and 7.3%, respectively.

Discussion

Inequalities in healthcare outcome often stem from disparities in access to healthcare resources, such as encounters. We show inequalities exist in physical and telephonic encounters across patients’ ZIP-codes with heterogeneous sociodemographic characteristics, after controlling for patients’ clinical, demographic, and insurance variables. We also show the implication of physical and telephonic encounters for patients’ glucose transition. Furthermore, we demonstrate that the two encounter types act differently on patient subpopulations with different sociodemographic status. Therefore, the two encounter types can be used in a complementary manner, differentially on different patient subpopulations to improve overall risk levels from diabetes under capacity constraints of patient encounters, especially when there is an increasing push towards more inclusiveness of patients from all sociodemographic backgrounds into the purview of managed care under government regulatory and legislative initiatives.

The results may help healthcare providers to target patient subgroups from specific ZIP-codes according to ZIP-code sociodemographic characteristics, who may have inadequate encounters to engage in diabetes care. Interventions are supported by extant research to target those patients with inadequate encounters [38]. Governmental and social support in terms of providing better access to healthcare resources, particularly preventative and primary care resources is important in improving overall healthcare measure and reducing healthcare inequality. Programs to engage targeted patient groups may be designed for improving encounters and engagement. For instance, education events may be provided in the ZIP-codes with inadequate encounters to raise the awareness of disease prevention and active control; nurses and case managers may arrange more telephonic encounters for consultation, education, appointment reminders, etc.

Since the total encounter capacity is often limited, increasing encounters among particular patient subgroups means that the number of encounters of some other patients needs to be reduced. Model 1 can be used to identify patient subgroups who have a high number of encounters, after controlling for their clinical status. Appropriately reducing encounters for the patients who have more than sufficient encounters may not have negative effects on their diabetes control. Indeed, proper reallocation of encounters can help reduce the inequality in limited healthcare resources, and lead to better outcomes of the entire patient population. In order to address such inequality in encounters, especially in physical encounters, telemedicine and online communication can be used to reduce the burden of the “super-utilizers” [39,40].

Patients’ socioeconomic contexts have been shown in extant research to be significant explanatory variables or predictors for healthcare outcome and disease risks [41,42]. Nevertheless, healthcare providers have seldom collected such information. There are concerns about the value, feasibility, and efficiency of collecting such patient data at the individual level [29]. Our study supplements the unavailable individual socioeconomic data with the ZIP-code socioeconomic data from the US census, and shows that it explains a significant amount of variation in patient encounter utilizations as well as glucose control outcomes. This supports the use of population data from geographic areas to infer individual patients’ socioeconomic context, and can be helpful to other studies that may need yet lack individual patient socioeconomic data.

Although the study is based on the data from a regional clinic in Illinois, the methods can be carried over to analyze data collected at other clinics as well. The insights from the study may be carried over to other regions where patients share similar sociodemographic and clinical features. Besides identifying patient subgroups based on ZIP-codes that require more encounters to enhance engagement, the models 1 and 2 can also be used to predict glucose measurement and encounter utilization for individual patients. Engagement programs may be tailored to individual patients to achieve the highest efficiency in encounter utilization and reduce the inequality in encounters and healthcare outcomes.

Conclusion

In closing, there exist inequalities in physical and telephonic encounter utilizations across ZIP-code areas with varying sociodemographic characteristics, after adjusting for patients’ clinical status, demographics, and insurance policy. The inequalities in encounter utilizations may lead to disparity in diabetes care outcomes. Policymakers should consider actions such as increasing healthcare capacity or designing programs for targeted patient groups with inadequate encounters to mitigate such inequality in encounter utilization, and ultimately improve the efficiency of care and the healthcare outcome of the entire served population. Also, healthcare providers and policymakers have the opportunity to consider complementarities between encounter types while planning chronic disease care which requires continued and repeated encounters of patients with the healthcare systems. The results of this study demonstrate the importance of designing healthcare systems that are able to recommend encounter frequency to the type for patient subpopulations characterized by specific sociodemographic characteristics.

Supporting information

S1 Appendix. Estimates for positive count model of physical and telephonic encounters.

(DOCX)

Data Availability

There are legal restrictions on sharing the data set. Christie Clinic does not release clinical data or patient information without a written agreement to the party that will receive it. Data access queries may be directed to Collin Roloff, Director, Information Management & Analytics at Christie Clinic (contact via Croloff@christieclinic.com). The zip-code level sociodemographic variables were obtained from the US Census Bureau (https://data.census.gov/cedsci/).

Funding Statement

The author(s) received no specific funding for this work.

References

  • 1.Centers for Disease Control and Prevention. National Diabetes Statistics Report [Internet]. 2018 [cited 2019 Jan 9]. Available from: https://www.cdc.gov/diabetes/data/statistics/statistics-report.html.
  • 2.American Diabetes Association. Economic Costs of Diabetes in the U.S. in 2017. Diabetes Care [Internet]. 2018. May 1 [cited 2019 Jan 9];41(5):917–28. Available from: http://www.ncbi.nlm.nih.gov/pubmed/29567642. 10.2337/dci18-0007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Graffigna G, Barello S, Bonanomi A, Menichetti J. The Motivating Function of Healthcare Professional in eHealth and mHealth Interventions for Type 2 Diabetes Patients and the Mediating Role of Patient Engagement. J Diabetes Res [Internet]. 2016. Jan 5 [cited 2019 Aug 7];2016:1–10. Available from: http://www.hindawi.com/journals/jdr/2016/2974521/. 10.1155/2016/2974521 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Rodriguez KM. Intrinsic and Extrinsic Factors Affecting Patient Engagement in Diabetes Self-Management: Perspectives of a Certified Diabetes Educator. Clin Ther [Internet]. 2013. Feb 1 [cited 2018 Jun 27];35(2):170–8. Available from: https://www.sciencedirect.com/science/article/pii/S014929181300009X. 10.1016/j.clinthera.2013.01.002 [DOI] [PubMed] [Google Scholar]
  • 5.Wagner EH. What Will It Take To Improve Care for Chronic Illness? Eff Clin Pract. 1998. [PubMed] [Google Scholar]
  • 6.Fenton JJ, Von Korff M, Lin EHB, Ciechanowski P, Young BA. Quality of preventive care for diabetes: effects of visit frequency and competing demands. Ann Fam Med. 2006;4(1):32–9. 10.1370/afm.421 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Carman KL, Dardess P, Maurer M, Sofaer S, Adams K, Bechtel C, et al. Patient And Family Engagement: A Framework For Understanding The Elements And Developing Interventions And Policies. Health Aff [Internet]. 2013. February 1 [cited 2018 Feb 22];32(2):223–31. Available from: http://www.ncbi.nlm.nih.gov/pubmed/23381514. [DOI] [PubMed] [Google Scholar]
  • 8.Fenwick EK, Xie J, Rees G, Finger RP, Lamoureux EL. Factors associated with knowledge of diabetes in patients with type 2 diabetes using the diabetes knowledge test validated with Rasch analysis. PLoS One. 2013. 10.1371/journal.pone.0080593 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Patel NK, Parchman ML. The chronic care model and exercise discussions during primary care diabetes encounters. J Am Board Fam Med. 2011;24(1):26–32. 10.3122/jabfm.2011.01.100137 [DOI] [PubMed] [Google Scholar]
  • 10.Kanter M, Martinez O, Lindsay G, Andrews K, Denver C. Proactive office encounter: a systematic approach to preventive and chronic care at every patient encounter. Perm J. 2010;14(3):38. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Kaplan SH, Greenfield S, Ware JE Jr. Assessing the effects of physician-patient interactions on the outcomes of chronic disease. Med Care. 1989;S110–S127. 10.1097/00005650-198903001-00010 [DOI] [PubMed] [Google Scholar]
  • 12.Sarkar U, Handley MA, Gupta R, Tang A, Murphy E, Seligman HK, et al. Use of an interactive, telephone-based self-management support program to identify adverse events among ambulatory diabetes patients. J Gen Intern Med. 2008;23(4):459–65. 10.1007/s11606-007-0398-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Holman H. Chronic disease—the need for a new clinical education. Jama. 2004;292(9):1057–9. 10.1001/jama.292.9.1057 [DOI] [PubMed] [Google Scholar]
  • 14.Ramos—Rivers C, Regueiro M, Vargas EJ, Szigethy E, Schoen RE, Dunn M, et al. Association between telephone activity and features of patients with inflammatory bowel disease. Clin Gastroenterol Hepatol. 2014;12(6):986–94. 10.1016/j.cgh.2013.11.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Karter AJ, Parker MM, Moffet HH, Ahmed AT, Ferrara A, Liu JY, et al. Missed Appointments and Poor Glycemic Control. Med Care [Internet]. 2004. February [cited 2018 Feb 22];42(2):110–5. Available from: http://www.ncbi.nlm.nih.gov/pubmed/14734947. 10.1097/01.mlr.0000109023.64650.73 [DOI] [PubMed] [Google Scholar]
  • 16.Morrison F, Shubina M, Turchin A. Encounter frequency and serum glucose level, blood pressure, and cholesterol level control in patients with diabetes mellitus. Arch Intern Med. 2011. September 26;171(17):1542–50. 10.1001/archinternmed.2011.400 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Nordin JD, Solberg LI, Parker ED. Adolescent primary care visit patterns. Ann Fam Med [Internet]. 2010. November 1 [cited 2019 Aug 7];8(6):511–6. Available from: http://www.ncbi.nlm.nih.gov/pubmed/21060121. 10.1370/afm.1188 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Williams GC, Freedman ZR, Deci EL. Supporting autonomy to motivate patients with diabetes for glucose control. Diabetes Care. 1998;21(10):1644–51. 10.2337/diacare.21.10.1644 [DOI] [PubMed] [Google Scholar]
  • 19.Group JDRFCGMS, others. Effectiveness of continuous glucose monitoring in a clinical care environment: evidence from the Juvenile Diabetes Research Foundation continuous glucose monitoring (JDRF-CGM) trial. Diabetes Care. 2010;33(1):17–22. 10.2337/dc09-1502 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Jin C, Chen S, Vaidya A, Wu Y, Wu Z, Hu FB, et al. Longitudinal change in fasting blood glucose and myocardial infarction risk in a population without diabetes. Diabetes Care. 2017;40(11):1565–72. 10.2337/dc17-0610 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Ellis DA, McQueenie R, McConnachie A, Wilson P, Williamson AE. Demographic and practice factors predicting repeated non-attendance in primary care: a national retrospective cohort analysis. Lancet Public Heal [Internet]. 2017. [cited 2019 Aug 7];2(12):e551–9. Available from: http://www.ncbi.nlm.nih.gov/pubmed/29253440. 10.1016/S2468-2667(17)30217-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Williams DR, Priest N, Anderson NB. Understanding associations among race, socioeconomic status, and health: Patterns and prospects. Heal Psychol [Internet]. 2016;35(4):407–11. Available from: http://doi.apa.org/getdoi.cfm?doi=10.1037/hea0000242. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Fisher-Hoch SP, Vatcheva KP, Rahbar MH, McCormick JB. Undiagnosed diabetes and pre-diabetes in health disparities. PLoS One. 2015. 10.1371/journal.pone.0133135 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Negandhi PH, Ghouri N, Colhoun HM, Fischbacher CM, Lindsay RS, McKnight JA, et al. Ethnic differences in glycaemic control in people with type 2 diabetes mellitus living in Scotland. PLoS One. 2013. 10.1371/journal.pone.0083292 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Cheng YJ, Kanaya AM, Araneta MRG, Saydah SH, Kahn HS, Gregg EW, et al. Prevalence of Diabetes by Race and Ethnicity in the United States, 2011–2016. JAMA—J Am Med Assoc. 2019. Dec 24;322(24):2389–98. 10.1001/jama.2019.19365 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Choi SH, Terrell JE, Fowler KE, McLean SA, Ghanem T, Wolf GT, et al. Socioeconomic and Other Demographic Disparities Predicting Survival among Head and Neck Cancer Patients. Langevin SM, editor. PLoS One [Internet]. 2016. Mar 1 [cited 2018 Feb 22];11(3):e0149886. Available from: http://dx.plos.org/10.1371/journal.pone.0149886. 10.1371/journal.pone.0149886 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Gaskin DJ, Thorpe RJ, McGinty EE, Bower K, Rohde C, Young JH, et al. Disparities in Diabetes: The Nexus of Race, Poverty, and Place. Am J Public Health [Internet]. 2014. Nov [cited 2018 Feb 22];104(11):2147–55. Available from: http://www.ncbi.nlm.nih.gov/pubmed/24228660. 10.2105/AJPH.2013.301420 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Sambamoorthi U, McAlpine DD. Racial, ethnic, socioeconomic, and access disparities in the use of preventive services among women. Prev Med (Baltim). 2003;37(5):475–84. [DOI] [PubMed] [Google Scholar]
  • 29.Moscrop A, Ziebland S, Roberts N, Papanikitas A. A systematic review of reasons for and against asking patients about their socioeconomic contexts. Int J Equity Health [Internet]. 2019. Dec 23 [cited 2019 Aug 7];18(1):112. Available from: https://equityhealthj.biomedcentral.com/articles/10.1186/s12939-019-1014-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Hastie T, Tibshirani R, Friedman J. The Elements of Data Mining. Math Intell. 2009. [Google Scholar]
  • 31.Grant RW, McCloskey J, Hatfield M, Uratsu C, Ralston JD, Bayliss E, et al. Use of Latent Class Analysis and k-Means Clustering to Identify Complex Patient Profiles. JAMA Netw open. 2020;3(12):e2029068—e2029068. 10.1001/jamanetworkopen.2020.29068 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Lambert D. Zero-inflated poisson regression, with an application to defects in manufacturing. Technometrics. 1992. [Google Scholar]
  • 33.Bohning D. Multinomial logistic regression algorithm. Ann Inst Stat Math [Internet]. 1992. Mar [cited 2019 Aug 7];44(1):197–200. Available from: http://link.springer.com/10.1007/BF00048682. [Google Scholar]
  • 34.Paton L, Troffaes MCM, Boatman N, Hussein M, Hart A. Multinomial logistic regression on Markov chains for crop rotation modelling. In: International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems. 2014. p. 476–85. [Google Scholar]
  • 35.Bizzotto R, Zamuner S, De Nicolao G, Karlsson MO, Gomeni R. Multinomial logistic estimation of Markov-chain models for modeling sleep architecture in primary insomnia patients. J Pharmacokinet Pharmacodyn. 2010;37(2):137–55. 10.1007/s10928-009-9148-2 [DOI] [PubMed] [Google Scholar]
  • 36.Augustin NH, Cummins RP, French DD. Exploring spatial vegetation dynamics using logistic regression and a multinomial logit model. J Appl Ecol. 2001;991–1006. [Google Scholar]
  • 37.Kuwata H, Okamura S, Hayashino Y, Tsujii S, Ishii H. Higher levels of physical activity are independently associated with a lower incidence of diabetic retinopathy in Japanese patients with type 2 diabetes: A prospective cohort study, Diabetes Distress and Care Registry at Tenri (DDCRT15). PLoS One. 2017. 10.1371/journal.pone.0172890 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Peek ME, Cargill A, Huang ES. Diabetes Health Disparities: A Systematic Review of Health Care Interventions. Med Care Res Rev [Internet]. 2007;64(5 suppl):101S–156S. Available from: http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2367214&tool=pmcentrez&rendertype=abstract. 10.1177/1077558707305409 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Dixon RF. Enhancing Primary Care Through Online Communication. Health Aff [Internet]. 2010. Jul 2 [cited 2019 Aug 9];29(7):1364–9. Available from: http://www.healthaffairs.org/doi/10.1377/hlthaff.2010.0110. [DOI] [PubMed] [Google Scholar]
  • 40.Lieberman J. How telemedicine is aiding prompt ECG diagnosis in primary care. Br J Community Nurs [Internet]. 2008. Mar 27 [cited 2019 Aug 9];13(3):123–6. Available from: http://www.magonlinelibrary.com/doi/10.12968/bjcn.2008.13.3.28676. [DOI] [PubMed] [Google Scholar]
  • 41.Pickett KE, Pearl M. Multilevel analyses of neighbourhood socioeconomic context and health outcomes: a critical review. J Epidemiol Community Health. 2001;55(2):111–22. 10.1136/jech.55.2.111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Hu J, Gonsahn MD, Nerenz DR. Socioeconomic status and readmissions: evidence from an urban teaching hospital. Health Aff. 2014;33(5):778–85. 10.1377/hlthaff.2013.0816 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Antonio Palazón-Bru

23 Nov 2020

PONE-D-20-27788

Customizing Patient Encounters for Different Socioeconomic and Demographic Strata Can Reduce Risks from Type 2 Diabetes

PLOS ONE

Dear Dr. Ye,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Jan 07 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Antonio Palazón-Bru, PhD

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2.Thank you for your submission to PLOS ONE.

We note that your manuscript states that your IRB approved this research, however your ethics statement states that the research activities described in this application meet the criteria for exemption at 45CFR46.101(b)(4). Please clarify whether the IRB approved your study or whether you obtained a waiver from ethics approval from the IRB due to exempt status.

In addition, please clarify whether exemption was for ethics approval and informed consent, or whether written informed consent was obtained from subjects.

Please also provide the name of the multi-facility clinic in central Illinois.

Also, please include the date(s) on which you accessed the databases or records to obtain the data used in your study.

3.We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions.

In your revised cover letter, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially identifying or sensitive patient information) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. Please see http://www.bmj.com/content/340/bmj.c181.long for guidelines on how to de-identify and prepare clinical data for publication. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.

We will update your Data Availability statement on your behalf to reflect the information you provide.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: No

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: No

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: No

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: My thanks to the editor and authors for the opportunity to review this manuscript. This manuscript reports the results of a retrospective study of the effect of social and demographic variables on blood glucose among persons with diabetes. It is an important topic and this manuscript fills an important gap in the research literature. Below I provide some comments and suggestions to the editor and authors that also serve as rationale for my recommendation. Overall, I do not think the manuscript is appropriate for publication but the authors might consider a separate and new analysis taking into account some of these critiques.

Minor: I do not see what “customizing” has to do with the hypothesis being tested in this manuscript. This study does not seem to be about customization, but rather recommendations for care utilization patterns.

Abstract:

• The abstract begins with a description of the importance of physician encounters and “the inverse law of patient encounters” which is seems to be mostly conventional wisdom rather than established theory. I recommend removing the first to sentences and providing a short and clear description of the rationale for the paper, grounded in literature on socioeconomic status and health care utilization. A more straightforward statement of the primary hypothesis would be clearer than the current presentation in the abstract.

• The abstract includes unnecessary stylistic language (e.g. “last but not least”) that detracts from the message.

• Description of “do correlate” should be accompanied be correlation coefficient or other statistic that illustrates this finding.

• The abstract seems to confuse segregation (measured as African Americans in a zip code) with socioeconomic status. The authors need to be accurate in their description of concepts.

Manuscript

• The body of the manuscript has some grammar problems. I have only mentioned the first few to appear. The authors should edit and revise their work more carefully. For example:

o Pg 3 ln. 60 is improperly worded. The prevalence did not increase with age, the prevalence is higher among older persons.

o Pg. 3 ln 62 “in 2017 is $327 billion” should be “was”

• Pg 5 ln 103 “glucose transitions” is not defined or referenced until much later in the paper and most readers will not understand what is being referred to here. The transition approach is elaborate and does not appear to have been reported or described elsewhere. The authors likely need to establish the validity of this approach in a separate manuscript.

• Pg 5 which IRB approved the study?

• Pg 5 How were the data collected? Chart abstraction? Electronic medical record? What procedures were used?

• Given that patients have A1C values, why use FPG as the outcome instead of A1C?

• Were A1C and FPG point of care or from a venous blood draw in the lab?

• Zip codes are large and potentially immensely heterogeneous. Why not geocode to the census tract or block group to get more fine grained information? Tools like tidycensus and sociome provide the opportunity for higher resolution, neighborhood level analyses that do not suffer from the same degree of weakness as zip codes.

• Major: The manuscript is presented as though all patients had complete data, but in my experience that is rarely the case. How many patients were excluded due to incomplete data?

• Major: The methods section does not appear to describe inclusion/exclusion criteria.

• What are the tests for which p values are reported in Table 1? Are there hypotheses tied to these tests?

• Table 1 refers to “clusters” but insufficent explanation in the table is given about what is meant by “cluster” How did the authors decide to create these clusters?

• The state transition framework for office measurements of FPG is somewhat innovative and potentially useful.

• Too little information is provided about the number and frequency of measurements of FPG per patient.

• It seems that patients with insufficient follow-up visits would be excluded because they would have no transition data. This is a potential limitation of the design. Doesn’t this mean that Model 2 should come first? Wouldn’t the probability of having repeated visits influence the frequency of measurement and thus the potential to observe a glucose transition?

• IT does not seem to make sense that the cluster analysis is embedded in model 2. Should not the cluster analysis be presented as having its own analytic plan and hypotheses? What was the rationale for this step other than data reduction? How do these locally, empirically defined clusters compare to other approaches of transforming zip code and clinical measurement data?

• K-means cluster analysis is notorious for having problems with reproducibility. This is a serious limitation of the analysis. The authors should recognize this limitation and consider or propose future working using more sophisticated techniques like latent class analysis or factor mixture models.

• Major weakness: Detailed results of the cluster analysis are largely omitted from the manuscript.

• There are many comparisons made in Table 2.

• Table 2 should have a sample size. No mention is made of whether there was any adjustment for multiple comparison, drawing into question the validity of any of the bolded “significant” findings. The authors need to address for multiple comparison in this type of table, or if they have done so, should say that they did. Significance tests are a poor and possibly invalid approach here anyhow (see American Statistical Association recent public statements on this matter) and 95% confidence intervals for each estimate would be much preferred.

• Table 2 seems a little bit more like an all by all table of raw output rather than a presentation of results. With so many interaction effects the results presentation is both bewildering and suspect.

• All of the interpretations about results surrounding the larger percentage of African Americans in a zip code are suspect. This is especially concerning given that “Cluster 2” has just 8% African American population. Are we to believe that there is something special about zip codes with 8% vs. 4% African Americans as it relates to diabetes care? What theory in the literature would suggest this is important for us to look at?

• Although there are good reasons to examine socioeconomic status and glucose control, and the authors appear to have assembled a useful regional data resource, the analytic plan is insufficiently described and poorly justified. Critical details are omitted from the manuscript (missing data, inclusion/exclusion criteria). The analytic methods and reported results, especially the clustering, the multiple comparisons in Table 2 and the many inferences made about small observed magnitude differences render the results and discussion unable to convince this reviewer of the accuracy, relevance or salience of any of the study findings.

Reviewer #2: 1. The study design was clearly laid out for the reader. Variables of interest were well defined for the reader.

2. The statistical methods were clearly explained in the paper and appropriate to the research question.

3. The statistical methods were appropriate to analyze the variables of interest and the rationale for the researcher(s) choice(s) of statistical methods was explained.

4. The subject matter was relevant to diabetes care and to more general patient care areas.

5. The subject matter of the study is an important area of concern regarding diabetes care outcomes in varying sociodemographic settings.

6. Allocation of medical care is based on a number of variables. The variables of interest in the study are some of the variables identified in other previous studies that examine the role of social determinants of health in population care outcomes.

7. This study adds to the body of knowledge in the subject area of diabetes care outcomes by identifying another aspect of care models and delivery of health care services to address areas of greater need.

8. Contentions of the author(s) were supported by the data.

The article was a very technical read in terms of data analyses and discussion of results. It contributes further to the body of knowledge in the subject area.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Pamela Phares PhD

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Decision Letter 1

Antonio Palazón-Bru

11 Mar 2021

Recommending Encounters According to the Sociodemographic Characteristics of Patient Strata Can Reduce Risks from Type 2 Diabetes

PONE-D-20-27788R1

Dear Dr. Ye,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Antonio Palazón-Bru, PhD

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

All the reviewers' concerns have been correctly addressed.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: 1. Reviewing comments of Reviewer #1 and responses from the authors, they appear to have addressed the problematic issues with Kmeans clustering algorithms, data source, data interpretation and data presentation (from the edited version). Explanations for the weaknesses in Kmeans algorithms and their use of multiple iterations using different starting points as well as using LCA analysis and comparing these results to the Kmeans clustering data appear to address some of the inherent problems with the use of Kmeans algorithms. Use of zip-code level data is commonly used with data sets that are restricted for privacy reasons and are useful to the extent of their limitations.

2. Again, going point by point in their analyses and conclusions I do not see any major errors.

4. Yes, the data appear to be complete except for those that they are not privy to for confidentiality reasons.

5. I think the editorial revisions that I reviewed have made this a more coherent paper with better grammatical composition overall.

I recommend publication of the paper. The major issues brought forth regarding the paper appear to be addressed satisfactorily. The study is a good starting point for additional studies analyzing the relationship among healthcare usage, healthcare outcomes and geographic areas that are underserved by healthcare delivery systems and public health initiatives.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

Acceptance letter

Antonio Palazón-Bru

15 Mar 2021

PONE-D-20-27788R1

Recommending Encounters According to the Sociodemographic Characteristics of Patient Strata Can Reduce Risks from Type 2 Diabetes

Dear Dr. Ye:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Antonio Palazón-Bru

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Appendix. Estimates for positive count model of physical and telephonic encounters.

    (DOCX)

    Attachment

    Submitted filename: Response to Reviewers.docx

    Data Availability Statement

    There are legal restrictions on sharing the data set. Christie Clinic does not release clinical data or patient information without a written agreement to the party that will receive it. Data access queries may be directed to Collin Roloff, Director, Information Management & Analytics at Christie Clinic (contact via Croloff@christieclinic.com). The zip-code level sociodemographic variables were obtained from the US Census Bureau (https://data.census.gov/cedsci/).


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES