Skip to main content
. Author manuscript; available in PMC: 2022 Feb 1.
Published in final edited form as: Med Care. 2021 Feb 1;59(2):118–122. doi: 10.1097/MLR.0000000000001478

Table 2.

Agreement between the 1511 social network analysis (SNA) communities identified from 2015 100% Texas data and communities identified from other datasets

Testing data Number of communities# Recall§ (%) 1-purity§ (%) F-measure§ (%)
Mean ± SD, Median (IQR)
2015, with primary care services area (PCSA) boundary 1525 80.8 ± 33.0 1.4 ± 6.6 83.4 ± 29.8
100.0 (66.7–100.0) 0.0 (0.0–0.0) 100.0 (80.0–100.0)
2015, 20%, ≥ 6 shared patients** 1339 93.9 ± 16.4 9.3 ±16.4 90.9 ± 15.8
100.0 (100.0–100.0) 0.0 (0.0–16.7) 100.0 (85.7–100.0)
2015, 20%, ≥ 11 shared patients 1155 73.8 ± 29.9 1.1 ± 6.2 80.3 ± 25.3
80.0 (50.0–100.0) 0.0 (0.0–0.0) 88.9 (66.7–100.0)
#

The communities shown here are the community that contained the largest number of providers appearing in both testing and reference community. When multiple reference communities matched to one testing community, we calculated the average agreement measures for this testing community.

§

Recall: For each predicted cluster C, the recall quantifies the percentage coverage by C of the validated partition, P, in which C has the most elements. In our study, we calculated among the primary care providers (PCPs) in the reference community (identified from 100% data) the percent of those PCPs also in the testing community.

1-purity: Purity quantifies degree of homogeneous predicted clusters, in the sense of whether a predicted cluster contains elements of only one validated partition. A purity of 1 indicates that a predicted cluster contains individuals from only one validated partition. In our study, we compared to the PCPs in the reference community (identified from 100% data) the percent of additional PCPs in the testing community who were not found in the reference community.

F-measure is the harmonic mean of the above two quantities.

**

It was not feasible to survey 1511 practices identified from 100% statewide data. Therefore, we selected the 8 practices which were surveyed in our previous study and had at least 3 PCPs, and then compared those identified from 100% data and 20% data. We found 6 out of 8 practices with a recall measure of > 90% and a purity measure of 100%.