Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2022 Sep 23;12:15843. doi: 10.1038/s41598-022-20295-4

Development and testing of diagnostic algorithms to identify patients with acromegaly in Southern Italian claims databases

Salvatore Crisafulli 1, Andrea Fontana 2, Luca L’Abbate 3, Valentina Ientile 4, Daniele Gianfrilli 5, Alessia Cozzolino 5, Maria Cristina De Martino 6, Marta Ragonese 7, Janet Sultana 8, Francesco Barone-Adesi 9,10, Gianluca Trifirò 3,
PMCID: PMC9508179  PMID: 36151305

Abstract

Acromegaly is a rare disease characterized by an excessive production of growth-hormone and insulin-like growth factor 1, typically resulting from a GH-secreting pituitary adenoma. This study was aimed at comparing and measuring accuracy of newly and previously developed coding algorithms for the identification of acromegaly using Italian claims databases. This study was conducted between January 2015 and December 2018, using data from the claims databases of Caserta Local Health Unit (LHU) and Sicily Region in Southern Italy. To detect acromegaly cases from the general target population, four algorithms were developed using combinations of diagnostic, surgical procedure and co-payment exemption codes, pharmacy claims and specialist’s visits. Algorithm accuracy was assessed by measuring the Youden Index, sensitivity, specificity, positive and negative predictive values. The percentage of positive cases for each algorithm ranged from 7.9 (95% CI 6.4–9.8) to 13.8 (95% CI 11.7–16.2) per 100,000 inhabitants in Caserta LHU and from 7.8 (95% CI 7.1–8.6) to 16.4 (95% CI 15.3–17.5) in Sicily Region. Sensitivity of the different algorithms ranged from 71.1% (95% CI 54.1–84.6%) to 84.2% (95% CI 68.8–94.0%), while specificity was always higher than 99.9%. The algorithm based on the presence of claims suggestive of acromegaly in ≥ 2 different databases (i.e., hospital discharge records, copayment exemptions registry, pharmacy claims and specialist visits registry) achieved the highest Youden Index (84.2) and the highest positive predictive value (34.8; 95% CI 28.6–41.6). We tested four algorithms to identify acromegaly cases using claims databases with high sensitivity and Youden Index. Despite identifying rare diseases using real-world data is challenging, this study showed that robust validity testing may yield the identification of accurate coding algorithms.

Subject terms: Endocrinology, Health care, Epidemiology

Introduction

Acromegaly is a chronic and progressive endocrine rare disease characterized by an excessive production of growth-hormone (GH) and insulin-like growth factor 1 (IGF-1), which typically results from a GH-secreting pituitary adenoma1. It has been estimated that acromegaly globally affects around 6 per 100,000 persons2. Epidemiological studies conducted in Italy reported a prevalence of acromegaly ranging from 6.9 to 9.7 cases per 100,000 persons35. The onset of acromegaly is usually slow and insidious, and the clinical conditions associated with it, such as arthritis, diabetes mellitus, hypertension, sleep apnea and metabolic dysfunction are common in general population6,7. As a result, acromegaly is often diagnosed late, thus influencing the long-term disease prognosis8 and leading to the development of complications and to increased mortality9. Disease prognosis is also conditioned by genetic and epigenetic variants10. The recommended diagnostic test for acromegaly consists in the assessment of age/sex normalized IGF-1 levels and GH levels during an oral glucose load in patients with the above-mentioned comorbidities. Nowadays, typical features of acromegaly can also be detected by multimedia tools11, useful systems for a modern diagnostic approach. Diagnoses based on laboratory tests should be confirmed thereafter by magnetic resonance imaging (MRI) (or computed tomography scan if MRI is contraindicated or unavailable) which are able to visualize adenoma size and appearance12. A multimodal therapeutic approach comprising neurosurgery, medical therapy and radiotherapy is often required to attain biochemical control and reduction of disease-related morbidity and mortality13. Surgery is generally recommended as the first-line therapy, while in patients with persistent disease following surgery medical treatment as adjuvant therapy is recommended14. There are three pharmacological classes currently available for acromegaly management, to be used as monotherapy or in combination, namely somatostatin analogues (i.e., octreotide, lanreotide and pasireotide), the GH receptor antagonist pegvisomant and dopamine agonists (e.g., cabergoline), which are off-label used15,16.

Although acromegaly is a rare disease, the clinical, economic and health-related quality of life burden is considerable due to the wide spectrum of comorbidities and the need for lifelong management17. Achieving biochemical control is associated with improvements in cost, quality of life, and mortality, albeit not to the level of the general population18.

Claims databases are widely used in pharmacoepidemiology to provide real-world evidence on disease burden, treatment patterns, healthcare resource utilization, benefit-risk profile of drugs and costs19. Italy is rich in electronic healthcare data at loco-regional and national level, and these data have been increasingly used in the last decade for research purposes. Large scale distributed database networks have also been set up in Italy, with the potential to substantially increase statistical power when studying rare diseases/outcomes20. However, the identification of rare diseases in claims databases remains challenging and requires validated coding algorithms.

Several epidemiologic studies applied different coding algorithms for acromegaly ascertainment in claims databases, but their accuracy has never been assessed, mostly due to the lack of a gold standard cohort with true positive cases.

The aim of this study was to newly develop and revise existing coding algorithms for the identification of acromegaly cases and to compare their accuracy in detecting such patients using claims databases from Caserta Local Health Unit (LHU) and Sicily Region, in Southern Italy.

Methods

Data sources

This Italian, retrospective, population-based study was conducted in the period January 2015–December 2018, using data extracted from the fully-anonymized administrative record linkage databases of Caserta Local Health Unit (LHU), with an average of 1,060,904 inhabitants. Moreover, data from the fully-anonymized administrative health databases of Sicily Region in the period January 2011–December 2018, with an average of 5,031,655 inhabitants, were used as a testing dataset to evaluate the performance of the proposed algorithms and to substantiate the estimates of acromegaly cases identified in Caserta LHU. Both databases contain demographic and medical data that is collected through services provided by the Italian National Health Service (NHS). They include information on demographics of residents in each catchment area, outpatient pharmacy claims, hospital discharges, exemptions from co-payment, referrals for outpatient diagnostic tests and specialist’s visits database. The content and context of such NHS claims databases have been described in detail elsewhere19. Claims databases from Caserta LHU also comprise the electronic therapeutic plans (ETP) database. In Italy, ETPs are directly filled by the specialists, who provide information on the exact brand name, number of dispensed packages, and indication for use. The dispensed drugs were coded using the Anatomical Therapeutic Clinical (ATC) classification system and the Italian Marketing Authorization Code (AIC), while comorbidities and indications of use were coded through the ninth revision of the International Classification of Diseases—Clinical Modification (ICD-9 CM).

Algorithm definition

To detect the presence of acromegaly in the target population, four different algorithms were proposed and developed based on a systematic review of published articles of acromegaly epidemiology using claims database-based algorithms2. Specifically, each algorithm was developed using a combination of acromegaly-related ICD-9 CM diagnostic codes (253.0), ICD-9 CM surgical procedure codes (07.6x and 92.3x), co-payment exemption codes (001 and 253.0), specialist visits, laboratory exams and pharmacy claims for somatostatin analogues (ATC: H01CB02, H01CB03, H01CB05) or pegvisomant (ATC: H01AX01). “Algorithm 1” was developed by Caputo et al. in 20184, “Algorithm 2” consisted of a revision of the “Algorithm 1”, which included further acromegaly-related codes, while the “Algorithm 3” and the “Algorithm 4” were developed through different combinations of the acromegaly-related codes used in published articles3,4,2123. For each algorithm, as proposed by Caputo et al.4, pharmacy claims for drugs approved for acromegaly treatment were not taken into account if (i) patients had received less than three separate drug prescriptions for the treatment of acromegaly (occasional drug users) during the observation period; (ii) the medications were not long-acting release (LAR) formulations; (iii) patients taking octreotide or lanreotide had at least one hospitalization with a diagnosis that, as reported in the summary of product characteristics, is one of the indications for use of these drugs [i.e., malignant neoplasms (ICD-9 CM: 140–209, 230–239), liver disorders (ICD-9 CM: 570–573), gastrointestinal bleeding (ICD-9 CM: 578), esophageal varices (ICD-9 CM: 42), Cushing’s disease (ICD-9 CM: 255; 255.0)]; (iv) patients had a co-payment exemption code for Cushing’s disease (code: 032). Inclusion and exclusion criteria are reported in Table 1.

Table 1.

Inclusion and exclusion criteria for each proposed algorithm for the identification of acromegaly cases. *Italian coding system.

Algorithm 1 Inclusion criteria
Subjects who had claims suggestive of acromegaly in ≥ 2 of these databases:
1. Hospital discharge records (ICD-9 CM code: 253.0)
2. Exemption from co-payment (exemption code: 001)
3. Prescription or dispensing of any of these drugs: octreotide (ATC: H01CB02), lanreotide (ATC: H01CB03), pegvisomant (ATC: H01AX01), pasireotide (ATC: H01CB05)
4. Prescriptions for any of the following tests: facial bone nuclear magnetic resonance (88.91.3–88.91.4), cranial computed tomography (87.03–87.03.1)
Algorithm 2 Inclusion criteria
Subjects who had claims suggestive of acromegaly in ≥ 2 of these databases
1. Hospital discharge records (ICD-9 CM code: 253.0)
2. Exemption from co-payment (exemption codes: 001, 253.0)
3. Prescription or dispensing of any of these drugs: octreotide (ATC: H01CB02), lanreotide (ATC: H01CB03), pegvisomant (ATC: H01AX01), pasireotide (ATC: H01CB05)
4. Prescriptions for any of the following tests: facial bone nuclear magnetic resonance (88.91.3–88.91.4), cranial computed tomography (87.03–87.03.1), 88.97, 90.35.1 (somatotropic hormone measurement), 90.40.6 (IGF-1 levels measurement)
Algorithm 3 Inclusion criteria
1. Hospital discharge records: ≥ 2 diagnostic codes for acromegaly (ICD-9 CM: 253.0) OR
2. Exemption from co-payment: ≥ 1 exemption code for acromegaly (exemption codes: 001, 253.0) OR
3. ≥ 1 prescription or dispensing of any of these drugs: octreotide (ATC: H01CB02), lanreotide (ATC: H01CB03), pegvisomant (ATC: H01AX01), pasireotide (ATC: H01CB05) AND ≥ 1 surgical procedure code for acromegaly (07.6*: Hypophysectomy; 92.3*: Stereotactic radiosurgery) OR
4. ≥ 1 prescription or dispensing of any of these drugs: octreotide (ATC: H01CB02), lanreotide (ATC: H01CB03), pegvisomant (ATC: H01AX01), pasireotide (ATC: H01CB05) AND ≥ 1 prescription for any of the following tests: facial bone nuclear magnetic resonance (88.91.3–88.91.4), cranial computed tomography (87.03–87.03.1), 88.97, 90.35.1 (somatotropic hormone measurement), 90.40.6 (IGF-1 levels measurement)
Algorithm 4 Inclusion criteria
1. Hospital discharge records: ≥ 1 diagnostic code for acromegaly (ICD-9 CM: 253.0) OR
2. Exemption from co-payment: ≥ 1 exemption code for acromegaly (exemption codes: 001, 253.0) OR
3. ≥ 1 prescription or dispensing of any of these drugs: octreotide (ATC: H01CB02), lanreotide (ATC: H01CB03), pegvisomant (ATC: H01AX01), pasireotide (ATC: H01CB05) AND ≥ 1 surgical procedure code for acromegaly (07.6*: Hypophysectomy; 92.3*: Stereotactic radiosurgery) OR
4. ≥ 1 prescription or dispensing of any of these drugs: octreotide (ATC: H01CB02), lanreotide (ATC: H01CB03), pegvisomant (ATC: H01AX01), pasireotide (ATC: H01CB05) AND ≥ 1 prescription for any of the following tests*: facial bone nuclear magnetic resonance (88.91.3–88.91.4), cranial computed tomography (87.03–87.03.1), 88.97, 90.35.1 (somatotropic hormone measurement), 90.40.6 (IGF-1 levels measurement)
Exclusion criteria (for all algorithms)
Pharmacy claims were not taken into consideration if:
1. Patients had received less than three separate drug prescriptions for the treatment of acromegaly (occasional drug users)
2. The medications were not long-acting release formulations
3. Patients taking octreotide or lanreotide had a hospitalization with a diagnosis different from acromegaly, among those for which there is an indication for the use these drugs, as reported in the summary of product characteristics [malignant neoplasms (ICD-9 CM: 140–209, 230–239), liver disorders (ICD-9 CM: 570–573), gastrointestinal bleeding (ICD-9 CM: 578), esophageal varices (ICD-9 CM: 42), Cushing’s disease (ICD-9 CM: 255; 255.0)]
4. Patients had an exemption code for Cushing’s disease (code: 032)

Gold standard cohort definition

In Caserta LHU, gold-standard cases were defined as subjects which had at least one registration in the ETP database with at least one ICD-9 CM code for acromegaly during the study period. Gold-standard non-cases were defined as all remaining subjects (i.e. registered in Caserta LHU with no ICD-9 CM codes for acromegaly in ETP database).

In Sicily Region, gold standard cases were defined as patients with a confirmed diagnosis of acromegaly in the Endocrinology Unit of the University Hospital of Messina, a province that lies in the Northeastern part of Sicily, during the study period. Data concerning non-cases were not available.

Statistical analysis

For each proposed algorithm, the accuracy was assessed by the sensitivity (SE) and specificity (SP) measures, along with their 95% confidence intervals (CIs) calculated using the exact Clopper–Pearson method for a binomial proportion. Moreover, the Youden Index (i.e. a summary statistic that balances both SE and SP)24 was computed and the algorithm that achieved the highest accuracy (i.e. Youden Index) was considered as the preferred one with respect to the others. Furthermore, the positive predictive value (PPV) and the negative predictive value (NPV) along with their 95% CIs, were also estimated to assess the algorithm’s accuracy. To visually assess the overlapping number of acromegaly cases detected by each coding algorithm with respect to different data sources, Venn diagrams were produced.

The four proposed algorithms were also applied to patients diagnosed with acromegaly in the Endocrinology Unit of the University Hospital of Messina through probabilistic record linkage and the proportion of true positives among these cases was compared with the SE achieved for each algorithm. In addition, to provide an estimate of the percentage of acromegaly cases in the Sicilian population, only the algorithm with the highest discriminatory power was performed in the administrative data of the Sicily Region (testing dataset).

Finally, for the algorithm with the highest Youden Index a network plot was produced to show the number of acromegalic patients identified in Caserta LHU and Sicily Region, respectively, to further substantiate the chronological occurrence of any acromegaly-specific code identified in each claims database. In particular, for each claims database, all the possible pathways by which the subjects were identified over time were represented. Each pathway is made of the chronological sequence by which every acromegaly-specific code occurred for each patient. This can be considered a proxy of the patient journey25.

All statistical analyses were performed using R Foundation for Statistical Computing (R Development Core Team 2008, Vienna, Austria, version: 4.0.3, packages: caret and PropCIs).

Ethics approval

Analyses were conducted in accordance with the ethical standards of the institutional and national research committee and with the 1964 Helsinki Declaration and its later amendments. This study was approved by the Ethics Committee of the Azienda Ospedaliera Universitaria Integrata of Verona (Protocol number 55986, 27th September 2021).

Consent to participate

Informed consent was obtained from all individual patients from the Endocrinology Unit of the University Hospital of Messina included in the study.

Results

In Caserta LHU, during the period 2015–2018, a total of 84, 92, 116 and 146 individuals were identified as having acromegaly according to “Algorithm 1”, “Algorithm 2”, “Algorithm 3” and “Algorithm 4”, respectively. The estimated percentage of positive cases in Caserta LHU (i.e., the number of detected cases over the total number of subjects) for each proposed algorithm ranged from 7.9 (95% CI 6.4–9.8) to 13.8 (95% CI 11.7–16.2) per 100,000 inhabitants (Table 2).

Table 2.

Number of subjects identified by each proposed algorithm as having acromegaly (cases) in Caserta Local Health Unit (period January 2015–December 2018) and in Sicily Region (period January 2011–December 2018). CI = confidence interval calculated using the exact Clopper–Pearson method for a binomial proportion.

Algorithm 1 Algorithm 2 Algorithm 3 Algorithm 4
Caserta local health unit (study period: 2015–2018)
N. identified cases/total population 84/1,060,904 92/1,060,904 116/1,060,904 146/1,060,904
N. identified cases per 100,000 inhabitants (95% CI) 7.9 (6.4–9.8) 8.7 (7.1–10.6) 11.0 (9.1–13.1) 13.8 (11.7–16.2)
Sicily region (study period: 2011–2018)
N. identified cases/total population 394/5,031,655 533/5,031,655 768/5,031,655 824/5,031,655
N. identified cases per 100,000 inhabitants (95% CI) 7.8 (7.1–8.6) 10.6 (9.7–11.5) 15.3 (14.2–16.4) 16.4 (15.3–17.5)

Seven overlapping cases were detected at all data sources by “Algorithm 1” and “Algorithm 2”, whereas 4 overlapping cases were detected in all data sources by “Algorithm 3” and “Algorithm 4”, respectively (Fig. 1).

Figure 1.

Figure 1

Frequency distribution (Venn diagrams) of the number subjects identified by each proposed algorithm as having acromegaly, with respect to different data sources (i.e. databases of Caserta Local Health Unit and Sicily Region) during the study period.

Of the 1,060,904 subjects registered in Caserta LHU, 81,388 (7.7%) had least one registration in the ETP database. Among them, 38 individuals with at least one ICD-9 CM code for acromegaly (i.e. gold-standard cases) were identified, yielding an estimated prevalence of 3.6 (95% CI 2.6–4.9) cases per 100,000 inhabitants. The mean age of gold-standard cases was 54.5 ± 13.9 years and 20 (52.6%) of them were females. Concerning the 1,060,866 gold-standard non-cases, their mean age was 26.6 ± 20.7 years and 538,086 (50.7%) of them were females.

For each proposed algorithm, the number of true and false positive/negative counts as well as diagnostic measures are reported in Table 3.

Table 3.

Number of subjects with at least one diagnosis code for acromegaly in the electronic therapeutic plans databases from Caserta Local Health Unit (cases) and other subjects registered in Caserta Local Health Unit (non-cases) in the period January 2015–December 2018, as well as number of true and false positive/negative counts, diagnostic and predictive accuracy measures with respect to each proposed algorithm. ^PPV and NPV were computed taking into account that the disease prevalence in Caserta Local Health Unit was 3.61 cases per 100,000 persons (i.e. 38 true cases over a total sample of 1,051,943 subjects); °CI calculated using the exact Clopper–Pearson method for a binomial proportion; §CI calculated using the standard logit confidence intervals. Predictive measures are dependent on disease prevalence; *The Youden Index is computed as the sum of SE (%) and SP (%) minus 100% and denote the accuracy of each considered algorithm; #The algorithm with the highest accuracy. TP True Positive, FP False Positive, FN False Negative, TN True Negative, SE sensitivity, SP specificity, PPV Positive Predictive Value, NPV Negative Predictive Value, CI confidence interval.

Counts Accuracy measures (95% CI°) Youden index (%)* Predictive accuracy measures (95% CI§)
TP FP FN TN Cases Non-cases Positive Negative SE (%) SP (%) PPV (%)^ NPV (%)^
(TP + FN) (FP + TN) (TP + FP) (TN + FN)
Algorithm 1 27 57 11 1,060,809 38 1,060,866 84 1,060,820 71.05 (54.10–84.58) 99.995 (99.99–100.00) 71.047 32.1 (25.4–39.7) 99.999 (99.999–100)
Algorithm 2 32 60 6 1,060,806 38 1,060,866 92 1,060,812 84.21 (68.75–93.98) 99.994 (99.99–100.00) 84.205# 34.8# (28.6–41.6) 99.999 (99.999–100)
Algorithm 3 30 86 8 1,060,780 38 1,060,866 116 1,060,788 78.95 (68.68–90.45) 99.992 (99.99–100.00) 78.939 25.9 (21.1–31.3) 99.999 (99.999–100)
Algorithm 4 32 114 6 1,060,752 38 1,060,866 146 1,060,758 84.21 (68.75–93.98) 99.989 (99.99–100.00) 84.200 21.9 (18.2–26.1) 99.999 (99.999–100)

SE ranged from 71.1% (95% CI 54.1–84.6%) for “Algorithm 1” to 84.2% (95% CI 68.8–94.0%) for “Algorithm 2” and “Algorithm 4”, while SP was higher than 99.9% for each proposed algorithm. The “Algorithm 2” achieved both the highest accuracy (i.e. Youden Index of 84.2%) and the highest accuracy (PPV of 34.8; 95% CI 28.6–41.6) and it was designated to be the preferred one.

The application of the coding algorithms on the claims databases of Sicily Region yielded a total of 394, 533, 768 and 824 acromegaly cases identified by “Algorithm 1”, “Algorithm 2”, “Algorithm 3” and “Algorithm 4”, respectively, with an estimated percentage of detected cases ranging from 7.8 (95% CI 7.1–8.6) to 16.4 (95% CI 15.3–17.5) per 100,000 inhabitants (Table 2). Only twenty-six and sixty-three overlapping cases were detected at all data sources by the “Algorithm 1” and “Algorithm 2”, respectively (Fig. 1). When assessed on the 76 (46 females and 30 males) confirmed acromegaly cases in the Endocrinology Unit of the University Hospital of Messina (Sicily), the percentage of true positives detected by “Algorithm 2” was 82.9% (95% CI 72.5–90.6), which was quite consistent with SE estimate in the Caserta LHU database. The mean age at diagnosis of these patients was 47.7 ± 17.4 years.

Patient journey maps showed that, for the majority of acromegalic patients identified by the “Algorithm 2” in Sicily Region, the first acromegaly-related code occurred in pharmacy claims database (188/533; 35.3%), while in Caserta LHU the first acromegaly-identifying code was mostly found in specialist’s visits database (58/92; 63.1%). The pathway of chronological occurrence of acromegaly-related codes in each claims database is shown in Fig. 2.

Figure 2.

Figure 2

Network plots showing the chronological occurrence of each acromegalic-specific code included in the algorithm 2 in both Caserta Local Health Unit and Sicily Region. Each node represents each claims database (i.e., diagnostic codes, co-payment exemption codes, specialist’s visits and pharmacy claims). Edges define the chronological direction to follow to identify a specific pathway. In panel (a) the frequencies (defined with respect to the number of patients in the start node) of different criteria of acromegalic patients’ identification are reported along the edges. For instance, starting from the specialist visit records in Caserta Local Health Unit, 12 patients (20.7%) having both a first acromegaly diagnosis recorded in that dataset plus a later additional acromegaly-specific code in the co-payment exemptions database were counted. In panel (b) the corresponding median lag time (days) among different criteria of acromegalic patients’ identification is reported for each pathway identified. For instance, starting from the specialist visit records in Caserta Local Health Unit, for the 12 patients (20.7%), a median time of 567 necessary days to be identified also by a later additional acromegaly-specific code in the co-payment exemptions database was observed.

Discussion

To our knowledge, this is the first population-based study that assessed and compared the accuracy of different coding algorithms for the identification of patients with acromegaly using claims databases. Generating epidemiological evidence on rare diseases like acromegaly is crucial to evaluate their impact in terms of burden of disease and to identify unmet clinical needs.

All the four algorithms tested were able, to different extent, to identify true acromegalic patients in claims databases. Based on this, we found that both “Algorithm 2” and “Algorithm 4” provided the most reliable results matching the majority of true acromegalic patients (84.2%). In particular, the preferred “Algorithm 2” achieved the highest discriminatory accuracy, as shown by the Youden Index and PPV, and yielded an estimated prevalence comparable to the global one estimated in the meta-analysis2. Furthermore, as compared to the “Algorithm 4”, “Algorithm 2” yielded a better performance also according to principle of parsimony, since it required less acromegaly-identifying codes to be computed, achieving the same SE and higher SP. This algorithm was developed from the one proposed by Caputo et al.4 (“Algorithm 1”) by adding further codes for specific laboratory exams, such as the codes for somatotropic hormone measurement and IGF-1 levels measurement, and searching for acromegaly-specific co-payment exemptions also using ICD-9 codes. Indeed, in Italy, it is not uncommon that disease-specific co-payment exemptions are registered using the ICD-9 code rather than the specific exemption code. As compared to “Algorithm 1”, the addition of these codes led to considerable improvements in their accuracy.

The estimated percentages of positive acromegaly cases identified using both Caserta LHU and Sicily Region claims databases were slightly higher than the global prevalence estimated in our meta-analysis of observational studies [5.9 (95% CI 4.4–7.9) cases per 100,000 persons]2, but they were comparable to those reported in the Italian epidemiological studies, conducted by Gatto et al.3 and Caputo et al.4. This corroborates the fact that prevalence estimates based on the proposed algorithms are consistent when applied to different databases of different Italian geographical areas.

One of the main obstacles in using real-world data sources, especially for rare diseases, is related to disease coding and to the availability of specific diagnostic codes. In many cases, rare diseases do not have specific codes (e.g. muscular dystrophies) nor co-payment exemption codes20. However, as compared to other rare diseases, acromegaly diagnostic (ICD-9 CM: 253.0) and co-payment exemption (001) codes are rather specific, even if they also refer to gigantism, a clinical condition due to GH overproduction during childhood and adolescence, specifically. Therefore, such codes are reliable enough to identify acromegaly in claims databases. This also holds true for the drugs used for the management of acromegalic patients. The GH receptor antagonist pegvisomant is exclusively approved for the treatment of acromegaly in patients who have had an inadequate response to surgery and/or radiation therapy and/or other medical therapies; although somatostatin analogues are also approved for indications other than acromegaly, they are reliable proxies of acromegaly, especially when used in combination with other acromegaly-related codes.

Patient journey maps showed that most of the cases were primarily identified through pharmacy claims and specialists’ visit. Accordingly, the frequency of those healthcare services is in general much higher than hospitalizations, surgical procedures and exemption codes and, as compared to other services, are more likely to be more accurately identified in claims databases.

One of the main strengths of this study is that it is a large-scale population-based study, for a total of more than 6,000,000 patients from both Caserta LHU and Sicily Region. This is especially relevant for research in the field of rare diseases, where the number of affected patients is very small. However, some limitations warrant caution. Considering that the database used was claims-based, this study presents some limitations associated to this kind of data sources, such as potential misclassification due to potentially inconsistent and inaccurate coding practice. Moreover, since claims data are primarily collected for administrative purposes (e.g., reimbursement), the proposed coding algorithms may yield different performances in populations with different administrative practices and coding patterns. Finally, given that complete remission from the disease is achieved in around 50% of surgically-treated patients2630 and that medical therapy is mainly indicated in those patients who failed to achieve remission after surgery, the extraction of the gold standard cohort from the ETP database may not be fully representative of acromegaly cases. Nevertheless, this has unlikely affected the comparison of the accuracy of different coding algorithms for acromegaly identification.

Conclusions

In this study, we have developed and tested four algorithms for the identification of acromegaly using claims databases from Caserta LHU and Sicily Region, achieving highly consistent diagnostic accuracy measures. Despite identifying a rare disease such as acromegaly using real-world data is challenging, this study showed that robust validity testing may yield the identification of accurate coding algorithms.

Author contributions

S.C.: writing the original draft and interpretation of data. A.F., L.L., V.I.: data analysis, visualization and interpretation of data. D.G., A.C., M.C.D.M., M.R., J.S., F.B.A.: revising the manuscript critically for important intellectual content. G.T.: conception and design of the study, interpretation of data and supervision. All authors contributed to and approved the final manuscript.

Funding

This study was conducted in the context of the “INSPIRE” Progetti di Ricerca di Interesse Nazionale (PRIN) project, which received a grant from the Italian Ministry of Education, University and Research (2017N8CK4K).

Data availability

The individual patient-level dataset generated and/or analyzed during the current study is not publicly available due to the agreement with data provider, but aggregated data can be shared from the corresponding author on reasonable request.

Competing interests

G.T. has served in the last three years on advisory boards/seminars funded by SANOFI, Eli Lilly, AstraZeneca, Abbvie, Servier, Mylan, Gilead, Amgen; he was the scientific director of a Master program on pharmacovigilance, pharmacoepidemiology and real-world evidence which has received non-conditional grant from various pharmaceutical companies; he coordinated a pharmacoepidemiology team at the University of Messina until October 2020, which has received funding for conducting observational studies from various pharmaceutical companies (Boehringer Ingelheim, Daichii Sankyo, PTC Pharmaceuticals). He is also scientific coordinator of the academic spin-off "INSPIRE srl" which has received funding for conducting observational studies from contract research organizations (RTI Health Solutions, Pharmo Institute N.V.). None of these listed activities are related to the topic of the manuscript. The other authors have no other relevant affiliations or financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript apart from those disclosed.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Colao A, et al. Acromegaly. Nat. Rev. Dis. Primers. 2019;5(1):20. doi: 10.1038/s41572-019-0071-6. [DOI] [PubMed] [Google Scholar]
  • 2.Crisafulli S, et al. Global epidemiology of acromegaly: A systematic review and meta-analysis. Eur. J. Endocrinol. 2021;185(2):251–263. doi: 10.1530/EJE-21-0216. [DOI] [PubMed] [Google Scholar]
  • 3.Gatto F, et al. Epidemiology of acromegaly in Italy: Analysis from a large longitudinal primary care database. Endocrine. 2018;61:533–541. doi: 10.1007/s12020-018-1630-4. [DOI] [PubMed] [Google Scholar]
  • 4.Caputo M, et al. Use of administrative health databases to estimate incidence and prevalence of acromegaly in Piedmont Region, Italy. J. Endocrinol. Investig. 2019;42(4):397–402. doi: 10.1007/s40618-018-0928-7. [DOI] [PubMed] [Google Scholar]
  • 5.Cannavò S, et al. Increased prevalence of acromegaly in a highly polluted area. Eur. J. Endocrinol. 2010;163(4):509–513. doi: 10.1530/EJE-10-0465. [DOI] [PubMed] [Google Scholar]
  • 6.Fleseriu M, et al. Prevalence of comorbidities and concomitant medication use in acromegaly: Analysis of real-world data from the United States. Pituitary. 2022;25(2):296–307. doi: 10.1007/s11102-021-01198-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Ferraù F, Albani A, Ciresi A, Giordano C, Cannavò S. Diabetes secondary to acromegaly: Physiopathology, clinical features and effects of treatment. Front. Endocrinol. (Lausanne) 2018;9:358. doi: 10.3389/fendo.2018.00358. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Lavrentaki A, Paluzzi A, Wass JA, Karavitaki N. Epidemiology of acromegaly: Review of population studies. Pituitary. 2017;20(1):4–9. doi: 10.1007/s11102-016-0754-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Pivonello R, et al. Complications of acromegaly: Cardiovascular, respiratory and metabolic comorbidities. Pituitary. 2017;20(1):46–62. doi: 10.1007/s11102-017-0797-7. [DOI] [PubMed] [Google Scholar]
  • 10.Colao A, Ferone D, Marzullo P, Lombardi G. Systemic complications of acromegaly: Epidemiology, pathogenesis, and management. Endocr. Rev. 2004;25(1):102–152. doi: 10.1210/er.2002-0022. [DOI] [PubMed] [Google Scholar]
  • 11.Prencipe N, et al. ACROSCORE: A new and simple tool for the diagnosis of acromegaly, a rare and underdiagnosed disease. Clin. Endocrinol. (Oxf). 2016;84(3):380–385. doi: 10.1111/cen.12959. [DOI] [PubMed] [Google Scholar]
  • 12.Katznelson L, et al. Acromegaly: An endocrine society clinical practice guideline. J. Clin. Endocrinol. Metab. 2014;99(11):3933–3951. doi: 10.1210/jc.2014-2700. [DOI] [PubMed] [Google Scholar]
  • 13.Giustina A, et al. Multidisciplinary management of acromegaly: A consensus. Rev. Endocr. Metab. Disord. 2020;21(4):667–678. doi: 10.1007/s11154-020-09588-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Fleseriu M, et al. A Pituitary Society update to acromegaly management guidelines. Pituitary. 2021;24:1–13. doi: 10.1007/s11102-020-01091-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Grottoli S, et al. ACROSTUDY: The Italian experience. Endocrine. 2015;48(1):334–341. doi: 10.1007/s12020-014-0393-9. [DOI] [PubMed] [Google Scholar]
  • 16.Filopanti M, et al. Growth hormone receptor variants and response to pegvisomant in monotherapy or in combination with somatostatin analogs in acromegalic patients: A multicenter study. J. Clin. Endocrinol. Metab. 2012;97(2):E165–E172. doi: 10.1210/jc.2011-1769. [DOI] [PubMed] [Google Scholar]
  • 17.Ben-Shlomo A, Sheppard MC, Stephens JM, Pulgar S, Melmed S. Clinical, quality of life, and economic value of acromegaly disease control. Pituitary. 2011;14(3):284–294. doi: 10.1007/s11102-011-0310-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Whittington MD, Munoz KA, Whalen JD, Ribeiro-Oliveira A, Campbell JD. Economic and clinical burden of comorbidities among patients with acromegaly. Growth Horm. IGF Res. 2021;59:101389. doi: 10.1016/j.ghir.2021.101389. [DOI] [PubMed] [Google Scholar]
  • 19.Trifirò G, et al. The role of European Healthcare Databases for post-marketing drug effectiveness, safety and value evaluation: Where does Italy stand? Drug Saf. 2019;42(3):347–363. doi: 10.1007/s40264-018-0732-5. [DOI] [PubMed] [Google Scholar]
  • 20.Crisafulli S, et al. Role of healthcare databases and registries for surveillance of orphan drugs in the real-world setting: The Italian case study. Expert Opin. Drug Saf. 2019;18(6):497–509. doi: 10.1080/14740338.2019.1614165. [DOI] [PubMed] [Google Scholar]
  • 21.Broder MS, Chang E, Cherepanov D, Neary MP, Ludlam WH. Incidence and prevalence of acromegaly in the United States: A claims based analysis. Endocr. Pract. 2016;22(11):1327–1335. doi: 10.4158/EP161397.OR. [DOI] [PubMed] [Google Scholar]
  • 22.Burton T, Le Nestour E, Neary M, Ludlam WH. Incidence and prevalence of acromegaly in a large US health plan database. Pituitary. 2016;19(3):262–267. doi: 10.1007/s11102-015-0701-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Placzek H, Xu Y, Mu Y, Begelman SM, Fisher M. Clinical and economic burden of commercially insured patients with acromegaly in the United States: A retrospective analysis. J. Manag. Care Spec. Pharm. 2015;21(12):1106–1112. doi: 10.18553/jmcp.2015.21.12.1106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Youden WJ. Index for rating diagnostic tests. Cancer. 1950;3(1):32–35. doi: 10.1002/1097-0142(1950)3:1<32::AID-CNCR2820030106>3.0.CO;2-3. [DOI] [PubMed] [Google Scholar]
  • 25.Trebble TM, Hansi N, Hydes T, Smith MA, Baker M. Process mapping the patient journey: An introduction. BMJ. 2010;341:c4078. doi: 10.1136/bmj.c4078. [DOI] [PubMed] [Google Scholar]
  • 26.Jane JA, Jr, et al. Endoscopic transsphenoidal surgery for acromegaly: Remission using modern criteria, complications, and predictors of outcome. J. Clin. Endocrinol. Metab. 2011;96(9):2732–2740. doi: 10.1210/jc.2011-0554. [DOI] [PubMed] [Google Scholar]
  • 27.Abe T, Lüdecke DK. Recent primary transnasal surgical outcomes associated with intraoperative growth hormone measurement in acromegaly. Clin. Endocrinol. (Oxf). 1999;50(1):27–35. doi: 10.1046/j.1365-2265.1999.00591.x. [DOI] [PubMed] [Google Scholar]
  • 28.Campbell PG, Kenning E, Andrews DW, Yadla S, Rosen M, Evans JJ. Outcomes after a purely endoscopic transsphenoidal resection of growth hormone-secreting pituitary adenomas. Neurosurg. Focus. 2010;29(4):E5. doi: 10.3171/2010.7.FOCUS10153. [DOI] [PubMed] [Google Scholar]
  • 29.Gittoes NJ, Sheppard MC, Johnson AP, Stewart PM. Outcome of surgery for acromegaly—The experience of a dedicated pituitary surgeon. QJM. 1999;92(12):741–745. doi: 10.1093/qjmed/92.12.741. [DOI] [PubMed] [Google Scholar]
  • 30.Giustina A, et al. Criteria for cure of acromegaly: A consensus statement. J. Clin. Endocrinol. Metab. 2000;85(2):526–529. doi: 10.1210/jcem.85.2.6363. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The individual patient-level dataset generated and/or analyzed during the current study is not publicly available due to the agreement with data provider, but aggregated data can be shared from the corresponding author on reasonable request.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES