Skip to main content
AMIA Annual Symposium Proceedings logoLink to AMIA Annual Symposium Proceedings
. 2008;2008:732–736.

Identification of Documented Medication Non-Adherence in Physician Notes

Alexander Turchin a,b,c, Holly I Wheeler d, Matthew Labreche d, Julia T Chu c, Merri L Pendergrass b,c, Jonathan S Einbinder a,b,c
PMCID: PMC2655985  PMID: 18998827

Abstract

Medication non-adherence is common and the physician’s awareness of it may be an important factor in clinical decision making. Few sources of data on physician awareness of medication non-adherence are available. We have designed an algorithm to identify documentation of medication non-adherence in the text of physician notes.

The algorithm recognizes eight semantic classes of documentation of medication non-adherence. We evaluated the algorithm against manual ratings of 200 randomly selected notes of hypertensive patients. The algorithm detected 89% of the notes with documented medication non-adherence with specificity of 84.7% and positive predictive value of 80.2%. In a larger dataset of 1,000 documents, notes that documented medication non-adherence were more likely to report significantly elevated systolic (15.3% vs. 9.0%; p = 0.002) and diastolic (4.1% vs. 1.9%; p = 0.03) blood pressure. This novel clinically validated tool expands the range of information on medication non-adherence available to researchers.

Introduction

Physician behavior is an important determinant of quality of care and clinical outcomes1, 2. It is, in turn, influenced by many factors3. In order to improve quality of care by influencing physician behavior it is important to understand these factors and how they affect patient care.

Medication non-adherence is common among patients4 and its association with adverse clinical outcomes is well documented5. Physician surveys show that awareness of the patient’s non-adherence may change the treatment they prescribe to the patients6. However, the magnitude of this effect is not well studied.

One of the reasons for the paucity of research in this area is the difficulty in obtaining information on physician awareness of their patients’ medication adherence. Many studies of medication adherence draw upon the data on timing of prescription refills to estimate adherence7. However, this information is not routinely available to physicians and consequently may not be optimal for modeling their behavior.

Physicians are expected to record all of the information obtained during a patient encounter, including their perception of medication adherence, in their notes. As electronic medical records become more common, notes are frequently available in digital format that is amenable to computational analysis. In recent years a number of successful systems for analysis of narrative medical documents have been reported811; however, none of them analyzes the text for documentation of patient compliance. In our previous work analyzing text of physician notes we successfully identified documentation of patient non-adherence to provider recommendations (including treatment, follow-up schedule and others) and found that it was associated with an increased rate of emergency department visits12. In this paper we describe the design of the algorithm to detect documentation of medication non-adherence in the text of provider notes. We report on the algorithm’s validation with respect to the accuracy of the text analysis and relationship to clinical outcomes.

Materials and Methods

The goal of this project was to develop software that could identify documentation of non-adherence to the medication regimen in the text of narrative notes in the electronic medical record. For the purpose of this study non-adherence was defined as lack of compliance with provider recommendations with regards to medications. It included patients discontinuing medications permanently or temporarily as well as using at an incorrect dose or frequency. Self-discontinuation of medications due to side effects was included if this was the first time the provider found out about it. Self-discontinuations of medications due to side effects in remote past were excluded. Discontinuations of medications as directed by a physician were excluded as well.

Algorithm

The algorithm used for identification of documentation of non-adherence in the text of the notes is schematically represented in Figure 1. The software takes plain text files containing multiple documents as input. The documents are split into sentences and three classes of semantic heuristics are sequentially applied to each sentence: a) continuous heuristics b) discontinuous heuristics and c) discontinuous heuristics with negative qualifiers. Continuous heuristics specify every word in the phrase to be detected (e.g. “does not take”), discontinuous heuristics specify some but not all of the words (e.g. “refuses [any words] and to take”), and discontinuous heuristics with negative qualifiers also provide a counter-heuristic that must not match (e.g. “stopped her medication” but not “I stopped her medication”). The order of application of the heuristics was chosen based on the expected relative precision: while discontinuous heuristics are more flexible, they inherently have a degree of unpredictability that lowers their specificity. The software implements a total of 87 heuristics that can be classified into eight semantic classes (Table 1). Multiple non-overlapping heuristics can be matched to a single sentence.

Figure 1.

Figure 1

Algorithm for Identification of Documentation of Medication Non-Adherence

Table 1.

Semantic Classes of Heuristics Representing Documentation of Medication Non-Adherence

Semantic Class Does not take Refuses Inconsistent Non-Compliant
Examples Has not been taking his glyburide.
Unable to afford her medications.
Hesitant to try insulin.
He is unwilling to take beta-blockers.
She takes her Lipitor only rarely. Poor medication compliance.
Semantic Class Uncertain Difficulty Counseling Not Tolerated
Examples He is not sure what medications he should be taking. Patient struggles with taking her insulin. I emphasized the importance of taking his blood pressure medications Unable to tolerate lisinopril due to cough.

Evaluation

We evaluated the accuracy of the software on a dataset of 200 primary care physicians’ notes from the electronic medical record system of two large academic hospitals. In order to ensure high event frequency necessary to achieve statistical power required for the evaluation, the notes were randomly selected from a set of documents that recorded a significantly elevated blood pressure (> 150/95 mm Hg) and no anti-hypertensive medication intensification (based on physician surveys that indicate that patient non-adherence is a common reason not to intensify treatment6). Blood pressure values and anti-hypertensive medication intensification were computationally abstracted from the notes using a previously validated technique13. Each note was manually analyzed by two independent reviewers (senior pharmacy students), who did not participate in the development of the algorithm, to identify all sentences that represented documentation of the patient’s non-adherence to their medication regimen. Sentences where the reviewers did not reach an agreement were re-analyzed to determine the consensus designation. The software output was compared to the consensus rating to determine sensitivity (recall), specificity and positive predictive value (precision) of the software for detection of notes with documented medication non-adherence..

In order to validate the software output clinically, we analyzed a set of 1,000 randomly selected primary care physicians’ notes on hypertensive diabetic patients with documented blood pressure. It has been shown that medication non-adherence identified from other data sources is associated with higher incidence of elevated blood pressure14. We therefore analyzed the relationship of documented medication non-adherence identified by the software in the note and the frequency of significantly elevated blood pressure (> 150/95) recorded in the same note to test the hypothesis that the data generated by our software conforms to this previously established association.

Statistical Analysis

Cohen’s kappa was used to assess inter-reviewer agreement. Normal approximation was used to calculate confidence intervals for sensitivity, specificity, and positive predictive value. Fisher exact test was used to estimate the significance of differences in fractions of visits with significantly elevated blood pressure between notes with and without documentation of medication non-adherence.

IRB

The study protocol was reviewed and approved by Partners Human Research Committee.

Results

The final consensus of the manual review identified a total of 82 notes with documented medication non-adherence among the 200 evaluated notes. The kappa statistic for agreement between the two reviewers at the note level was 0.673 (95% CI 0.565 – 0.781) indicating good agreement. In these 82 notes manual review consensus identified 150 sentences (range 1–6 per note) that recorded documentation of medication non-adherence. Eighty (53.3%) of these sentences were identified by both reviewers in the initial analysis

Sensitivity, specificity and positive predictive value of the software results are shown in Table 2. The algorithm correctly identified 89% of the notes with documented medication non-adherence. Examples of phrases correctly and incorrectly rated by the algorithm are listed in Table 3. Many of the missed phrases included misspellings and information distributed over several sentences or requiring the broad context of the note for correct interpretation. Common false-positives included documentation of patient not taking their medication due to physician instructions or not taking medications that were not necessarily ordered / recommended by a medical provider.

Table 2.

Accuracy of Identification of Notes with Documented Medication Non-Adherence

Sensitivity 89.0% (± 6.0%)
Specificity 84.7% (± 5.5%)
Positive Predictive Value 80.2% (± 6.3%)
Negative Predictive Value 91.7 (± 4.7%)

95% confidence interval is given in parentheses

Table 3.

Examples of Phrases Correctly and Incorrectly Rated by the Algorithm

True Positives He has run out of pain medications.
With regard to hyperkalemia, he did not increase the Kayexalate as instructed.
He may have missed a few doses.
False Positives She felt warm but has not taken her temperature.
She does not take any medications for relief of her hernia symptoms.
Last TSH was very high 10/99, reflecting being off of Synthroid for her scan.
False Negatives Takes meds but none today.
I had asked her to increase her lisinopril dosing to 40 mg a day. She presents today stating that she has not done this.
Withut meds today.
Also compliance may be an issue.

In order to establish clinical validity of the data generated by the algorithm we analyzed 1,000 notes with any documented blood pressure level to test the hypothesis that patients with documented non-adherence would be more likely to have elevated blood pressure. None of these notes were previously analyzed in the development of the algorithm. The algorithm detected a total of 633 phrases in 365 notes documenting medication non-adherence. The frequency of phrases documenting medication non-adherence per note was distributed exponentially (Figure 2). Phrases stating that the patient does not take the medication were the most frequent (Figure 3) followed by phrases that referred to the patient refusing a change in their medication regimen. In the entire set of 1,000 notes, the average blood pressure documented in the note was 130.5 mm Hg systolic and 74.1 mm Hg diastolic. Notes that had documented medication non-adherence identified by the software were more than 1.5-fold likely to have systolic blood pressure >150 mm Hg (15.3% vs. 9.0%; p = 0.002) and more than two-fold likely to have diastolic blood pressure >95 mm Hg (4.1% vs. 1.9%; p = 0.03) recorded.

Figure 2.

Figure 2

Distribution of Frequency of Phrases Documenting Medication Non-Adherence per Note

Figure 3.

Figure 3

Distribution of Non-Adherence Heuristic Classes in 1,000 patient notes

Discussion

Few means to study patient medication adherence information are currently available to researchers. Most published studies have used physician surveys6, 15, 16 which do not allow large-scale investigations. In this project we have investigated feasibility of using computational analysis of the text of physician notes as a novel source of data on physician awareness of patient medication adherence.

Our analysis has shown that documentation of medication non-adherence is lexically diverse and is represented by multiple semantic classes. Nevertheless the algorithm we designed was able to achieve high sensitivity detecting nearly 9 in 10 notes with documented medication non-adherence while maintaining acceptable specificity and precision. Many of the instances of documentation of non-adherence the algorithm missed required advanced semantic analysis including information distributed across multiple sentences or interpretation of the context (e.g. a record of “non-compliance” without specifically referring to medications but located in a paragraph discussing pharmacologic treatment of the patient’s hypertension). Common false positives included documentation of patients not taking over the counter medications or discontinuing medications according to their provider’s recommendations. Crucially, we were able to show that the data generated by the algorithm correlates well with expected patient outcomes (higher frequency of elevated blood pressure) providing a level of clinical validation in addition to factual validation against the human interpretation of the text (which can also be fallible).

Data obtained using the algorithm could be used for several purposes. Researchers could utilize it for large-scale studies evaluating, for example, how physicians ascertain medication non-adherence or the effect of physicians’ perception of patient medication adherence on their clinical decision making. Operationally, if no other sources of information on patient medication adherence are available (as is frequently the case), it could be used to identify high-risk patients who could potentially benefit from more intensive clinical interventions (e.g. a phone call from a pharmacist).

Our study has several limitations. The data we analyzed came from primary care practices of two academic medical centers in Massachusetts and may not be applicable to other healthcare settings. The inter-reviewer agreement for the manual review to identify documentation of medication non-adherence was less than what could have been expected. This may confound the interpretation of the evaluation of the algorithm’s accuracy. However, lower levels of inter-reviewer agreement are common for medical documents17 and likely reflect the complex nature of these documents. Per design requirements, our algorithm identified documentation of past medication non-adherence and medication discontinuation due to side effects, but other researchers may require different definitions which our algorithm in its current form is not able to provide.

Conclusion

We have designed and validated the first, to our knowledge, algorithm that identifies documentations of medication non-adherence in the text of physician notes. The algorithm showed high accuracy and was also validated clinically. This is an important step in enhancing the armamentarium of health services researchers with novel tools that let them utilize previously inaccessible sources of data.

Acknowledgments

This research was supported in part by funding from Diabetes Action Research and Education foundation and Agency for Healthcare Research and Quality (R18 HS017030).

References

  • 1.Phillips LS, Branch WT, Cook CB, et al. Clinical inertia. Ann Intern Med. 2001 Nov 6;135(9):825–834. doi: 10.7326/0003-4819-135-9-200111060-00012. [DOI] [PubMed] [Google Scholar]
  • 2.DiMatteo MR. Variations in patients’ adherence to medical recommendations: a quantitative review of 50 years of research. Med Care. 2004 Mar;42(3):200–209. doi: 10.1097/01.mlr.0000114908.90348.f9. [DOI] [PubMed] [Google Scholar]
  • 3.Turchin A, Shubina M, Chodos AH, Einbinder JS, Pendergrass ML. Effect of board certification on antihypertensive treatment intensification in patients with diabetes mellitus. Circulation. 2008 Feb 5;117(5):623–628. doi: 10.1161/CIRCULATIONAHA.107.733949. [DOI] [PubMed] [Google Scholar]
  • 4.Dunbar-Jacob J, Mortimer-Stephens MK. Treatment adherence in chronic disease. J Clin Epidemiol. 2001 Dec;54(Suppl 1):S57–60. doi: 10.1016/s0895-4356(01)00457-7. [DOI] [PubMed] [Google Scholar]
  • 5.Pladevall M, Williams LK, Potts LA, Divine G, Xi H, Lafata JE. Clinical outcomes and adherence to medications measured by claims data in patients with diabetes. Diabetes Care. 2004 Dec;27(12):2800–2805. doi: 10.2337/diacare.27.12.2800. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Safford MM, Shewchuk R, Qu H, et al. Reasons for not intensifying medications: differentiating “clinical inertia” from appropriate care. J Gen Intern Med. 2007 Dec;22(12):1648–1655. doi: 10.1007/s11606-007-0433-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Steiner JF, Prochazka AV. The assessment of refill compliance using pharmacy records: methods, validity, and applications. J Clin Epidemiol. 1997 Jan;50(1):105–116. doi: 10.1016/s0895-4356(96)00268-5. [DOI] [PubMed] [Google Scholar]
  • 8.Friedman C, Shagina L, Lussier Y, Hripcsak G. Automated encoding of clinical documents based on natural language processing. J Am Med Inform Assoc. 2004 Sep-Oct;11(5):392–402. doi: 10.1197/jamia.M1552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Meystre S, Haug PJ. Automation of a problem list using natural language processing. BMC Medical Informatics and Decision Making. 2005;5(30) doi: 10.1186/1472-6947-5-30. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Hazlehurst B, Frost HR, Sittig DF, Stevens VJ. MediClass: A system for detecting and classifying encounter-based clinical events in any electronic medical record. J Am Med Inform Assoc. 2005 Sep-Oct;12(5):517–529. doi: 10.1197/jamia.M1771. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hahn U, Romacker M, Schulz S. MEDSYNDIKATE - a natural language system for the extraction of medical information from findings reports. Int J Med Inform. 2002;67(1–3):63–74. doi: 10.1016/s1386-5056(02)00053-9. [DOI] [PubMed] [Google Scholar]
  • 12.Turchin A, Kolatkar NS, Pendergrass ML, Kohane IS. Computational analysis of non-adherence and non-attendance using the text of narrative physician notes in the electronic medical record. Med Inform Internet Med. 2007 Jun;32(2):93–102. doi: 10.1080/14639230601135323. [DOI] [PubMed] [Google Scholar]
  • 13.Turchin A, Kolatkar NS, Grant RW, Makhni EC, Pendergrass ML, Einbinder JS. Using regular expressions to abstract blood pressure and treatment intensification information from the text of physician notes. J Am Med Inform Assoc. 2006 Nov-Dec;13(6):691–695. doi: 10.1197/jamia.M2078. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Rose AJ, Berlowitz DR, Orner MB, Kressin NR. Understanding uncontrolled hypertension: is it the patient or the provider? J Clin Hypertens (Greenwich) 2007 Dec;9(12):937–943. doi: 10.1111/j.1524-6175.2007.07332.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Cotton A, Aspy CB, Mold J, Stein H. Clinical decision-making in blood pressure management of patients with diabetes mellitus: an Oklahoma Physicians Resource/Research Network (OKPRN) Study. J Am Board Fam Med. 2006 May-Jun;19(3):232–239. doi: 10.3122/jabfm.19.3.232. [DOI] [PubMed] [Google Scholar]
  • 16.Oliveria SA, Lapuerta P, McCarthy BD, L’Italien GJ, Berlowitz DR, Asch SM. Physician-related barriers to the effective management of uncontrolled hypertension. Arch Intern Med. 2002 Feb 25;162(4):413–420. doi: 10.1001/archinte.162.4.413. [DOI] [PubMed] [Google Scholar]
  • 17.Kukafka R, Bales ME, Burkhardt A, Friedman C. Human and automated coding of rehabilitation discharge summaries according to the International Classification of Functioning, Disability, and Health. J Am Med Inform Assoc. 2006 Sep-Oct;13(5):508–515. doi: 10.1197/jamia.M2107. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from AMIA Annual Symposium Proceedings are provided here courtesy of American Medical Informatics Association

RESOURCES