Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Apr 1.
Published in final edited form as: Am J Prev Med. 2020 Jan 23;58(4):591–595. doi: 10.1016/j.amepre.2019.10.020

A Comparison of Smoking History in the Electronic Health Record With Self-Report

Nikhil Patel 1, David P Miller Jr 2, Anna C Snavely 3, Christina Bellinger 4, Kristie L Foley 5, Doug Case 3, Malcolm L McDonald 1, Youssef R Masmoudi 1, Ajay Dharod 2
PMCID: PMC7533103  NIHMSID: NIHMS1578547  PMID: 31982229

Abstract

Introduction:

Knowing patients’ smoking history helps guide who may benefit from preventive services such as lung cancer screening. The accuracy of smoking history in electronic health records remains unclear.

Methods:

This was a secondary analysis of data collected from a portal-based lung cancer screening decision aid. Participants of an academically affiliated health system, aged 55–76 years, completed an online survey that collected a detailed smoking history including years of smoking, years since quitting, and smoking intensity. Eligibility for lung cancer screening was defined using the Centers for Medicare and Medicaid Services criteria. Data analysis was performed May–December 2018, and data collection occurred between November 2016 and February 2017.

Results:

A total of 336 participants completed the survey and were included in the analysis. Of 175 participants with self-reported smoking intensity, 72% had packs per day and 62% had pack-years recorded in the electronic health record. When present, smoking history in the electronic health records correlated well with self-reported years of smoking (r =0.78, p≤0.0001) and years since quitting (r =0.94, p≤0.0001). Self-reported smoking intensity, including pack-years (r =0.62, p<0.0001) and packs per day (r =0.65, p≤0.0001), was less correlated. Of those participants eligible for lung cancer screening by self-report, only 35% met criteria for screening by electronic health records data alone. Others were either incorrectly classified as ineligible (23%) or had incomplete data (41%).

Conclusions:

The electronic health records frequently misses critical elements of a smoking history, and when present, it often underestimates smoking intensity, which may impact who receives lung cancer screening.

INTRODUCTION

Tobacco is the leading cause of morbidity and mortality in the U.S., responsible for more than 400,000 yearly deaths.13 A patient’s smoking status affects recommendations for cholesterol treatment, aspirin prophylaxis, immunizations, and screening for vascular disease and cancer.48 Therefore, obtaining an accurate smoking history in the electronic health record (EHR) is essential for delivering high- quality care.

Medicare “Meaningful Use” criteria require health systems to document smoking status in the EHR,9 and EHRs contain structured fields to capture granular smoking history (e.g., current status, start year, quit year, and packs per day [PPD]). Unfortunately, this is difficult in a busy clinical practice. How well EHRs capture smoking history is unclear. Most studies have only examined whether EHRs accurately identify current or former smokers and have reported sensitivities ranging from 78% to 95%.1012 One study examined accuracy of pack-years but was limited to patients referred for lung cancer screening (LCS).13 None of these studies examined the accuracy of years since quitting, an important eligibility criterion for LCS. The EHR’s ability to sufficiently guide LCS screening remains unknown.

To investigate the accuracy of the smoking history captured in EHRs, data are analyzed from a pragmatic trial of an LCS decision aid that captured participants’ self-reported smoking history and EHR-recorded smoking history.14

METHODS

The study was done in an academic health system network of 70 community-based primary care clinics in North Carolina. Details of the trial and decision aid have been published.14,15 Briefly, between November 2016 and February 2017, the EHR was queried weekly to identify individuals aged 55–77 years who were scheduled to see a network primary care provider within the next 4 weeks and had no prior history of lung cancer or other disease predicting short life expectancy. Those identified as never smokers in the EHR were excluded. The first 1,000 identified individuals were sent an invitation via the patient portal to visit a web-based LCS decision aid called mPATH-Lung. On visiting mPATH-Lung, participants answered 5 questions about their smoking history to determine their eligibility for LCS based on Centers for Medicare and Medicaid Services criteria.16

Of the 1,000 individuals who received an invitation, 404 visited mPATH-Lung and 349 participants entered self-reported smoking data. Of those, 13 participants were never smokers, leaving 336 participants with self-reported smoking data included in the analysis.

This study was approved by the Wake Forest Baptist Health IRB (IRB00036974).

Self-reported smoking history was defined as data collected from participants using mPATH-Lung. EHR-recorded smoking history was defined as data collected by a healthcare provider during an in-person encounter. As smoking history may be recorded at multiple encounters, the encounter date that was closest to the date the patient used mPATH-Lung was chosen. The following elements were examined: (1) smoking status (current, former, or never), (2) smoking years (years of smoking and years since quitting), and (3) smoking intensity (pack-years and PPD).

Individuals were defined as eligible for LCS using published Centers for Medicare and Medicaid Services criteria (age 55–77 years, current smoker or quit within the last 15 years, and smoked at least 30 pack-years).16 Patients with missing key data elements were considered indeterminant for LCS. Frequency tables were constructed to summarize eligibility for LCS using both self-reported and EHR-recorded data, and the sensitivity and specificity of the EHR data were examined using the self-reported data as the gold standard. When both self-reported and EHR-recorded data were available, Spearman correlation coefficients were assessed to determine the monotonic association between the 2 smoking metrics and Wilcoxon signed-rank tests were used to assess the difference between the measures. Extreme outliers (e.g., 10 PPD) were considered missing for this analysis. Smoking history was considered accurate under the following parameters: years of smoking (± 3 years), years since quitting (± 3 years), PPD (± 0.25 packs), and pack-years (± 5 years). Statistical analysis was performed using SAS, version 9.4.

RESULTS

Nonresponders were more likely to be male and were slightly younger but were otherwise similar to participants (Table 1). Participants reported a smoking mean of 30.5 (SD=14.0) years and almost 1 PPD (×=0.96, SD=0.56). Among the 264 former smokers in the sample, mean years since quitting was 19.8 (SD=13.3) years.

Table 1.

Demographic Characteristics of Responders and Nonresponders (n=336)

Participant characteristics Nonresponders,a mean (SD) or n (%), n=651 Responders (current and former smokers),b mean (SD) or n (%), n=336

Age, years 63.8 (6.0) 64.7 (5.8)
Sex
 Male 348 (53.5%) 141 (42.0%)
 Female 303 (46.5%) 195 (58.0%)
Race
 White or Caucasian 550 (84.5%) 284 (84.5%)
 Black or African American 90 (13.8%) 46 (13.7%)
 Asian 2 (0.3%) 2 (0.6%)
 Other 9 (1.4%) 4 (1.2%)
Ethnicity
 Hispanic or Latino 5 (0.8%) 4 (1.2%)
 Not Hispanic or Latino 645 (99.1%) 332 (98.8%)
 Refused to answer 1 (0.1%) 0 (0%)
Insurance status
 Commercial 303 (46.5%) 141 (42.0%)
 Medicare 322 (49.5%) 184 (54.8%)
 Medicaid 13 (2.0%) 6 (1.7%)
 Uninsured 13 (2%) 5 (1.5%)
Smoking status in EHR
 Current 115 (17.7%) 70 (20.8%)
 Former 536 (82.3%) 266 (79.2%)
 Unknown 0 (0%) 0 (0%)
Packs per day present in EHR 385 (59.1%) 213 (63.3%)
a

Nonresponder defined as someone who did not complete the smoking history items in the mPATH-Lung decision aid.

b

Thirteen responders indicated they never smoked and are excluded from the table above and all analyses.

EHR, electronic health record; mPATH, mobile patient technology for health.

Table 2 compares participants’ self-reported smoking history with the EHR. Smoking status (current or former smoker) was accurately reported in the EHR for >90% of participants; however, measures of smoking years and smoking intensity were missing or inaccurate for more than half of participants. Data on pack-years were present and accurate only 20% of the time in the EHR. The EHR under-reported participants’ smoking years and smoking intensity (Table 3). Self-reported years since quitting had the highest correlation with the EHR (r=0.94, p<0.0001) (Appendix Figure 1, available online).

Table 2.

Smoking History Availability and Accuracy, Self-Reported Versus EHR-Recorded

Self-reported smoking history Data available in EHR, n (%) Data available and accurate in EHR,a n (%)

Current smoker, n=72 72 (100) 67 (93.0)
Former smoker, n=264 264 (100) 261 (98.9)
Years smoked (±3 years), n=336 210 (62.5) 71 (21.2)
Years since quitting (±3 years), n=336 200 (59.5) 149 (44.4)
Packs per day (±0.25 packs), n=175b 126 (72.0) 70 (40.0)
Pack-years (±years), n=175b 109 (62.3) 35 (20.0)
a

Missing in the EHR is counted as inaccurate.

b

mPATH-Lung only asks for smoking intensity from those who are current smokers or quit within the last 15 years.

EHR, electronic health record; mPATH, mobile patient technology for health.

Table 3.

Self-Reported Versus EHR-Recorded Smoking History Comparison in Participants With Both Self-Reported and EHR-Recorded Data Available

Variable Na EHR-recorded, median (IQR) Self-reported, median (IQR) Difference, p-valueb Spearman correlation (p-value) % under-reported by EHR

Years smoked 210 25.0 (24.0) 31.5 (22.0) <0.001 0.79 (<0.001) 75.7
Years since quitting 200 19.4 (23.4) 20.0 (22.0) 0.088 0.94 (<0.001) 43.0
Pack-years 109 25.0 (32.0) 34.5 (26.5) <0.001 0.62 (<0.001) 74.3
Packs per dayc 126 1.0 (0.5) 1.0 (0.5) 0.006 0.65 (<0.001) 38.1

Note: Boldface indicates statistical significance (p<0.05).

a

Number of observations with EHR-recorded and self-reported data.

b

Wilcoxon signed rank test.

c

The median and IQR are the same between EHR-recorded and self-reported; however, the distribution of packs per day differs.

EHR, electronic health record.

Smoking history recorded in the EHR identified 35% of patients who met Medicare criteria for LCS. Among those eligible for screening, the EHR incorrectly classified 23% as being ineligible, and 41% were missing data needed to determine eligibility (8% missing years since quitting only, 23% missing pack-years only, and 10% missing both). A total of 227 patients had enough data in the EHR to determine LCS eligibility. The sensitivity and the specificity of EHR data were 60% and 97%, respectively.

DISCUSSION

The EHR accurately classifies more than 90% of patients as ever smokers or never smokers; however, detailed elements of smoking history (e.g., smoking years and smoking intensity) are often missing or underestimated. Overall, the EHR identifies only one third of patients who qualify for LCS.

Annual LCS is recommended for patients aged 55–80 years who have a 30–pack-year smoking history and either currently smoke or have quit smoking in the past 15 years.16 This study has significant implications, as clinicians may make faulty decisions if they rely on inaccurate EHR documentation of smoking.17 Additionally, health systems using EHR data to identify patients who may benefit from LCS may miss potentially eligible patients.

Social desirability bias and financial incentives may affect the accuracy of EHR-recorded smoking history. When asked by a healthcare provider, patients may underestimate their smoking or focus on current attempts to cut down. Tobacco users have up to 50% increases in premiums compared with nonusers, encouraging under-reporting.9 In one study, 7.7% of those who reported current or past smoking on a questionnaire subsequently reported never having smoked at a later date.18 Additionally, others have shown that patients’ EHR-calculated pack-years fluctuate over time as patients cut back on smoking.17

Limitations

This study has limitations. The study sample is limited to those who use the Internet in a single health system. However, another study found a similar sensitivity and specificity of the EHR for determining LCS eligibility, suggesting these estimates may be stable.19 Recall bias could affect results if time elapsed between when smoking history was captured in mPATH-Lung and in the EHR, although analyses limiting the sample to participants with self-reported data collected within 60 days of the EHR-recorded data yielded similar results.

CONCLUSIONS

Across all measures of smoking years and intensity, incomplete documentation of smoking history in the EHR was found. Continuous validation efforts are needed to understand the quality of smoking documentation. Health systems would benefit from studying and developing strategies to ensure accurate EHR smoking history, such as developing mechanisms that allow patients to update smoking history characteristics themselves.

Supplementary Material

supp

ACKNOWLEDGMENTS

The authors would like to acknowledge the efforts of Don Babcock who programmed the mPATH-Lung WebApp. The authors received funding from the Wake Forest University Comprehensive Cancer Center (NCI CCSG P30CA012197) and the Wake Forest Clinical and Translational Science Institute (NCATS UL1TR001420).

Dr. David Miller and Dr. Ajay Dharod are the co-inventors of mPATH. Dr. David Miller, Dr. Ajay Dharod, and Wake Forest University Health Sciences have an ownership interest in the mPATH application.

Footnotes

No other financial disclosures were reported by the authors of this paper.

SUPPLEMENTAL MATERIAL

Supplemental materials associated with this article can be found in the online version at https://doi.org/10.1016/j.amepre.2019.10.020.

REFERENCES

  • 1.GBD 2015 Tobacco Collaborators. Smoking prevalence and attributable disease burden in 195 countries and territories, 1990–2015: a systematic analysis from the Global Burden of Disease Study 2015. Lancet. 2017;389(10082):1885–1906. 10.1016/S0140-6736(17)30819-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Yanbaeva DG, Dentener MA, Creutzberg EC, Wesseling G, Wouters EF. Systemic effects of smoking. Chest. 2007;131(5):1557–1566. 10.1378/chest.06-2179. [DOI] [PubMed] [Google Scholar]
  • 3.Edwards R The problem of tobacco smoking. BMJ. 2004;328(7433): 217–219. 10.1136/bmj.328.7433.217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Stone NJ, Robinson JG, Lichtenstein AH, et al. 2013 ACC/AHA guideline on the treatment of blood cholesterol to reduce atherosclerotic cardiovascular risk in adults: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. J Am Coll Cardiol. 2014;63(25 Pt B):2889–2934. 10.1016/j.jacc.2013.11.002. [DOI] [PubMed] [Google Scholar]
  • 5.Fox CS, Golden SH, Anderson C, et al. Update on prevention of cardiovascular disease in adults with type 2 diabetes mellitus in light of recent evidence: a scientific statement from the American Heart Association and the American Diabetes Association. Diabetes Care. 2015;38(9):1777–1803. 10.2337/dci15-0012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Pham H, Geraci SA, Burton MJ. Adult immunizations: update on recommendations. Am J Med. 2011;124(8):698–701. 10.1016/j.amjmed.2010.07.032. [DOI] [PubMed] [Google Scholar]
  • 7.Wender R, Fontham ET, Barrera E, et al. American Cancer Society lung cancer screening guidelines. CA Cancer J Clin. 2013;63(2):107–117. 10.3322/caac.21172. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Preventive US Services Task Force. Screening for abdominal aortic aneurysm: recommendation statement. Ann Intern Med. 2005; 142(3):198–202. 10.7326/0003-4819-142-3-200502010-00011. [DOI] [PubMed] [Google Scholar]
  • 9.Friedman AS, Schpero WL, Busch SH. Evidence suggests that the ACA’s tobacco surcharges reduced insurance take-up and did not increase smoking cessation. Health Aff (Millwood). 2016;35(7):1176–1183. 10.1377/hlthaff.2015.1540. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Calhoun PS, Wilson SM, Hertzberg JS, et al. Validation of Veterans Affairs electronic medical record smoking data among Iraq- and Afghanistan-era veterans. J Gen Intern Med. 2017;32(11):1228–1234. 10.1007/s11606-017-4144-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.McGinnis KA, Brandt CA, Skanderson M, et al. Validating smoking data from the Veteran’s Affairs Health Factors dataset, an electronic data source. Nicotine Tob Res. 2011;13(12):1233–1239. 10.1093/ntr/ntr206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Chen LH, Quinn V, Xu L, et al. The accuracy and trends of smoking history documentation in electronic medical records in a large managed care organization. Subst Use Misuse. 2013;48(9):731–742. 10.3109/10826084.2013.787095. [DOI] [PubMed] [Google Scholar]
  • 13.Modin HE, Fathi JT, Gilbert CR, et al. Pack-year cigarette smoking history for determination of lung cancer screening eligibility. comparison of the electronic medical record versus a shared decision-making conversation. Ann Am Thorac Soc. 2017;14(8):1320–1325. 10.1513/AnnalsATS.201612-984OC. [DOI] [PubMed] [Google Scholar]
  • 14.Dharod A, Bellinger C, Foley K, Case LD, Miller D. The reach and feasibility of an interactive lung cancer screening decision aid delivered by patient portal. Appl Clin Inform. 2019;10(1):19–27. 10.1055/s-0038-1676807. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Bellinger C, Pinsky P, Foley K, Case D, Dharod A, Miller D. Lung cancer screening benefits and harms stratified by patient risk: information to improve patient decision aids. Ann Am Thorac Soc. 2019;16(4):512–514. 10.1513/AnnalsATS.201810-690RL. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Moyer VA U.S. Preventive Services Task Force. Screening for lung cancer: U.S. Preventive Services Task Force recommendation statement. Ann Intern Med. 2014;160(5):330–338. 10.7326/M13-2771. [DOI] [PubMed] [Google Scholar]
  • 17.Tarabichi Y, Kats DJ, Kaelber DC, Thornton JD. The impact of fluctuations in pack-year smoking history in the electronic health record on lung cancer screening practices. Chest. 2018;153(2):575–578. 10.1016/j.chest.2017.10.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Soulakova JN, Hartman AM, Liu B, Willis GB, Augustine S. Reliability of adult self-reported smoking history: data from the Tobacco Use Supplement to the Current Population survey 2002–2003 cohort. Nicotine Tob Res. 2012;14(8):952–960. 10.1093/ntr/ntr313. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Palmer EL, Higgins J, Hassanpour S, et al. Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source. BMC Med Inform Decis Mak. 2019;19(1):143 10.1186/s12911-019-0864-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supp

RESOURCES