Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 May 1.
Published in final edited form as: Epidemiology. 2022 May 1;33(3):383–385. doi: 10.1097/EDE.0000000000001460

Concordance of data about sex from electronic health records and the National Death Index: Implications for transgender populations

John R Blosnich 1,2, Taylor L Boyer 2
PMCID: PMC8986558  NIHMSID: NIHMS1768887  PMID: 35067566

Abstract

Background:

Transgender individuals have greater health risks than cisgender individuals, which may bode for greater mortality. However, research is limited by lack of gender identity information at the time of death. Novel opportunities to combine administrative data with National Death Index (NDI) data may facilitate mortality research about transgender populations, but binary measures of sex and gender may pose problems for analyses. This study explored differences in sex recorded in Veterans Health Administration (VHA) electronic health record (EHR) and NDI data between transgender and cisgender decedents.

Methods:

We used VHA EHR data from fiscal years 2000-2016 to identify deaths among a sample of transgender and cisgender patients. We cross-tabulated sex recorded in the NDI with EHR-based sex from VHA EHR data. We extracted data in 2018 and conducted analyses in 2020.

Results:

Death occurred for 1,109 transgender patients and 7,757 cisgender patients. For cisgender decedents, EHR-based sex and NDI-based sex were 100% concordant. For transgender decedents, 46 (4%) were discordant between data sources. Of transgender decedents with female EHR-based sex (n = 259), 17% were indicated as male in NDI data; of those with male EHR-based sex (n = 850), 0.2% were indicated as female in NDI data.

Conclusions:

Data linkage between EHR and the NDI can facilitate transgender mortality research, but examining mortality specific to various transgender identities remains difficult. Improved documentation of sex and gender is needed within US mortality surveillance.

Keywords: transgender, mortality, electronic health records, death certificates

INTRODUCTION

Transgender individuals have greater risks to their health (e.g., homelessness, violence) than cisgender individuals,1 which bodes for greater risk of death.2 For example, homicide among transgender individuals (specifically transgender women of color), prompted the American Medical Association to issue a policy statement advocating for infrastructure, research, and practice to reduce violence against people who are transgender.3 However, mortality studies are strongly limited by lack of gender identity information at the time of death.4 Currently, only binary data about sex (male or female) are available in the National Death Index (NDI).

Matching electronic health record (EHR) with NDI data presents opportunities to learn about mortality among transgender populations,5 ideally if EHR data contain self-identified gender identity data. Unfortunately, collection of gender identity data in EHR is currently rare.6 International Classification of Disease (ICD) codes for gender identity disorder (GID), which is often (but not always) required for transgender individuals to access gender affirming therapies, can identify a subset of transgender populations (i.e., those with GID diagnosis codes).7

The Veterans Health Administration (VHA) linked EHR and NDI data through 2017 for basic mortality data, including sex documented on the death certificate. However, it is unclear how something often construed as a basic demographic – the sex of an individual – becomes complex for transgender individuals when cross-referenced with administrative data. The objective of this brief report was to examine concordance of sex denoted between VHA EHR and NDI. We hypothesize that cisgender patients would be 100% concordant between the two data sources, but transgender patients would be less than 100% concordant.

METHODS

This secondary analysis used data from a previous study that included all VHA patients between fiscal years 2000-2016 who ever had one of four ICD 9 (e.g., 302.85 GID in adolescents or adults) or one of seven 10 GID-related codes (e.g., F64.9 GID, unspecified), creating the group of transgender patients (n=8,977). A complete list of these ICD codes has been previously published.7 In the absence of self-reported gender identity in VHA data,8 ICD codes have been used to define transgender status with documented validity from clinical text notes. For example, among patients with a GID-related code, nearly 90% had clinical text notes that included text suggesting the patient’s gender identity (e.g., “patient is transgender man”).7 For each transgender patient, three patients were randomly chosen based on not having GID-related ICD codes and, to account for facility and temporal factors, had to have an outpatient visit in the same facility within +/− 5 days from the date of the transgender patient’s GID diagnosis. This group was referred to as cisgender patients (n=26,907). Data older than 5 years from the time of the study were necessary due to the rarity of GID-related codes and the need to create the most comprehensive sample of transgender patients. From this existing study data, we focused the present analysis only on individuals who had died during the observation period pursuant to this brief report’s focus on mortality.

NDI is considered the gold standard for mortality data in the United States,9 aggregating information about cause and date of death gathered from offices of vital statistics from all US states, Washington DC, and US territories. Sex data in the NDI are compiled by the National Center for Health Statistics from death certificate data reported from 57 jurisdictions (e.g., U.S. states, the District of Columbia, Puerto Rico); death certificate data typically include information input from medical professionals, funeral directors, and next-of-kin contacts.10

VHA contracts with the NDI to conduct probabilistic matching on all patients to ascertain vital status and mortality data for deceased patients currently through 31 December 2017 and includes information such as date, cause of death, and sex documented on the death certificate. We categorized deaths according to National Vital Statistics System (NVSS) conventions.8 At the time of the study, NDI data for VHA patients were available only through 2017.

VHA EHR include a single data field for sex of the patient, which can only be populated as either male or female (i.e., EHR-based sex). VHA has a formal process through which patients may request changes to their EHR-based sex.8 Thus, it is unclear if EHR-based sex represents a patient’s sex designated at birth or indicates their current gender identity. For all patients in our cohort, we extracted EHR-based sex from their most currently available patient data as the closest approximation of sex prior to death.

Sex recorded in NDI from the death certificate was cross-tabulated with the EHR-based sex from VHA administrative data for overall causes of death. The institutional review board of the VA Pittsburgh Healthcare System approved this study.

RESULTS

Over the observation period, there were 1,109 deaths among transgender patients and 7,757 deaths among non-transgender patients. For cisgender decedents, there was 100% concordance between the EHR-based sex and the sex documented in NDI data (Table). For transgender decedents, 46 (4%) were discordant between data sources. Among transgender decedents whose EHR-based sex indicated female (n=259), 17% were indicated as male in NDI data; among those whose EHR-based sex indicated male (n=850), 0.2% were indicated as female in NDI data.

Table.

Comparisons of sex recorded at last medical visit and sex recorded in the National Death Index for all causes of death

Decedents without GID ICD code (i.e., cisgender) (n=7,757) Decedents with GID ICD code (i.e., transgender) (n=1,109)
EHR-based Sex a
Male Female Male Female
(n=7,527) (n=230) (n=850) (n=259)
NDI-based Sex   n (%)  n (%)  n (%) n (%)

  Male 7,527 (100) 0 (0) 848 (99.8) 44 (17)
  Female 0 (0) 230 (100) 2 (0.2) 215 (83)
a

Sex documented in patients’ most recently available medical data

EHR=electronic health record; GID=gender identity disorder; ICD=International Classification of Disease; NDI=National Death Index

DISCUSSION

Because US mortality surveillance does not include gender identity4 and few EHR systems include gender identity information,6 mortality research about transgender individuals currently requires proxy-based procedures. However, as this first study illustrated with linked mortality and EHR data for a subsample of transgender individuals (i.e., those with GID diagnoses), discordant information emerged about the decedent’s sex, which supported the study hypotheses.

The discordance among transgender patients is difficult to interpret. For example, among the 44 transgender decedents with female EHR-based sex who were documented as male in NDI, it is unclear if these decedents may be misgendered transgender women (EHR valid, NDI invalid), misgendered transgender men (EHR invalid, NDI valid), or some combination. Thus, although data linkage can facilitate mortality research for transgender people generally, challenges remain for learning about transgender women, transgender men, and especially non-binary individuals who may specifically be lost in solely binary sex and gender data.

This brief report was a first step in exploring this heretofore undocumented issue in data systems. Further research should explore if other structured data (e.g., Current Procedural Terminology [CPT] codes) or unstructured data (e.g., clinical text notes; chart reviews) of gender affirming surgeries or hormones, which have been used in recent studies to try to determine sex assigned at birth, 11,12 may clarify gender identity for mortality studies. However, because VHA currently does not cover gender affirming surgeries8 and only 58% of transgender veterans receive hormones through VA,13 data may be limited compared to other systems that do cover affirming surgeries (e.g., Medicare), which may have implications for deaths involving autopsies and consequently sex data recorded on death certificates.

Because mortality is a key metric for health services research, the ramifications of these results for transgender populations require bold future directions. The breadth of NVSS data facilitates identification of disparities by socio-demographics, such as race, ethnicity, age, and sex.14 Transgender populations are invisible in these data. Piecemeal efforts (e.g., EHR linkage) cannot replace surveillance powered by the >2.5 million death records processed annually in the NVSS.14 Widespread implementation of self-identified gender identity in EHR may help but remains vexed by disparities in access to care for transgender individuals.15 There are efforts to improve data quality for the National Violent Death Reporting System (NVDRS) by training death investigators to document decedent gender identity as they do any other demographic characteristic.4 However this effort is predicated on the existence of a NVDRS data field for “transgender status,” which is, itself, problematic but is a start.16 The NDI does not include a data field for gender identity. Expanding data fields on death certificates is no small task, including variability in professionals involved in completing them (e.g., medical examiners, coroners, funeral directors). However, it has been done before, as when Hispanic ethnicity and educational attainment were added in 1989.17

We note several limitations. Using ICD codes to define transgender status invites misclassification bias; not all patients who self-identify as transgender need or have a GID diagnosis and not all patients with a GID diagnosis may self-identify as transgender. The sample of veterans utilizing VHA care may limit generalizability to non-veteran populations or veterans who do not utilize VHA care.

Transgender individuals face several challenges jeopardizing their health (e.g., violence, homelessness, discrimination),1 and without mortality data, research cannot identify patterns and correlates to guide prevention. Although linking EHR and NDI data provide opportunities to learn about transgender mortality,18 the limitations of these methods – and the fact that such data gymnastics are needed in the first place – reify the very disparity of transgender communities, i.e., second-rate research methods for a population treated like second-class citizens.15 This is not to say progress cannot happen with creative methods with administrative data; indeed it has, but incrementally. To identify, understand, and ultimately address health disparities for transgender communities, US mortality surveillance must reckon with the nuances, challenges, and improvements around sex and gender identity data.

Sources of Funding:

The creation of data for this project were initially supported by a U.S. Department of Veterans Affairs Health Services Research & Development career development award to JRB (CDA-14-408). JRB was also supported by a research award by the National Institute of Mental Health (1R21MH125360-01).

Footnotes

Disclosure: The views expressed are those of the authors and do not necessarily reflect the position or policy of the institutions, National Institutes of Health, U.S. Department of Veterans Affairs or the United States Government.

Data and Computing Code Availability: The data and computing code are not generally available for replication because they reside within the U.S. Department of Veterans Affairs VA Informatics and Computing Infrastructure (VINCI), requiring employment or official affiliation (e.g., Interagency Personnel Agreement, Without Compensation Appointment) with VA for access.

REFERENCES

  • 1.Reisner SL, Poteat T, Keatley J, et al. Global health burden and needs of transgender populations: a review. The Lancet. 2016;388(10042):412–436. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Blosnich JR, Brown GR, Shipherd P, Jillian C, Kauth M, Piegari RI, Bossarte RM. Prevalence of gender identity disorder and suicide risk among transgender veterans utilizing Veterans Health Administration care. American Journal of Public Health. 2013;103(10):e27–e32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.American Medical Association. AMA adopts new policies on first day of voting at 2019 Annual Meeting. https://www.ama-assn.org/press-center/press-releases/ama-adopts-new-policies-first-day-voting-2019-annual-meeting. Published 2019. Accessed October 9, 2020.
  • 4.Haas AP, Lane AD, Blosnich JR, Butcher BA, Mortali MG. Collecting sexual orientation and gender identity information at death. American journal of public health. 2019;109(2):255–259. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Mays VM, Cochran SD. Challenges and opportunities for modernizing the National Violent Death reporting system. In: American Public Health Association; 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Grasso C, McDowell MJ, Goldhammer H, Keuroghlian AS. Planning and implementing sexual orientation and gender identity data collection in electronic health records. Journal of the American Medical Informatics Association. 2019;26(1):66–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Blosnich JR, Cashy J, Gordon AJ, et al. Using clinician text notes in electronic medical record data to validate transgender-related diagnosis codes. Journal of the American Medical Informatics Association. 2018;25(7):905–908. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Burgess C, Kauth MR, Klemt C, Shanawani H, Shipherd JC. Evolving sex and gender in electronic health records. Federal Practitioner. 2019;36(6):271. [PMC free article] [PubMed] [Google Scholar]
  • 9.Lash TL, Silliman RA. A comparison of the National Death Index and Social Security Administration databases to ascertain vital status. Epidemiology. 2001;12(2):259–261. [DOI] [PubMed] [Google Scholar]
  • 10.NIH Collaboratory. Living textbook of pragmatic clinical trials, Section 4: Using death as an endpoint. https://rethinkingclinicaltrials.org/chapters/design/choosing-specifying-end-points-outcomes/using-death-as-an-endpoint/. Published 2020. Accessed October 16, 2021.
  • 11.Jasuja GK, de Groot A, Quinn EK, et al. Beyond gender identity disorder diagnoses codes: an examination of additional methods to identify transgender individuals in administrative databases. Medical Care. 2020;58(10):903–911. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wolfe HL, Reisman JI, Yoon S, et al. Validating Data-driven Methods to Identify Transgender Individuals in the Veterans Affairs. American Journal of Epidemiology. 2021. [DOI] [PubMed] [Google Scholar]
  • 13.Wong DH, Vimalananda VG, Reisman JI, et al. Patient Characteristics Associated With the Receipt of Hormone Therapy Among Transgender Patients in the Veterans Health Administration. Journal of the Endocrine Society. 2021;5(Supplement_1):A794–A794. [Google Scholar]
  • 14.Heron M Deaths: Leading causes for 2017. National Vital Statistics Reports; vol 68 no 6. Hyattsville, MD: 2019. [PubMed] [Google Scholar]
  • 15.Grant JM, Motter LA, Tanis J, Harrison J, Herman JL, Keisling M. Injustice at every turn: A report of the national transgender discrimination survey. Washington, DC: 2011. [Google Scholar]
  • 16.Centers for Disease Control and Prevention. National Violent Death Reporting System (NVDRS) Coding Manual Revised [Online] 2016 National Center for Injury Prevention and Control, Centers for Disease Control and Prevention (producer). https://www.cdc.gov/violenceprevention/pdf/nvdrs/NVDRS-WebCodingManual.pdf. Published 2019. Accessed September 23, 2020. [Google Scholar]
  • 17.Freedman MA, Gay GA, Brockert JE, Potrzebowski PW, Rothwell CJ. The 1989 revisions of the US standard certificates of live birth and death and the US standard report of fetal death. American Journal of Public Health. 1988;78(2):168–172. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Reisner SL, Deutsch MB, Bhasin S, et al. Advancing methods for US transgender health research. Current opinion in endocrinology, diabetes, and obesity. 2016;23(2):198. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES