Skip to main content
Journal of the American Medical Informatics Association : JAMIA logoLink to Journal of the American Medical Informatics Association : JAMIA
. 2021 Sep 6;29(2):271–284. doi: 10.1093/jamia/ocab136

Transgender data collection in the electronic health record: Current concepts and issues

Clair A Kronk 1,, Avery R Everhart 2,3, Florence Ashley 3,4, Hale M Thompson 5, Theodore E Schall 6, Teddy G Goetz 7, Laurel Hiatt 8, Zackary Derrick 9, Roz Queen 10, A Ram 11, E Mae Guthman 3,12, Olivia M Danforth 13, Elle Lett 3,14, Emery Potter 15, Simón(e) D Sun 3,16,17, Zack Marshall 18, Ryan Karnoski 3,19
PMCID: PMC8757312  PMID: 34486655

Abstract

There are over 1 million transgender people living in the United States, and 33% report negative experiences with a healthcare provider, many of which are connected to data representation in electronic health records (EHRs). We present recommendations and common pitfalls involving sex- and gender-related data collection in EHRs. Our recommendations leverage the needs of patients, medical providers, and researchers to optimize both individual patient experiences and the efficacy and reproducibility of EHR population-based studies. We also briefly discuss adequate additions to the EHR considering name and pronoun usage. We add the disclaimer that these questions are more complex than commonly assumed. We conclude that collaborations between local transgender and gender-diverse persons and medical providers as well as open inclusion of transgender and gender-diverse individuals on terminology and standards boards is crucial to shifting the paradigm in transgender and gender-diverse health.

Keywords: gender and sexual minorities, transgender persons, electronic health records, bioethics

INTRODUCTION

Thirty-three percent of transgender persons have had negative experiences with healthcare providers and 23% describe avoiding needed medical care due to fear of mistreatment.1 Reasons for avoiding care include experiences of misgendering, pathologization, and medicalization.1–4 These forms of discrimination have been codified in electronic health records (EHRs), as well as in diagnostic and billing codes and criteria including the DSM, Systematized Nomenclature of Medicine Clinical Terms (SNOMED-CT), and International Classification of Diseases (ICD), becoming the de facto language of treatment. It is especially concerning that terms like “sodomy,” “surgically transgendered transsexual,” and “transvestic fetishism” persist in some terminologies. These terms, formed from alleyways of systemic discrimination,5 continue to perpetuate anti-LGBTQ violence, despite protest against their usage from affected groups.6–10 As feminist scholar Sara Ahmed noted in a 2018 talk: “When those who try to stop a culture from being reproduced are stopped, a culture is being reproduced,”11 thereby continuing an already ever-present cycle of transphobic rhetoric in medical standards and terminologies.3,12–16

For the past 3 centuries,22 medical professionals have diagnosed being transgender as schizophrenia,23–26 paraphilia,27,28 multiple personality disorder,29,30 borderline personality disorder,31,32 narcissistic personality disorder,33 obsessive-compulsive disorder,34 or autism,40,41 if analyzed in terms of contemporary understandings of the term transgender (see Definitions and Concepts). However, it is of note that a number of cultures have gender systems that are not binary and have varying degrees of gender diversity (as judged by contemporary Eurocentric/binary standards), in general and including rituality some have compared with hormone replacement therapy and gender-affirming surgery; further, some of these cultures have histories of gender diversity stretching back far further than a few centuries, up to thousands of years.17–21 While it is crucial to avoid pathologization of trans experiences (ie, suggesting that transness itself is a mental disorder or condition), it is important to acknowledge that a number of neurodiverse people identify as trans and gender diverse.35,36 Oftentimes, such neurodiversity is inappropriately used as a mechanism to deny access to potentially life-saving gender-affirming care.37,38 For an in-depth exploration of the intersections between neurodiversity and gender diversity, see Adams and Liang.39

Even in contemporary literature, pathological terms such as “gender dysphoria syndrome,”42 “transsexual syndrome,”43 “gender identity disorder,”44 “transsexualism,”16,45 and “transgenderism”16,46,47 still exist. “Transgender” continues to appear on patients’ problem lists.48–50 While this is partially due to vocabulary systems like SNOMED CT and ICD, this is also part of a larger systemic problem concerning how a “problem list” is defined. Oftentimes, there is nowhere else to indicate that a patient is transgender, forcing even affirming providers to use it. Replacement terminology or another area provided for such indications to not be labeled as “problems” is vital to preventing pathologization and medicalization of transness.

Prejudice and discrimination against transgender people in biomedical research and medical practice are well established.1,51–66 The misunderstanding and erasure of gender diversity has led to poor study design, additional barriers accessing care, unethical disclosure of transgender status in medical contexts, and in some cases, patient death. Three prominent cases of avoidable patient death due to negligence include that of Lou Sullivan, who died in 1991 after being denied care at an HIV/AIDS clinic67; Tyra Hunter, who died in 1995 when emergency medical technicians discovered she was transgender while treating her after a car accident and refused to save her life68; and Alejandra Monocuco, who died from COVID-19 (coronavirus disease 2019) after emergency medical technicians learned of her HIV status in 2020.69 Little has changed in terms of either medical discrimination against transgender patients, or data collection and usage with transgender patients in the period spanning their deaths.22,70,71 To move past barriers to proper health care for transgender people, both fields must undergo paradigm shifts.

Sexual orientation and gender identity (SOGI) EHR data collection policies are currently implemented without adequate knowledge of the needs of transgender populations, without provider education, and often only in limited care contexts (such as psychiatric, endocrinological, and gender identity clinics).72,73 Additionally, many policies have been rolled back as the result of political pressures,70,71,74 despite a call to action for supporting transgender scientists during the epidemic.75 In one case involving the COVID-19 National Institutes of Health Researchers Survey SOGI questions, 2 of this piece’s authors were told via personal communication that a 2-step question, like the one proposed later in this perspective, concerning assigned sex at birth was omitted due to concern “about the number of demographic questions,” despite its validated efficacy.74,75 As a result, transgender populations are, and will not be, adequately described in biomedical contexts.

Ethical data collection for transgender and gender-diverse communities in health care is needed now more than ever. Medical providers and health informaticians have a duty to address the ongoing systemic mistreatment of transgender and gender-diverse people within healthcare institutions.76,77 Accurate research is crucial to redressing inequities. To policymakers, what is not documented might as well have never happened.

DEFINITIONS AND CONCEPTS

Accurate and equitable clinical practice and research requires discussion of operational standards.78–80 Transgender populations have often been defined in terms of relationships to diagnostics ,81–83 procedures,44,84 or assigned sex at birth (ASAB) or assigned gender at birth (AGAB) vs gender identity data (Table 1).94,95 The differences in terminology are important to discuss, not only because they have impact on clinical care,96 but also because lack of appropriate terminological frameworks in research studies impacts the usability, applicability, and reproducibility of results.

Table 1.

Selected definitions of the term “transgender” and its variants, 1965-2021, showcasing how the term continues to evolve and change over time, oftentimes making results of studies utilizing the term (or similar terms) nonreproducible85

Year Definition Reference
1965 Where the compulsive urge reaches beyond female vestments, and becomes an urge for gender (“sex”) change, transvestism becomes “transsexualism.” The term is misleading; actually, “transgender[ism]” is what is meant, because sexuality is not a major factor in primary transvestism. 86
1978 [Transgenderists] are people who have adopted the exterior manifestations of the opposite sex on a full-time basis but without any surgical intervention. 87
1988 It used to be that a “transgender[ist]” was a person who could express him or herself comfortably in both masculine and feminine terms… Recently, however, the term “transgender[ist]” has come more and more to mean a person of one sex living entirely in the gender role generally considered appropriate for the opposite sex (cross-living). Most people who consider themselves to be transgender[ist]s do not want or need sexual reassignment surgery, and do not identify with “transvestite.” 88
1998 Originally, transgender referred to the group of people, also know[n] as full-time cross-dressers or nonsurgical transsexuals, who live and work as the opposite gender continuously and for always. Now it more often refers to the group of all people who are inclined to cross the gender line, including both cross-dressers and gender-benders, the “umbrella definition” that covers everyone. 89
2008 An umbrella term for people whose gender identity and/or gender expression differs from the sex they were assigned at birth. The term may include but is not limited to transsexuals, cross-dressers, and other gender-variant people. Transgender people may identify as female-to-male (FTM) or male-to-female (MTF). Use the descriptive term (transgender, transsexual, cross-dresser, FTM, or MTF) preferred by the individual. Transgender people may or may not decide to alter their bodies hormonally and/or surgically. 90
2011 Relating to or being a person whose gender identity does not conform to that typically associated with the sex to which they were assigned at birth. 91
2018 Often shortened as “trans.” An umbrella term for people whose gender and/or expression does not match their birth assignment. Transgender includes many of the terms in this list and may or may not include transsexuals, cross dressers, drag kings/queens, and others who defy what society tells them their gender should be. How people identify with this term depends on the individual and their relationship with their gender. 92
2021 A term describing a person’s gender identity that does not necessarily match their assigned sex at birth. Transgender people may or may not decide to alter their bodies hormonally and/or surgically to match their gender identity. This word is also used as an umbrella term to describe groups of people who transcend conventional expectations of gender identity or expression—such groups include, but are not limited to, people who identify as transsexual, genderqueer, gender variant, gender diverse, and androgynous. 93

In this work, we use Ashley’s gender modality framework to discuss the term transgender as specifically representing an incongruence between one’s AGAB and one’s gender identity.97 Occasionally, transgender is shortened as trans.92 We also sparingly use the term transgender and genderdiverse, which is meant to be inclusive of gender diversity that does not specifically fall under the gender modality framework .

Within this framework, we would first like to clarify the meanings of 2 terms: sex and gender. In international legal systems, these terms are often defined as being interchangeable, given that a number of languages do not have a gender-sex distinction linguistically.98–101 This necessitates usage of qualifying terms such as assigned sex or gender identity.105 For instance, “gender identity” may be rendered as “sex identity” in some languages.102–104 Yet, while gender identity is widely understood as the gender one identifies as, assigned sex (at birth) often is not. Owing to confusion individuals may have over the phrasing, there is often an additional clause added when asking for ASAB, namely clarifying that what is meant is the gender marker on an individual’s birth certificate or other birth record.106–108 It is at this point that ASAB in theory becomes AGAB in practice. This is because birth certificates (and by extension, birth records) are legal documents109–111 containing a marker that is a culturally dependent and socially constructed signifier (and therefore correctly called a gender marker).112,113 A list of known culturally dependent AGAB values is linked in the additional supplementary tables in Supplementary Appendix A. Owing to translation difficulties, it is entirely possible that “assigned gender at birth” may be rendered as “assigned sex at birth” in some jurisdictions and languages. In terms of clinical practice (and therefore in medical standards), this will likely not affect care. However, it is important to realize that these are fundamentally different concepts and relate different information, which may become an issue in terms of cultural competency in some regions, that is, “sex” implies a fundamentally biological component in English (even if this is not always the case with “assigned sex at birth”), which can insinuate that a provider believes that a trans woman patient’s “real” sex is “male.”

PERSPECTIVES ON DATA COLLECTION

From the clinical perspective

When a transgender person enters a clinic, health care is rarely their sole concern. Will the staff use their name and pronouns? Will their privacy be respected? Will the clinic provide adequate services or services at all? These questions burden transgender people as they navigate the medical system, and they are often answered quickly through implicit and explicit cues.

Because providers rarely understand their trans patients’ lived realities, their unique needs and questions are seldom prioritized. Expecting boards comprised mostly or entirely cisgender medical professionals or EHR vendors to understand what transgender patients need without close collaboration is impractical, especially given the dearth of LGBTQ health education in most medical training (Figure 1).116,117,126–130 (LGBTQ is the abbreviated form of lesbian, gay, bisexual, transgender, and queer or questioning. Note that LGBTQIA+114 or 2SLGBTQIA+115 may be more appropriate if the curriculum also actively includes Two-Spirit, intersex, and asexual or aromantic persons, as well as other marginalized or minoritized sexual- or gender-based groups.) Up to half of transgender people must educate their providers to receive appropriate care.1,131 This structural incompetency translates into significant pitfalls in SOGI data collection.132–135

Figure 1.

Figure 1.

The current state of trans healthcare education, wherein patients are the primary educators of providers. (A) The percentage of medical students among 365 Canadian medical schools who felt knowledgeable enough to care for transgender patients.116 (B) Percentage of transgender patients who reported having a negative experience with a healthcare provider in the 2015 U.S. Transgender Survey (USTS).1 (C) The number of LGBTQ education hours117 and estimated percentage of transgender education hours,116,118 based on percentage hours in other disciplines,119 compared with hours in nutrition and microbiology.120,121 The total number of hours was conservatively estimated based on 1100 hours of instruction in the first 2 years of medical school. We estimate that approximately 18.3 minutes of medical education is spent on transgender-related care, care which is potentially life-saving for millions of people.122–125

SOGI data collection often begins with intake forms. Institutions and providers do not always understand how to ask for one’s gender identity. Gender identities can include female (woman), male (man), and nonbinary, among numerous others. In EHR systems, gender identity answers often include “FTM” (short for female-to-male), “MTF” (short for male-to-female), “transgender male,” “transgender female,” and simply “transgender.”136–139 These options are antiquated and problematic, in part owing to the massive shift in the conceptualization of what these terms have meant over the past 8 decades (Table 1). The first 2 (FTM and MTF) have largely fallen out of use among transgender people and in transgender-related literature because they fundamentally assume a “change” in gender—that a transgender person was “female” and now is “male” (FTM), typically after some sort of medicalization process (hormone replacement therapy and/or gender-affirming surgery).137,141,149–151 This contradicts how many trans people understand their identity and can confuse patients. Note that distaste toward “FTM” and “MTF” is not universal; many transgender men have centered much of their North American community around the usage of FTM (in organizations like the FTM Foundation and FTM International, for instance), whereas the usage of “MTF” is more heavily tied to the pornography industry and is actively utilized to discriminate against transgender women.140–144 Two other terms that have become more popular in the last decade to describe “directionality” of transness, being “transfeminine” and “transmasculine.” While these terms are marginally more effective than terms like “FTM” and “MTF” and often are used to describe an individual’s gender identity,145 we would like to emphasize that their usage tends to conflate access to medical care with unspoken gender expression requirements, that is, insisting that trans women must be feminine or that trans men must be masculine to be considered as having “succeeded” in transitioning.146,147 Further, these terms often binarize nonbinary persons unnecessarily.147,148 In medical contexts, using assigned gender at birth alongside gender identity already achieves most of what terms like “transfeminine” and “transmasculine” attempt to elucidate. Therefore, we use terms like “assigned male at birth” or “assigned female at birth” where necessary in this article. All 5 options (being “FTM,” “MTF,” “transgender male,” “transgender female,” and “transgender”) center cisgender people on intake forms, reserving unqualified “male” and “female” for them while trans people are segregated into the separate, qualified categories of “MTF,” “FTM,” “transgender male,” “transgender female,” and “transgender.” Fundamentally, this implies trans women and trans men are deviant and not actually female or male, respectively, especially when the selection is ordered as “male,” “female,” “MTF,” and“FTM.”156–158 Usage of “transgender male” and “transgender female” may be confusing as well, as the terms have varied over time. For instance, some sources use “transgender male” or “transsexual male” to refer to a trans man,152,153 while others used it to refer to a trans woman.154,155 Further, by separating these groups in the first place, “male” and “female” are presented as the norm while “transgender male” and “transgender female” are othered.138,156–159 The separation can insinuate that transgender people are not their stated gender and that separation is cited heavily in transphobic and exclusionary actions.160–162 Additionally, it has been noted that terms like “FTM” and “MTF” are likely to be misinterpreted and confused within medical systems, with anecdotal evidence supporting the notion that terms like “FTM top surgery,” “MTF bottom surgery,” etc. being misunderstood because the actual procedure is unclear from those phrases alone. In addition to the cited sources regarding the separation of binary trans identities from unqualified terms, one of the authors (C.A.K.) ran an informal 24-hour Twitter poll and received 401 responses, with 84% noting that the separation implied that “trans woman”/”trans man” was deviant and “woman”/”man” was normal. Further, just under 14% of trans men and trans women elected to de-emphasize their trans status via write-in on the 2015 U.S. Transgender Survey, even when the question asked was not considering gender identity, but instead considering general identity terms. For the data methodology used to analyze the U.S. Transgender Survey data, see Supplementary Appendix A, Supplement B. In practice, many binary transgender persons indicated they would answer “female” over “MTF” or “transgender female.”

For nonbinary people, being relegated to an “other” category is often experienced as a literal “othering” whereas the inclusion of a “nonbinary” category communicates inclusion.136,138,158,163 Unfortunately, intake forms guided by the Veteran Affairs’ “Self-Identified Gender Identity” initiative employ such exclusionary linguistic practices. The initiative enforces a distinction between transgender and cisgender individuals by providing patients the options of “male,” “female,” “transman,” “transwoman,” “other,” and “the individual chooses not to answer.”168 Additionally, the inclusion of “transwoman” and “transman” is not best practice or grammatically correct. The othering effect this has is another subtle linguistic turnoff to transgender patients. “Trans” is an adjective and the correct wording would be “trans woman” or “trans man” with a space.164–167 EHRs should accurately record transgender people’s gender based on their gender identity as male, female, nonbinary, etc. such as that presented in Figure 3 (see also the additional supplementary tables for further discussions of possible regional extensions).

Figure 3.

Figure 3.

Examples of 2 ways to ask for identification of transgender identity: direct method (1-step) and indirect method (2-step); indicates different options that may be used to similar effect in certain circumstances. In general, it should be clear that this information is not a proxy for karyotype or organ inventory, which should be ascertained independently (see “Additional Considerations”). However, it is important to include for providers, researchers, and patients, as patients might not have karyotype- or organ inventory–based knowledge and tests may be unaffordable, unavailable, or unnecessarily invasive. Note 1: For neonates and infants for which gender identity has not yet developed, a medical provider could enter “unknown,” “uncertain,” “undifferentiated,” or “none.” Note 2: This may only be appropriate in some jurisdictions as the original birth certificate may no longer officially exist (such as in Germany). It does resolve ambiguity in a situation wherein a patient has been issued a new birth certificate with an updated gender marker. Additionally, there may be values and options for this question that differ significantly by jurisdiction. Note 3: “X” is allowed in some jurisdictions, such as in New York City, Washington DC, New Jersey, California, etc. and has been recently approved for federal documentation in the U.S.170 While occasionally intersex is considered a gender identity or as an assigned gender at birth, this is commonly thought of as inappropriate, because intersex people can have many different gender identities or assigned genders at birth (some intersex people do identify their gender identity as “intersex” or “hermaphrodite” [typically as a form of reclamation], but these identifications can adequately be covered by “A gender not listed [please specify]” in most cases).171 For more information on intersex-inclusive question design, see “Intersex Data Collection: Your Guide to Question Design.”172 It is also important to note that birth certificates may change to not include gender markers or may not be consistent from jurisdiction to jurisdiction.111,173,174 Therefore, a two-step is simply a better proxy to gender- and/or sex-related information than a one-step, which may be replaced in the future with more accurate models incorporating information like hormonal milieau.

In addition to avoiding derogatory and othering terms when referring to transgender people’s gender, EHRs should encode their correct name and pronouns. Accurately recording person’s name and pronouns without labelling either as preferred or chosen is necessary to improve transgender health.176 English-language pronoun sets have begun to be codified in some EHRs.175 For instance, LOINC provides an answer list for reported personal pronouns (https://loinc.org/90778-2/), and the HL7 Gender Harmony Project has announced inclusion of pronouns per their current balloting process (for more information on pronouns and for example use cases, see https://www.mypronouns.org/what-and-why and https://pronoun.is/). Usage of the correct name and pronouns is linked to lower risk of depression, suicidal ideation, and suicidal behavior among trans people.59,96,177

Adding pronoun-related information in EHRs can easily be modeled as part of the software, with example usage sentences provided using an automated system, some of which are open source.178–180 A potential setup for asking a person their pronouns is shown in Figure 2 but giving one’s pronouns and asking for the patients’ during all clinical interactions should be made routine in all patient care. Pronoun information can also be included with a list of utilized titles or honorifics. An algorithm connected to a person’s selected pronouns or titles can be used to automatically set the appropriate terms in the provider’s notes.

Figure 2.

Figure 2.

A potential question to ask an individual their pronouns. Individuals may use or be comfortable with multiple sets of pronouns, hence “choose all that apply.” Neopronouns were added based on the 2 most recent Gender Censuses (2020, 2021), in which over 24 000 and 44 000 nonbinary people, respectively, were queried about their terminological usages.181,182 Pronoun sets could be shifted in or out based on usage statistics on a local level over a period of time. Having a pronoun not listed could alert the medical provider about pronoun usage prior to the encounter. Selecting “prefer not to respond; prefer not to disclose” or the like could trigger a series of potential privacy options, wherein the patient could determine pronouns for various kinds of healthcare encounters depending on safety, presentation, etc.

While it is essential for the medical provider to have current and accurate information, it is also necessary for frontline staff to have sufficient training to conduct intake without misgendering or distressing transgender patients. Misgendering patients fosters distrust when the healthcare system is already perceived as unsafe by many transgender people and can lead to poor health outcomes.59,96 The healthcare system must also undergo transformational change, and we ask for more than just brief intervention-based training, but rather for a system in which transgender care knowledge is integrated into the pedagogy of healthcare disciplines.130

All gender-related demographic information should be modifiable by patients without a provider’s consent. Requiring a medical provider to change the data may (1) unnecessarily out the patient to the provider, (2) lead to potentially transphobic or wrong information being entered given the state of LGBTQ medical education, or (3) pathologize and medicalize one’s status as transgender. Regardless of how EHR data is recorded, transgender patients must be assured of their autonomy, privacy, and confidentiality throughout the process.183 As one trans person put it, “If you can’t even go to the bathroom without fearing for your safety, then why would you feel safe disclosing your [identity] in a… medical setting?”184

From the researcher’s perspective

Used appropriately, EHRs can improve the quality of care and research on transgender patient populations by facilitating consistency and comprehensiveness in population definition. Transgender prevalence estimates can differ significantly based on case definition; therefore, patient self-identification is the gold standard.85,185 Usage of methodologies such as ICD codes or free text linked primarily to procedures or diagnostics considered trans-related prioritizes White, middle-class transgender persons because these individuals showcase higher utilization of gender-affirming hormone therapies and surgeries,1,186,187 higher rates of insurance coverage,188 lower poverty rates,1,189 and lower mortality rates.190,191

There are 2 common methods for patient self-identification (Figure 3).136,138,156,157,192 Note that the answers provided in Figure 3 are placeholders and reference the set of tables derived in the additional supplementary tables, which provide further notes and international implications/considerations..

Both methods have significant advantages for researchers collecting data on transgender persons. In the first version, all individuals who responded “yes” in their most recent intake forms have self-identified as transgender.

The second casts a wider net. If a person selects “Male” for both questions, they are considered a cisgender man, whereas a person who selects “Female” as a gender identity then “Male” as their AGAB would be considered a transgender woman.192,193 This method also captures nonbinary persons who may not consider themselves transgender. It also includes persons who have undergone transition-related processes and no longer consider themselves transgender. It is crucial to emphasize that the gender identity question should be asked first, as asking for a person’s AGAB first implies that AGAB is an individual’s “real” gender, denying the individual’s lived experience.194

The 2-step methodology has the distinct advantage that we can split our transgender population into “assigned male at birth” or “assigned female at birth” (or other gender assignment designations), as well as into “trans women” (answering “Female,” then “Male”), “trans men” (answering “Male,” then “Female”), and “nonbinary persons” (if using umbrella terminology, all additional answers which differ and do not fit into binary categorizations). This disaggregation is useful in data collection because it accurately denotes the patient’s gender as indicated by self-report. It also enables providers to offer medically relevant testing and services based on AGAB (eg, pap smears and prostate exams) and organ inventory (see Additional Considerations). Best practice recommendations for screening of transgender patients should be individualized based on organ history and cumulative exposure to hormones.195 Implementation of these goals is facilitated by accurate SOGI documentation in the EHR, but administrators should be cautious of being overreliant on automated screening rules in any setting.196

The 2-step process leads to an increased ascertainment of transgender status.183,197 Cisgender and transgender persons understand the 2-step thoroughly and in a manner that makes for more complete and more accurate data analytics.192,193,198–200 The UCSF Center of Excellence for Transgender Health, Fenway Health in Boston, the Mayo Clinic, and the U.S. Centers for Disease Control and Prevention all recommend using some form of the 2-step process.201 Anecdotally, it has been mentioned that some individuals dislike the 2-step process. However, in the authors’ experience, this is usually because the reasoning behind asking the questions has not be adequately explained to the patient.193 In one author’s experience (C.A.K.), discussion of accuracy of data collection and what that data collection could be used for (for instance, to indicate correct reference ranges for medical testing), followed by discussion of privacy standards, ameliorated any distrust of the 2-step process.

To highlight the limitations of current EHR data capture, we examined the Medical Information Mart for Intensive Care-III (MIMIC-III) database, a de-identified database of the EHRs of over 46 000 emergency department patients.202 “Gender” in the system is defined as the “genotypical sex of the patient,” which can be analyzed as meaning AGAB.203 This is inaccurate, as (1) the patients were not mass genotyped and (2) investigation of clinical notes regarding persons labeled as transgender revealed that this was the correct AGAB about 54% of the time. Identification by ICD–Ninth Revision–Clinical Modification codes in the diagnostics table reveals that only 46% of patients identified as transgender in notes were identified as such by ICD codes.204–207 This may be due to a history (and current reality) of health insurance denials for trans persons, meaning that providers adapted to using non–trans-related codes to avoid treatments being flagged as trans care.208

To find out which patients were transgender men, transgender women, or nonbinary persons was difficult, as it required manual full review of all patients’ charts. No nonbinary persons were detected, but this could reflect patient discomfort disclosing their identity or the decision to report a binary trans identity to ease access to desired care.183 Alternatively, it might indicate that whoever entered the notes failed to correctly codify disclosed identities.183 If we assume that all transgender persons in the MIMIC-III sample were trans men or trans women, correct pronouns were used <40% of the time in notes.204,205 It is extremely likely that many transgender persons were not identified at all, as many do not disclose their status as transgender.1

ADDITIONAL CONSIDERATIONS

A number of proposals related to transgender data collection protocols include sections calling for organ inventories and karyotyping.209–211 However, there is little discussion considering wide implementation of such systems.210 Karyotyping can be performed relatively easily, but is omitted from most extant SOGI vocabularies, with atypical karyotype results tracked under disease or disorder hierarchies or on a “problem list.”

Organ inventories using a presence/absence system are effective in showcasing differences in anatomy quickly and can display more medically salient information than a person’s gender identity or AGAB alone.212 Some implementations permit addition of free-text comments to document pertinent anatomical details.213 All patients can benefit from usage of organ inventories, regardless of transgender status.

The difficulty with clinical decision making based on purely presence- or absence-based organ inventories lies in the plethora of possible permutations and health risk profiles associated with anatomical features, of which medical providers might not be aware. For instance, organs and tissues are oftentimes not clearly demarcated (eg, in ovotesticular tissue), some anatomical features may develop unexpectedly, some organs or tissues might be partially present in certain situations, and some organ combinations might be unanticipated. For instance, persons assigned female at birth persons have shown some evidence of prostatic tissue growth following extended testosterone therapy.214 There may also be difficulties representing chest cancer risks following many forms of chest reconstruction or contouring in individuals assigned female at birth.215,216 This can be a serious issue if an individual has a BRCA mutation or other familial cancer syndrome.217 Procedures like penis-preserving vaginoplasty or vagina-preserving phalloplasty218 may contradict binarist systems that do not consider it possible for an individual to have a penis and vagina simultaneously.

Therefore, usage of a modifiable presence and absence system with integrated anatomical terminology (such as that seen in SNOMED CT) offers a reasonable entry point. The patient would be provided a list of common organs to consider, with presence, absence, and something else options for each. At the bottom of the list, there would be a place to clarify the “something else” selections and to indicate any additional presence and absence data not included in the list. Following this, a provider or administrator would enter the information into the EHR, which could then be transformed into an image of the body with present and absent organs highlighted appropriately and with text notes included in full. Such information could be assessed quickly visually and be integrated easily into existing clinical workflows. A brief discussion of how this workflow could be tested and rolled out is included in Supplementary Appendix A, Supplement A.

Gender identity and AGAB questions are important to preserve alongside organ inventory, as organ status cannot always easily be ascertained. There are cases wherein a patient’s privacy should be respected, and cases in which invasive procedures may be required to “check” for particular organ statuses. Patients may not remember particular surgical and organ histories, be aware of them, or necessarily understand the associated vocabulary. This is pertinent to consider when caring for intersex persons, who may have had medically unnecessary surgeries performed on them without their knowledge,219–222 as well as when considering general language barriers in health care.223–225 Having both systems side by side reduces the chance for medical error.

In addition to organ inventories, karyotyping (and related genetic laboratory techniques) may be important to showcase when considering the impacts of certain chromosomal conditions.226–228 However, while karyotyping can be useful in select cases, it is ultimately less informative than other inventories that have been proposed. Karyotyping should only be performed or recorded at the discretion of the patient, involve full privacy protections, and should only be considered alongside other data for prognostics and diagnostics.

DISCUSSION AND CONCLUSION

In part owing to the absence of trans voices leading trans research teams, cisgender researchers doing trans research often fail to consider the ethical and material implications of their study designs, methods, and underlying assumptions.51,60,66,77,229–240 These researchers, and the informaticists who codify their assumptions, have contributed and continue to contribute to the epidemics of transphobic violence we see today.1,3,10,12,14,52,55,241–245

The 2-step validation process proposed above meets the needs of patients, clinicians, and researchers, and is preferred by all 3 groups.138,157,192 We therefore ask the American Medical Informatics Association and its members to endorse the 2-step method presented in Figure 3 as a starting point for collecting gender-related information in EHRs. It is important to carefully select the options provided in coordination with local gender-diverse communities to avoid anything inaccurate, confusing, pathologizing, transphobic, or intersexphobic. This involvement is vital to ensure locally relevant data collection, but also to afford community members the opportunity to be active participants in their own health. An outline of some identities to consider as a starting point is available as part of the additional supplementary tables. We also call for more research into ethical implementation of organ inventories, hormone inventories, and karyotyping in general medical settings and in EHR systems.

Further, cultural considerations should inform procedural aspects of data collection, including physical location and employee training. Financial support for transgender participation in coordinated, multidisciplinary community advisory boards and employee-based medical coding standards boards within healthcare-affiliated organizations to center the needs of trans persons in the EHR and improve clinical decision making is a critical next step to achieve data-driven, gender-affirming care.

FUNDING

TS was supported by grant number T32HS000029 from the Agency for Healthcare Research and Quality. HT was supported by the Agency for Healthcare Research & Quality Grant Number K12-HS-026385 and the National Institute on Drug Abuse grant number 5R25DA035692-08. The content is solely the responsibility of the authors and does not necessarily represent the official views of the Agency for Healthcare Research and Quality and those of the National Institute on Drug Abuse. The other authors received no specific funding for this work.

AUTHOR CONTRIBUTIONS

CK conceived the research idea and was the primary author of the manuscript. AE, FA, HT, TS, TG, LH, ZD, RQ, AR, EG, OD, EL, EP, SS, and RK contributed several critical revisions and provided significant feedback on discussed systems. AE performed the single analysis regarding the U.S. Transgender Survey data in footnote 10, covered under the auspices of her Institutional Review Board at the University of Southern California. EG designed Figure 1. SS designed Figures 2 and 3. All authors agree to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. CK performed analyses using the MIMIC-III database, which were published in prior work and did not specially require Institutional Review Board approval (all records therein are de-identified). A CITI training course (entitled “Data or Specimens Only Research”) was required and taken in order to gain access to the database.

SUPPLEMENTARY MATERIAL

Supplementary material is available at Journal of the American Medical Informatics Association online .

DATA AVAILABILITY STATEMENT

No new data were generated or analyzed in support of this piece. Requests for 2015 United States Transgender Survey data can be made here: https://www.ustranssurvey.org/data-requests. Information about requesting access to MIMIC-III records can be found at https://mimic.physionet.org/gettingstarted/access/.

CONFLICT OF INTEREST STATEMENT

None declared.

Supplementary Material

ocab136_Supplementary_Appendix_A

ACKNOWLEDGMENTS

This document adheres to the principles outlined in the Declaration of Helsinki. Special thanks to the following individuals for their critical assistance in this work: Judith W. Dexheimer, Laur Bereznai, and Os Keyes.

REFERENCES

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

ocab136_Supplementary_Appendix_A

Data Availability Statement

No new data were generated or analyzed in support of this piece. Requests for 2015 United States Transgender Survey data can be made here: https://www.ustranssurvey.org/data-requests. Information about requesting access to MIMIC-III records can be found at https://mimic.physionet.org/gettingstarted/access/.


Articles from Journal of the American Medical Informatics Association : JAMIA are provided here courtesy of Oxford University Press

RESOURCES