Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2025 Oct 4.
Published before final editing as: Psychol Methods. 2024 Apr 4:10.1037/met0000652. doi: 10.1037/met0000652

Will All Youth Answer Sexual Orientation and Gender-Related Survey Questions? An Analysis of Missingness in a Large US Survey of Adolescents and Young Adults

Sabra L Katz-Wise 1,2,3, Lynsie R Ranker 1,4, R Korkodilos 1, Jennifer Conti 4, Kimberly M Nelson 4,5, Ziming Xuan 4, Allegra R Gordon 4
PMCID: PMC11450101  NIHMSID: NIHMS2013655  PMID: 38573664

Abstract

Some researchers and clinicians may feel hesitant to assess sexual orientation and gender-related characteristics in youth surveys because they are unsure youth will respond to these questions or are concerned the questions are too sensitive and will cause discomfort or offense. This can result in missed opportunities to identify LGBTQ+ youth and address health inequities among this population. The aim of this study was to examine prevalence and sociodemographic patterns of missingness among survey questions assessing current sexual orientation, gender identity and expression (SOGIE) and past change in sexual orientation (sexual fluidity) among a diverse sample of US youth. Participants (N=4,245, ages 14-25 years; 95% cisgender, 70% straight/heterosexual, 53% youth of color), recruited from an online survey panel, completed the Wave 1 survey of the longitudinal Sexual Orientation Fluidity in Youth (SO*FLY) Study in 2021. Current SOGIE, past sexual fluidity, and sociodemographic characteristics were assessed for missingness. Overall, 95.7% of participants had no missing questions, 3.8% were missing one question, and 0.5% were missing ≥2 questions. Past sexual fluidity and assigned sex were most commonly missing. Sociodemographic differences between participants who skipped the SOGIE questions and the rest of the sample were minimal. Missingness for the examined items was low and similar across sociodemographic characteristics, suggesting that almost all youth are willing to respond to survey questions about SOGIE. SOGIE and sexual fluidity items should be included in surveys and clinical assessments of youth to inform clinical care, policy-making, interventions, and resource development to improve the health of all youth.

Keywords: adolescents, gender identity, gender expression, missingness, sexual fluidity, sexual orientation, young adults


The population of LGBTQ+ (lesbian, gay, bisexual, transgender, queer, and all sexual and gender minority identities) youth in the US is significant, representing an estimated 9.5% of youth, ages 13-17 years (Conron, 2020). The proportion of LGBTQ+ youth in the US has increased over time (Jones, 2022); for example, 14.3% of youth who responded to the Youth Risk Behavioral Surveillance Survey in 2017 identified as a sexual minority vs. 7.3% of youth in 2009 (Raifman et al., 2020). LGBTQ+ youth are more likely than their heterosexual and/or cisgender peers to report adverse mental, behavioral, and physical health outcomes (Hafeez et al., 2017). Given the health inequities and discrimination LGBTQ+ youth face, it is necessary to understand the breadth of identities and identity trajectories in youth. Improved identity measurement will allow researchers, practitioners, and advocates to accurately identify and address specific needs to improve the health of LGTBQ+ and all youth (Spock et al., 2022).

Health surveys that assess sexual orientation, gender identity, and gender expression (SOGIE) are necessary to inform legislation, government funding, interventions, and resource development (Streed et al., 2020). Despite its importance, research on LGBTQ+ youth health has been limited by a lack of standardized and consistently collected measures of SOGIE constructs (National Academies of Sciences, 2022). Additionally, when sexual orientation is assessed, it is often unidimensional, not considering the three primary dimensions of sexual orientation: identity, attractions, and behavior (Ruberg & Ruelos, 2020), which may not be congruent (Ybarra et al., 2019), and are uniquely linked to health (Dawson et al., 2022; Matthews et al., 2014).

Current research supports sexual orientation assessment in adult surveys (Brabete et al., 2020; Case et al., 2008; Jans et al., 2015; VanKim et al., 2010). An analysis of New Mexico health survey data found low nonresponse rates for sexual orientation questions, suggesting adults are willing to answer these questions, and recommending that future health surveys should assess sexual orientation (VanKim et al., 2010). Another study examining response rates for sexual orientation questions in the Nurses’ Health Study II found these questions did not prompt participants to leave the study (Case et al., 2008). Similarly, an analysis of demographic patterns of missing sexual orientation data in adults in the Canadian Community Health Survey found a low nonresponse rate for sexual orientation questions (Brabete et al., 2020). These findings suggest collecting this information among adults is feasible.

Despite existing research examining sexual orientation nonresponse in adult surveys (Brabete et al., 2020; Case et al., 2008; Jans et al., 2015; VanKim et al., 2010), little research has examined missingness among gender identity questions among either adults or youth. One exception, a recent study examining missingness of both sexual orientation and gender identity questions among adults in the Behavioral Risk Factor Surveillance System, found gender identity was less likely than sexual orientation identity to be missing and that predictors of missingness varied considerably (Jesdale, 2021). Data collection methods, particularly around demographics, often fail to capture complexities of LGBTQ+ identities and experiences (Ruberg & Ruelos, 2020). For example, sexual orientation questions may be imbued with binary ideas of sexuality and gender (Brabete et al., 2020), which may create difficulty for individuals who have past changes in sexual orientation identity and attractions (i.e., sexual fluidity) and/or are gender fluid. Surveys that only allow participants to select one identity do not capture the multiple terms that some participants use to describe their identities, leaving researchers with an incomplete or inaccurate understanding of LGBTQ+ experiences.

Gaps in literature around missingness of SOGIE and sexual fluidity survey questions are even more pronounced for youth. The US National Academies of Science, Engineering, and Medicine made recommendations about SOGIE measures for adults; however, the report stopped short of reviewing measures for youth, noting a need for further work in this area (National Academies of Sciences, 2022). One study assessing question nonresponse in eight school-based surveys administered from 1986-1999 found that nonresponse for sexual orientation was similar to other sexual-focused questions (Saewyc et al., 2004). However, this research has several limitations. The reliance on school-based samples misses youth who are not in school, either because they matriculated or left for another reason. Furthermore, these data were collected in the 1980’s and 1990’s; due to increasing public acceptance and awareness of LGBTQ+ identities, these findings are likely not representative of the likelihood of today’s youth answering such questions. The majority of prior research has also failed to examine missingness among gender identity and expression survey questions in youth, and missingness in sexual fluidity questions in any age group. Collecting SOGIE and sexual fluidity data is necessary to create systems to improve the health of all youth, particularly LGTBQ+ youth.

The aim of this study was to examine prevalence and sociodemographic patterns of missingness among SOGIE and sexual fluidity survey items administered to youth. Based on previous adult studies (Brabete et al., 2020; Case et al., 2008; Jans et al., 2015; Jesdale, 2021; VanKim et al., 2010), we hypothesized the prevalence of SOGIE missingness in this youth sample would be low. Due to lack of prior evidence to inform specific hypotheses, no hypotheses were proposed about sociodemographic patterns of missingness.

Method

Participants

Participants were 4,245 US youth, ages 14-25 years, who completed Wave 1 of the Sexual Orientation Fluidity in Youth (SO*FLY) Study (Katz-Wise et al., 2022). Eligibility criteria for SO*FLY included being age 14-25 years and living in the US. Eligible individuals were recruited from a participant panel from Prodege, an online survey firm, which recruits from TV, radio, and online advertisements. Potential participants were identified based on sociodemographic information previously provided as part of their participation in the Prodege panel, including sexual orientation, gender identity, and race/ethnicity. The full panel was 29% age 13-25 years, 65% female, 71% white, 15% Latina/o/x and/or Hispanic, 9% Asian, 10% Black or African American, and 10% another race/ethnicity. For the SO*FLY Study, LGBTQ+ youth and youth of color were oversampled and participants were sampled from across the US to enable subgroup analyses. Sociodemographic characteristics are in Table 1.

Table 1.

Sample Sociodemographic Characteristics (N=4,245)

SOGIE Characteristics n % Missing n (%)
Sexual orientation identity 17 (0.4)
  Straight/heterosexual 2,972 70.3
  Bisexual 624 14.8
  Gay or lesbian 181 4.3
  Pansexual 188 4.5
  Queer 43 1.0
  Asexual 76 1.8
  Not sure 112 2.7
  Another identity/identities 32 0.8
Attractions 11 (0.2)
  Girls/women only 1,509 35.6
  Boys/men only 1,750 41.3
  Nonbinary people only 32 0.8
  Another gender identity only 41 1.0
  Multiple gender identities 902 21.3
Sex assigned at birth 28 (0.7)
  Female 2,630 62.4
  Male 1,587 37.6
Gender identity 10 (0.2)
  Girl/woman 2,457 58.0
  Boy/man 1,602 36.6
  Nonbinary 140 3.3
  Another gender identity 3 0.1
  Selected multiple gender identities 33 0.8
Transgender status 17 (0.4)
  Do not identify as transgender 4,003 94.7
  Identify as transgender 116 2.7
  Not sure 91 2.2
  Don’t know what question is asking 18 0.4
Gender expression 16 (0.4)
  Very feminine 663 15.7
  Mostly feminine 994 23.5
  Somewhat feminine 522 12.3
  Equally feminine and masculine 543 12.8
  Somewhat masculine 358 8.5
  Mostly masculine 693 16.4
  Very masculine 456 10.8
Past change in sexual orientation identity
  Yes 698 16.6 51 (1.2)
  No 3,496 83.4
Past change in attractions 34 (0.8)
  Yes 1,386 32.9
  No 2,825 67.1
Sociodemographic Characteristics
Age group 0 (0)
 14-17 years 440 10.4
 18-21 years 1,734 40.9
 22-25 years 2,071 48.8
Gender identity (constructed)a 38 (0.9)
 Cisgender girl/woman 2,438 58.0
 Cisgender boy/man 1,540 36.6
 Transgender girl/woman 6 0.1
 Transgender boy/man 48 1.1
 Nonbinary/additional identity or identities 175 4.2
Race/ethnicity 21 (0.5)
 American Indian or Alaska Native 51 1.2
 Asian 378 9.0
 Black or African American 701 16.6
 Latina/o/x and/or Hispanic 206 4.9
 Native Hawaiian or Other Pacific Islander 12 0.3
 White 2,003 47.4
 Another race 23 0.5
 Selected multiple races/ethnicities 849 20.1
Urban-rural categoryb 5 (0.1)
 Large central metro 1,333 31.4
 Large fringe metro 1,051 24.8
 Medium metro 885 20.9
 Small metro 419 9.9
 Micropolitan 338 8.0
 Non-core 214 5.1
a

Variable constructed from sex assigned at birth and gender identity.

b

County determined by reported zip code, based on the National Center for Health Statistics urban-rural county categories, 2013.

Measures

Sociodemographic Characteristics

The following sociodemographic characteristics were used to describe the sample and examine missing data patterns: age in years, current gender identity, current sexual orientation identity, race/ethnicity, and rurality. Race and ethnicity were assessed with two separate items: “Are you Latina/o/x and/or Hispanic?” (Response options: yes, no) and “Which race(s) best describe you?” Participants could select all that applied from: American Indian or Alaska Native, Asian, Black or African American, White, Native Hawaiian or Other Pacific Islander, another race: (write-in). These items were combined into one race/ethnicity variable for analyses. Rurality was assessed by coding participants’ zip codes into categories (large central metro, large fringe metro, medium metro, small metro, micropolitan, non-core) (Ingram & Franco, 2012).

Current SOGIE and Past Sexual Fluidity

Missingness was examined for current sexual orientation identity, current attractions, current gender identity, sex assigned at birth, transgender status, current gender expression, and past change in sexual orientation identity and attractions.

Sexual orientation identity was assessed with one item adapted from prior studies (Centers for Disease Control and Prevention, 2019; Human Rights Campaign, 2018; Meyer, 2020): “Sexual orientation’ describes who you are attracted to and how you identify yourself based on those attractions. Sexual orientation may change over the course of people’s lives. Which of the following best describes your current sexual orientation?” Response options: straight/heterosexual, bisexual, gay or lesbian, pansexual, queer, asexual, not sure, another identity/identities (write in). Participants could select one response option.

Attractions were assessed with one item adapted from prior studies (Centers for Disease Control and Prevention, 2019; Human Rights Campaign, 2018; Meyer, 2020): “‘Attraction’ describes sexual or romantic feelings toward another person. Attractions may change over the course of people’s lives. Who are you currently attracted to?”. Response options: girls/women, boys/men, nonbinary people (e.g., genderqueer, gender non-conforming, another nonbinary identity), people of another gender identity: (write-in). Participants could select all that applied. The following mutually exclusive categories were created for analyses: girls/women only, boys/men only, nonbinary people only, people of another gender identity only, and multiple gender identities (attraction to ≥2 identities).

Gender identity was assessed with one item adapted from prior research (Bauer et al., 2017): “Which of the following best describes your current gender identity?” Response options: girl/woman; boy/man; nonbinary, e.g., genderqueer, gender non-conforming, another nonbinary identity; another gender identity (write-in). Participants could select all that applied. Sex assigned at birth was assessed with one recommended item (Williams Institute Scholars, 2020): “What sex were you assigned at birth, on your original birth certificate?” Response options: female, male. Gender identity categories were created from these items as follows: cisgender girl/woman (sex assigned at birth: female, gender identity: girl/woman), cisgender boy/man (sex assigned at birth: male, gender identity: boy/man), transgender girl/women (sex assigned at birth: male, gender identity: girl/woman), transgender boy/man (sex assigned at birth: female, gender identity: boy/man), nonbinary (gender identity: nonbinary with or without another identity or a nonbinary identity written in with or without girl/woman or boy/man).

Transgender status was assessed with one item adapted from prior research (Johns et al., 2022): “Do you identify as transgender?” Response options: no, I do not identify as transgender; yes, I do identify as transgender; I am not sure if I am transgender; I do not know what this question is asking. Participants could select one option.

Gender expression was assessed with one item adapted from prior research (Wylie et al., 2010): “A person’s appearance, style, dress, or the way they walk or talk may affect how people describe them. On average, how do you think other people would describe you?” Response options: Very feminine, mostly feminine, somewhat feminine, equally feminine and masculine, somewhat masculine, mostly masculine, very masculine. Participants could select one option.

Past sexual fluidity was assessed with two items adapted from prior research (Katz-Wise, 2015): 1) “Have you ever experienced a change in your sexual orientation identity? For example, identifying one way, then identifying another way.” 2) “Have you ever experienced a change in your attractions to others over time? For example, feeling attracted to only one gender, then feeling attracted to a different gender or more than one gender.” Response options for both items: yes, no.

Procedure

Participants completed Wave 1 surveys online via the Prodege survey platform in August 2021. Eligible individuals from the Prodege panel were presented with the survey length and incentive amount on a Prodege dashboard accessed either via an app or through a website. After clicking on the survey, participants saw the following introductory text: “Welcome! The following questions will ask about your sexual orientation and attraction(s). If you are not sure what a term means, please answer to the best of your ability. All responses will remain confidential.” Then participants provided informed consent and continued with the survey. Surveys assessed sociodemographic characteristics, including current SOGIE and past sexual fluidity (see Measures). Upon survey completion, all participants received mental health and support resources, and an incentive within Prodege’s structure of panel currency. Study procedures were approved by the Boston Children’s Hospital Institutional Review Board.

Statistical Analysis

We first examined sociodemographic characteristics of the overall sample (N=4,245), reporting the frequency and proportions for dichotomous and categorical variables. We then reported the proportion of data missing across individual variables. Finally, we qualitatively compared the proportion of missingness (for SOGIE variables with missing n ≥20) for specific sociodemographic subgroups to the full sample. Statistical tests of difference were not conducted due to small subgroup sample sizes. Analyses were conducted using SAS 9.4. This study was not preregistered. The data and study materials can be made available upon request.

Results

Sociodemographic Characteristics

Descriptive characteristics of the analytic sample are in Table 1. Participants were age 14-25 years; 48.8% were age 22-25 years. Participants were 47.4% white, 16.6% Black, 9.0% Asian, 4.9% Latina/o/x, and 1.2% American Indian or Alaska Native; 20.1% identified with multiple racial/ethnic groups. Over half lived in a large central (31.4%) or large fringe metro (24.8%) county; micropolitan and non-core (more rural counties) represented 13.1% of the sample.

Sexual orientation identities included: 70.3% straight/heterosexual, 14.8% bisexual, 4.5% pansexual, and 4.3% gay or lesbian. Most participants reported attraction to one gender (41.3% boys/men only, 35.6% girls/women only, 0.8% nonbinary only, and 1.0% to another gender); 21.3% reported attraction to multiple genders. Most participants identified as a cisgender girl/woman or cisgender boy/man with a little over 5% identifying as transgender (girl/woman, boy/man), nonbinary, or another identity/identities.

Prevalence of Missingness

Overall, 95.7% (4,063/4,245) of participants had complete data (no missing variables); 3.8% (160/4,245) were missing only one question. Only 22 (0.5%) were missing ≥2 questions (range of missing questions 0-4). The most commonly missing questions were past change in sexual orientation identity (n=51 missing), past change in attractions (n=34 missing), and sex assigned at birth (n=28 missing, Table 1). Missingness increased for questions toward the end of the survey (Table 2).

Table 2.

Proportion Missing Response by Survey Question Order (N=4,245)

Variable Survey Order Missing Count % Missing
Age 1 0 0.0
Race/ethnicitya 2 21 0.5
Urban-rural categoryb 3 5 0.1
Gender identity 4 10 0.2
Sex assigned at birth 5 28 0.7
Transgender status 6 17 0.4
Gender expression 7 16 0.4
Attractions 8 11 0.3
Current SOI 9 17 0.4
Past change in attractions 10 34 0.8
Past change in SOI 11 51 1.2
a

Variable constructed from two consecutive questions: Are you Latina/o/x and/or Hispanic? (n=13 missing) and What race(s) best describe you? (n=23 missing).

b

County was determined based on reported zip code, using the National Center for Health Statistics urban-rural county categories, 2013.

Sociodemographic Patterns of Missingness

We examined sociodemographic patterns of missing responses to the most commonly missing questions (Table 3). Qualitatively, there was limited variation in the sociodemographic characteristics of those missing the sexual orientation identity change question (n=51) compared to the overall sample, with small overrepresentation of: age 18-21 years, cisgender girls/women, participants identifying as Latina/o/x and/or Hispanic, and participants living in large central metro or small metro areas. Limited sociodemographic variability was also found among those missing the attraction change question compared to the overall sample, with small overrepresentation of age 18-21 years; cisgender boys/men; straight/heterosexual participants; American Indian or Alaska Native, Black or African American, or Latina/o/x and/or Hispanic participants; and participants living in large central metro or micropolitan/non-core areas. Those missing the sex assigned at birth question were similar to the overall sample with a slight overrepresentation of individuals identifying as pansexual. However, sociodemographic variations were small across these three questions, suggesting minimal differences between those who skipped these questions and the full sample.

Table 3.

Sociodemographic Differences in Missingness in Sexual Orientation and Past Sexual Fluidity (N=4,245)

Overall Sample
(N=4,245)
Missing Sex Assigned
at Birth (N=28)
Missing Attraction
Change (N=34)
Missing Sexual
Orientation Identity
Change (N=51)
n % n % n % n %
Age group
 14-17 years 440 10.4 2 7.1 4 11.8 7 13.7
 18-21 years 1,734 40.9 12 42.9 18 52.9 28 54.9
 22-25 years 2,071 48.8 14 50.0 12 35.3 16 31.4
 Missing 0 0.0 0 0.0 0 0.0 0 0.0
Gender identity (constructed)b
 Cisgender girl/woman 2,438 57.4 - 20 58.8 31 60.8
 Cisgender boy/man 1,540 36.3 - 14 41.2 19 37.3
 Transgender girl/woman 6 0.1 - 0 0.0 0 0.0
 Transgender boy/man 48 1.1 - 0 0.0 0 0.0
 Nonbinary/additional identity or identities 175 4.1 - 0 0.0 1 2.0
 Missing 38 0.2 28 100.0 0 0.0 0 0.0
Sexual orientation identity
 Straight/heterosexual 2,972 70.0 20 71.4 25 73.5 34 66.7
 Bisexual 624 14.7 3 10.7 3 8.8 7 13.7
 Gay or lesbian 181 4.3 1 3.6 2 5.9 3 5.9
 Pansexual 188 4.4 3 10.7 0 0.0 0 0.0
 Queer 43 1.0 0 0 0 0.0 0 0.0
 Asexual 76 1.8 0 0 0 0.0 0 0.0
 Not sure 112 2.6 1 3.6 0 0.0 2 3.9
 Another identity/identities 32 0.8 0 0 0 0.0 0 0.0
 Missing 17 0.4 0 0 4 11.8 5 9.8
Race/ethnicity
 American Indian or Alaska Native 51 1.2 0 0 3 8.8 1 2.0
 Asian 378 8.9 1 3.6 2 5.9 2 3.9
 Black or African American 701 16.5 5 17.9 8 23.5 7 13.7
 Latina/o/x and/or Hispanic 206 4.9 1 3.6 7 20.6 7 13.7
 Native Hawaiian or Other Pacific Islander 12 0.3 0 0 0 0.0 1 2.0
 White 2,003 47.2 14 50.0 12 35.3 17 33.3
 Another race 23 0.5 0 0 0 0.0 1 2.0
 Selected multiple races/ethnicities 849 20.0 6 21.4 2 5.9 12 23.5
 Missing 21 0.5 1 3.6 0 0.0 3 5.9
Urban-rural categoryb
 Large central metro 1,333 31.4 9 32.1 13 38.2 19 37.3
 Large fringe metro 1,051 24.8 7 25.0 4 11.8 8 15.6
 Medium metro 885 20.9 7 25.0 5 14.7 10 19.6
 Small metro 419 9.9 3 10.7 5 14.7 7 13.7
 Micropolitan 338 8.0 2 7.1 5 14.7 5 9.8
 Non-core 214 5.0 0 0 2 5.9 2 3.9
 Missing 5 0.1 0 0 0 0.0 0 0.0
a

Variable constructed from sex assigned at birth and gender identity.

b

County was determined based on zip code, using the National Center for Health Statistics urban-rural county categories, 2013.

Discussion

The aim of this study was to examine prevalence and sociodemographic patterns of missingness among SOGIE and sexual fluidity survey questions in a large sample of US youth. This study represents, to our knowledge, the first exploration of missingness patterns across a broad range of SOGIE-related survey items and the first to explore these missingness patterns among adolescents. Further, it fills a gap in the literature regarding missingness in survey questions assessing gender expression and provides the first exploration of missingness in sexual fluidity survey questions in any age group.

Consistent with previous adult literature (Brabete et al., 2020; Case et al., 2008; Jans et al., 2015; Jesdale, 2021; VanKim et al., 2010), our study found low rates of missingness (<5%) in SOGIE and sexual fluidity questions. These findings directly challenge the idea that SOGIE-related questions are too sensitive to include in youth surveys. Considering that most of our sample identified as heterosexual (70.3%) and cisgender (94.6%), our findings also challenge assumptions that these youth will not answer SOGIE-related questions due to discomfort or offense. This is consistent with prior research on patient and clinician perspectives on collecting SOGIE data (Maragh-Bass et al., 2017).

In contrast with a prior adult study (Brabete et al., 2020), we found only small sociodemographic differences in missingness patterns. This may be due in part to small subgroup sample sizes, likely making proportions sensitive to small sociodemographic differences across questions; results should be interpreted with this limitation in mind. We also did not assess income or education levels, which have previously been cited alongside increased age as potential factors in data missingness patterns (Brabete et al., 2020). Further research is needed to understand the influence of these sociodemographic characteristics on SOGIE response probabilities in youth.

To our knowledge, no existing literature has assessed youth willingness to answer questions regarding past sexual fluidity. Importantly, we found the number of individuals missing sexual fluidity data was small (n=51 missing identity change and n=34 missing attraction change out of 4,245, respectively), and appeared to be influenced by survey fatigue (i.e., these questions were toward the end of the survey) rather than hesitance to answer. This suggests that youth are open to answering questions about sexual fluidity, providing support for their inclusion in surveys.

While we were unable to conduct statistical tests of differences in sexual fluidity missingness among sociodemographic subgroups due to small sample sizes, missingness was more common among cisgender individuals and those residing in rural areas. This may be a function of experience; some research has found prevalence of sexual fluidity among cisgender individuals is lower than among transgender and nonbinary individuals (Katz-Wise et al., 2016, 2022). It is possible that cisgender individuals are more likely to skip these questions because they lack these experiences or are unsure how to respond. Similarly, individuals in rural areas may be less familiar with the concept of sexual fluidity (and other aspects of sexual orientation) than individuals in urban areas (Page, 2017), and may be more likely to skip these questions. More research is needed to understand why these groups may be slightly less likely to respond to sexual fluidity questions.

Study Strengths and Limitations

Our study has several strengths. Our sample included responses from a large, racially/ethnically and gender diverse sample of US youth, ages 14-25 years. Previous studies of missingness typically collected data from adults aged 18 years and older; our findings provide new insight into the unique experiences of adolescents. Additionally, our measures of sexual orientation identity, attractions, and gender identity were inclusive of nonbinary identities, allowed for the selection of multiple response options, and included an open-ended option for self-identification.

Online data collection is a strength, contributing to the diversity of our sample. However, the willingness of individuals to respond to SOGIE questions anonymously and online may not reflect the willingness of individuals to respond to similar questions in-person or in other non-anonymized settings (e.g., clinics or schools) that may compromise safety or confidentiality (Meckler et al., 2006; Robertson et al., 2018; Temkin et al., 2017). In addition, the current study oversampled LGBTQ+ youth and recruitment materials described the study as focusing on sexual orientation and attractions. Thus, individuals who chose to participate in this study may be more willing to answer SOGIE and sexual fluidity questions. Use of a convenience sample from an existing online participant panel has limitations. The Prodege panel may have specific unknown characteristics that are not shared by all US youth. For example, individuals are recruited to participate in the Prodege panel via advertisements on TV, radio, and online. Although most US youth have access to the internet, particularly via smartphones (Vogels et al., 2022), the panel does not include youth who were not exposed to Prodege advertisements on one of these platforms. Thus, our findings may not be generalizable to the broader US youth population.

Our survey was US-based and only available in English. Given the potential influence of location, language, and cultural background on societal norms and understandings of SOGIE and LGBTQ+ identities (Jans et al., 2015; Michaels et al., 2017; Reisner et al., 2014), more research is needed to assess missingness patterns in other geographic and cultural contexts. Finally, the measure used to assess attraction did not assess degrees of attraction to each gender, which did not allow participants who were attracted to more than one gender to indicate that they were more or less attracted to each gender group (e.g., being mostly attracted to one gender, but sometimes attracted to other genders). Future research could assess attraction with greater nuance to capture a wider diversity of youths’ experiences with attraction.

The SO*FLY study is a prospective cohort study with seven waves of data collection with the same participants. Thus, we are well-positioned to conduct future analyses examining prospective changes in SOGIE, which can play a key role in improving inclusivity of SOGIE questions. The language individuals use to identify their sexual orientation and gender identity is vast and can change over time (Eliason & Streed, 2017; Russell et al., 2009). Prospectively tracking these changes with open-ended questions may lead to faster implementation of questions with response options that better reflect participants’ identities. Doing so may increase likelihood of answering these questions, though to our knowledge, this has not been examined.

Implications for Research and Practice

Relatively low levels of missingness in this study suggest youth are willing to respond to questions about SOGIE and sexual fluidity. Thus, SOGIE questions should be included in youth surveys, as well as in clinical practice with youth. The Youth Risk Behavior Surveillance System includes some SOGIE items (e.g., sexual orientation identity and gender identity) but could be improved by adding items assessing attractions and sexual fluidity (Centers for Disease Control, 2022). Given the prevalence of sexual fluidity among youth, it is pertinent to assess these questions at multiple time points. Asking SOGIE and sexual fluidity questions in research and clinical settings acknowledges the multidimensional, dynamic nature of sexual identity, behavior, and attraction, and improves our understanding of how these factors develop and shift over time (Ruberg & Ruelos, 2020; Suen et al., 2020). It also allows researchers and clinicians to identify and address the unique needs of LGBTQ+ individuals and the health inequities experienced within this population (National Academies of Sciences, 2020).

Findings from this study also have implications for methods used to address missingness in survey data, particularly the use of multiple imputations (MI). While MI is a common method to address bias due to missingness, one should consider the magnitude of missingness, type of missingness, and variables with missingness for MI use with SOGIE and sexual fluidity questions. First, MI may not be necessary if the levels of missingness are already low among these questions. Although guidance varies, MI provides negligible benefit with less than 5% missingness (Schafer, 1999). However, when there are large amounts of missingness, MI can introduce bias not present in a complete case analysis. Second it can be challenging to assume that data in SOGIE and sexual fluidity questions is missing at random (MAR) as the missingness only depends on observed data. The use of multiple imputation for these questions may not fully address non-response bias if they are missing not at random (MNAR), even after conditioning on observed data (Li et al., 2015). Third, missingness on the type of variables (exposure, confounders, covariates, or outcome) also affects the role of missingness on bias and efficiency, thus influencing MI’s performance in improving them (Lee & Carlin, 2012). Finally, in addition to empirical considerations, there may be ethical concerns related to imputing data for social identity-related questions (Brown et al., 2021; Randall et al., 2021; Woods et al., 2023). We encourage researchers who are considering use of MI for missingness of SOGIE and sexual fluidity questions to first address these considerations.

Conclusions

Our findings represent a significant contribution to literature exploring SOGIE and sexual fluidity. Missingness for the examined questions was low in this sample and differences in patterns of missingness across sociodemographic subgroups were small. Higher levels of missingness found toward the end of the survey suggests survey fatigue, rather than lack of willingness to respond. SOGIE-related questions should be included in surveys and clinical assessments to inform clinical care, policy-making, interventions, and resource development to improve the health of all youth.

Acknowledgments

The authors would like to thank Liam Keohane and Neeki Parsa for their contributions to this work, the Harvard Sexual Orientation and Gender Identity and Expression Health Equity Research Collaborative (Harvard SOGIE), and the SO*FLY Study participants who contributed data to this project.

This research was supported by the National Institute on Minority Health and Health Disparities (R21MD015838). Sabra Katz-Wise and Allegra Gordon were also funded by the Maternal and Child Health Bureau, Health Resources and Services Administration (Leadership Education in Adolescent Health project 6T71-MC00009). Allegra Gordon is supported by the National Institute on Drug Abuse (K01DA054357). Jennifer Conti, Allegra Gordon, and Kimberly Nelson were also funded by EY (Ernst & Young) on a separate research project at the time this study was conducted. The study sponsors did not have any role in study design; collection, analysis, and interpretation of data; writing the report; or the decision to submit the report for publication.

This study was not preregistered. The data and study materials can be made available upon request.

Findings from this research were presented at the annual meeting of the Society for Research on Adolescence in San Diego, CA in 2023.

Footnotes

Sabra Katz-Wise is a diversity consultant for Paramount Global. The other authors do not have any conflicts of interest to disclose.

References

  1. Bauer GR, Braimoh J, Scheim AI, & Dharma C (2017). Transgender-inclusive measures of sex/gender for population surveys: Mixed-methods evaluation and recommendations. PLOS ONE, 12(5), e0178043. 10.1371/JOURNAL.PONE.0178043 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Brabete AC, Doray-Demers P, & Vissandjée B (2020). Where sexual orientation has no name: Sexual orientation missing data in the Canadian Community Health Survey (2005-2014). The Journal of Sex Research, 57(8), 997–1004. 10.1080/00224499.2020.1772704 [DOI] [PubMed] [Google Scholar]
  3. Brown KS, Su Y, Jagganath J, Rayfield J, & Randall M (2021). Ethics and empathy in using imputation to disaggregate data for racial equity: Landscape scan findings. https://www.urban.org/sites/default/files/publication/104678/ethics-and-empathy-in-using-imputation-to-disaggregate-data-for-racial-equity.pdf [Google Scholar]
  4. Case P, Austin SB, Hunter DJ, Willett WC, Malspeis S, Manson JE, & Spiegelman D (2008). Disclosure of sexual orientation and behavior in the Nurses’ Health Study II. 10.1300/J082v51n01_02, 51(1), 13–31. [DOI] [PubMed] [Google Scholar]
  5. Centers for Disease Control. (2022). Youth Risk Behavior Survey (YRBS) 2023 Standard Questionnaire Item Rationale. https://www.cdc.gov/healthyyouth/data/yrbs/pdf/2023/2023_Standard_YRBS_Item_Rationale.pdf
  6. Centers for Disease Control and Prevention. (2019). 2019 National Youth Risk Behavior Survey. https://www.cdc.gov/healthyyouth/data/yrbs/pdf/2019/2019_YRBS-National-HS-Questionnaire.pdf.
  7. Conron K. (2020). LGBT youth population in the United States. https://williamsinstitute.law.ucla.edu/publications/lgbt-youth-pop-us/ [Google Scholar]
  8. Dawson CA, Ehlke SJ, Lewis RJ, Amerson R, Braitman AL, Shappie AT, & Heron KE (2022). A latent class analysis of sexual identity, attraction, and behavior among young sexual-minority women. Psychology of Sexual Orientation and Gender Diversity. 10.1037/sgd0000594 [DOI] [Google Scholar]
  9. Eliason MJ, & Streed CG (2017). Choosing “Something Else” as a Sexual Identity: Evaluating Response Options on the National Health Interview Survey. Https://Home.Liebertpub.Com/Lgbt, 4(5), 376–379. 10.1089/LGBT.2016.0206 [DOI] [PubMed] [Google Scholar]
  10. Hafeez H, Zeshan M, Tahir MA, Jahan N, & Naveed S (2017). Health care disparities among lesbian, gay, bisexual, and transgender youth: A literature review. Cureus, 9(4). 10.7759/CUREUS.1184 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Human Rights Campaign. (2018). Growing Up LGBT in America: HRC Youth Survey Report Key Findings. . https://www.hrc.org/resources/about-the-survey [Google Scholar]
  12. Ingram DD, & Franco SJ (2012). NCHS urban-rural classification scheme for counties. Vital and Health Statistics. Series 2, Data Evaluation and Methods Research, 154, 1–65. https://europepmc.org/article/med/22783637 [PubMed] [Google Scholar]
  13. Jans M, Viana J, Grant D, Cochran SD, Lee AC, & Ponce NA (2015). Trends in sexual orientation missing data over a decade of the California health interview survey. American Journal of Public Health, 105(5), e43–e50. 10.2105/AJPH.2014.302514 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Jesdale BM (2021). Sources of Missing Sexual Orientation and Gender Identity Data in the Behavioral Risk Factor Surveillance System. American Journal of Preventive Medicine, 61(2), 281–290. 10.1016/j.amepre.2021.02.027 [DOI] [PubMed] [Google Scholar]
  15. Johns MM, Lowry R, Andrzejewski J, Barrios LC, Demissie Z, McManus T, Rasberry CN, Robin L, & Underwood JM (2022). Transgender Identity and Experiences of Violence Victimization, Substance Use, Suicide Risk, and Sexual Risk Behaviors Among High School Students — 19 States and Large Urban School Districts, 2017. MMWR. Morbidity and Mortality Weekly Report, 68(3), 67–71. 10.15585/MMWR.MM6803A3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Jones JBM (2022). LGBT Identification in U.S. Ticks Up to 7.1%. https://news.gallup.com/poll/389792/lgbt-identification-ticks-up.aspx [Google Scholar]
  17. Katz-Wise SL (2015). Sexual fluidity in young adult women and men: associations with sexual orientation and sexual identity development. Psychology & Sexuality, 6(2), 189–208. 10.1080/19419899.2013.876445 [DOI] [Google Scholar]
  18. Katz-Wise SL, Ranker LR, Gordon AR, Xuan Z, & Nelson K (2022). Sociodemographic patterns in retrospective sexual orientation identity and attraction change in the Sexual Orientation Fluidity in Youth Study. Journal of Adolescent Health. 10.1016/j.jadohealth.2022.10.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Katz-Wise SL, Reisner SL, Hughto JW, & st. Amand C (2016). Differences in sexual orientation diversity and sexual fluidity in attractions among gender minority adults in Massachusetts. The Journal of Sex Research, 53(1), 74–84. 10.1080/00224499.2014.1003028 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Lee KJ, & Carlin JB (2012). Recovery of information from multiple imputation: a simulation study. Emerging Themes in Epidemiology, 9(1), 3. 10.1186/1742-7622-9-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Li P, Stuart EA, & Allison DB (2015). Multiple Imputation. JAMA, 314(18), 1966. 10.1001/jama.2015.15281 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Maragh-Bass AC, Torain M, Adler R, Schneider E, Ranjit A, Kodadek LM, Shields R, German D, Snyder C, Peterson S, Schuur J, Lau B, & Haider AH (2017). Risks, Benefits, and Importance of Collecting Sexual Orientation and Gender Identity Data in Healthcare Settings: A Multi-Method Analysis of Patient and Provider Perspectives. Https://Home.Liebertpub.Com/Lgbt, 4(2), 141–152. 10.1089/LGBT.2016.0107 [DOI] [PubMed] [Google Scholar]
  23. Matthews DD, Blosnich JR, Farmer GW, & Adams BJ (2014). Operational Definitions of Sexual Orientation and Estimates of Adolescent Health Risk Behaviors. LGBT Health, 1(1), 42–49. 10.1089/lgbt.2013.0002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Meckler GD, Elliott MN, Kanouse DE, Beals KP, & Schuster MA (2006). Nondisclosure of sexual orientation to a physician among a sample of gay, lesbian, and bisexual youth. Archives of Pediatrics & Adolescent Medicine, 160(12), 1248–1254. 10.1001/ARCHPEDI.160.12.1248 [DOI] [PubMed] [Google Scholar]
  25. Meyer IH (2020). Generations: A Study of the Life and Health of LGB People in a Changing Society, United States, 2016-2019. https://www.icpsr.umich.edu/web/DSDR/studies/37166 [Google Scholar]
  26. Michaels S, Milesi C, Stern M, Viox MH, Morrison H, Guerino P, Dragon CN, & Haffer SC (2017). Improving Measures of Sexual and Gender Identity in English and Spanish to Identify LGBT Older Adults in Surveys. Https://Home.Liebertpub.Com/Lgbt, 4(6), 412–418. 10.1089/LGBT.2016.0168 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. National Academies of Sciences, E. and Medicine. (2020). Understanding the well-being of LGBTQI+ populations (Patterson CJ, Sepúlveda M-J, & White J, Eds.). National Academies Press. 10.17226/25877 [DOI] [PubMed] [Google Scholar]
  28. National Academies of Sciences, E. and Medicine. (2022). Measuring sex, gender identity, and sexual orientation. In Measuring Sex, Gender Identity, and Sexual Orientation. National Academies Press. 10.17226/26424 [DOI] [PubMed] [Google Scholar]
  29. Page ML (2017). From awareness to action: Teacher attitude and implementation of LGBT-inclusive curriculum in the English language arts classroom. SAGE Open, 7(4). 10.1177/2158244017739949/ASSET/IMAGES/LARGE/10.1177_2158244017739949-FIG2.JPEG [DOI] [Google Scholar]
  30. Raifman J, Charlton BM, Arrington-Sanders R, Chan PA, Rusley J, Mayer KH, Stein MD, Austin SB, & McConnell M (2020). Sexual Orientation and Suicide Attempt Disparities Among US Adolescents: 2009–2017. Pediatrics, 145(3). 10.1542/peds.2019-1658 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Randall M, Stern A, & Su Y (2021). Five ethical risks to consider before filling missing race and ethnicity: Data workshop findings on the ethics of data imputation and related methods. https://www.urban.org/sites/default/files/publication/103830/five-ethical-risks-to-consider-before-filling-missing-race-and-ethnicity-data-workshop-findings_0.pdf [Google Scholar]
  32. Reisner SL, Biello K, Rosenberger JG, Austin SB, Haneuse S, Perez-Brumer A, Novak DS, & Mimiaga MJ (2014). Using a Two-Step Method to Measure Transgender Identity in Latin America/the Caribbean, Portugal, and Spain. Archives of Sexual Behavior, 43(8), 1503–1514. 10.1007/s10508-014-0314-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Robertson RE, Tran FW, Lewark LN, & Epstein R (2018). Estimates of Non-Heterosexual Prevalence: The Roles of Anonymity and Privacy in Survey Methodology. Archives of Sexual Behavior, 47(4), 1069–1084. 10.1007/s10508-017-1044-z [DOI] [PubMed] [Google Scholar]
  34. Ruberg B, & Ruelos S (2020). Data for queer lives: How LGBTQ gender and sexuality identities challenge norms of demographics. Big Data & Society, 7(1), 205395172093328. 10.1177/2053951720933286 [DOI] [Google Scholar]
  35. Russell ST, Clarke TJ, & Clary J (2009). Are Teens “Post-Gay”? Contemporary Adolescents’ Sexual Identity Labels. Journal of Youth and Adolescence, 38(7), 884–890. 10.1007/s10964-008-9388-2 [DOI] [PubMed] [Google Scholar]
  36. Saewyc EM, Bauer GR, Skay CL, Bearinger LH, Resnick MD, Reis E, & Murphy A (2004). Measuring sexual orientation in adolescent health surveys: Evaluation of eight school-based surveys. Journal of Adolescent Health, 35(4), 345.e1–345.e15. 10.1016/J.JADOHEALTH.2004.06.002 [DOI] [PubMed] [Google Scholar]
  37. Schafer JL (1999). Multiple imputation: a primer. Statistical Methods in Medical Research, 8(1), 3–15. 10.1177/096228029900800102 [DOI] [PubMed] [Google Scholar]
  38. Spock A, Popkin R, & Barnhart C (2022). Strategies to Improve Measurement of Sexual Orientation and Gender Identity Among Youth. Journal of Adolescent Health. 10.1016/j.jadohealth.2022.09.009 [DOI] [PubMed] [Google Scholar]
  39. Streed CG, Grasso C, Reisner SL, & Mayer KH (2020). Sexual orientation and gender identity data collection: Clinical and public health importance. 10.2105/AJPH.2020.305722, 110(7), 991–993. 10.2105/AJPH.2020.305722 [DOI] [Google Scholar]
  40. Suen LW, Lunn MR, Katuzny K, Finn S, Duncan L, Sevelius J, Flentje A, Capriotti MR, Lubensky ME, Hunt C, Weber S, Bibbins-Domingo K, & Obedin-Maliver J (2020). What Sexual and Gender Minority People Want Researchers to Know About Sexual Orientation and Gender Identity Questions: A Qualitative Study. Archives of Sexual Behavior, 49(7), 2301–2318. 10.1007/S10508-020-01810-Y/TABLES/6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Temkin D, Belford J, McDaniel T, Stratford B, & Parris D (2017). Improving Measurement of Sexual Orientation and Gender Identity Among Middle and High School Students. Child Trends, 22, 1–64. [Google Scholar]
  42. VanKim NA, Padilla JL, Lee JGL, & Goldstein AO (2010). Adding sexual orientation questions to statewide public health surveillance: New Mexico’s experience. American Journal of Public Health, 100(12), 2392–2396. 10.2105/AJPH.2009.186270 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Vogels EA, Gelles-Watnick R, & Massarat N (2022). Teens, social media and technology 2022. [Google Scholar]
  44. Williams Institute Scholars. (2020). Sexual orientation and gender identity (SOGI) adult measures recommendations FAQs. [Google Scholar]
  45. Woods AD, Gerasimova D, Van Dusen B, Nissen J, Bainter S, Uzdavines A, Davis-Kean PE, Halvorson M, King KM, Logan JAR, Xu M, Vasilev MR, Clay JM, Moreau D, Joyal-Desmarais K, Cruz RA, Brown DMY, Schmidt K, & Elsherif MM (2023). Best practices for addressing missing data through multiple imputation. Infant and Child Development. 10.1002/icd.2407 [DOI] [Google Scholar]
  46. Wylie SA, Corliss HL, Boulanger V, Prokop LA, & Austin SB (2010). Socially Assigned Gender Nonconformity: A Brief Measure for Use in Surveillance and Investigation of Health Disparities. Sex Roles, 63(3–4), 264–276. 10.1007/s11199-010-9798-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Ybarra ML, Price-Feeney M, & Mitchell KJ (2019). A cross-sectional study examining the (in)congruency of sexual identity, sexual behavior, and romantic attraction among adolescents in the US. The Journal of Pediatrics, 214, 201–208. 10.1016/j.jpeds.2019.06.046 [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES