Skip to main content
International Journal of Developmental Disabilities logoLink to International Journal of Developmental Disabilities
. 2023 Apr 10;70(8):1343–1366. doi: 10.1080/20473869.2023.2193490

The influence of Spanish language on autism spectrum disorder screening: a systematic review

Casie H Morgan 1,, Monica Abdul-Chani 1, Gabriela M Sherrod 1, Kathryn P King 1, Kristi Carter Guest 1
PMCID: PMC11660367  PMID: 39713504

Abstract

Objective

In the United States, autism spectrum disorder (ASD) is significantly less diagnosed in Spanish-speaking populations compared to other racial/ethnic groups. Structural barriers such as transportation, insurance, and time have been documented within this population. However, many families also report language as a barrier. The present systematic literature review aims to evaluate research findings on the influence of Spanish language and culture on the ASD screening process.

Method

Five databases were searched using key words relating to the aim of the paper to identify potential articles to review. Inclusion criteria included: (1) sample of participants diagnosed with ASD/being evaluated for ASD, (2) Latinx populations that included Spanish-speaking participants, (3) a reference to language included as a research aim or study outcome, and (5) a focus on the screening process or measures for ASD. A total of 132 articles were initially identified.

Results

After reviewing for inclusion criteria, 22 articles were included. In total, six countries were represented across all included papers. Major themes emerged regarding barriers to care, cultural awareness, translation barriers, intervention, knowledge/awareness, and screener evaluation.

Discussion

Findings indicate that language and sociocultural differences can affect knowledge development, communication within healthcare settings, and result in a limited number of translated resources available for the heterogeneous population of Spanish-speakers.

Keywords: Autism, Spanish language, assessment, evaluation, review

Introduction

Autism spectrum disorder (ASD) is a neurodevelopmental disorder characterized by impairments in social communication and the presence of stereotyped behaviors, interests, and activities (American Psychiatric Association (APA) 2013). In the United States (U.S.), the average age of ASD diagnosis is four years old (Christensen et al. 2016); however, early diagnosis, which can occur as early as 18 months of age, is critical for prognostic outcomes, as it allows access to interventions targeting language and behavioral development (MacDonald et al. 2014). Consequently, ASD screening procedures have become an important facet of developmental surveillance. The American Academy of Pediatrics (AAP) recommends formal ASD screening at ages 18 and 24 months (Hyman et al. 2020), and an estimated 63% of pediatricians report using developmental screening tools in their practice (Lipkin et al. 2020).

ASD has a prevalence rate of 1 in 44 children, and prevalence is similar across racial and ethnic groups (Maenner et al. 2021). However, Latinx, individuals with Latin American heritage, children with ASD are more likely to be under-identified (Maenner et al. 2021) or misdiagnosed (Magaña et al. 2013, Christensen et al. 2016) compared with non-Latinx Black and White children. Furthermore, Latinx children receive ASD diagnoses at an even later age than non-Latinx Black and White children (Jo et al. 2015). Without timely and appropriate diagnostic identification, beginning with appropriate screening, Latinx children are at a disadvantage with respect to accessing treatment and services, consequently impacting health equity for this population.

A plurality of factors may contribute to this differential rate of diagnosis, including different symptom presentation patterns (Magaña and Smith 2013) and socio-structural barriers (Bornstein and Cote 2004). Latinx families voice that limited ASD information, stigma, poverty, lack of empowerment to utilize services, dismissal of parental concern by providers, and length of diagnostic process are predominant barriers to seeking ASD-related care (Zuckerman et al. 2014). Families report that these barriers have led to dismissal of concerns, normalization of red flag behaviors, and perpetuation of mistrust in the medical system (Zuckerman et al. 2014).

The literature has traditionally ignored language when explaining factors influencing ASD detection among primarily Spanish-speaking individuals and Latinos, preferring instead to focus on the above cultural factors (Troxel et al. 2018). However, evidence suggests that language plays a central role in health outcomes. For example, a population-based study of Latinx individuals demonstrated those with high levels of acculturation (i.e. effective English communication and endorsing mainstream ideas related to U.S. current events) reported greater ease in accessing care, understanding ASD, and receiving ASD treatment compared to Latinx individuals with low levels of acculturation (Voelkel et al. 2013). The present study aims to address this gap by evaluating the influence of language in the ASD screening process within the context of cultural factors. Therefore, the goal of this review is to strengthen the understanding of how the Spanish language and Spanish culture combined may impact ASD screening with Spanish-speaking families. As screening represents a critical first step in the pathway to early diagnosis and intervention, it is essential to understand the ways in which use of Spanish language interacts with and affects the screening process for both families and providers. The present review provides an evaluation of Spanish language from a sociocultural and relational perspective rather than an examination of linguistics like grammar, or syntax. Specifically, this systematic review aims to synthesize research findings on the influence of Spanish language on the ASD screening process. This review is exploratory in nature; however, the following research questions were considered: 1) How does Spanish language impact the relationship between patients and providers? 2) What are some of the barriers related to screening for ASD in Spanish? 3) What current measures exist for screening for ASD in Spanish, how are they administered, and how is the quality of those screeners? 4) How can we improve ASD screening in the U.S. for Spanish-speaking populations? These questions were addressed by summarizing evaluative studies of Spanish-translated ASD screeners including psychometrics, strengths and weaknesses, and bias risk. Although a major aim of the study includes improving ASD screening services for Spanish-speaking families in the U.S., studies were included from other countries to increase representation and broaden interpretation of current Spanish screeners. This review also addresses the clinical implications related to Spanish language on ASD screening and provides recommendations for use of these screeners in clinical settings. Finally, research on the steps of translating, adapting, and validating pre-existing ASD screeners for use in the heterogeneous Spanish-speaking population both in the U.S. and internationally is discussed.

Method

Search strategy

The search strategy was developed in collaboration with the lead author and a medical research librarian. The search strategy included the following databases: PubMed, Scopus, PsycINFO, and PsycArticles. The final search was completed on February 1st, 2023. The Cochrane Database of Systematic Reviews (Wiley) was explored for existing systematic reviews and protocols. Records were deduplicated using EndNote and then loaded into Covidence for further screening of duplicates. The review adhered to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines (Moher et al. 2009) and was registered before data extraction on the international prospective register of systematic reviews PROSPERO website (registration number CRD42021266041). Detailed search terms are included in the supplementary materials.

Inclusion and exclusion criteria

Inclusion criteria included (1) sample of participants diagnosed with ASD/being evaluated for ASD, (2) Latinx populations that included Spanish-speaking participants, (3) a reference to language included as a research aim or study outcome, and (5) a focus on the screening process or measures for ASD. Conference abstracts, dissertations/theses, and book chapters, were not considered due to them not being peer reviewed in nature. Systematic reviews and meta-analyses were excluded as well; however, these papers served as a tool to find any additional studies for the present review. In addition, papers were excluded if they used non-Spanish-speaking bilingual samples (i.e. German-English bilingual groups, etc.), or if they focused on Spanish cultural themes without discussing Spanish language. Based on the inclusion criteria, some studies were excluded if they focused on an ASD screening tool in Spanish but did not include examination of language in the study or if they instead focused on race, ethnicity, and culture without mention of language as a primary or secondary aim. Additionally, studies examining parent or provider perspectives of a Spanish screening tool were excluded if language was not included as an aspect of the evaluation.

Study selection

Literature search results were downloaded and organized in EndNote and exported to Covidence. The first and second authors each independently screened the titles and abstracts from the initial search results using inclusion criteria. Full-text screening was also conducted independently by the first two authors. Discrepancies were resolved through mediated discussion by the first author and second author.

Risk of bias and quality assessment

The risk of bias was assessed using standardized risk of bias assessment tools for qualitative and case control studies. The Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2; Whiting et al. 2011) was used to evaluate risk of bias and applicability concerns related to patient selection, index tests, reference standard, and flow and timing in cross-sectional studies evaluating ASD screening measures.

Data extraction and synthesis

The third and fourth authors extracted data independently from each eligible study using the Covidence extraction form developed by reviewers before formal data extraction (Appendix A). Discrepancies were resolved through mediated discussion by the first, third, and fourth author. For each article, data were identified for the following variables: screener, reliability and validity data, administration language, translation methodology, sample size, inclusion/exclusion criteria, study location, gender distribution, racial and ethnic distributions, diagnoses, age data, advantages of the screeners, limitations of the screeners, and clinical implications.

Data analysis

Analyses were conducted in three steps. First, interrater reliability was conducted and evaluated during the screening process between the first and second authors. Second, descriptive statistics, such as frequency data and measures of central tendency, were compiled to assess screener psychometrics and other clinically-relevant details. Third, a brief thematic review modeled from Braun and Clarke (2006) characterized the clinical implications of each study into specific themes. Given the brevity of the data included for the thematic analysis, all themes were established via discussion between the first and second author. The first two authors completed the first three steps of the brief thematic review: (1) review the main findings of each included paper a minimum of three times to improve understanding of the data, (2) identify preliminary themes that best characterized the findings, and (3) refined and finalize the themes that best represent the observed clinical implications. The final step was completed by the first author and involved pulling the main findings from each article, identifying specific examples that best represent elements of the themes, interpreting the themes in the results and discussion sections of this paper, and organizing the themes in a table.

All reported results including strengths and limitations are summaries of the information provided by the authors of each cited article. Therefore, when describing these components in the results section, the current authors are providing an overview of the information listed in each article. Elaboration on these summaries is provided in the discussion section of this article in which the authors of the current study provide reflection on the results.

Results

As shown in the PRISMA Flow Chart (Figure 1), the initial search yielded 132 results. Following deduplication, 112 unique articles remained. Study selection via title and abstract screening and full text review yielded 24 articles that met inclusion criteria. Of these 24 studies, only 22 were included in the final analysis based on data availability. Figure 1 details the process of screening records, retrieving reports, and eligibility determination according to PRISMA guidelines. Table 1 summarizes study characteristics for all 22 articles included in this review, described in more detail below. The results section is organized into three parts. First, we describe the characteristics of the reviewed studies. Second, we report on the results of the Spanish-translated screeners. In this section, we discuss psychometrics, strengths and limitations of the screeners, barriers and facilitators to implementation, and clinical implications. Within the strengths and limitations subsection, we further separate each type of screener to clearly highlight specific strengths and weaknesses attributable to the screener. Third, we document the QUADAS-2 bias and quality assessment across the papers.

Figure 1.

Figure 1.

PRISMA flow diagram.

From: Page et al. (2021).

Table 1.

Study sample characteristics.

Study Sample description Sample Size (N) Age Mean(SD) Gender (% Female) Race/Ethnicity (%) Country Inclusion and Exclusion Criteria
Albores-Gallo et al. (2012) Two groups of Mexican children and their caregivers: 1) typically developing and 2) ASD diagnoses 456 Children: 4.46 years
Mothers: 31.79 years
Fathers: 35.79 years
41.9 Not reported Mexico
  1. Children from both sexes between 18 to 72 months of age

  2. Presumptive diagnosis of ASD

  3. Children from the general community with the same age range than the clinical group

  4. Excluded if children had comorbid chronic conditions that would bias the MM-CHAT scores

Atwood et al. (2015) Peruvian children with documented developmental disabilities and challenging behaviors and their parent/caregiver
Diagnoses included: ASD, Down Syndrome, Cerebral Palsy, cognitive deficiencies, and genetic syndromes
71 dyads Children: 41 months
Parents: Not reported
36.6 Not reported Peru
  1. Some inclusion criteria cited from Mayo-Ortega et al. (2012)

  2. Participation in prior mass screening project

  3. Parental consent to participate in follow-up study

  4. No exclusion criteria reported

Bölte et al. (2018) Representatives from 21 European countries involved in COST-ESSEAa 16 language groupsb Not reported Not reported Not reported 21 European Countries c
  1. Country involvement in COST- ESSEA

  2. No exclusion criteria included

Canal-Bedia et al. (2011) Stage 1: Children aged 18–36 months living in Salamanca and Zamora and high-risk children ages 18–24
Stage 2: Children ages 18–36 months living in Madrid
Stage 1: 2480 children
Stage 2: 2055 children
Not reported Stage 1:
47.5
Stage 2:
46.2
Not reported Spain Stage 1:
  1. High risk children with developmental age 18–24 months, maximum chronological age 48 months, living in Salamanca and Zamora

  2. No exclusion criteria reported


Stage 2:
  1. Children ages 18–36 months, living and receiving medical care in Madrid

  2. No exclusion criteria reported

Coelho-Medeiros et al. (2019) Chilean children aged 16–30 months, recruited from the UC Christus Health network 120 children 22.47 months 35.8 Not reported Chile
  1. For the ASD group, children needed to present with high clinical suspicion of ASD after an evaluation by specialists

  2. Exclusion criteria included severe medical conditions or mental health concerns

Cuesta-Gómez et al. (2016) Toddlers living in Buenos Aires 420 children Not reported Parents:
77.6
Children:
44.8
Not reported Argentina
  1. Consumers of most visited medical centers

  2. Representative of CABA neighborhoods

  3. No exclusion criteria reported

DuBay et al. (2021a) Spanish-speaking primary caregivers 25 caregivers Parents:
33.2 years
Children: 11.8 months
Parents:
96.0
Children:
44.0
Hispanic
100
96.0 born in Latin America
1.0 born in US
United States
  1. No inclusion criteria reported

  2. No exclusion criteria reported

DuBay et al. (2021b) Community sample of caregivers who took the M-CHAT-R either in Spanish or in English
3724 records reviewed
Risk Analyses
n = 2591
Missingness Analyses
n = 1024
Risk Analyses
Parents: not reported
Children:
21.9 months
Missingness Analyses
Parents: not reported
Children:
22.1 months
Risk
Analyses
Parents: not reported
Children:
48.75
Missingness Analyses
Parents: not reported
Children:
49.55
74.3 (English-speaking)
25.7 (Spanish-speaking)
United States
  1. Records excluded if patients outside of the 16–30 month age range or did not contain age data

  2. Records excluded if not English or Spanish

  3. Records excluded if used the older M-CHAT-R

  4. Records excluded if a duplicate child

DuBay et al. (2022) US-based Latin American Spanish-speaking caregivers of children 8–16 months of age 380 Parents:
31 years
Children:
11.3 months
Parents:
93.6
Children:
48.3
Range of countries of origin, including almost all Latin American countries where Spanish is considered an official languaged United States
  1. Spanish-speaking families

  2. No exclusion criteria reported

Eugenin et al. (2015) Caregivers and their toddlers representative of medium-low and low-income residents of a major Chilean city 200 dyads Mother-SJ: 32 years
Mother-LP: 27 years
Father-SJ:
34 years
Father-LP:
29 years
Child-SJ: 19.9 months
Child-LP:
22.1 months
Parents:
Not reported
Children-SJ:
39.8
Children-LP:
47.6
Not reported Chile
  1. No inclusion criteria reported

  2. No exclusion criteria reported

Fombonne et al. (2012) Children with/without diagnosis of Pervasive Developmental Disorder 563 children 8(2.3) years Parents:
91.3
Children:
32.9
Not reported Mexico
  1. Diagnosis of PDD-NOS from CLIMA clinic (case group)

  2. Age- and gender-match to case (control group)

  3. No exclusion criteria included

Kimple et al. (2014) Toddlers seeking primary care Not reportede Not reported Not reported Not reported United States
  1. All children who presented for 18- or 24-month well-visits between July 2010 and July 2012

  2. No exclusion criteria

Kuhn et al. (2021) Low-income, racial/ethnic minority parents and their child who screened positive on the M-CHAT-R/F 309 parents and children Child age at screening
21 months
Parent age at screening
30 years
Children:
30.4
Parents:
Not reported
Child race/ethnicity not reported
Parents:
55.8 = Black, non-Hispanic
29.2 = Hispanic, any race
8.6 = Other, non-Hispanic
6.3 = White, non-Hispanic
United States
  1. Needed to screen positive on the M-CHAT-R/F

  2. No exclusions were based on language or comorbid conditions

Lugo-Marín et al. (2019) Spanish native adults N = 294
46 adults with ASD
41 ASD relatives
17 adults with SSDf
190 non clinical adults
36.77 years 51.3 All participants were born in Spanish territory Spain
  1. ASD group – needed a prior evaluation and ASD diagnosis; below average IQ was exclusion criteria

  2. No inclusion/exclusion criteria reported for ASD-Relatives group

  3. SSD group – needed a diagnosis of SSD and could not have acute psychotic symptoms, intellectual difficulties, or ASD

  4. Non-ASD group – not having received an ASD diagnosis, nor any other psychiatric disorder diagnosis prior to the evaluation

Magán-Maganto et al. (2020) Children in Salamanca, Zamora, and Valladolid, Spain presenting for Well Baby Check-ups 6625 children (months)
14–22 range: 18.2(.72)
23–36 range: 24.5(1.23)
14–22 rangeg:
47.31
23–36 range:
48.56
Not reported Spain
  1. Children born in study catchment areas between data collection dates and who attended routine 18- and 24-month well baby screenings

  2. Excluded if caregiver did not complete the FUI or evaluation due to communication

Miller et al. (2011) Toddlers with a 2006 date of birth, between the ages of 14 and 24 months at the start of the screening period and between 21 and 32 months of age at the end. Diagnoses include: ASD or non-ASD 796 children Not reported Not reported hBlack (3.2)
Pacific Islander (4)
Asian (4.2)
Native American (2)
White Non-Hispanic (55)
Other (0.2)
United States
  1. Child between 14 and 30 months of age seeking care from community pediatric practice during 6-month screening period

  2. No exclusion criteria included

Morales-Hidalgo et al. (2017a) Spanish sample of children from community and clinical settings between the ages of 4–12 years 1476 7.51 years 40.17 Autochthonous (86.32) Spain
  1. No inclusion criteria reported

  2. No exclusion criteria reported

Reyes et al. (2021) Mothers of school-aged children (Kindergarten through Grade 12) who spoke either English or Spanish
Children were either at high or low risk for internalizing, externalizing, or tic disorders. Diagnoses include: ASD, non-ASD
199 mothers
English: 104
Spanish: 95
Spanish Male:
10.6 years
Spanish Female: 10.52 years
English
Male:
10.66 years
English Female: 11.04 years
Spanish:
46.32
English:
43.27
Hispanic/Latino (58.3)
African American (11.7)
White Non-Hispanic (14.6)
Mixed (11)
Asian American (2)
Native American (1.5)
Other (.5)
United States
  1. A random sample of school-aged children enrolled in PLAY-MHi whose teachers completed a BASC-2 and whose caregiver agreed to a follow-up interview

  2. Excluded if follow-up interview was completed by someone other than biological mother

Rosenberg et al. (2018) Children between the ages of 30 and 68 months of age enrolled in the SEEDj Study, recruited via clinical/educational sources due to previous developmental concerns (CE) or birth certificates to represent the general population (BC)
Diagnoses included: ASD, DD, no ASD/DD
2,557 children
CE: 1653
BC: 904
(months)
CE Children: 55.8(7.2)
BC Children: 55.6(7.6)
Parents:
99
Children:
33.9
Maternal Race: Black (17.7)
White (71.2)
Other (1.2)
Maternal Ethnicity: Hispanic (11.5)
Non-Hispanic (88.5)
United States
  1. Born between September 1, 2003 and August 31, 2006 in a study catchment area

  2. Child resided with caregiver at least 18 years of age and who consistently cared for child since they were ≤ 6 months

  3. Caregiver able to speak English (or, in California and Colorado, English or Spanish)

Scarpa et al. (2013) Toddlers in Southwest Virginia presenting for well-child visits 447 toddlers Not reported Parents:
86.4
Children:
45.2
Hispanic (27.1)
Black (25.3)
White (25.1)
Mixed (8.9)
Asian (1.8)
Native American (0.7)
Not reported (11.2)
United States
  1. Caregiver and their child attending 18- or 24-month well visit

  2. Between ages 16 and 32 months

  3. Excluded if outside of age range and/or if primary language of caregiver was neither English nor Spanish

Windham et al. (2014) Primarily low-income Latinx families covered by government-funded insurance 1760 families Parents not reported Parent gender not reported
Children:
50.1
Not reported United States
  1. Child between 14 and 30 months of age

  2. Seeking care from community pediatric practice during 6-month screening period

  3. No exclusion criteria included

Zuckerman et al. (2013) Primary Care Pediatricians (PCPs) 267 PCPs Not reported Not reported Not reported United States
  1. Excluded if provided invalid contact information

a

COST-ESSEA: European Cooperation in Science and Technology; (www.cost.eu) action ‘Enhancing the Scientific Study of Early Autism’ (ESSEA; for details see www.cost-essea.com) participated. COST-ESSEA is a network of over 60 scientists from 23 European countries, including three LAMI countries (Romania, FYR Macedonia, and Turkey), aiming to improve function in early autism research.

b

These 16 language/cultural groups in practice typically use the same versions of adapted diagnostic tools: Austria/Germany/Switzerland (German), Belgium/The Netherlands (Dutch), Czech Republic, Finland, France (including information on French-Canadian adaptations of tools), Hungary, Iceland, Italy, Israel, Macedonia, Poland, Portugal (including information on Brazilian adaptations of tools), Spain, Sweden/Norway (Scandinavia), Romania, and UK/Ireland (English; even US versions of instruments included here).

c

21 European Countries.

d

Countries of origin include: Mexico (N = 80), United States (N = 64), El Salvador (N = 48), Honduras (N = 33), Guatemala (N = 28), Nicaragua (N = 4), Costa Rica (N = 2), Panama (N = 1), Puerto Rico (N = 20), Cuba (N = 13), Dominican Republic (N = 8), Colombia (N = 28), Venezuela (N = 11), Peru (N = 10), Bolivia (N = 8), Chile (N = 3), Argentina (N = 1), Ecuador (N = 1), and Paraguay (N = 1).

e

only analyzed screener results from medical record, 338 questionnaires from pre-transition and 251 from post transition to WHS (589).

f

Schizophrenia spectrum disorders.

g

these values are (n) and not (%).

h

only reported for 524 of the 796 children screened.

i

PLAY-MH: Project to Learn About Youth-Mental Health, a multi-site (Colorado, Florida, Ohio, South Carolina) study aiming to estimate the prevalence, treated prevalence, and co-occurrence of internalizing, externalizing, and tic disorders in school-aged youth (ages 5 to 18).

j

SEED: Study to Explore Early Development – Phase I, a multi-site (California, Colorado, Georgia, Maryland, North Carolina, Pennsylvania), case-control study exploring phenotypes and determinants of ASD.

Characteristics of reviewed studies

Of the 22 included studies, 11 were conducted in the United States (DuBay et al. 2021a, DuBay et al. 2021b, DuBay et al. 2022, Kimple et al. 2014, Kuhn et al. 2021, Miller et al. 2011, Reyes et al. 2021, Rosenberg et al. 2018, Scarpa et al. 2013, Windham et al. 2014, Zuckerman et al. 2013), and the other 11 were conducted in Latin American (Albores-Gallo et al. 2012, Atwood et al. 2015, Coelho-Medeiros et al. 2019, Cuesta-Gómez et al. 2016, Eugenin et al. 2015, Fombonne et al. 2012, Lugo-Marín et al. 2019, Morales-Hidalgo et al. 2017a) or European countries (Bölte et al. 2018, Canal-Bedia et al. 2011, Magán-Maganto et al. 2020). In total, six countries were represented across all included papers.

A total of 7 studies involved parents and their children (Albores-Gallo et al. 2012, Atwood et al. 2015, Eugenin et al. 2015, Kuhn et al. 2021, Reyes et al. 2021, Rosenberg et al. 2018, Windham et al. 2014). The remainder of the studies (n = 15) involved either only parents (n = 4), only children (n = 8), or only non-parent adults (n = 3; e.g. primary care pediatricians) and reported the demographics for each group, respectively. Of the 22 included studies, only 14 studies included participant data related to age (Albores-Gallo et al. 2012, Atwood et al. 2015, Coelho-Medeiros et al. 2019, DuBay et al. 2021a, DuBay et al. 2021b, DuBay et al. 2022, Eugenin et al. 2015, Fombonne et al. 2012, Kuhn et al. 2021, Lugo-Marín et al. 2019, Magán-Maganto et al. 2020, Morales-Hidalgo et al. 2017b, Reyes et al. 2021, Rosenberg et al. 2018). Of the studies that included age data, the reported age range for children was 11.3 months to 18 years), while the reported age range for adults was 26 years to 37 years. Specific details on participant racial and ethnicity data were frequently omitted; however, race and ethnicity was often broadly discussed when describing the study setting to suggest the majority of samples were Spanish-speaking and from Latin American countries of origin. Only eight studies provided specific breakdowns of race and ethnicity (DuBay et al. 2021a, DuBay et al. 2021b, Kuhn et al. 2021, Lugo-Marín et al. 2019, Miller et al. 2011, Morales-Hidalgo et al. 2017a, Reyes et al. 2021, Rosenberg et al. 2018). When gender was reported, most studies reported relatively even distributions (50% females and 50% males). Yet, four studies did not provide any gender details (Bölte et al. 2018, Kimple et al. 2014, Miller et al. 2011, Zuckerman et al. 2013).

A total of 14 studies did not identify diagnoses present before or after screening (Bölte et al. 2018, Canal-Bedia et al. 2011, Cuesta-Gómez et al. 2016, Dubay et al. 2021a, DuBay et al. 2021b, DuBay et al. 2022, Eugenin et al. 2015, Kimple et al. 2014, Magán-Maganto et al. 2020, Morales-Hidalgo et al. 2017a, Reyes et al. 2021, Scarpa et al. 2013, Windham et al. 2014, Zuckerman et al. 2013), and three studies did not clearly identify inclusion and exclusion criteria (Dubay et al. 2021a, Eugenin et al. 2015, Morales-Hidalgo et al. 2017a). Two studies did not explicitly evaluate a Spanish-translated screener, but rather analyzed the potential impact of demographic factors, including Spanish maternal language and appropriate application of interpreters, on screening results (Kuhn et al. 2021, Rosenberg et al. 2018).

Spanish-translated screener results

The Modified Checklist for Autism in Toddlers (M-CHAT) was the most frequently studied screener (n = 14; Albores-Gallo et al. 2012, Bölte et al. 2018, Canal-Bedia et al. 2011, Coelho-Medeiros et al. 2019, Cuesta-Gómez et al. 2016, Dubay et al. 2021b, Eugenin et al. 2015, Kimple et al. 2014, Kuhn et al. 2021, Magán-Maganto et al. 2020, Miller et al. 2011, Scarpa et al. 2013, Windham et al. 2014, Zuckerman et al. 2013), followed by the Ages and Stages Questionnaire (ASQ; n = 2; Windham et al. 2014, Zuckerman et al. 2013), First Year Inventory version 3.1 (FYIv.3.1; n = 2; DuBay et al. 2021a, DuBay et al. 2022), and the Social Communication Questionnaire (SCQ; n = 2; Reyes et al. 2021, Rosenberg et al. 2018).

Translation procedures differed across studies, with some studies utilizing native Spanish-speakers or fluent Spanish-speakers for translation or administration (n = 8; Atwood et al. 2015, DuBay et al. 2021a, DuBay et al. 2021b, Eugenin et al. 2015, Fombonne et al. 2012, Lugo-Marín et al. 2019, Morales-Hidalgo et al. 2017b, Windham et al. 2014) and others hiring formal interpreters or translators (n = 9; Canal-Bedia et al. 2011, Coelho-Medeiros et al. 2019, Cuesta-Gómez et al. 2016, DuBay et al. 2022, Eugenin et al. 2015, Kuhn et al. 2021, Magán-Maganto et al. 2020, Scarpa et al. 2013, Windham et al. 2014). Many studies applied or evaluated the forward-back translation process (n = 7; Canal-Bedia et al. 2011, Cuesta-Gómez et al. 2016, DuBay et al. 2021a, DuBay et al. 2021b, Fombonne et al. 2012, Magán-Maganto et al. 2020, Morales-Hidalgo et al. 2017b), some used an informal translation approach (n = 2; Albores-Gallo et al. 2012, Atwood et al. 2015), and one applied a translation with cultural adaptation (TCA) approach (Lugo-Marín et al. 2019). Additionally, many studies also used a cognitive interview or pilot interview stage in the translation process to help further clarify any confusion and interpretation challenges among caregivers (n = 7; Canal-Bedia et al. 2011, Cuesta-Gómez et al. 2016, DuBay et al. 2021a, DuBay et al. 2021b, DuBay et al. 2022, Eugenin et al. 2015, Magán-Maganto et al. 2020). Moreover, three studies did not describe a translation process (Bölte et al. 2018, Miller et al. 2011, Zuckerman et al. 2013) and five studies used already translated versions of screeners and did not provide explicit details on the original translation procedure (Kimple et al. 2014, Kuhn et al. 2021, Scarpa et al. 2013, Windham et al. 2014).

Of the 22 included studies, seven screeners were administered in written format (Albores-Gallo et al. 2012, Coelho-Medeiros et al. 2019, Fombonne et al. 2012, Kimple et al. 2014, Miller et al. 2011, Scarpa et al. 2013, Zuckerman et al. 2013), six were administered in oral format (Atwood et al. 2015, Canal-Bedia et al. 2011, Cuesta-Gómez et al. 2016, DuBay et al. 2021a, Reyes et al. 2021, Rosenberg et al. 2018), and 12 included a combination of written and oral administration (Albores-Gallo et al. 2012, Bölte et al. 2018, Canal-Bedia et al. 2011, Cuesta-Gómez et al. 2016, DuBay et al. 2021b, Eugenin et al. 2015, Kuhn et al. 2021, Lugo-Marín et al. 2019, Magán-Maganto et al. 2020, Miller et al. 2011, Morales-Hidalgo et al. 2017b, Windham et al. 2014). In summary, a variety of translation approaches and administration approaches were applied across the included studies.

Psychometric properties

Not all studies included reliability or validity statistics, and it was unclear in some studies whether a comparison screener was used to validate Spanish screeners. Cronbach’s alpha statistics were provided for 10 studies (Atwood et al. 2015, Coelho-Medeiros et al. 2019, Cuesta-Gómez et al. 2016, Fombonne et al. 2012, Lugo-Marín et al. 2019, Magán-Maganto et al. 2020, Morales-Hidalgo et al. 2017b, Reyes et al. 2021, Rosenberg et al. 2018, Scarpa et al. 2013). Factors related to education and income levels appeared to impact Cronbach’s alpha for Scarpa et al. (2013), such that Cronbach’s alpha was lower in the lower income sample. Further, Rosenberg et al. (2018) found that Cronbach’s alpha was lower among a sample recruited from the general population compared to a sample recruited from sources with previous developmental concerns. Additionally, test-retest statistics were obtained for three studies (Canal-Bedia et al. 2011, Lugo-Marín et al. 2019, Windham et al. 2014) and internal reliability or consistency was assessed for four studies (Albores-Gallo et al. 2012, Fombonne et al. 2012, Morales-Hidalgo et al. 2017a, Coelho-Medeiros et al. 2019).

Many studies used a range of validity statistics: concurrent (Coelho-Medeiros et al. 2019), discriminant (Albores-Gallo et al. 2012, Coelho-Medeiros et al. 2019, Fombonne et al. 2012), convergent (Albores-Gallo et al. 2012, Lugo-Marín et al. 2019, Morales-Hidalgo et al. 2017b), criterion (Albores-Gallo et al. 2012), and construct validity (Albores-Gallo et al. 2012). Additionally, sensitivity and specificity data were reported for seven studies (Canal-Bedia et al. 2011, Coelho-Medeiros et al. 2019, Lugo-Marín et al. 2019, Magán-Maganto et al. 2020, Morales-Hidalgo et al. 2017b, Rosenberg et al. 2018). Each study incorporated these statistics into their findings slightly differently. For example, Albores-Gallo et al. (2012) evaluated convergent validity with Spearman correlations, assessed discriminant validity with t-tests and chi-square analyses, and established criterion validity with kappa coefficients. Their findings indicated that the Mexican-M-CHAT showed strong discriminant validity between typically developing and autism populations and moderate internal consistency and convergent validity with the CBCL and ADI-R. Further, criterion validity for the Mexican-M-CHAT demonstrated a higher concordance with the nonverbal dimension on the ADI-R.

Some studies did not report on reliability and validity but instead reported on false positives, completion rates, missingness, and measurement variance (DuBay et al. 2021b, DuBay et al. 2022). Studies that reported on false positives, completion rates, and missingness often demonstrated significant differences in positive screenings between English-speaking and Spanish-speaking samples (DuBay et al. 2021b) as well as differences in translation approaches (DuBay et al. 2022). Measurement and item-specific variance findings also provided details on specific items that may be more or less challenging for Spanish-speaking families to comprehend (DuBay et al. 2021b, DuBay et al. 2022). Together, these psychometric statistics enhance the need to adequately translate and culturally adapt ASD screeners for Spanish-speaking families. See Table 2 for a more detailed breakdown of the psychometric properties for the relevant studies.

Table 2.

Measure/screener characteristics.

Study Study Design Name of Screener Translation Process Psychometric Statistics Risk of Bias/QUADAS-2
Albores-Gallo et al. (2012) Case control Mexican Modified Checklist for Autism in Toddlers (MM-CHAT)a
  1. Spanish translation of each item

  2. Adding minor cultural adjustments

  1. Internal consistency of MM-CHAT total (MM-CHAT-T) using Kuder-Richardson coefficient (KR) = .76

  2. Internal consistency of MM-CHAT for 6 critical items (MM-CHAT-6ci) using KR = .70

  3. Convergent validity using Spearman correlation between MM-CHAT-T, MM-CHAT-6ci, CBCL subscales, and the ADI-R varied and ranged from .18-.86.

  4. Discriminant validity: MM-CHAT-T was significantly higher for the ASD group compared to the TD group overall, in the younger age group, and in the older age group.

  5. Construct validity: Kappa coefficient (κ) was calculated with the following criteria: MM-CHAT 2/6 or greater. In the ASD group, κ between the MM-CHAT and the ADI-R dimensions was .17.

  1. Patient Selection Bias = Low

  2. Index Bias = Low

  3. Reference Standard Bias = Low

  4. Patient Flow Bias = Low

Atwood et al. (2015) Cross sectional study Communication Complexity Scale (CCS)
  1. Spanish translation

  2. Written translation

  3. Native Spanish speakers from Peru involved in translation

  1. Inter-rater reliability during measureb development (ICC = .98)

  2. 92% scores from

  3. independent raters fell within 1 point of each other

  4. Kappa values for each collection site = .44, .77, and .78.

  5. SPANISH TRANSLATION CCS correlates with CSBS (r = .685, p<.000)

  1. Patient Selection Bias = High

  2. Index Bias = Low

  3. Reference Standard Bias = Low

  4. Patient Flow Bias = High

Bölte et al. (2018) Cross sectional study CBCL, Q-CHAT/M-CHAT, SCQ, CDI, SRSc
  1. Translated versions from each respective country’s language

Not reported No Risk of Bias analysis
Canal-Bedia et al. (2011) Cross sectional study Modified Checklist for Autism in Toddlers (M-CHAT)
  1. Spanish translated version for use in Spain

  2. Written translation

  3. Three interpreters/translators included

  1. Sensitivity = 1

  2. Specificity = 0.98

  3. PPV = 0.19

  4. NPV = 1

  1. Patient Selection Bias = Low

  2. Index Bias = Low

  3. Reference Standard Bias = Low

  4. Patient Flow Bias = Low

Coelho-Medeiros et al. (2019) Case control study Modified Checklist for Autism in Toddlers – Revised (M-CHAT-R) Chilean Version; Modified Checklist for Autism in Toddlers – Revised with Follow-up (M-CHAT-R/F) Chilean Versiond Phase 1 of this study:e
  1. Cultural adaptation from the M-CHAT-R/F in its European Spanish version

  2. A content equivalence, semantic, conceptual, and technical comprehension analysis was completed in a sample from the UC Health network users

  1. Cronbach’s alpha = 0.889

  2. Concurrent validity comparing the results of the M-CHAT-R/F and ADOS-2 = r = 0.849, p = 0.0001
    1. Sensitivity = 100%
  3. Specificity = 83.3%

  • Discriminant validity estimated by comparing high-risk clinical diagnosis of ASD with the M-CHAT-R/F = 100%

  1. Patient Selection Bias = Medium

  2. Index Bias = Low

  3. Reference Standard Bias = Low

  4. Patient Flow Bias = Medium

Cuesta-Gómez et al. (2016) Cross sectional study M-CHAT
  1. Spanish translation

  2. Written translation

  1. Test-retest Kappa index = 1

  2. Cronbach’s alpha = 0.76

  3. Father-mother Kappa = 0.79

  1. Patient Selection Bias = Low

  2. Index Bias = Low

  3. Reference Standard Bias = Unclear

  4. Patient Flow Bias = Low

DuBay et al. (2021a) Qualitative research First year inventory (FYI) 3.1 (FYv.3.1)
  1. Spanish translation

  2. Clinicians spoke the language fluently and did not use interpretation services

  3. Written translation

  4. 3 clinicians included during forward translation, 2 during back translation

Not reported
  1. Patient Selection Bias = Low

  2. Index Bias = Low

  3. Reference Standard Bias = Low

  4. Patient Flow Bias = Low

Dubay et al. (2021b) Retrospective Medical Chart Review M-CHAT-R Spanish-Western Hemisphere Version (SWH) Translated specifically for Latin American Spanish-speaking populations
  1. 1 forward translator

  2. 1 back translator

  3. additional translators for same- language comparison and revisions


Translators recruited to represent multiple Spanish dialects
No reliability or validity statistics reported
Risk-Score Analyses
  1. Spanish-speakers (SS) had significantly higher total risk scores

  2. English-speakers (ES) had a significantly higher floor rate

  3. ES had a significantly lower initial screen-positive rate

  4. Six items remained significantly different in proportions of at-risk responses with SS more likely to endorse the at-risk response for 5 of the 6 items


Missingness Analyses
  1. SS left more items blank

  2. SS were more likely to contain missing data for 7 items

  1. Patient Selection Bias = Low

  2. Index Bias = Low

  3. Reference Standard Bias = Low

  4. Patient Flow Bias = Low

Dubay et al. (2022) Mixed Methods FYIv3.1e Forward-Back
  1. One professional translator translated the tool into Spanish

  2. Another translator translated the Spanish version back to English

  3. An expert in ASD then compared the English versions and items with notable differences were re-translated by the professional translator


Translation with Cultural Adaptation
Developed using multiple translators from a range of backgrounds
  1. Preliminary pre-testing

  2. Back translations

  3. Additional pre-testing

  4. Consensus meetings


Cognitive Interviews
  1. Participants were shown both translations (beginning with their randomization)

  2. Read items aloud, rephrased it, and explained their response

  3. Misinterpretations were clarified by interviewers

  4. Shown alternative translation – asked to identify differences between the versions and semantic impact of those differences

  5. Chose the version they found easiest to understand

Completion Rates
Those who responded to the TCA version were more likely to discontinue the survey prior to completion.
Missingness
Not significant different between the groups
Variability
More evenly distributed selection of response among the TCA group
Measurement Invariance
Non-invariance was identified at the first level of invariance testing suggesting that the item factor loadings were not equivalent between the two groups
No reliability or validity statistics reported
  1. Patient Selection Bias = Low

  2. Index Bias = Low

  3. Reference Standard Bias = Low

  4. Patient Flow Bias = Low

Eugenin et al. (2015) Quantitative and
cross sectional
study
M-CHAT
  1. Spanish translation

  2. Clinicians spoke the language fluently and did not use interpretation services

Not reported
  1. Patient Selection Bias = Low

  2. Index Bias = Low

  3. Reference Standard Bias = Low

  4. Patient Flow Bias = Low

Fombonne et al. (2012) Case control
study
Social Responsiveness Scale (SRS)
  1. Spanish translation

  2. Clinicians spoke the language fluently and did not use interpretation services

  1. Internal consistency: Parent SRS Cronbach’s alpha was 0.97, Teacher SRS Cronbach’s alpha was 0.97

  2. Interrater reliability: Moderate correlation between parent-teacher reports (r = 0.49)

  3. Discriminant validity: Parent report AUC =0.96, teacher report AUC = 0.96

  4. No difference in AUC for males/females or younger/older children

  1. Patient Selection Bias = Low

  2. Index Bias = Low

  3. Reference Standard Bias = Low

  4. Patient Flow Bias = Low

Kimple et al. (2014) Cross sectional study M-CHAT
  1. English version

  2. Spain Spanish Version (SS)

  3. Western Hemisphere Spanish Version (WHS)

  4. Written translation

Not reported
  1. Patient Selection Bias = Low

  2. Index Bias = Low

  3. Reference Standard Bias = Low

  4. Patient Flow Bias = Low

Kuhn et al. (2021) Randomized Control Trial M-CHAT-R/F
  1. Research staff systematically administered the M-CHAT-R/F by telephone to every family within 1 week of study referral

  2. Interpreters were employed as needed

  3. 3 staff who administered the M- CHAT-R/F used a phone interpreter for all non-English languages

  4. 1 conducted Spanish administrations using a previously translated versions

Not reported No Risk of Bias analysis
Lugo-Marín et al. (2019) Case control study Spanish Autism-Spectrum Quotient (AQ-Short)f
  1. Translated into Spanish after obtaining permission from the original author

  2. Translated completed by a native Spanish speaker

  3. Translator was blind to the construct that was intended to be measured

  4. Translator was told to translate with the consideration of semantic, linguistic, and cultural equivalence

  5. First translation was revised by the first author of the article to verify items corresponded to the items of the original version

  6. Bilingual psychologist then translated the Spanish version back into English

  7. Panel of 4 bilingual experts compared the original version and the translated version, discussed points of discrepancy and reached agreements for corrections

  1. Cronbach’s alpha = .79 to .88

  2. Test-retest analysis = .90 to .97

  3. Convergent validity = r = .734, significant

  4. Accuracy = 71.77%

  5. Sensitivity = .98

  6. Specificity = .84

  1. Patient Selection Bias = Low

  2. Index Bias = Low

  3. Reference Standard Bias = Low

  4. Patient Flow Bias = Low

Magán-Maganto et al. (2020) Cross sectional study M-CHAT-R/F (revised/follow-up)
  1. Spain Spanish Version (SS) version

  2. Written translation

  1. Cronbach’s alpha = 0.62 for 14 to 22 months

  2. Cronbach’s alpha = .72 for 23 to 36 months

  3. Sensitivity = 0.79

  4. Specificity = 0.99

  1. Patient Selection Bias = Low

  2. Index Bias = Low

  3. Reference Standard Bias = Low

  4. Patient Flow Bias = High

Miller et al. (2011) Cross sectional study M-CHAT and the Infant Toddler Checklist (ITC)
  1. English version

  2. Spain Spanish Version (SS)

  3. Written translation

Not reported
  1. Patient Selection Bias = Low

  2. Index Bias = Low

  3. Reference Standard Bias = High

  4. Patient Flow Bias = Low

Morales-Hidalgo et al. (2017b) Cross sectional study Spanish version of the Childhood Autism Spectrum Test (CAST)g; Full version = all items; Reduced version = 3 developmental items removedg
  1. Back-translation method
    1. First translated from English into Spanish and then back into English
  • Two native speakers served as translators

  • Some items adapted for culture

Full CAST
  1. Sensitivity = 83.9%

  2. Specificity = 92.5%

  3. Positive Predictive Value = 0.63

  4. Cronbach’s alpha = 0.826


Reduced CAST
  1. Sensitivity = 85.7%

  2. Specificity = 91.16%

  3. Positive Predictive Value = 0.61

  4. Cronbach’s alpha = 0.839


Convergent validity was high between the full and reduced versions and the ADI-R, ADOS-2, and CBCL scales.
  1. Patient Selection Bias = Low

  2. Index Bias = Low

  3. Reference Standard Bias = High

  4. Patient Flow Bias = Low

Reyes et al. (2021) Cross sectional study Social Communication Questionnaire (SCQ)
  1. English version

  2. Spanish version of SCQ

  1. Cronbach’s alpha = .70 for English group

  2. Cronbach’s alpha = .83 for Spanish group

  3. DIF analysis showed non-uniform responses on 5 items

  1. Patient Selection Bias = Low

  2. Index Bias = Low

  3. Reference Standard Bias = Unclear

  4. Patient Flow Bias = Low

Rosenberg et al. (2018) Case control study SCQ
  1. English Version

  2. Method of Spanish translation not reportedh

  1. Cronbach’s alpha = 0.89 for CEi group

  2. Cronbach’s alpha = 0.77 for BC group

  3. Sensitivity = .88 for CE group

  4. Specificity = .74 for CE group

  5. Sensitivity = .78 for BC group

  6. Specificity = .95 for BC group

  1. Patient Selection Bias = Low

  2. Index Bias = Low

  3. Reference Standard Bias = Unclear

  4. Patient Flow Bias = Low

Scarpa et al. (2013) Cross sectional study M-CHAT
  1. English version

  2. Western Hemisphere Spanish Version (WHS)

  3. Written translation

  4. 2 translators (back-translated)

Reliability by education groups:
  1. >12th grade education (alpha = .729)

  2. 12th grade/GED education (alpha = .320)

  3. <12th grade/no GED (alpha = .428)

Reliability by income groups:
  1. <$20,000/year (alpha = .585)

  2. >$20,000/year (alpha = .284)

  1. Patient Selection Bias = Low

  2. Index Bias = Low

  3. Reference Standard Bias = Low

  4. Patient Flow Bias = Low

Windham et al. (2014) Cross sectional study M-CHAT; ASQ
  1. Spanish translated M-CHAT for Mexican-Spanish speakers

  2. Spanish version of ASQ

  3. Clinicians spoke the language fluently and did not use interpretation services

Rescreened children:
  1. 40 % scored positive on the

  2. M-CHAT a second time

  3. Another 11 % were only ASQ-positive


Re-screening children who initially passed both screeners:
  1. Additional 95 (25%) scored positive on at least one measure

  1. Patient Selection Bias = Low

  2. Index Bias = Low

  3. Reference Standard Bias = Unclear

  4. Patient Flow Bias = Low

Zuckerman et al. (2013) Observational, cross-sectional study ASQ, PEDS, Denver, CDI, MCHAT, ASSQ, SCQj
  1. English version

  2. Spanish version

Not reported
  1. Patient Selection Bias = Low

  2. Index Bias = Unclear

  3. Reference Standard Bias = Unclear

  4. Patient Flow Bias = Low

a

Other measures evaluated in this study include: The Autism Diagnostic Interview-Revised (ADI-R) and the Child Behavior Checklist (CBCL).

b

The original paper evaluating the screener obtains reliability and validity statistics (Brady et al. 2012).

c

Q-CHAT: Quantitative Checklist for Autism in Toddlers. Other measures evaluated in this study include: ADOS, ADI-R, CBCL, Social Communication Questionnaire (SCQ), Children’s Depression Inventory (CDI), Development and Well-Being Assessment (DAWBA), Vineland Adaptive Behaviour Scales (VABS), Infant Behavior Questionnaire-Revised (IBQ-R), Early Childhood Behavior Questionnaire-Revised (ECBQ-R), Social Responsiveness Scale (SRS), Infant Toddler Sensory Profile (ITSP), Repetitive Behavior Scale-Revised (RBS-R), Mullen Scales of Early Learning (MSEL).

d

Other measures evaluated in this study include: Autism Diagnostic Observation Schedule (ADOS-2).

e

Phase 1 of this study includes details on the translation process for this screener. Refer to those papers for additional details (Coelho-Medeiros et al. 2019, Robins et al. 2017).

f

Refer to Dubay et al. (2021a) for a detailed description of the translation process.

g

Other measures include a WISC-IV, ADI-R, CBCL, ADOS-2.

h

Families for whom Spanish was the primary language only comprised of 3.8% of the total sample. Further, only the California and Colorado sites were able to recruit families who primarily spoke Spanish. Therefore, the authors do not describe how the materials given to these 3.8% of participants were translated.

i

The majority of SCQ’s given in this sample were in English (<4% were given in Spanish). These are not sensitivity/specificity metrics for the Spanish-translated SCQ.

j

ASQ, Parents’ Evaluation of Developmental Status (PEDS), Denver Developmental Screening Test Autism Spectrum Screening Questionnaire (ASSQ).

Strengths and limitations

Among studies evaluating Spanish-translated screeners (n = 16; Albores-Gallo et al. 2012, Atwood et al. 2015, Canal-Bedia et al. 2011, Cuesta-Gómez et al. 2016, Coelho-Medeiros et al. 2019, DuBay et al. 2021a, DuBay et al. 2021b, DuBay et al. 2022, Fombonne et al. 2012, Kimple et al. 2014, Kuhn et al. 2021, Lugo-Marín et al. 2019, Magán-Maganto et al. 2020, Morales-Hidalgo et al. 2017b, Reyes et al. 2021, Scarpa et al. 2013), strengths and limitations were characterized for the following screeners: Spanish version of the Childhood Autism Spectrum Test (CAST), Communication Complexity Scale (CCS), First Year Inventory (FYI) 3.1, M-CHAT (Spain Spanish Version [SS], Western Hemisphere Spanish Version [WH], Argentinian Version, Mexican Version), M-CHAT-Revised (M-CHAT-R) or M-CHAT-R with Follow-Up [M-CHAT-R/F]), Social Responsiveness Scale (SRS), and Social Communication Questionnaire (SCQ). See Table 3 for a more detailed analysis of each study’s main findings categorized by theme.

Table 3.

Main findings, clinical implications, and themes addressed by included studies.

Study Main findings Themes
Albores-Gallo et al. (2012)
  1. The MM-CHAT demonstrated overall moderate to strong psychometrics.

  2. In concordance with other international studies, some parents did not understand all of the MM-CHAT questions and additional cultural adaptations were needed to assist parents in understanding items on the M-CHAT.

  3. Parenting and social behavior styles could be influencing these cultural biases.

Screener evaluation
Cultural awareness
Translation barriers
Atwood et al. (2015)
  1. The CCS may represent a new tool to assess pre-symbolic communication in Spanish speaking children with intellectual disability for researchers outside of the United States.

Intervention
Screener evaluation
Bölte et al. (2018)
  1. The limited access to evidence-based diagnostic and screening tools among middle-income European countries and/or countries with small populations/rarely spoken languages may lower the rates of ASD awareness, identification, and screening/diagnosis for these individuals.

  2. Inequalities persist regarding access to translated and well-validated screeners and diagnostic tools for autism across Europe (especially middle and low income countries).

  3. Several Spanish translated measures exist; few have been evaluated for reliability and validity or culturally adapted for Spain/Spanish-speakers.

Knowledge/awareness
Barriers to care
Screener evaluation
Translation barriers
Cultural awareness
Canal-Bedia et al. (2011)
  1. Specifically when using Spain M-CHAT, the false-positive rate may be lowered if a positive screen was deemed following 5 failed items.

  2. The M-CHAT can be integrated into primary care settings during a critical period for ASD diagnosis and evaluation.

Screener evaluation
Intervention
Coelho-Medeiros et al. (2019)
  1. The M-CHAT-RF Chilean Spanish version demonstrated adequate internal consistency, strong discriminant validity, and high levels of sensitivity and specificity with the ADOS-2.

  2. The work in this study has led to the implementation of training for health professionals in primary care to implement this screener in the Well-Child Care Program for 18-month-old children at risk for ASD.

Screener evaluation
Intervention
Knowledge/awareness
Cuesta-Gómez et al. (2016)
  1. The Argentinian version of the M-CHAT facilitates earlier ASD detection among children in Buenos Aires.

  2. May provide an easily accessible and adaptable ASD screener for pediatricians in Argentina.

  3. Some language and comprehension concerns may need to be clarified for mothers not from Argentina.

Screener evaluation
Intervention
Knowledge/awareness
Translation barriers
Cultural awareness
DuBay et al. (2021a)
  1. Rigorous translation methods are essential when translating screeners; translation and comprehension errors can present during various stages of translation.

  2. The First Year Inventory (FYI) 3.1 may serve as a screener for ASD in children 8 to 16 months with Spanish-speaking caregivers living in the United States, following careful revision.

Screener evaluation
Translation barriers
Intervention
DuBay et al. (2021b)
  1. The M-CHAT-R SWH translation does not perform the same way as the original English M-CHAT-R.

  2. Authors suggest that the differences are a reflection of the potential cultural factors not addressed adequately in the traditional translation procedures.

Screener evaluation
Translation barriers
Cultural awareness
DuBay et al. (2022)
  1. Qualitative interviews with participants demonstrated that participants chose the translated term in the TCA version 67% of the time and the term used in the FB version 33% of the time.

  2. Word order also made a difference and led to a preference of the TCA version 67% of the time.

  3. Participants reported a preference of items to be short, simple, and direct.

  4. Some noted a need for additional details to help understand the question.

Screener evaluation
Translation barriers
Cultural awareness
Eugenin et al. (2015)
  1. Caregivers with lower education levels and/or from non-Western cultures may benefit from revised versions of the M-CHAT.

  2. Suggestions such as pairing M-CHAT items with pictograms may increase understanding and comprehension of behaviors.

  3. Latinx cultures may interpret implications of social communication deficits differently (e.g. social challenges may only apply to strangers rather than mothers).

  4. The importance of family among Latinx patients must not be underestimated by providers.

  5. The new version of the M-CHAT-R – modifies the two particularly problematic questions on this M-CHAT – and/or screening only at-risk children may increase accessibility and feasibility.

  6. Prior to screening, it is necessary to consider whether there are available services to accommodate and treat ASD should it be identified – especially whether services are available in the patient’s primary language.

Screener evaluation
Barriers to care
Knowledge/awareness
Translation barriers
Cultural awareness
Intervention
Fombonne et al. (2012)
  1. SRS is valid and reliable for screening in Spanish-speaking populations.

  2. Parent- and teacher-report is not always necessary to administer given that there is limited additional benefit noted for the teacher report. Consider using parent-only if the circumstances suggest difficulty obtaining a teacher-report.

Screener evaluation
Intervention
Barriers to care
Kimple et al. (2014)
  1. This screening tool may not be as effective or culturally appropriate across ethnic groups.

  2. The M-CHAT-SS is abnormal more often than the M-CHAT in English, despite lower national prevalence rate of autism in Latinx populations.

  3. The M-CHAT-SS and M-CHAT-WH demonstrated similar patterns of discrepancies.

Screener evaluation
Translation barriers
Cultural awareness
Kuhn et al. (2021)
  1. Hispanic families were significantly less likely to receive both an evaluation and a diagnosis.

  2. Children screened directly in the parent’s preferred language (no interpreter) were more likely to receive an ASD diagnosis.

  3. These findings support previous research indicating problems with M-CHAT-R/F accuracy among minority populations.

  4. Potential issues with the telephonic language interpretation process for parent checklist screening approaches – greater reliance on word-for-word translation which leads to increased confusion and misunderstanding, resulting in poorer positive predictive power of the screening process.

  5. Authors emphasize the importance of hiring and training native speakers to directly administer screening assessments in families’ preferred languages.

Translation barriers
Cultural awareness
Barriers to care
Lugo-Marín et al. (2019)
  1. The AQ-Short is a reliable instrument for the screening of ASD in Spanish native-speaker adults.

  2. It shows very good internal structure and a good convergence with diagnostic ASD gold-standard measures.

  3. It clearly differentiates ASD from other specific populations (SSD, ASD-relatives), and is thus a useful instrument in the screening stage.

  4. Clinical groups (ASD, SSD) are less coherent in their responses, suggesting the need to identify the possible difficulties of these groups when understanding the items.

Screener evaluation
Cultural awareness
Magán-Maganto et al. (2020)
  1. The M-CHAT-R/F is an effective tool for screening young children in the general Spanish-speaking population.

  2. In order to improve early detection, it is necessary to train pediatricians to identify early signs of ASD and provide them with a standardized tool to screen further.

Screener evaluation
Intervention
Knowledge/awareness
Miller et al. (2011)
  1. Partnerships between pediatric providers and ASD providers are important for making progress in developing and adapting formal screening procedures and facilitating earlier and more accurate identification.

  2. Inclusion of both English and Spanish screeners is feasible in a pediatric primary care setting as long as there is relevant expertise involvement. This integrated approach may successfully identify more children at risk than traditional clinic observation or reported parent concern.

  3. Screening children for ASD at sick visits rather than only well-child visits may further improve early identification.

Intervention
Knowledge/awareness
Cultural awareness
Barriers to care
Morales-Hidalgo et al. (2017a)
  1. The CAST was demonstrated to be a reliable and valid test for screening for ASD among Spanish school-aged children.

  2. Boys showed higher scores than girls on the CAST. The effect size was medium.

  3. Questions on general development did not appear to make the screener more readily understandable or improve the psychometric properties.

  4. The authors propose a reduced Spanish version of the CAST.

Screener evaluation
Cultural awareness
Reyes et al. (2021)
  1. No differences in SCQ scores were found between the English and Spanish versions in a nonclinical sample.

  2. Spanish-speaking mothers with less educational attainment reported more ASD symptoms compared to Spanish-speaking mothers with more educational attainment.

  3. Spanish-speaking mothers exhibited different response patterns on 5 SCQ items.

  4. Among the English-speaking sample only, teacher BASC-2 scores predicted higher SCQ scores.

  5. Among the Spanish-speaking sample only, decreased maternal education and teacher BASC-2 scores predicted higher SCQ scores.

Screener evaluation
Translation barriers
Rosenberg et al. (2018)
  1. Spanish maternal language did not have a significant impact on SCQ score.

  2. Male child sex, lower household income, Black and Other maternal race, and lower maternal education were associated with higher SCQ scores among both children with pre-identified developmental concerns (CE group) and the general population (BC group).

  3. Using a cut-point of 11, the SCQ may be more sensitive in detecting ASD among young children with pre-identified developmental concerns, but more specific in rejecting ASD for children in the general population.

Cultural awareness
Knowledge/awareness
Scarpa et al. (2013)
  1. Modifications of the Spanish M-CHAT may be necessary to improve internal consistency and accuracy across ethnic and educational groups in low SES rural areas.

  2. Alternative administrations may be considered for low SES parents (e.g. read aloud, items explicitly reviewed for comprehension, paired with pictures).

  3. The Spanish M-CHAT is not recommended until further research better evaluates its properties in the United States.

Screener evaluation
Translation barriers
Cultural awareness
Intervention
Windham et al. (2014)
  1. Additional resources, beyond screeners, will be required to successfully assess children at-risk for ASD from underserved backgrounds.

  2. Similar to Kimple et al. (2014), a higher screen positive rate was detected in predominantly Latinx groups compared to non-Latinx groups despite lower prevalence rates of ASD.

Intervention
Cultural awareness
Zuckerman et al. (2013)
  1. Provider-related factors – such as bias, limited Spanish language resources, or limited exposure – may contribute to some diagnostic disparities among Latinx and Spanish-speaking children.

  2. Primary care providers identified the following barriers to ASD care for Latinx children: limited access to specialists, language and cultural differences between providers and patients, and limited access to care.

  3. Two suggestions are made to address these barriers: increase training for working with this population and increase connections between developmental/ASD specialists and primary care pediatricians.

Barriers to care
Translation barriers
Cultural awareness
Intervention
Spanish version of the Childhood Autism Spectrum test (CAST)

The CAST demonstrated to be a reliable and valid test for ASD screening in Spanish school-aged children. In particular, the screener showed high values of sensitivity and specificity that align closely with previous research (Scott et al. 2002). Notably, both a full and reduced version of the CAST was evaluated by Morales-Hidalgo et al. (2017b). The authors report that the reduced version possesses many strengths, including better psychometric properties and higher correlations with the ADI-R, ADOS-2 and CBCL specific scales. All limitations of this screener pertain to the full version, which identified a few items that showed low discriminative poor between children with ASD and without ASD. Similarly, the general development questions included in the screener are considered a limitation due to evidence demonstrating that these items did not enhance comprehension or improve the screener’s psychometric properties. Therefore, the authors recommend the reduced version as a solution to the limitations evident in the full version. Ultimately, findings support application of the adapted full Spanish version and the reduced version of the CAST for screening for milder or subtler manifestations of ASD in a Spanish population.

Communication Complexity Scale (CCS)

Strong, positive correlations with the Communication and Symbolic Behavior Scales Developmental Profile Infant-Toddler Checklist (CSBS-ITC) and limited reliance on parent report were considered strengths of the Spanish-translated CCS (Atwood et al. 2015). Some limitations include that item comprehension appeared to depend on the syntax or pronunciation used by the administrator, and that translation may reflect cultural/linguistic differences across cities and regions rather than generalize to Peru (the origin country). Further, the validation sample was not balanced between gender, and the measure has only been validated in Peruvian children, thereby limiting generalizability. Overall, findings support the CCS as a valid screening measure of communication behaviors among native Spanish-speaking children in Peru.

First years Inventory (FYI) v. 3.1

A few studies evaluated the FYI v. 3.1. Across all studies, the Spanish-translated FYI’s strengths included a rigorous translation and cultural adaptation process. Limitations for one study included a lack of representation for some Latin American Spanish dialects or cultures among both translators and participants as well as a disparity in administration of the screener, such that it was delivered as an interview rather than independent parent-report (DuBay et al. 2021a). Alternatively, in a follow-up paper (DuBay et al. 2022), representation was achieved across a wide range of countries of origin despite the study taking place in the United States. Further, DuBay et al. (2022) successfully garnered strengths and limitations reported by caregivers completing the surveys and discerned important nuances in item recognition, length of item descriptions, and language preferences (e.g. ‘hijo/hija’ instead of ‘hijo’ only to represent a child). The translated measure is praised for its rigorous translation and adaptation approach, and although not a limitation of the screener, it is acknowledged that this level of rigor is challenging, time consuming, and resource intensive to develop. These barriers increase risk of error, which could unintentionally impact the reliability of the screeners.

Modified Checklist for Autism in Toddlers (M-CHAT)

Many studies included the M-CHAT. We first discuss the studies that examined its dialectic variations, then we separately review the study that evaluated the Argentinian M-CHAT as unique strengths and limitations emerged from that research, and finally we discuss a revised version of the M-CHAT.

When considering strengths and limitations of the M-CHAT, it is necessary to review and compare the studies that examined its dialectic variations. Five studies (Albores-Gallo et al. 2012, Canal-Bedia et al. 2011, Eugenin et al. 2015, Kimple et al. 2014, Magán-Maganto et al. 2020) evaluated the M-CHAT in this manner. The following strengths were described: high values of sensitivity and specificity that were similar to the original M-CHAT (Coelho-Medeiros et al. 2019, Magán-Maganto et al. 2020), flexibility with phone-administration that may increase access to screening (Canal-Bedia et al. 2011), minimal evidence of differences between the M-CHAT SS and the M-CHAT WH when used with Mexican/Central American caregivers in the United States (Kimple et al. 2014), simplicity and ease of answering across SES levels (Magán-Maganto et al. 2020), incorporation of implicit references to developmental milestones for teaching purposes (Eugenin et al. 2015), and inclusion of questions that resemble and require reflection of daily activities familiar to Spanish-speaking parents (Eugenin et al. 2015). Limitations included difficulties comprehending questions (Albores-Gallo et al. 2012, Eugenin et al. 2015) – especially regarding repetitive or unusual behaviors (Eugenin et al. 2015), increased time and financial resources for follow-up (Canal-Bedia et al. 2011, Magán-Maganto et al. 2020), lower positive predictive value than the original M-CHAT (Canal-Bedia et al. 2011), potential barriers to in-person screening (Canal-Bedia et al. 2011), and difficulty with blinding and preventing caregiver bias when evaluating the efficacy of the screeners (Coelho-Medeiros et al. 2019).

Second, the study reporting on the development of the Argentinian M-CHAT identified the following strengths: robust translation and methodology to support content, semantic, and technical equivalence, high test-retest reliability and item-response agreement when comparing mothers and fathers, efficiency as a screening tool for pediatricians, and inclusive, elementary-level reading for parents with lower literacy (Cuesta-Gómez et al. 2016). These strengths emphasize the utility of this translated screener when considering its ease of use for providers and reporters, and its similarities to the original screener. However, this study was conducted in Buenos Aires, which may limit generalizability to a specific subset of the Argentinian population. An additional limitation includes that a pediatrician must discern the screener’s administration method based on the parent’s literacy level (i.e. self-administered or administered with assistance). Further, the locational specificity of the screener comes at the cost of reduced clarity and relatability for users not originally from or currently living in Argentina. These factors limit widespread utility given the practical challenges and more limited scope of the translation.

Last, the M-CHAT-R and M-CHAT-R/F are revised and updated Spanish versions of the previous M-CHAT (Coelho-Medeiros et al. 2019, DuBay et al. 2021b, Magán-Maganto et al. 2020). Various versions of the M-CHAT-R and M-CHAT-R/F exist with multiple different Spanish translations. Strengths of the M-CHAT-R/F included accurate detection of ASD, reduction of follow up appointments, and adequate sensitivity and specificity scores. Limitations highlight that psychometric properties may be especially dependent on the age of the child and their education level. More specifically, more accurate results were noted for younger children and for parents with higher education levels. These limitations are important when considering AAP recommends formal ASD screening at ages 18 and 24 months (Hyman et al. 2020). Additionally, internal consistency may be on the lower end of acceptability. Further, concerns remain regarding how the translated versions compare to the English versions: DuBay et al. (2021b) demonstrated that the M-CHAT-R Spanish Western Hemisphere translation does not perform the same way as the original English M-CHAT-R and instead is more sensitive to the ASD diagnosis and positive screening of ASD symptoms in Spanish-speaking populations in the United States.

Social Responsiveness Scale (SRS). The Spanish-translated SRS demonstrated strengths in both its teacher and parent forms (Fombonne et al. 2012). For both forms, excellent discriminant validity was documented when discerning children diagnosed with Pervasive Developmental Disorder (PDD) from typically developing children that did not receive a PDD or ASD-related diagnosis. This is a key finding for increased detectability of Spanish-speaking children showing signs of ASD, which can reduce diagnostic gaps and improve access to early intervention. Interrater reliability statistics were also strong, evidenced by the moderately high correlations observed between parent and teacher ratings. Noted limitations include lengthy administration time, limited age range for validation (ages 4–13), lack of data determining concurrent validity with other screening measures, and lack of discriminant validity data differentiating ASD from other clinical presentations among Spanish-speaking populations. These limitations indicate that use of this screener may impede early identification of ASD (i.e. before age 4) among Spanish-speaking children. Additionally, the Spanish-translated SRS was studied in 2012 before the changes for an autism diagnosis were released in the DSM. Therefore, the Spanish SRS may be suited as a general screener for potential autism-specific behaviors until restudied utilizing the current DSM-5 criteria.

Social Communication Questionnaire (SCQ). Strengths of the Spanish-translated SCQ include its ease and low cost of implementation, as well as its ability for use across a wide age range. This may be especially relevant when screening for ASD symptoms among school-aged children who may have immigrated to the United States past the age when developmental surveillance measures intended for younger children may have flagged them. A limitation of this measure is its lack of comparison norms for Spanish-speaking populations. Reyes et al. (2021) noted other limitations of the Spanish-translated SCQ when used in a nonclinical sample. Translated items may not reflect the intended meaning of the original measure, resulting in the potential misinterpretation of items reduced reliability and validity. Moreover, SCQ results may reflect co-occurring behavior and emotional problems masked as ASD symptoms rather than ASD symptoms themselves, further reducing its potential validity.

Barriers and facilitators to implementation and interpretation

Additional studies focused on assessing successes and barriers to implementation of screeners (n = 6; Bölte et al. 2018, Kimple et al. 2014, Reyes et al. 2021, Scarpa et al. 2013, Windham et al. 2014, Zuckerman et al. 2013). The findings from these studies highlight important differences in screener performance between low- and higher-income or low- and high-education Spanish-speaking families (Bölte et al. 2018, Reyes et al. 2021, Scarpa et al. 2013, Windham et al. 2014, Zuckerman et al. 2013). Screeners were less effective when presented to Spanish-speaking families with lower income levels and lower education histories. Reyes et al. (2021) found that Spanish-speaking mothers with less education attainment reported more ASD symptoms compared to Spanish-speaking mothers with more education attainment. Additionally, income level affects screener accessibility (Bölte et al. 2018, Windham et al. 2014, Zuckerman et al. 2013). For example, middle-income European countries are at the highest disadvantage for availability of translated measures compared to high- or low-income European countries (Bölte et al. 2018). Further, Spanish-speaking families in low-income communities are underserviced due to the high costs of translated screening tools in predominantly English-speaking communities or countries like the U.S. (Zuckerman et al. 2013).

When examining differences in education level, Spanish-speaking respondents with fewer years of completed education reported more ASD symptoms among their children compared to Spanish-speaking respondents with higher educational attainment, despite there being no differences in overall symptom reporting between English- and Spanish-speaking respondents (Reyes et al. 2021). Additionally, respondents with lower education attainment demonstrated more difficulty with screeners due to literacy and comprehension challenges (Scarpa et al. 2013). Further, slight differences were noted between Spanish-translated screeners of different Latinx dialects (Kimple et al. 2014), demonstrating the heterogeneity of Latinx culture and Spanish language and the complexity of translated screening tools. In particular, Kimple et al. (2014) measured response differences when comparing the M-CHAT SS and M-CHAT WH, which indicated distinctions in cultural interpretation rather than translation differences among the Spanish-speaking population. Although there were limited differences in abnormal screening rates for the M-CHAT SS and M-CHAT WH, both measures produced higher abnormal screening rates for Spanish-speakers compared to English-speakers using the original M-CHAT. This finding further highlights the importance of proper Spanish translation and interpretation in ASD screening. See Table 3 for a more detailed breakdown of each included study’s main findings.

Clinical implication themes

All studies offered clinical implications relevant to the study question. In this section, the current authors summarize the clinical implications provided by the authors of the cited articles. In order to provide broader conceptualizations of the implications of utilizing Spanish screeners for ASD, the authors of the current study read and grouped implications listed in each article into themes using a brief thematic analysis protocol (Braun and Clarke 2006): barriers to care (n = 6), cultural awareness (n = 15), translation barriers (n = 12), intervention (n = 12), knowledge/awareness (n = 7), and screener evaluation (n = 16). Regarding barriers to care, authors discussed issues related to limited access to primary or specialty care providers for Spanish-speaking individuals, especially families from low-to-middle income countries or rural communities (Bölte et al. 2018, Scarpa et al. 2013, Zuckerman et al. 2013). Additional barriers to care included price and opportunity cost related to time for follow-up visits and interviews for some screeners (Eugenin et al. 2015, Fombonne et al. 2012, Miller et al. 2011), and availability of post-screening services for Spanish-speaking families (Eugenin et al. 2015, Zuckerman et al. 2013).

Themes of cultural awareness and translation barriers centered around challenges associated with comprehending differences between cultural groups within the Latinx community (Albores-Gallo et al. 2012, Cuesta-Gómez et al. 2016, Eugenin et al. 2015, Reyes et al. 2021, Scarpa et al. 2013, Zuckerman et al. 2013). For example, generalizability concerns were noted for the Argentinian M-CHAT, as non-Argentinian mothers might need assistance understanding certain items (Cuesta-Gómez et al. 2016). Additionally, for the Mexican M-CHAT, it was noted that the translation process involved making small cultural adjustments to the screener, such as describing the ‘peek-a-boo’ game, since Mexican mothers do not know the explicit name for it. Further, issues with patient/provider interactions, such as miscommunication between patients and providers due to language barriers, highlighted the importance of multicultural training and teaching for providers working with Spanish-speaking populations (Zuckerman et al. 2013). The intervention theme consisted of recommendations for screener implementation outside of the U.S. (Atwood et al. 2015), screener integration into primary care clinics (Miller et al. 2011, Zuckerman et al. 2013), and adaptation of validated screeners for appropriate age and cultural groups (DuBay et al. 2021a, Scarpa et al. 2013, Windham et al. 2014). The importance of incorporating Latinx families in the screening and diagnostic process was also considered (Eugenin et al. 2015).

Within the knowledge/awareness theme, a general lack of ASD knowledge and awareness was noted in Spanish-speaking or Latinx populations, thus emphasizing the importance of developing these screeners as tools for improved parental education and awareness for early detection. Last, the screener evaluation theme captured interpretations of the screener included in the study (e.g. Kimple et al. 2014) and suggestions for future more rigorous evaluation of future screeners (e.g. Scarpa et al. 2013). See Table 3 for a more detailed breakdown of each study’s main findings categorized by theme.

QUADAS-2 bias and quality assessment

Mixed results emerged regarding risk of bias and quality assessment. Most studies included were cross sectional study designs (n = 12; Atwood et al. 2015, Bölte et al. 2018, Canal-Bedia et al. 2011, Cuesta-Gómez et al. 2016, Fombonne et al. 2012, Kimple et al. 2014, Magán-Maganto et al. 2020, Miller et al. 2011, Morales-Hidalgo et al. 2017a, Reyes et al. 2021, Scarpa et al. 2013, Windham et al. 2014, Zuckerman et al. 2013), some were case control designs (n = 5; Albores-Gallo et al. 2012, Coelho-Medeiros et al. 2019, Fombonne et al. 2012, Lugo-Marín et al. 2019, Rosenberg et al. 2018), one was a qualitative design (DuBay et al. 2021b), and one was a mixed methodology design (DuBay et al. 2022). Risk of bias was considered relatively low, but a few studies had higher risk of bias due to lack of clarity regarding selection bias, detection bias, and reporting bias (Table 2). No study was excluded or considered high risk due to the risk of bias analysis. An adapted version of the QUADAS-2 evaluated bias risk for the 16 screener evaluation studies (Albores-Gallo et al. 2012, Atwood et al. 2015, Canal-Bedia et al. 2011, Cuesta-Gómez et al. 2016, Coelho-Medeiros et al. 2019, DuBay et al. 2021a, DuBay et al. 2021b, DuBay et al. 2022, Fombonne et al. 2012, Kimple et al. 2014, Kuhn et al. 2021, Lugo-Marín et al. 2019, Magán-Maganto et al. 2020, Morales-Hidalgo et al. 2017b, Reyes et al. 2021, Scarpa et al. 2013). Quality assessment results paralleled risk bias outcomes and demonstrated similar, moderate concern for bias. Concerns typically noted lack of information on participant flow, diagnostic knowledge and screening process, index text psychometrics, and identification of a comparison test (Table 2).

Discussion

This review of 22 studies serves to synthesize known information on the influence of Spanish language on ASD screening. Primarily Spanish-speaking individuals experience unique barriers in detection and screening for ASD. The studies included in the present review provide evidence to support the use of ASD screeners in Spanish but demonstrate some limitations, including challenges of generalizability due to variability in Spanish dialects across populations and insufficient psychometrics for some screeners. The included studies also provide insight on the implementation of Spanish screeners in healthcare settings and emphasize the relevance of client characteristics in validation of screeners (e.g. income level and education history), importance of multicultural training for providers working with Spanish-speaking clients, and the general lack of ASD knowledge and awareness noted in Spanish-speaking or Latinx populations.

However, there were limitations that must be addressed. Most evaluative studies did not report standardized translation processes, reliability and validity statistics, or factors related to accessibility and receipt of services outside of linguistic barriers. This missing information complicates replication, adoption, and dissemination of the screeners. Future research should gather the appropriate psychometric data, replicate the findings, and examine the feasibility of dissemination in settings that foster access to care. A second limitation refers to the heterogeneity of Latinx culture, which suggests that validation techniques utilized may not apply to all Latinx populations and limits generalizability. Future research must take this heterogeneity into account. With additional studies examining more representative samples, the procedures may gain the necessary empirical support for widespread use. Additionally, results regarding risk of bias and quality of assessment were mixed. Future research would benefit from the application of more rigorous research designs, like Randomized Controlled Trials and diagnostic accuracy studies, to address concerns related to bias, psychometric interpretations, and generalizability. Moreover, rigorous research designs will not have the intended impact if best-practice methods are not used to ensure cultural and linguistic relevance. The translation, adaptation, and validation (TAV) method is recommended as it differs from common translation and validation techniques (i.e. forward-back translation) and includes adaptation (DuBay et al. 2021). Without adaptation, direct translation of materials normed in English-speaking cultures will not be relevant to the heterogeneous Spanish-speaking population. A final limitation refers more to the methodological approach of the review. Although it is common to exclude grey literature (Paez 2017) in a systematic review, given that much of the work is new and less studied, some research may be captured in dissertations or conference presentations rather than published, peer-reviewed papers. Future research on this topic should consider including grey literature, especially if completing a more comprehensive analysis (e.g. a meta-analysis).

Clinical implications

The Latinx population is a diverse population with many different sub-cultural traditions, levels of multilingualism, and, in the U.S., differing layers of acculturation. Several steps can be utilized to begin moving towards linguistically and culturally-competent care for Latinx, Spanish-speaking populations. First, researchers should identify specific ASD screening and assessment measures and linguistic and cultural adaptations that have been shown to be useful and effective for this population. This review provides a first step in this process, yet much of the necessary details to build on this research was notably missing. An additional, recently published review by Alonso Esteban et al. (2020) further supports the need for a concentrated and collaborative effort to develop reliable and valid instruments for the early identification of ASD in Spanish-speaking populations at an international level. Part of this process includes simply identifying and reporting culture and language backgrounds of participants included in their research. This is necessary for informing the creation of specific Spanish-language screening and assessment measures as well as evaluation and treatment procedures included later on in the ASD diagnostic process. Second, studies evaluating the efficacy and validity of screeners must report psychometric data relevant to the validity, generalizability, and reliability of the measure. In order to provide appropriate ASD screenings to individuals, it is necessary to report suitable statistics to support those claims. Third, when examining cultural and linguistic differences in Latinx and non-Latinx populations, research recruitment must focus on building a representative sample of Latinx individuals to increase the field’s dialectical and cultural knowledge.

Conclusion

Overall, this systematic literature review demonstrates the important role Spanish language plays in the ASD screening process. Namely, language differences – especially considering the indelible relationship between language and culture – may: affect knowledge and awareness of development and developmental milestones due to a lack of available resources and supports that are linguistically- and culturally-relevant for the wide range of populations within the Latinx culture; level and ease of communication between families and providers; and result in translated resources validated for use in specific sub-groups of the heterogeneous population of Spanish-speakers potentially impacting generalizability. Additional measures may be needed to capture all concerns and symptoms. The findings from this systematic literature review provide support to improve the current ASD screening process for Spanish-speaking families. The current review provides an ideal foundation for the growing body of Latinx ASD literature and offers a source of clinically applicable information for both ASD providers and Spanish-speaking families.

Supplementary Material

Supplemental Material

Appendix A.

Data Extraction Sheet

  • Title of Article

  • Authors (last name, first/middle initials)

  • Year of publication

  • Journal

  • Volume/Issue #

  • Page numbers

  • Lead author contact details

  • Country in which the study conducted

  • Language(s) in which the paper was published

  • Aim of study

  • Study design

  • Start date

  • End date

  • Study funding sources

  • Possible conflicts of interest for study authors

  • Name of screener(s) evaluated

  • Reliability Statistics

  • Validity Statistics

  • Administration of screener (e.g. English version, Spanish version, translated version?)

  • Translation methods

  • Population description

  • Participant Type(s)

  • Inclusion criteria

  • Exclusion criteria

  • Country of Study Location (if possible, include geographical region [i.e. US, Midwest])

  • Method of recruitment of participants

  • Total number of participants

  • Were parents included?
    • Total number of parents
    • Age range of parents: Youngest age
    • Age range of parents: Oldest age
    • Mean age of parents
    • SD age of parents
    • Sex: Number of females (parents)
    • Sex: Number of males (parents)
    • Race – parents (n, % [if possible]; please indicate if not reported)
    • Ethnicity – parents (n, % [if possible]; please indicate if not reported)
  • Were children included?
    • Total number of children
    • Age range of children: Youngest age
    • Age range of children: Oldest age
    • Mean age of children
    • SD age of children
    • Sex: Number of females (children)
    • Sex: Number of males (children)
    • Diagnoses of children included in the study
    • Race – children (n, % [if possible]; please indicate if not reported)
    • Ethnicity – children (n, % [if possible]; please indicate if not reported)
  • Total number of interpreters/translators

  • Education level of sample

  • Advantages of Screeners

  • Limitations of Screeners

  • Clinical implications

Funding Statement

This work was funded by National Institute of Child Health and Human Development; Center for Clinical and Translational Science, University of Alabama at Birmingham; Department of Health and Human Services Administration for Children and Families; National Institute of Child Health and Human Development; National Institutes of Health training grant through the Heart, Lung and Blood Disease Institute; Dwight David Eisenhower Transportation Fellowship Program.

Disclosure statement

No potential conflict of interest was reported by the authors.

References

  1. Albores-Gallo, L., Roldán-Ceballos, O., Villarreal-Valdes, G., Betanzos-Cruz, B. X., Santos-Sánchez, C., Martínez-Jaime, M. M., Lemus-Espinosa, I. and Hilton, C. L.. 2012. M-CHAT Mexican version validity and reliability and some cultural considerations. International Scholarly Research Notices, 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Alonso Esteban, Y., Hedley, D., Uljarevié, M., Barbaro, J., Canal Bedia, R. A., and Marín, F.. 2020. Screening instruments for early detection of autism spectrum disorder in Spanish speaking communities. Psicothema, 32, 245–252. [DOI] [PubMed] [Google Scholar]
  3. American Psychiatric Association (APA) . 2013. Diagnostic and statistical manual of mental disorders: DSM-5. Vol. 10. Washington, DC: American Psychiatric Assocation. [Google Scholar]
  4. Atwood, E., Brady, N. C. and Esplund, A.. 2015. Measuring early communication in spanish speaking children: The Communication Complexity Scale in Peru. International Journal of Speech & Language Pathology and Audiology, 3, 9–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bölte, S., Tomalski, P., Marschik, P. B., Berggren, S., Norberg, J., Falck-Ytter, T., Pokorska, O., Jones, E.J., Charman, T., Roeyers, H., and Kostrzewa, E.. 2018. Challenges and inequalities of opportunities in European psychiatry research: The example of psychodiagnostic tool availability in research on early autism identification. European Journal of Psychological Assessment, 34, 270–277. [Google Scholar]
  6. Bornstein, M. H. and Cote, L. R.. 2004. “Who is sitting across from me?” immigrant mothers’ knowledge of parenting and children’s development. Pediatrics, 114, 557–564. [DOI] [PubMed] [Google Scholar]
  7. Brady, N. , Fleming, K., Thiemann-Bourque, K., Olswang, L., Dowden, P., Saunders, M. and Marquis, J. 2012. Development of the communication complexity scale. American Journal of Speech-Language Pathology, 21, 16–28. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Braun, V. and Clarke, V. 2006. Using thematic analysis in psychology. Qualitative Research in Psychology, 3, 77–101. [Google Scholar]
  9. Canal-Bedia, R., García-Primo, P., Martín-Cilleros, M. V., Santos-Borbujo, J., Guisuraga-Fernández, Z., Herráez-García, L., del Mar Herráez-García, M., Boada-Muñoz, L., Fuentes-Biggi, J., and Posada-de La Paz, M.. 2011. Modified checklist for autism in toddlers: Cross-cultural adaptation and validation in Spain. Journal of Autism and Developmental Disorders, 41, 1342–1351. [DOI] [PubMed] [Google Scholar]
  10. Christensen, D. L., Baio, J., Van Naarden Braun, K., Bilder, D., Charles, J., Constantino, J. N., Daniels, J., Durkin, M. S., Fitzgerald, R.T., Kurzius-Spencer, M., Lee, L., Pettygrove, S., Robinson, C., Schulz, E., Wells, C., Wingate, M. S., Zahorodny, W. and Yeargin-Allsopp, M.. 2016. Prevalence and characteristics of autism spectrum disorder among children aged 8 years – Autism and developmental disabilities monitoring network, 11 sites, United States, 2012. Morbidity and Mortality Weekly Report (MMWR) Surveillance Summary, 65, 1–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Coelho-Medeiros, M. E., Bronstein, J., Aedo, K., Pereira, J. A., Arraño, V., Perez, C. A., Valenzuela, P. M. and Bedregal, P.. 2019. M-CHAT-R/F validation as a screening tool for early detection in children with autism spectrum disorder. Revista Chilena de Pediatria, 90, 492–499. [DOI] [PubMed] [Google Scholar]
  12. Cuesta-Gómez, J. L., Andrea Manzone, L. and Posada-De-La-Paz, M.. 2016. Modified checklist for autism in toddlers cross-cultural adaptation for Argentina. International Journal of Developmental Disabilities, 62, 117–123. [Google Scholar]
  13. DuBay, M., Sideris, J. and Rouch, E.. 2022. Is traditional back translation enough? Comparison of translation methodology for an ASD screening tool. Autism Research, 15, 1868–1882. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. DuBay, M., Watson, L. R., Baranek, G. T., Lee, H., Rojevic, C., Brinson, W., Smith, D. and andSideris, J.. 2021a. Rigorous translation and cultural adaptation of an autism screening tool: First Years Inventory as a case study. Journal of Autism and Developmental Disorders, 51, 1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. DuBay, M., Watson, L. R., Méndez, L. I. and Rojevic, C.. 2021b. Psychometric comparison of the English and Spanish western-hemisphere versions of the Modified Checklist for Autism in Toddlers—Revised. Journal of Developmental & Behavioral Pediatrics, 42, 717–725. [DOI] [PubMed] [Google Scholar]
  16. Eugenin, M. I., Rosario Moore, M. D., Martinez-Gutierrez, J., Perez, C. A. and Valenzuela, P. M.. 2015. Screening for autism in Santiago Chile: Community perspectives. International Journal of Child and Adolescent Health, 8, 439–448. [Google Scholar]
  17. Fombonne, E., Marcin, C., Bruno, R., Tinoco, C. M. and Marquez, C. D.. 2012. Screening for Autism in Mexico. Autism Research, 5, 180–189. [DOI] [PubMed] [Google Scholar]
  18. Hyman, S. L., Levy, S. E., Myers, S. M., Kuo, D. Z., Apkon, S., Davidson, L. F., Ellerbeck, K.A., Foster, J.E., Noritz, G.H., Leppert, M.O., Saunders, B.S., Stille, C., Yin, L., Weitzman, C.C., Childers, D.O., Jr., Levine, J.M., Peralta-Carcelen, A.M., Poon, J.K., Smith, P.J., Blum, N.J., Takayama, J.I., Baum, R., Voigt, R.G. and Bridgemohan, C.. 2020. Identification, evaluation, and management of children with autism spectrum disorder. Pediatrics, 145, e20193447. [DOI] [PubMed] [Google Scholar]
  19. Jo, H., Schieve, L. A., Rice, C. E., Yeargin-Allsopp, M., Tian, L. H., Blumberg, S. J., Kogan, M. D. and Boyle, C. A.. 2015. Age at Autism Spectrum Disorder (ASD) diagnosis by race, ethnicity, and primary household language among children with special health care needs, United States, 2009-2010. Maternal and Child Health Journal, 19, 1687–1697. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Kimple, K. S., Bartelt, E. A., Wysocki, K. L. and Steiner, M. J.. 2014. Performance of the modified checklist for autism in toddlers in Spanish-speaking patients. Clinical Pediatrics, 53, 632–638. [DOI] [PubMed] [Google Scholar]
  21. Kuhn, J., Levinson, J., Udhnani, M. D., Wallis, K., Hickey, E., Bennett, A., Fenick, A. M., Feinberg, E. and Broder-Fingert, S. 2021. What happens after a positive primary care autism screen among historically underserved families? Predictors of evaluation and autism diagnosis. Journal of Developmental & Behavioral Pediatrics, 42, 515–523. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Lipkin, P. H., Macias, M. M., Baer Chen, B., Coury, D., Gottschlich, E. A., Hyman, S. L., Sisk, B., Wolfe, A. and Levy, S. E.. 2020. Trends in pediatricians’ developmental screening: 2002–2016. Pediatrics, 145, e20190851. [DOI] [PubMed] [Google Scholar]
  23. Lugo-Marín, J., Díez-Villoria, E., Magán-Maganto, M., Pérez-Méndez, L., Alviani, M., de la Fuente-Portero, J. A. and Canal-Bedia, R.. 2019. Spanish validation of the autism quotient short form questionnaire for adults with autism spectrum disorder. Journal of Autism and Developmental Disorders, 49, 4375–4389. [DOI] [PubMed] [Google Scholar]
  24. MacDonald, R., Parry-Cruwys, D., Dupere, S. and Ahearn, W.. 2014. Assessing progress and outcome of early intensive behavioral intervention for toddlers with autism. Research in Developmental Disabilities, 35, 3632–3644. [DOI] [PubMed] [Google Scholar]
  25. Maenner, M. J., Shaw, K. A., Bakian, A. V., Bilder, D. A., Durkin, M. S., Esler, A., Furnier, S. M., Hallas, L., Hall-Lande, J., Hudson, A., Hughes, M. M., Patrick, M., Pierce, K., Poynter, J.N., Salinas, A., Shenouda, J., Vehorn, A., Warren, Z., Constantino, J.N., DiRienzo, M., Fitzgerald, R.T., Grzybowski, A., Spivey, M.H., Pettygrove, S., Zahorodny, W., Ali, A., Andrews, J.G., Baroud, T., Gutierrez, J., Hewitt, A., Lee, L., Lopez, M.m., Mancilla, K.C., McArthur, D., Schwenk, Y.D., Washington, A., Williams, S. and Cogswell, M. E.. 2021. Prevalence and characteristics of autism spectrum disorder among children aged 8 years – Autism and Developmental Disabilities Monitoring Network, 11 sites, United States, 2018. MMWR Surveillance Summary, 70, 1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Magaña, S., Lopez, K., Aguinaga, A. and Morton, H.. 2013. Diagnosis and treatment services among Latino children with autism spectrum disorders. Intellectual and Developmental Disabilities, 51, 141–153. [DOI] [PubMed] [Google Scholar]
  27. Magaña, S. and Smith, L.. 2013. Are there differences in ADI-R lifetime scores between Latino and non-Latino adolescents and adults with autism spectrum disorders. Journal of Autism and Developmental Disorders, 43, 141–153. [Google Scholar]
  28. Magán-Maganto, M., Canal-Bedia, R., Hernández-Fabián, A., Bejarano-Martín, Á., Fernández-Álvarez, C. J., Martínez-Velarte, M., Martín-Cilleros, M.V., Flores-Robaina, N., Roeyers, H. and Posada de la Paz, M. P.. 2020. Spanish cultural validation of the modified checklist for autism in toddlers, revised. Journal of Autism and Developmental Disorders, 50, 2412–2423. [DOI] [PubMed] [Google Scholar]
  29. Mayo-Ortega, L., Oyama-aniko, R., Leblanc, J.,bSchroeder, S. R., Brady, N., Butler, M. G., Reese, M. R., Richman, D. M., Peacock, G., Foster, J. and Marquis, J. 2012. Mass screening for severe problem behavior among infants and toddlers in Peru. Journal of Mental Health Research in Intellectual Disabilities, 5, 246–259. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Miller, J. S., Gabrielsen, T., Villalobos, M., Alleman, R., Wahmhoff, N., Carbone, P. S. and Segura, B.. 2011. The each child study: Systematic screening for autism spectrum disorders in a pediatric setting. Pediatrics, 127, 866–871. [DOI] [PubMed] [Google Scholar]
  31. Moher, D., Liberati, A., Tetzlaff, J. and Altman, D. G.. 2009. Research methods & reporting. BMJ, 339, 332–336. [Google Scholar]
  32. Moher, D., Liberati, A., Tetzlaff, J. and Altman, D. G.. 2009. Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement. PLoS Medicine, 6, e1000097. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Morales-Hidalgo, P., Hernández-Martínez, C., Voltas, N. and Canals, J.. 2017b. EDUTEA: A DSM-5 teacher screening questionnaire for autism spectrum disorder and social pragmatic communication disorder. International Journal of Clinical and Health Psychology, 17, 269–281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Morales-Hidalgo, P., Roigé‐Castellví, J., Vigil‐Colet, A. and Canals Sans, J.. 2017a. The childhood autism spectrum test (CAST): Spanish adaptation and validation. Autism Research, 10, 1491–1498. [DOI] [PubMed] [Google Scholar]
  35. Paez, A. 2017. Gray literature: An important resource in systematic reviews. Journal of Evidence‐Based Medicine, 10, 233–240. [DOI] [PubMed] [Google Scholar]
  36. Page, M. J., McKenzie, J. E., Bossuyt, P. M., Boutron, I., Hoffmann, T. C., Mulrow, C. D., Shamseer, L., Tetzlaff, J. M., Akl, E. A., Brennan, S. E., Chou, R., Glanville, J., Grimshaw, J. M., Hróbjartsson, A., Lalu, M. M., Li, T., Loder, E. W., Mayo-Wilson, E., McDonald, S., McGuinness, L. A., Stewart, L. A., Thomas, J., Tricco, A. C., Welch, V. A., Whiting, P. and Moher, D.. 2021. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ (Clinical Research ed.), 372, n71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Reyes, N. M., Moody, E., Hightshoe, K., Davidon, S., Rosenberg, S., Dahl, E. and Kubicek, L. F.. 2021. Factors that may influence social communication questionnaire (SCQ) scores: An examination of the Spanish and English SCQ in school-aged students. School Psychology Review, 1–11. [Google Scholar]
  38. Robins, D., Frein, D. and Barton, M. MCHAT R/F (Internet). Adaptación por el Equipo T EA UC. 2017. M-CHAT-R/F. Español-Chile. http://www.mchatscreen.com/wp-content/uploads/2017/01/M-CHAT-R_F_Espanol-Chile2017.pdf. [Google Scholar]
  39. Rosenberg, S. A., Moody, E. J., Lee, L., DiGuiseppi, C., Windham, G. C., Wiggins, L. D., Schieve, L. A., Ledbetter, C. M., Levy, S. E., Blaskey, L., Young, L., Bernal, P., Rosenberg, C. R. and Fallin, M. D.. 2018. Influence of family demographic factors on social communication questionnaire scores. Autism Research, 11, 695–706. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Scarpa, A., Reyes, N. M., Patriquin, M. A., Lorenzi, J., Hassenfeldt, T. A., Desai, V. J. and Kerkering, K. W.. 2013. The modified checklist for autism in toddlers: Reliability in a diverse rural American sample. Journal of Autism and Developmental Disorders, 43, 2269–2279. [DOI] [PubMed] [Google Scholar]
  41. Scott, F. J., Baron-Cohen, S., Bolton, P. and Brayne, C.. 2002. The CAST (Childhood Asperger Syndrome Test): Preliminary development of a UK screen for mainstream primary-school-age children. Autism, 6, 9–31. [DOI] [PubMed] [Google Scholar]
  42. Troxel, M., Linas, K., Jacobstein, D., Biel, M., Soto, S. H., Lorenzo-Hubert, I., Dos-Santos, S., Mendez, R. and Anthony, B. J.. 2018. Implementing autism screening for latino children in primary care: Perspectives from parents and providers. Journal of Health Disparities Research and Practice, 12, 88–105. [Google Scholar]
  43. Voelkel, H., LeCroy, C. W., Williams, L. R. and Holschuh, J.. 2013. The full spectrum: Hispanic understanding of autism. Best Practices in Mental Health: An International Journal, 9, 31–46. [Google Scholar]
  44. Whiting, P. F., Rutjes, A. W. S., Westwood, M. E., Mallett, S., Deeks, J. J., Reitsma, J. B., Leeflang, M. M. G., Sterne, J. A. C. and Bossuyt, P. M. M. and QUADAS-2 Group*. 2011. QUADAS-2: A revised tool for the quality assessment of diagnostic accuracy studies. Annals of Internal Medicine, 155, 529–536. [DOI] [PubMed] [Google Scholar]
  45. Windham, G. C., Smith, K. S., Rosen, N., Anderson, M. C., Grether, J. K., Coolman, R. B. and Harris, S.. 2014. Autism and developmental screening in a public, primary care setting primarily serving Latinxs: Challenges and results. Journal of Autism and Developmental Disorders, 44, 1621–1632. [DOI] [PubMed] [Google Scholar]
  46. Zuckerman, K. E., Mattox, K., Donelan, K., Batbayar, O., Baghaee, A. and Bethell, C.. 2013. Pediatrician identification of Latino children at risk for autism spectrum disorder. Pediatrics, 132, 445–453. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Zuckerman, K. E., Sinche, B., Mejia, A., Cobian, M., Becker, T. and Nicolaidis, C.. 2014. Latino parents’ perspectives of barriers to autism diagnosis. Academic Pediatrics, 14, 301–308. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Material

Articles from International Journal of Developmental Disabilities are provided here courtesy of The British Society of Developmental Disabilities

RESOURCES