Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Aug 1.
Published in final edited form as: Autism. 2020 Feb 14;24(6):1566–1571. doi: 10.1177/1362361320903138

Short Report: Effects of Pivotal Response Treatment on Reciprocal Vocal Contingency in a Randomized Controlled Trial of Children with Autism Spectrum Disorder

Jena McDaniel 1, Paul Yoder 2, Madison Crandall 2, Maria Estefania Millan 3, Christina Mich Ardel 3, Grace W Gengoux 3, Antonio Y Hardan 3
PMCID: PMC7375927  NIHMSID: NIHMS1549246  PMID: 32054315

Abstract

A pivotal response treatment package (PRT-P) consisting of clinician-delivered and parent-implemented strategies was recently found to be effective in improving language and social communication deficits in children with autism spectrum disorder (ASD; Gengoux et al., 2019). Reciprocal vocal contingency (RVC), an automated measure of vocal reciprocity, may provide stronger and complementary evidence of the effects of PRT-P. RVC is derived through an automated process from day-long audio samples from the child’s natural environment. Therefore, RVC is at lower risk for detection bias than parent report and brief parent-child interaction measures. Although differences were non-significant at baseline and after 12 weeks of intervention for the 48 children with ASD who were randomly assigned to PRT-P or a delayed treatment control group, the PRT-P group had higher ranked RVC scores than the control group after 24 weeks (U = 125, p = .04). These findings are consistent with results from parent report and parent-child interaction measures obtained during the trial. The participants in PRT-P exhibited greater vocal responsiveness to adult vocal responses to their vocalizations than the control group. Findings support the effectiveness of PRT-P on vocal reciprocity of children with ASD, which may be a pivotal skill for language development.

Keywords: autism spectrum disorder, Interventions – psychosocial/behavioral, communication and language, pre-school children

Lay Abstract

A recent randomized controlled trial (Gengoux et al., 2019) found that children with autism spectrum disorder (ASD) who received a pivotal response treatment package (PRT-P) showed improved language and social communication skills following the intervention. PRT-P includes clinician-delivered and parent-implemented strategies. Reciprocal vocal contingency (RVC) is an automated measure of vocal reciprocity derived from day-long audio samples from the child’s natural environment. It may provide stronger and complementary evidence of the effects of PRT-P because it is at lower risk for detection bias than parent report and brief parent-child interaction measures. The current study compared RVC for 24 children with ASD in the PRT-P group and 24 children with ASD in the control group. The PRT-P group received 24 weeks of PRT-P intervention. The control group received their usual intervention services during that time. The groups did not differ in RVC when the intervention started or after 12 weeks of intervention. However, after 24 weeks the PRT-P group had higher ranked RVC scores than the control group. These findings are consistent with results from parent report and parent-child interaction measures obtained during the trial. The participants in PRT-P exhibited greater vocal responsiveness to adult vocal responses to their vocalizations than the control group. Findings support the effectiveness of PRT-P on vocal reciprocity of children with ASD, which may be a pivotal skill for language development.


One instantiation of naturalistic developmental behavioral interventions (NDBIs) combines clinician-delivered intervention and parent training. Pivotal response treatment (PRT) is a naturalistic developmental behavioral intervention for children with autism spectrum disorder (ASD) that targets “pivotal” skills (e.g., motivation and responsivity) to facilitate broad changes across functional skills (e.g., Koegel, Koegel, & Brookman, 2005). However, prior randomized controlled trials (RCTs) for PRT had investigated either clinician-delivered or parent-implemented PRT (e.g., Hardan et al., 2015; Mohammadzaheri et al., 2014).

A recent RCT (Gengoux et al., 2019) evaluated the effectiveness of combining PRT parent training with in-home, clinician-delivered PRT (PRT-Package or PRT-P) on communication skills of children with ASD aged 2 to 5 years old. The treatment group received 12 weeks of weekly parent training sessions and 10 hours per week of clinician-delivered in-home treatment, followed by 12 weeks of monthly parent training sessions and 5 hours per week of clinician-delivered in-home treatment (Gengoux et al., 2019). The delayed treatment control group received continued community services during the 24-week trial. Participants who received PRT-P (n = 23) exhibited greater gains in frequency of functional utterances during structured laboratory observations (SLO) and in parent-reported expressive vocabulary (MacArthur-Bates Communicative Development Inventory, MB-CDI; Fenson et al., 2007), Clinical Global Impressions - Improvement subscale (CGI-I; Guy, 1976), and Brief Observation of Social Communication Change (BOSCC, Grzadzinski et al., 2016) during a parent-child interaction relative to the control group (n = 20; Gengoux et al., 2019). Investigators and research staff blind to group assignment completed outcome measures requiring clinical assessments (CGI-I) or video scoring (SLO, BOSCC).

In this more strenuous analysis we test whether the PRT-P group outperforms the control group on a more objective measure less influenced by context and at lower risk of detection bias – reciprocal vocal contingency (RVC) from day-long vocal samples in the home environment. Two 16-hour weekend-day audio recordings were collected using Language ENvironment Analysis (LENA) devices for participants at baseline, intervention midpoint (12 weeks post study entry), and end of intervention (24 weeks post study entry). Briefly, RVC is the degree to which an adult vocal response to a child vocalization increases the likelihood of an immediately following child vocalization beyond chance (Harbison et al., 2018). RVC quantifies the extent to which child vocalizations occur after parent responses to child vocalizations more than after other event types or times (Harbison et al., 2018). It measures a three-event sequence (child vocalization→adult vocalization→child vocalization) rather than a two-event sequence (child vocalization→adult vocalization; adult vocalization→child vocalization) used by the LENA conversational turn variable.

RVC is an appealing outcome measure for the PRT trial for at least three reasons. First, RVC quantifies vocal reciprocity, which is related to communication and language development based on empirical and theoretical evidence in children with ASD (Harbison et al. 2018; Warlaumont, Richards, Gilkerson, & Oller, 2014). Second, its automated nature reduces bias risk because no human coding is used. With other measures, the coder’s lack of blindness to treatment assignment can increase risk of overestimating the treatment groups’ true skills relative to the control group, particularly when communication acts are marginally intelligible. Third, because RVC is calculated from audio recordings of day-long vocal samples in the child’s natural environment, it represents the child’s interaction with multiple adults across natural settings. Even on weekends, parents, grandparents, adult friends, and other visitors may talk with the child. Assuming that only one of these adults received parent training, some adults will not be using the trained interaction style. In this way, RVC is a measure of children’s generalized tendency to be vocally responsive to adults’ vocal responses. This characteristic of RVC is a large advantage over measures of parent-child interaction in the lab (e.g., SLO, BOSCC) because when parents are the adults eliciting the child behavior, the parents are playing the “examiner” role in the measurement context for the child dependent variable. When half of the parents are using an interaction style that immediately elicits child vocalizations, as is possibly the case for parents receiving PRT-P training, children in the PRT-P group are tested under importantly different conditions than children in the control group. Thus, child skills derived from parent-child interaction sessions in parent-implemented intervention trials are at risk of correlated measurement error (i.e., true score is systematically elevated for the predicted superior group and systematically underestimated for the control group; Yoder, Lloyd, & Symons, 2018). RVC from day-long vocal samples in the natural environment is much less likely to be biased in this manner. Ultimately, PRT-P is designed to influence children’s generalized tendencies, not just their behaviors when interacting with their parents in a laboratory environment.

Purpose

This study evaluated whether participants in the PRT-P group exhibited higher RVC values than the participants in the control group (community treatment) after 12 and 24 weeks of intervention. If the PRT package including clinician-delivered and parent-implemented PRT is effective on automatic measures of RVC from day-long recordings, a stronger argument about effects of PRT-P could be made than has been previously documented.

Methods

Participants

Participants in this study were enrolled in the PRT-P RCT (Gengoux et al., 2019). They were referred by local professionals or recruited through flyers and word of mouth. Participants enrolled December 2013 through July 2016. See Gengoux et al., 2019 for Consort Form flow chart. ASD diagnosis was based on meeting criteria on the Autism Diagnostic Interview – Revised (ADI-R; Lord, Rutter, & Le Couteur, 1994), Autism Diagnostic Observation Schedule (ADOS-2; Lord et al., 2012), Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition diagnostic criteria for ASD (American Psychiatric Association, 2013), and expert clinical judgment. Inclusion criteria also included scoring at least one standard deviation below the mean for 2- and 3-year-olds, two standard deviations below the mean for 4-year-olds, and three standard deviations below the mean for 5-year-olds on the Preschool Language Scale – Fifth Edition expressive communication subscale (Zimmerman, Steiner, & Pond, 2011). Exclusion criteria included individual speech therapy more than 1 hour per week, one-on-one applied behavior analysis treatment more than 15 hours per week, severe psychiatric disorder, genetic abnormality or active medical disease, primary language other than English, or living more than 50 miles from the research center.

Forty-eight participants were randomized (PRT-P = 24; control = 24). Forty-three families completed the trial (PRT-P = 23; control = 20). Data for the current analysis (i.e., naturalistic audio recording with at least 60 minutes of “meaningful time”; LENA Research Foundation, 2015) were available for 40 participants (PRT-P = 20; control = 20). Meaningful time is defined consistent with LENA user guidelines as “usable, distinguishable speech that is included in the reported information” (LENA Research Foundation, 2015, p. 35).

Participating children were primarily male (88%). Participating parents were primarily female (79%) and college graduates (84%). The sample was ethnically diverse with 56% Asian, 28% Caucasian, 7% biracial/other, 7% Hispanic, and 2% Native Hawaiian. Median annual household income was reported in the $150,000-200,000 range. Table 1 displays additional participant characteristics. Mann-Whitney U tests only revealed a group difference at baseline for words understood on the MacArthur-Bates Communicative Development Inventory – Words and Gestures Form (MB-CDI WG; Fenson et al., 2007). However, because MB-CDI WG Receptive was not correlated with RVC at Time 3 (Spearman’s rho = .25; p = .12), this baseline difference cannot explain the treatment effect presented below.

Table 1.

Participant Characteristics at Study Initiation

PRT-P Group (n = 20) Control Group (n = 20)

M SD M SD U p
Chronological age (months) 49.85 11.92 46.85 9.66 171.5 .45
Mental age (months) 21.63 6.62 19.40 6.78 171.5 .45
Developmental quotient 44.50 12.62 42.26 14.80 168.0 .39
MB-CDI WG Receptive (raw score) 215.50 112.22 144.45 96.52 124.0 .04
MSEL Receptive (age equivalent in months) 17.85 7.66 16.20 8.00 175.5 .52
MB-CDI WS Expressive (raw score) 151.40 135.94 78.35 107.30 139.0 .10
MSEL Expressive (age equivalent in months) 19.60 8.85 14.75 9.51 133.5 .07

Note. Control Group = delayed treatment control group; Developmental quotient = mental age divided by chronological age multiplied by 100; MB-CDI WG = MacArthur-Bates Communicative Development Inventory – Words and Gestures Form (Fenson et al., 2007); MB-CDI WS = MacArthur-Bates Communicative Development Inventory – Words and Sentences Form (Fenson et al., 2007); Mental age = mean age equivalent score across subtests of the Mullen Scales of Early Learning (MSEL; Mullen, 1995); PRT-P = pivotal response treatment package including parent training and clinician-delivered intervention; U = test statistic for Mann-Whitney U test.

PRT-P and Control Conditions

The PRT-P included weekly individual parent training sessions and 10 hours per week of in-home intervention for 12 weeks, followed by 12 additional weeks of monthly parent training sessions and 5 hours per week of in-home intervention. Parent training sessions were conducted in a clinic and involved systematically introducing the motivational components of PRT to parents, clinician modeling of implementation with the child, and parent practicing directly with the child with in vivo clinician feedback. During in-home sessions, a trained clinician played with the child using child-preferred toys and implemented PRT strategies to encourage functional communication practice. See Gengoux et al., 2019 for a detailed description of the intervention procedures. Children in the delayed treatment control group continued stable community services during this study.

Reciprocal Vocal Contingency (RVC)

We quantified RVC using a three-event sequential analysis to calculate the degree to which children’s vocal response was contingent on prior adult vocal responses to children’s immediately preceding vocalizations. RVC is independent of chance sequencing of events (Yoder et al., 2018). Because this analysis requires long vocal samples, we used two day-long LENA system audio recordings per participant per period. Recordings were collected at baseline and after 12 and 24 weeks of intervention. We used the LENA Pro computer program to segment and classify adult and child vocalizations and an open-sourced, cross-platform computer program (Version 1; https://github.com/HomeBankCode/LENA_contingencies) to compute RVC values for each day-long vocal sample using a 2-second pause duration (Yoder, Wade, Tapp, Warlaumont, & Harbison, 2016).

Results

The mean length for the audio recordings was 15 hours, 25 minutes, 26 seconds (SD = 1 hour, 42 minutes, 25 seconds) for the PRT-P group and 15 hours, 2 minutes (SD = 2 hours, 39 minutes) for the control group. These lengths did not vary by group (t = 0.55, p = .59). Two families contributed a single recording at Time 1 (PRT group = 0), seven did so at Time 2 (PRT group = 3), and eight at Time 3 (PRT group = 4). Three of the single recordings included less than 60 minutes of meaningful time and therefore were excluded. All participants with usable Time 3 recordings had usable Time 1 and Time 2 recordings except for one participant due to zero values in multiple contingency table cells for Time 2 RVC.

As a measure of RVC stability across recordings, we calculated intraclass correlation coefficients (ICCs) for two day-long audio recordings at each time point. The ICCs were .59, .77, and .63 at Times 1,2, and 3 respectively. Mitchell (1979) describes ICCs of .7 as “very good.” The following RVC analyses used the sum of the two day-long recordings per participant per time point.

Because the population distribution of RVC is unknown, we used the nonparametric Mann-Whitney U test to assess between-group differences. Although group differences were non-significant at baseline (Time 1; n1 = 20 [PRT-P group]; n2 = 20 [control group]; ΣR1 = 458; ΣR2 = 362; U = 152; p = .20) and after 12 weeks of intervention (Time 2; n1 = 20; n2 = 19; ΣR1 = 449; ΣR2 = 331; U = 141; p = .18), children in the PRT-P group had higher ranked RVC scores than children in the control group after 24 weeks of treatment (Time 3; n1 = 20; n2 = 20; ΣR1 = 485; ΣR2 = 335; U = 125, p = .04). The percentage of all possible pairwise comparisons for which the PRT-P group exceeded the control group at Time 3 was 69%, which is a moderate effect size (Wuensch, 2015). Figure 1 displays the median and confidence intervals for Time 3 RVC by group.

Figure 1.

Figure 1.

Control = delayed treatment control group. PRT-P = pivotal response treatment package with parent training and clinician-delivered intervention. Each open circle represents one participant’s reciprocal vocal contingency value (RVC) at Time 3 (immediately after 24 weeks of intervention). The filled diamonds represent the median RVC for each group. The filled circles represent the limits of the 95% confidence interval around the median values.

Discussion

Compared with the control group, the PRT-P group exhibited significantly higher RVC values after 24 weeks of intervention. No significant difference was observed at baseline or intervention midpoint. The lack of an effect at the intervention midpoint is not explained by reduced stability because RVC stability was higher at the midpoint (Time 2) than the endpoint (Time 3), for which an effect was identified. The findings support the effectiveness of PRT-P on vocal reciprocity of children with ASD, which may be a pivotal skill for language development. The RVC analysis results are consistent with results from expressive language measures obtained during the RCT (Gengoux et al., 2019) and earlier PRT studies (e.g., Gengoux et al., 2015; Hardan et al., 2015; Mohammadzaheri et al., 2014). They provide additional support for PRT-P benefits. Arguably, RVC assesses a more generalized behavioral tendency and is at lower detection bias risk than previously reported PRT-P benefits (e.g., parent-reported expressive vocabulary and social communication within parent-child interaction sessions). Additionally, RVC is derived from vocal samples collected from the child’s natural environment, which often included multiple communication partners, not only the parent who had learned to implement the intervention.

Limitations

The identities of the adult communication partners in the day-long audio recordings are unknown. Therefore, we cannot differentiate between participants’ RVC values with the parent who received PRT training versus other individuals who did not receive PRT training. There is also some known error in the classification of acoustic events for the LENA system. These classification errors involve misidentifying the speaker (e.g., child versus adult) and event type (e.g., speech-like child vocalization versus other child vocalization; Warren et al., 2010). Reports indicate that the LENA system algorithms are generally conservative relative to human coders (e.g., Warren et al., 2010). Collecting a vast amount of data (i.e., multiple day-long samples) reduces the impact of these classification errors.

Future Directions

Given the novel nature of the RVC treatment effect, replication is required. Continued investigation is also required to determine the specific time range and developmental level for which RVC is most useful.

Conclusions

In summary, participants in the PRT-P group exhibited a greater generalized tendency to be vocally responsive to adults’ vocal responses than participants in the control group. The identified treatment effect on vocal reciprocity as measured by RVC, which is at low risk for bias and evaluates children’s performance in naturalistic settings, lends further support to the broad reaching effects of PRT. The finding supports continued development and evaluation of PRT for children with ASD.

Acknowledgments

This research was funded by the NIDCD (DC01368902) and supported by two US Department of Education grants (H325D140087 and H325D140077). We thank all of the children and families who participated to make this work possible.

Footnotes

The authors have no conflicts of interest to disclose. JM participated in the study design, analyzed the data, and drafted the manuscript; PY conceived the study, participated in the study design, helped interpret the data, and helped draft the manuscript; MC participated in the study design and analyzed the data; MEM collected the data and helped interpret the data; CMA collected the data and helped interpret the data; GWG and AYH conceived the study, participated in the study design, supervised data collection, and helped interpret the data. All authors read and approved the final manuscript.

References

  1. American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders (5th ed.). Arlington, VA: American Psychiatric Publishing. [Google Scholar]
  2. Fenson L, Marchman VA, Thal DJ, Dale PS, Reznick JS, & Bates E (2007). MacArthur-Bates Communicative Development Inventories: User’s guide and technical manual (2nd ed.). Baltimore, MD: Brookes. [Google Scholar]
  3. Gengoux GW, Abrams DA, Schuck R, Millan ME, Libove R, Ardel CM, … Hardan AY. (2019). A pivotal response treatment package for children with autism spectrum disorder: An RCT. Pediatrics, 144, e20190178. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Gengoux GW, Berquist KL, Salzman E, Schapp S, Phillips JM, Frazier TW, … & Hardan AY. (2015). Pivotal response treatment parent training for autism: Findings from a 3-month follow-up evaluation. Journal of Autism and Developmental Disorders, 45, 2889–2898. doi: 10.1007/s10803-015-2452-3 [DOI] [PubMed] [Google Scholar]
  5. Grzadzinski R, Carr T, Colombi C, McGuire K, Dufek S, Pickles A, & Lord C (2016). Measuring Changes in Social Communication Behaviors: Preliminary Development of the Brief Observation of Social Communication Change (BOSCC). Journal of Autism and Developmental Disorders, 46, 2464–2479. doi: 10.1007/s10803-016-2782-9 [DOI] [PubMed] [Google Scholar]
  6. Guy W (1976). ECDEU Assessment Manual for Psychopharmacology. Rockville, MD: U.S. Department of Health, Education, and Welfare. [Google Scholar]
  7. Harbison A, Woynaroski T, Warlaumont A, Wade J, Tapp J & Yoder P (2018). A new measure of vocal contingency in young children with autism spectrum disorder. Autism Research, 11, 903–915. doi: 10.1002/aur.1942 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Hardan AY, Gengoux GW, Berquist KL, Libove RA, Ardel CM, Phillips J, … Minjarez MB. (2015). A randomized controlled trial of Pivotal Response Treatment Group for parents of children with autism. Journal of Child Psychology and Psychiatry, 56, 884–892. doi: 10.1111/jcpp.12354 [DOI] [PubMed] [Google Scholar]
  9. Koegel LK, Koegel RL, & Brookman LI (2005). Child-initiated interactions that are pivotal in intervention for children with autism In Hibbs ED & Jensen PS (Eds.), Psychosocial treatments for child and adolescent disorders: Empirically based strategies for clinical practice (2nd ed., pp. 633–657). Washington, DC: American Psychological Association. [Google Scholar]
  10. LENA Research Foundation (2015). LENA Pro User Guide. Retrieved from https://cdn.shopify.com/s/files/1/0596/9601/files/LENA_Pro_User_Guide.pdf
  11. Lord C, Rutter M, DiLavore P, Risi S, Gotham K, & Bishop S (2012). Autism Diagnostic Observation Schedule (ADOS-2). Torrance, CA: Western Psychological Services. [Google Scholar]
  12. Lord C, Rutter M, & Le Couteur A (1994). Autism Diagnostic Interview-Revised: A revised version of a diagnostic interview for parents of individuals with possible pervasive developmental disorders. Journal of Autism and Developmental Disorders, 24, 659–685. doi: 10.1007/bf02172145 [DOI] [PubMed] [Google Scholar]
  13. Mitchell SK (1979). Interobserver agreement, reliability, and generalizability of data collected in observational studies. Psychological Bulletin, 86, 376–390. doi: 10.1037/0033-2909.86.2.376 [DOI] [Google Scholar]
  14. Mohammadzaheri F, Koegel LK, Rezaee M, & Rafiee SM (2014). A randomized clinical trial comparison between pivotal response treatment (PRT) and structured applied behavior analysis (ABA) intervention for children with autism. Journal of Autism and Developmental Disorders, 44, 2769–2777. doi: 10.1007/s10803-014-2137-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Mullen EM (1995). Mullen Scales of Early Learning (AGS ed.). Los Angeles, CA: Western Psychological Services. [Google Scholar]
  16. Warlaumont AS, Richards JA, Gilkerson J, & Oller DK (2014). A social feedback loop for speech development and its reduction in autism. Psychological Science, 25, 1314–1324. doi: 10.1177/0956797614531023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Warren SF, Gilkerson J, Richards JA, Oller DK, Xu D, Yapanel U, & Gray S (2010). What automated vocal analysis reveals about the vocal production and language learning environment of young children with autism. Journal of Autism and Developmental Disorders, 40, 555–569. doi: 10.1007/s10803-009-0902-5 [DOI] [PubMed] [Google Scholar]
  18. Wuensch KL (2015). Nonparametric effect size estimators. Retrieved from http://core.ecu.edu/psyc/wuenschk/docs30/Nonparametric-EffectSize.pdf on August 18, 2019.
  19. Yoder PJ, Lloyd BP, & Symons FJ (2018) Observational measurement of behavior (2nd ed.). Baltimore, MD: Paul H. Brooks Publishing Company. [Google Scholar]
  20. Yoder P, Wade J, Tapp J, Warlaumont A, & Harbison AL (2016) Contingencies from LENA data. Retrieved from https://github.com/HomeBankCode/LENA_contingencies
  21. Zimmerman IL, Steiner VG, & Pond RE (2011) The Preschool Language Scale – Fifth Edition. San Antonio, TX: The Psychological Corporation. [Google Scholar]

RESOURCES