Open Science Practices in Communication Sciences and Disorders: A Survey

Mariam El Amin; James C Borders; Helen L Long; Mary Alice Keller; Elaine Kearney

doi:10.1044/2022_JSLHR-22-00062

. 2022 Nov 23;66(6):1928–1947. doi: 10.1044/2022_JSLHR-22-00062

Open Science Practices in Communication Sciences and Disorders: A Survey

Mariam El Amin ^a,^✉, James C Borders ^b, Helen L Long ^c, Mary Alice Keller ^d, Elaine Kearney ^e

PMCID: PMC10554559 PMID: 36417765

Abstract

Purpose:

Open science is a collection of practices that seek to improve the accessibility, transparency, and replicability of science. Although these practices have garnered interest in related fields, it remains unclear whether open science practices have been adopted in the field of communication sciences and disorders (CSD). This study aimed to survey the knowledge, implementation, and perceived benefits and barriers of open science practices in CSD.

Method:

An online survey was disseminated to researchers in the United States actively engaged in CSD research. Four-core open science practices were examined: preregistration, self-archiving, gold open access, and open data. Data were analyzed using descriptive statistics and regression models.

Results:

Two hundred twenty-two participants met the inclusion criteria. Most participants were doctoral students (38%) or assistant professors (24%) at R1 institutions (58%). Participants reported low knowledge of preregistration and gold open access. There was, however, a high level of desire to learn more for all practices. Implementation of open science practices was also low, most notably for preregistration, gold open access, and open data (< 25%). Predictors of knowledge and participation, as well as perceived barriers to implementation, are discussed.

Conclusion:

Although participation in open science appears low in the field of CSD, participants expressed a strong desire to learn more in order to engage in these practices in the future.

Supplemental Material and Open Science Form:

https://doi.org/10.23641/asha.21569040

Accurate and reliable research is a cornerstone of evidence-based practice and a key driver of progress in the field of communication sciences and disorders (CSD). In the past decade, a number of concerns have been raised surrounding the reproducibility of research findings across many disciplines (e.g., psychology, education, biology, and ecology; Baggerly & Coombes, 2009; Fraser et al., 2018; Makel & Plucker, 2014; Open Science Collaboration, 2015). These concerns were sparked by the “replication crisis” in psychology; the Open Science Collaboration attempted to replicate 100 experimental and correlational studies and found that only 36% of the replicated studies had statistically significant findings compared with 97% of the original studies (Open Science Collaboration, 2015). In addition, the effect sizes in the replicated studies were approximately half the original effects, suggesting that selective reporting and/or publication bias may have inflated effects in the original studies.

At the core of the replication crisis is an overall lack of transparency in scientific studies as well as the use of questionable research practices, which has been argued to have originated from an ongoing culture of academia that incentivizes publication quantity over quality (Munafò et al., 2017). Lack of transparency can mean incomplete reporting of methodology or failure to provide access to materials, protocols, data sets, or publications (Samsa & Samsa, 2019). Questionable research practices are practices used by researchers with the intention of enhancing the likelihood of finding evidence to support their hypotheses. Examples of these practices include the selective reporting of findings, hypothesizing after results are known (HARK-ing), and deciding to collect more data after finding nonsignificant results with a given sample size, among others (John et al., 2012).

Selective reporting of findings refers to the practice of deliberately not fully or accurately reporting research findings to serve the researcher's agenda and hide undesirable findings. It can also be used to report only significant findings. HARK-ing refers to researchers making hypotheses after they have seen the study results. HARK-ing can be detrimental as it can propagate a finding that was due to statistical error (Type I error) and translate it into theory. Both selective reporting of finding and HARK-ing lead to the loss of an opportunity to communicate what did not work in research, which is equally important to what did work (Kerr, 1998). Collecting more data after finding nonsignificant results can lead to p-hacking, where a researcher is knowingly making choices after seeing the results in order to get to a significant finding. The consequences of p-hacking include a waste of time and resources, an increase in the number of false positives, and a biased literature base that does not replicate. Some of these practices, such as collecting additional data, may be necessary in the context of exploratory work. The practices are problematic, however, when they are undisclosed and used selectively to get interesting findings that would otherwise not exist. Therefore, the use of questionable research practices can lead to the publication of misleading findings that cannot be replicated. Journal publication biases encouraging selective reporting of findings indeed contribute to the ongoing replication crisis (Ioannidis et al., 2014). To mitigate these issues, the field of psychology has increasingly adopted open science practices (Nelson et al., 2018). The call to reconsider scientific methods and processes has also been seen across other fields such as cancer research (Begley & Ellis, 2012) and strategic management in business (Hubbard et al., 1998).

To date, no empirical evidence has examined the extent to which open science practices have been adopted in the field of CSD. The closest proxy comes from a relevant medical field that includes research from CSD scientists—otolaryngology, where less than 6% of 300 randomly selected articles published between 2014 and 2018 reported reproducible and transparent research practices (Johnson et al., 2020). Consequently, there is a need to better understand the knowledge, attitudes, and implementation of open science in CSD.

Open Science and Potential Benefits to CSD

Open science refers to a collection of research practices that aim to increase the accessibility and transparency of science. These practices include preregistration, self-archiving, gold open access, and open data defined in Table 1. The principles of open science can be incorporated into all stages of the scientific process, from preregistering a study plan to sharing study materials or increasing access to research publications. Implementing open science, however, is not an all-or-nothing endeavor; researchers can incrementally add open science practices to their workflow with the long-term goal to open their science.

Table 1.

Open science practices definitions.

Practice	Definition
Preregistration	The practice of documenting the research plan, study design, hypotheses, and/or analyses prior to data collection and submitting it to a registry. Preregistration separates hypothesis-generating (exploratory) versus hypothesis-testing (confirmatory) research.
Self-archiving	The act of making a version of a manuscript legally and freely available online on a lab/personal website or in a repository. The version may be the submitted, accepted, or published version of the manuscript, depending on publisher policy.
Gold open access	Unrestricted public availability of a research paper on the Internet through formal publication systems (e.g., Open Access Publishers). Gold open access indicates that researchers paid money to the publishers for them to make their work available online through open access.
Open data	Unrestricted public availability of research data and/or any resource necessary for the collection of these data (methodology, protocol, software packages, etc.), generally through online repositories.

Open in a new tab

Open science has a number of potential benefits for the field of CSD—both in terms of scientific discovery and evidence-based practice for clinicians. First, open science is associated with increased transparency and reproducibility, potentially facilitating a higher quality of research output (Hardwicke et al., 2020; OECD, 2015; Rubin, 2020). Higher quality, reproducible research can make science more cost effective, as scientific discoveries are more robust. An estimate of the cost of irreproducible preclinical research in the United States alone, for example, is $28 billion annually (Freedman et al., 2015). Second, open science reduces duplication and costs associated with creating, transferring, and reusing data since the efficiency of science is improved (OECD, 2015). This improved efficiency may help to reduce the time associated with clinical uptake of research findings, currently thought to take approximately 17–20 years for health care research (Balas & Boren, 2000). Third, open science can increase the global impact of research, as it promotes collaboration and faster knowledge transfer (OECD, 2015). Optimizing assessment, diagnosis, and treatment practices in CSD is a global issue and one that serves to benefit from worldwide collaboration. Ultimately, more widespread adoption of open science practices could result in a more robust, transparent, and replicable body of literature, as well as an increased rate of clinical translation and implementation.

Factors Affecting Adoption of Open Science Practices

Several factors may affect the uptake of open science practices. These factors can pertain to the level of an individual scientist, their institution, and more broadly to the field (Zečević et al., 2021). At the individual level, these factors can include age, seniority, position, as well as knowledge of and attitudes toward open science (Houtkoop et al., 2018; Toribio-Flórez et al., 2021; Zhu, 2017). Researchers may face financial limitations in paying the fees associated with publishing gold open access or may fear being scooped if they make their data open (Bahlai et al., 2019). In health care research, in particular, scientists may be unsure how to share data while maintaining patient privacy and confidentiality (Kostkova et al., 2016). Power hierarchies can also affect early career researchers as more senior collaborators may not want to adopt open science practices (Bahlai et al., 2019).

At the institutional level, uptake may be affected by the availability of funding and infrastructure to support scientists in adopting open science practices (Zečević et al., 2021). Larger institutions may be better equipped to provide financial support to offset costs associated with opening up research and to provide dedicated support through education and training on open science (Bahlai et al., 2019; Zečević et al., 2021).

At the field level, the movement toward open science can be influenced by structural support and incentives for open science. For instance, tenure committees and funding agencies do not uniformly recognize and/or incentivize the contribution of nontraditional research outputs, such as open research materials and data, to progress in the field. Additionally, not all journals facilitate open science by accepting preprints for publication as well as publishing registered reports. A recent study found that CSD journals currently have a low level of encouraging researchers to participate in open science practices as measured by the TOP (Transparency and Openness Promotion) Factor metric (Schroeder et al., 2022). A small number of journals in CSD, however, have begun to facilitate and promote open science practices. For example, the Journal of Speech, Language, and Hearing Research and Language Learning now accept registered reports (a publication format that involves full peer review of the methods protocol and an in-principle acceptance for publication before data collection begins—regardless of the outcome of the study; Chambers, 2019; Marsden et al., 2018; Storkel & Gallun, 2022) and Ear and Hearing introduced a badge system to reward authors who implement open science practices (Svirsky, 2020). The badge reward system is also available to authors who publish in any of the American Speech-Language-Hearing Association (ASHA) journals. Additionally, the ASHA journals adopted the TOP guidelines, which encourage the use of open science practices. These include, for example, requiring an explicit Data Availability Statement from authors. These journal initiatives suggest that open science practices are feasible in the field of CSD; however, multifaceted support across all levels may be necessary for widespread adoption.

Study Aims

The goal of this study was to survey researchers in CSD to better understand their knowledge, implementation, and perceived benefits of open science practices, as well as to identify barriers to implementation. For the purposes of this work, we defined four-core practices of interest (hereby referred to as open science practices): preregistration, self-archiving, gold open access, and open data. Specifically, we aimed to

Describe CSD researchers' knowledge and perceived benefit of open science practices;
Describe the frequency of CSD researchers' participation in open science practices;
Report perceived barriers to implementation of open science practices;
Examine the relationship between demographics and knowledge and participation in these open science practices; and
Examine whether perceived knowledge or benefit differs across practices.

Hypotheses

We hypothesized that participants would report “low” knowledge of open science practices, which we defined as a median score of 3 or lower on a 6-point Likert scale;
We hypothesized that participants would report “low” participation in open science practices, which we defined as ≤ 50%;
We explored known barriers in the implementation of open science practices included in this study (preregistration, self-archiving, gold open access, and open data) and their associations with knowledge and participation in open science practices;
We hypothesized that participants with less research experience and in more junior roles would report higher knowledge of open science practices but would not report higher participation;
We hypothesized that the perceived knowledge and benefit of preregistration and gold open access would be higher than other open science practices (open data and self-archiving).

Findings on the field's current state of open science practices have the potential to elucidate directions for growth within the field.

Method

This study was approved by the University of Georgia Institutional Review Board. The study preregistration, data, and analysis code can be found on the Open Science Framework: https://osf.io/2f7xp/. No deviations from the preregistration were encountered for statistical analyses. Thematic analysis of participants' open-text responses to survey questions was added to the analysis plan following preregistration, which is reflected in an addendum.

Survey Development

A 57-item online survey was created to examine research scientists' knowledge, participation, and barriers to implementing open science practices (available in Supplemental Material S1). To develop the survey, we first performed a literature search to identify studies examining similar constructs in related fields. Our final survey was adapted from Toribio-Flórez et al. (2021) who sought to explore attitudes toward open science among early career researchers in the Max Planck Society. Our survey followed a similar format as the original survey by aiming to assess the knowledge, attitudes, perceived benefit, and implementation of each open science practice. We modified the Toribio-Flórez et al. (2021) survey to include a 6-point Likert scale for questions exploring degree responses and adapted the demographic questions for our sample of CSD scholars. Our survey was created and distributed through the Qualtrics platform. It was composed of seven sections: (a) informed consent, (b) eligibility screening, (c) demographic information, (d) preregistration, (e) self-archiving, (f) gold open access, and (g) open data.

The demographics section included eight questions. Participants were asked to indicate (a) their research position/job title, (b) what year their PhD was awarded (if applicable), (c) years of experience conducting research in CSD, (d) the Carnegie classification of their current institution (Indiana University Center for Postsecondary Research, 2021), (e) their research area, (f) an approximate number of peer-reviewed manuscripts submitted in the past 3 years, (g) type(s) of regular research engagement, and (h) background in authoring scientific research.

The remaining sections of the survey asked participants about their knowledge, participation, and perceived benefits and barriers of implementing the four-core open science practices of interest: preregistration, self-archiving, gold open access, and open data, defined in Table 1 earlier. Participants were presented with nine to 12 questions in each section. Response methods to questions included Likert rating scales (1–6; 1 = not at all; 6 = extremely; 2–5 not labeled), slider scales (0%–100%), yes/no, multiple-choice, and forced-choice options. All multiple choice and forced-choice options included a free-text “Other” option to write in alternative responses.

During development, the survey was shared with six external, unaffiliated researchers for pilot assessment. Pilot participants were required to be actively engaged in publishing scientific research but not necessarily in the field of CSD. The researchers completed the survey and provided feedback regarding the clarity of instructions and questions, adequacy of response options, potential information omission, and time requirement to complete the survey. The survey questions were revised based on the feedback from the expert judges prior to its formal dissemination.

Participation Criteria

Inclusion criteria for survey participation included (a) active engagement in research in the field of CSD and (b) residence in the United States. We defined engagement as participation in any aspect of the research process. This included doctoral students, postdoctorates, research scientists, and faculty members, but not undergraduate students. The survey was restricted to participants based in the United States as open science practices are expected to differ by country. For example, 20% of funders in the United Kingdom mandate that resulting publications be made open access, compared with less than 5% in the United States (Open Science Monitor, 2019). Examining country-specific differences in policy was beyond the scope of this study.

Procedure

Convenience sampling was used to examine a representative sample of researchers in the field of CSD. Recruitment occurred using two primary methods. First, we identified all universities in the United States with CSD programs through the ASHA EdFind website (https://find.asha.org/ed; February to March 2021). For each CSD department on EdFind, we manually searched for contact information for the department chair. We sent a recruitment e-mail to all of the identified department chairs (n = 311), including a description of the survey and inclusion/exclusion criteria, and asked them to forward the survey to eligible students, faculty, and staff. A reminder follow-up e-mail was sent 2 weeks after the initial contact. Our second recruitment method was promoting the survey across social media platforms, specifically, Instagram, Facebook, and Twitter, and to an ASHA Special Interest Group (SIG 13, Swallowing and Swallowing Disorders [Dysphagia]) of which one author was a member. The survey was open for 6 weeks (July to August 2021).

Statistical Analysis

R (v. 4.0.1) was used for descriptive and inferential statistical analyses (R Core Team, 2018) with the following packages: ordinal (v12.10) for cumulative link regression models, car (v3.0) for likelihood ratio tests, and lsmeans (v2.30) for post hoc comparisons (Christensen, 2019; Fox, 2019; Lenth, 2016). For descriptive analyses within each open science practice, frequencies were used for categorical variables, medians and interquartile ranges for ordinal variables, and means and standard deviations for continuous variables. For inferential analyses, we used an alpha level of .05. Multiple post hoc pairwise comparisons were conducted with Tukey's honestly significant difference test.

To examine the relationship between demographic variables and knowledge of each open science practice (measured on a Likert scale from 1 to 6), cumulative link ordinal regression models were performed with the following independent variables: years of research experience, Carnegie classification of the participants' institution, year doctorate degree was awarded, and current research position. Binary logistic regression models were performed with a dependent variable of prior participation with an open science practice and the aforementioned demographic predictors.

To examine differences in knowledge, behaviors, and perceived benefit between open science practices, we performed separate ordinal regression models. Open science practice was included as a dummy-coded categorical predictor in the full model. A likelihood ratio test then compared models with and without this predictor. If this test was statistically significant, follow-up pairwise comparisons were performed to examine differences between each type of open science practice.

For inferential statistical models with multiple predictors, we used a model-fitting procedure that began with the null model, iteratively added a predictor, and then examined the Akaike information criterion (AIC) to determine model fit. AIC values of at least 2 were required for the inclusion of a predictor variable (Burnham & Anderson, 2004). In order to account for multicollinearity, independent variables with a variance inflation factor > 3 or correlation > 0.80 were excluded from the model. Residuals from the final full model were examined to ensure that assumptions were satisfied. If statistical models did not converge, we scaled continuous predictors or collapsed categorical predictors, as necessary. Due to unequal distributions, two variables were collapsed: the Carnegie classification (R1 versus all other categories) and research position (PhD student vs. all other categories). Nagelkerke's R2 and Tjur's R2 served as measures of variance explained for cumulative link ordinal and binary logistic models, respectively. Likelihood ratios (LRs) were used as a measure of effect size for model comparisons. LRs between 2 and 5 were considered “small,” “moderate” between 5 and 10, and “large” if greater than 10 (McGee, 2002).

Thematic Analysis

Respondents were given the opportunity to elaborate on their responses concerning barriers to implementation of open science practices through open-ended text responses on the survey. These responses were analyzed using coding reliability thematic analysis (Braun et al., 2019). The first author (M.E.A.) read through the responses and identified potential themes. The third author (H.L.L.) reviewed the themes and collapsed them into larger overarching themes. After the themes were identified, the first author coded the responses of participants for each open science practice. To establish the reliability of the coding procedure, the fourth author (M.A.K.) coded 25% of the statements for each open science practice (percent agreement = 64%). Consensus coding was conducted between M.E.A. and M.A.K. for statements where discrepancies were found until agreements were reached. An example of a discrepancy included the following statement that was coded by M.E.A. as “lack of buy-in” but by M.A.K. as “worry about confidentiality:” “…I spend a lot of time and money developing experiments and running subjects. So, to just hand that data over to someone else doesn't seem quite fair.” After meeting for consensus coding, we decided that “Interest in retaining data for own analyses” better reflected the statement. Additional points of discrepancy occurred where M.A.K. coded N/A for statements she was unsure about (14% of statements), for example, “I try to publish open access.” After the consensus meeting, this statement was coded as “preferred gold.” Therefore, both authors discussed all points of discrepancy until both authors were in full agreement on the codes for those statements.

Results

Demographics

Two-hundred sixty-four participants responded to the survey. Two participants reported living outside the United States, and 17 did not indicate whether they lived in the United States. Eleven participants reported that they did not engage in research, and five did not respond to this question. Therefore, 245 participants met the eligibility criteria. Of these 245 participants, 23 did not complete any survey questions. Thus, our final sample for analysis was 222 participants.

The majority of participants were doctoral students (38.29%) or assistant professors (23.87%). Most respondents were employed at an R1 (i.e., very high research activity) institution (58.56%; see Figure 1). On average, participants reported 10.33 years of research experience (SD = 8.57) and 8.78 publications submitted in the past 3 years (SD = 36.02). Of 136 participants with a PhD, the average length of time since PhD was awarded was 8.95 years (SD = 9.59). The most frequently reported research areas were language learning and/or language disorders, neurogenic communication disorders, cognitive aspects of communication, and swallowing and/or swallowing disorders (see Figure 2).

Figure 1. — Distribution of (A) participants' research positions and (B) Carnegie classifications.

Figure 2. — Distribution of participant research areas in communication sciences and disorders. CSD = communication sciences and disorders.

Preregistration

Knowledge, Participation, and Barriers

Response methods to questions included Likert rating scales (1–6; 1 = not at all; 6 = extremely; 2–5 not labeled). Participants reported median scores of 3 (IQR = 2) for knowledge, 4 (IQR = 2) for desire to learn more, 3 (IQR = 2) for extent of barriers, 4 (IQR = 1) for benefits of preregistration to daily, 4 (IQR = 2) for benefits to research, and 4 (IQR = 2) for benefits to public society (see Figure 3). The distribution of knowledge, participation, and barriers to preregistration by research position are shown in Supplemental Material S2. Twenty-five percent of respondents reported previously preregistering at least one study. These projects were primarily preregistered at either the researcher's personal website (32%) or the Open Science Framework (30%). PROSPERO (10.70%), clinicaltrials.gov (16.1%), AsPredicted (5.4%), and internal institutional review board (IRB) submission (5.4%) were also used. Among respondents who reported previously preregistering a study, the mean percentage of their studies that were preregistered was 44% (SD = 27%). Thirty-six percent of respondents stated that they planned to preregister a study in the next year. The most common barrier to preregistration was a lack of knowledge on how to preregister a study (see Table 2). This barrier was also frequently reported in the analysis of free-text responses (see Table 3), in addition to a lack of buy-in from others and a perception that preregistration occurs through existing processes (e.g., when submitting an ethics application to the institutional review board).

Figure 3. — Preregistration: knowledge, barriers, and perceived benefit.

Table 2.

Perceived barriers to preregistration.

Perceived barrier	Frequency
“I don't know how to preregister my work”	100
“Lack of time is why I don't preregister my studies”	78
“I feel that it limits my ability to change the study moving forward”	70
“I have never heard of preregistration”	65
“Lack of buy-in from colleagues/the field to preregistration”	65
“I fear that other authors might steal my work”	45
“No barriers”	21
“I don't feel like my research needs to be fully open”	21
“Institutional/university policies are a barrier”	15
Other (free-text response)	14

Open in a new tab

Note. Participants were allowed to select more than one answer.

Table 3.

Thematic analysis of barriers to preregistration.

Theme	Example response	Frequency
Lack of knowledge and experience	“No orientation and complicated process”	5
Preregistration considered to occur elsewhere	“clinicaltrials.gov was required in IRB submission, uncertain how to preregister in ‘full’ format”	4
Lack of buy-in from others	“I conduct basic science research where preregistration is not the norm”	4
Negative perceptions	“pre-registration appears to completely unnecessary and dangerous to a research project and program”	3
The impact of the study design and related policies	“I think the type of study is important here - RTCs absolutely should be pre-registered (as required by clinical trials.gov) but for chart reviews or student projects that are limited in scope, pre-registration may just be yet another administrative hurdle with little scientific/societal benefit.”	2
Lack of time	“…I don't have the time to look into it”	2

Open in a new tab

Note. A single response can include several themes.

Demographic Predictors of Knowledge and Participation in Preregistration

Due to a high correlation (r = .92) between years since doctoral degree was awarded and years of research experience, we excluded the former variable from all inferential model fitting procedures. Research years of experience (AIC Δ = 1.41), Carnegie classification (AIC Δ = 0.45), and research position (AIC Δ = 0.40) did not improve model fit compared to a null model.

For participation in preregistration, research years of experience did not uniquely contribute to the model (AIC Δ = 1.79). However, Carnegie classification (AIC Δ = 3.72) and research position (AIC Δ = 3.12) improved model fit. Results showed that Carnegie classification, χ²(3) = 5.49, p = .139, and research position, χ²(7) = 10.95, p = .141, did not show statistically significant associations with participation in preregistration (R ² = .065).

Self-Archiving

Knowledge, participation, and barriers. Participants reported median scores of 4 (IQR = 2) for knowledge of self-archiving, 5 (IQR = 3) for desire to learn more, 3 (IQR = 2) for extent of barriers, 5 (IQR = 3) for benefits of self-archiving to daily, 5 (IQR = 2) for benefits to research, and 5 (IQR = 2) for benefits to public society (see Figure 4). The distribution of knowledge, participation, and barriers to self-archiving by research position are shown in Supplemental Material S3. Thirty-eight percent of participants reported previously self-archiving, including on a personal website (42.86%), lab or university website (34.52%), institutional repository (16.67%), external server (19.05%), and social networking site (39.29%). Fifty-nine percent of participants reported planning to self-archive in the next year. The most common barriers included difficulty interpreting copyright rules (40.99%) and journal policies (45.94%; see Table 4). Analysis of the free-text responses also highlighted lack of knowledge and time and a preference for publishing gold open access as barriers to self-archiving (see Table 5).

Figure 4. — Self-archiving: knowledge, barriers, and perceived benefit.

Table 4.

Perceived barriers to self-archiving.

Perceived barrier	Frequency
“Journal policies are a barrier to self-archiving”	102
“Copyright rules are too difficult to figure out”	91
“I don't know how to self-archive”	71
“Publishing in open access journals costs too much”	71
“Lack of time is why I don't self-archive”	52
“I have never heard of self-archiving”	39
“No barriers”	31
“Institutional/university policies are a barrier”	24
“Lack of buy-in from colleagues/the field to self-archiving”	20
“I don't feel like my research needs to be fully open”	3
Other (free-text response)	7

Open in a new tab

Note. Participants were allowed to select more than one answer.

Table 5.

Thematic analysis of barriers to self-archiving.

Theme	Example response	Frequency
Lack of knowledge	“I fully support complete open access to research. Particularly in our field of CSD, I think this is a moral imperative, but I have little to no knowledge about how this works policy/rights/finance-wise.”	2
Preferred gold	“I routinely pay open access fees to make work accessible after peer review.”	2
Lack of time and resources	“I do self-archive now, but it takes time”	3

Open in a new tab

Note. A single response can include several themes.

Demographic predictors of knowledge and participation in self-archiving. For predictors of knowledge of self-archiving, research years of experience (AIC Δ = 5.55) uniquely contributed to the model, whereas Carnegie classification (AIC Δ = 1.54) and research position (AIC Δ = 0.93) did not improve model fit. Results showed that participants with more research experience reported greater knowledge of self-archiving, LR χ²(1) = 24.14; p < .001, R ² = .036.

For participation in self-archiving, Carnegie classification (AIC Δ = 3.77) and research position (AIC Δ = 5.47) uniquely contributed to the model, whereas research years of experience did not improve model fit (AIC Δ = 0.77). Results showed that Carnegie classification, χ²(3) = 1.68, p = .642, and research position, χ²(7) = 8.53, p = .288, did not show statistically significant associations with participation in self-archiving (R ² = .007).

Gold Open Access

Knowledge, participation, and barriers. Participants reported median scores of 3 (IQR = 3) for knowledge of gold open access, 4 (IQR = 2) for desire to learn more, 4 (IQR = 2) for extent of barriers, 4 (IQR = 2) for benefits of gold open access to daily, 4 (IQR = 3) for benefits to research, and 5 (IQR = 3) for benefits to public society (see Figure 5). The distribution of knowledge, participation, and barriers to gold open access by research position are shown in Supplemental Material S4. Twenty-two percent of participants reported previously using gold open access. Among these participants, they reported approximately 45% (SD = 26%) of papers published gold open access. Eighteen percent of participants reported planning to use it in the next year. The most common barriers included journal cost (59%) and lack of buy-in from colleagues/the field to pay for publishing (22.97%; see Table 6). The analysis of free-text responses also revealed a lack of interest in publishing gold open access (see Table 7).

Figure 5. — Gold open access: knowledge, barriers, and perceived benefit.

Table 6.

Perceived barriers to gold open access.

Perceived barrier	Frequency
“Publishing in open access journals costs too much”	131
“I don't feel like I need to pay to publish my work”	72
“I find that there is a lack of buy-in from colleagues/the field to pay for publishing”	51
“I don't know how to publish in gold open-access journals”	49
“I have never heard of gold open access”	44
“Institutional/university policies are a barrier”	27
“No barriers”	19
“I don't feel like my research needs to be fully open”	4
Other (free-text response)	16

Open in a new tab

Note. Participants were allowed to select more than one answer.

Table 7.

Thematic analysis of barriers to gold open access.

Theme	Example response	Frequency
Lack of interest/negative perceptions	“I don't WANT to pay to publish my work because I shouldn't have to. My work is funded by taxpayer dollars and should be publicly available.” “I find it unethical to have to pay to publish”	8
Financial barriers	“Cost”	7
Other	“Lack of institutional support for research in general” “Open access journals generally have requirements for preregistration/open science that I have not followed to the letter from the time that I conceived of the study”	2

Open in a new tab

Note. A single response can include several themes.

Demographic predictors of knowledge and participation in gold open access. For predictors of knowledge of gold open access, research years of experience (AIC Δ = 4.22), research position (AIC Δ = 2.21), and Carnegie classification (AIC Δ = 2.88) contributed to the model. Results showed main effects of Carnegie classification, LR χ²(3) = 125.20, p < .001, and research position, LR χ²(7) = 65, p < .001, R ² = .098, whereas research experience was nonsignificant, LR χ²(1) = 0.78, p = .377. Specifically, the probability of higher self-reported knowledge of gold open access increased by 4% for each additional year of research experience. Post hoc pairwise comparisons between Carnegie classifications were nonsignificant (p > .05; Supplemental Material S5).

For participation in gold open access, Carnegie classification (AIC Δ = 3.96) and research position uniquely contributed to the model (AIC Δ = 6.38), whereas research years of experience did not (AIC Δ = 0.20). Results showed that Carnegie classification, χ²(3) = 2.03, p = .567, and research position, χ²(7) = 7.62, p = .368, did not show statistically significant associations with participation in gold open access (R ² = .001).

Open Data

Knowledge, perceived benefit, participation, and barriers. Participants reported median scores of 4 (IQR = 1) for knowledge of sharing open data, 4 (IQR = 2) for desire to learn more, 3 (IQR = 2) for extent of barriers, 4 (IQR = 2) for benefits of sharing open data to daily, 5 (IQR = 3) for benefits to research, and 4 (IQR = 3) for benefits to public society (see Figure 6). The distribution of knowledge, participation, and barriers to sharing open data by research position are shown in Supplemental Material S6. Twenty-six percent of participants reported previously sharing open data, and 37% reported planning to share open data in the next year. The most common barriers included lack of knowledge on how to share open data (34.68%) and concern for the confidentiality of participants (31.53%; see Table 8). Analysis of the free-text responses also revealed confidentiality concern as a key barrier (see Table 9).

Figure 6. — Open data: knowledge, barriers, and perceived benefit.

Table 8.

Perceived barriers to sharing open data.

Perceived barrier	Frequency
“I don't know how to share open data”	77
“I fear for the confidentiality of my participants (can be identified)”	70
“Lack of time is why I don't share open data”	61
“I fear for my copyright over the data I'm sharing”	61
“Lack of buy-in from colleagues/the field to sharing open data”	44
“Institutional/university policies are a barrier”	36
“No barriers”	31
“I have never heard of open data”	26
“I don't feel like my research needs to be fully open”	19
Other (free-text response)	13

Open in a new tab

Note. Participants were allowed to select more than one answer.

Table 9.

Thematic analysis of barriers to open data.

Theme	Example response	Frequency
Concerns about confidentiality	“I work with vulnerable populations, Even though I can fully de-identify my data, I want my families to feel protected.”	3
Lack of time/resources	“I do try to share code now, but preparing it to be publicly available takes time…” “Lack of a central source for depositing data”	3
Worry about perceptions/judgment	“I also worry about what others will think of my code.” “Not sure anyone would be interested in or know how to read my data”	2
IRB/institutional policies	“It is very difficult to get IRB approval to make data (especially audio recordings of speech) publicly available. This greatly lengthens the amount of time it takes to get IRB approval, and there is a ton of pushback.”	3
Interest in retaining data for own analyses	“I spend a lot of time and money developing experiments and running subjects. So, to just hand that data over to someone else doesn't seem quite fair.” “We are not yet done working on the dataset”	2
Other	“I need to learn more about how to do it properly” “Getting scooped.”	2

Open in a new tab

Note. A single response can include several themes.

Demographic predictors of knowledge and participation in sharing open data. For predictors of knowledge of open data, research years of experience (AIC Δ = 3.32), Carnegie classification (AIC Δ = 2.12), and research position (AIC Δ = 7.27) contributed to the model. Results showed that years of Carnegie classification, LR χ²(3) = 164.63, p < .001, and research position, LR χ²(7) = 80.02, p < .001, were significantly associated with knowledge of sharing open data; however, years of research experience, LR χ²(1) = 0.29, p = .588, was nonsignificant (R ² = .097). Post hoc pairwise comparisons examining differences between Carnegie classifications showed that participants from R1 (very high research activity) institutes reported higher knowledge compared to doctoral/professional institutions (p = .013). All other comparisons of Carnegie classifications were nonsignificant (p > .05; Supplemental Material S7). All pairwise comparisons between research positions were nonsignificant (p > .05; Supplemental Material S8).

For participation in sharing open data, Carnegie classification (AIC Δ = 2.84) and research position uniquely contributed to the model (AIC Δ = 3.84), whereas research years of experience did not improve model fit (AIC Δ = 1.63). Results showed that Carnegie classification, χ²(3) = 2.18, p = .536, and research position, χ²(7) = 10.16, p = .180, did not show statistically significant associations with participation in sharing open data (R ² = .007).

Differences Between Open Science Practices

There was a significant main effect of open science practices on perceived knowledge, χ²(3) = 22.40, p < .001, R ² = .027; see Figure 7. Specifically, the knowledge of self-archiving (p = .013), open data (p = .001), and gold open access (p = .005) was higher than preregistration. All other pairwise comparisons between open science practices on knowledge were nonsignificant (p > .05; Supplemental Material S9).

There was a significant main effect of open science practices on perceived benefit to the daily life of a researcher, χ²(3) = 34.30, p < .001; R ² = .039. Specifically, self-archiving was viewed as more beneficial to the daily life of a researcher compared to preregistration (p < .001) and gold open access (p < .001). Open data were also rated as more beneficial compared with preregistration (p = .01). All other pairwise comparisons were nonsignificant (p > .05; Supplemental Material S10).

There was a significant main effect of open science practices on perceived benefit to one's research field, χ²(3) = 31.48, p < .001; R ² = .036. Specifically, self-archiving was viewed as more beneficial to research fields compared to preregistration (p < .001) and gold open access (p < .001). Open data were also rated as more beneficial compared to preregistration (p = .004). All other pairwise comparisons were nonsignificant (p > .05; Supplemental Material S11).

There was a significant main effect of open science practices on perceived benefit to public society, χ²(3) = 31.86, p < .001; R ² = .036. Specifically, self-archiving was viewed as more beneficial to public society compared with preregistration (p < .001), gold open access (p = .014), and open data (p < .001). All other pairwise comparisons were nonsignificant (p > .05; Supplemental Material S12).

Discussion

There has been a growing movement to promote open science practices to improve the transparency, openness, and replicability of research. In the social sciences, the adoption of these practices has rapidly increased over the past decade, potentially signaling a shift in cultural and normative scientific values (Christensen et al., 2020). Despite this high rate of implementation in adjacent disciplines, it remains unclear whether researchers in the field of CSD are familiar with these practices and implement them in their own research. This study had five aims. Specifically, we sought to (a) describe CSD researchers' knowledge and perceived benefit of open science practices, (b) describe the frequency of CSD researchers' participation in open science practices, (c) report perceived barriers to implementation of open science practices, (d) examine the relationship between demographics and knowledge and participation in these open science practices, and (e) examine whether perceived knowledge or benefit differs across practices. Across all open science practice areas, we hypothesized overall low knowledge and low participation, higher knowledge in more junior scientists, and the highest perceived knowledge and benefit from preregistration and gold open access. A discussion of these questions with respect to each open science practice is provided below.

Overall, our findings demonstrate that CSD researchers report low knowledge related to preregistration and gold open access, as well as low participation across all core open science practices. However, many reported a strong desire to learn more and engage in these practices in the future. The key barriers that may impede the adoption of open science practices in the field of CSD include lack of knowledge, time, and costs associated with implementation. Collectively, these findings suggest that initiatives to increase knowledge and reduce barriers in the implementation of open science practices are desired by the scientific community.