Skip to main content
Biological Psychiatry Global Open Science logoLink to Biological Psychiatry Global Open Science
. 2023 Aug 22;4(1):110–119. doi: 10.1016/j.bpsgos.2023.08.007

Open Science Practices in Psychiatric Genetics: A Primer

Adrianna P Kępińska a,b,c,d,e, Jessica S Johnson a,f, Laura M Huckins a,b,c,d,g,
PMCID: PMC10829621  PMID: 38298792

Abstract

Open science ensures that research is transparently reported and freely accessible for all to assess and collaboratively build on. Psychiatric genetics has led among the health sciences in implementing some open science practices in common study designs, such as replication as part of genome-wide association studies. However, thorough open science implementation guidelines are limited and largely not specific to data, privacy, and research conduct challenges in psychiatric genetics. Here, we present a primer of open science practices, including selection of a research topic with patients/nonacademic collaborators, equitable authorship and citation practices, design of replicable, reproducible studies, preregistrations, open data, and privacy issues. We provide tips for informative figures and inclusive, precise reporting. We discuss considerations in working with nonacademic collaborators and distributing research through preprints, blogs, social media, and accessible lecture materials. Finally, we provide extra resources to support every step of the research process.

Keywords: Equity, Methodology, Open access, Open science, Psychiatric genetics, Reproducibility


Open science is an approach where research and resulting publications are transparent and freely accessible. Consequently, findings may be replicated, and errors prevented or corrected (1). The core of open science is a commitment to sharing research with everyone (2). Thus, several definitions of open science also address collaboration, inclusion, and diversity to improve participation in science (3,4). Consistent with this multifaceted view, we discuss open science practices related to transparency, integrity, diversity, inclusivity, equity, reproducibility, rigor, and accessibility (see Box S1 for definitions).

Psychiatric genetics has been praised as an early adopter of open science, championing genome analyses with embedded replication, initially through genome-wide association studies (GWASs) (5,6). Multiple consortia now exist to improve data pooling and provide free bioinformatic methods, e.g., the Psychiatric Genomics Consortium (PGC) (6), the Genotype-Tissue Expression Project (7), and the Global Biobank Meta-analysis Initiative (8). Geneticists frequently use preprint servers (where manuscripts get freely deposited before peer review), e.g., bioRxiv (9). All these efforts are aimed at remedying years of unreplicable research (10,11) due to small samples that were underpowered to detect associations between psychiatric disorders (12), singular gene variants, and environmental factors (13); questionable statistical practices generating false positives (14,15); and inconsistent testing of different variants on the same genes (16).

However, GWASs may be losing their status as a “paragon of open science” (17). GWASs have been criticized for being increasingly difficult to reproduce owing to inconsistent, un(der)reported methods and increasing use of private company data to enlarge samples. Commercial releases of partial data hinder replication. Finally, data come from unrepresentative samples (individuals who are predominantly affluent, of European ancestry, better educated, and/or healthy volunteers).

For science to remain open, existing research standards must be consistently implemented and constantly refined (1). Recent literature has addressed open science in psychiatry (18, 19, 20) or GWASs (21,22), but not specifically psychiatric GWASs. When reviews have addressed psychiatric genetics, they have prioritized replicability and reproducibility (23,24). These otherwise crucial dimensions are not the only facets of open scholarship. Here, we provide a primer on implementing open science practices in psychiatric genetics at multiple research steps (Figure 1): topic choice and authorship order, study design and preregistration, data access, open code, informative figures, inclusive language, reporting standards, preprints, nonacademic (citizen scientist) involvement, and research distribution. We identify common issues, suggest solutions, and include resources for every step.

Figure 1.

Figure 1

Open science practices at every research step, values which these actions support, and barriers to open science implementation. Figure created with Biorender.com.

Research Topic and Authorship

Several open science values–integrity, transparency, equity, and accessibility–are intertwined with study conceptualization.

Choice of Research Topic

Patient experts and researchers increasingly recognize that patients and community representatives should be successfully involved in every step of the research process (25). Such inclusion ensures that research addresses nonstigmatizing topics that are of the highest benefit to community-specific health needs (26).

Equity in Authorship and Author Order

Transparency and equity should be considered in authorship decisions. Authorship disclosures may be standardized forms of contributions, e.g., Contributor Roles Taxonomy (CRediT) (27). However, authorship is academic currency, but author order is still unequally attributed to men over women (28). Liboiron et al. (29) argued for equity as the guiding principle in establishing author order, considering care work and unequal division of labor. Care work (mentoring and teaching) is poorly acknowledged and disproportionately delegated to women and early-career or marginalized scholars (30). However, care work is vital to early-career scholars’ skillsets and well-being in research groups (31). Thus, we argue that care work must be credited, e.g., using CRediT (27).

To improve transparency and equity, we suggest that, prior to any project, researchers declare detailed criteria for determining author order (32, 33, 34), such as extent of contribution, author seniority, and the degree/rarity or number of biological samples contributed by each collaborator. Policies could define a significant contribution, e.g., whether author order is based on the number of completed tasks and how these tasks are quantified. This is crucial for studies that require different types of expertise.

Another solution is designating multiple corresponding authors for specific analyses, thereby distributing extra credit among authors who are responsible for technical contributions (e.g., programmers, statisticians). These authors are often placed in less desirable middle positions in author lists (35) or, historically in the case of female programmers, given acknowledgments but not authorship (36). Unfortunately, joint authorships are not consistently acknowledged (37,38). A novel approach proposes crediting authors throughout the manuscript text, thus making the contributions of each author clearer to readers, including hiring committees (39). While implementing this approach may prove complicated, we agree that changes in hiring practices are necessary. Diverse contributions should not be underappreciated if they do not result in first or senior authorship.

Equality in Citations

Genetic studies fail to represent global genetic diversity and come predominantly from Western sites (40), which have effectively monopolized the literature. Currently limited ways of redressing this imbalance are transparency about how a lack of global perspective impacts results and limits generalizability of study findings and referencing both large-scale, European ancestry-skewed studies and smaller studies of more diverse populations. Where studies are unavailable, researchers should cite theoretical work, protocols, and software by underrepresented scholars.

Study Design

Next, we briefly outline key study design considerations related to research clarity, reproducibility, replicability, and diversity.

Preventing Researcher Bias in Secondary Data Analysis

Secondary data analysis is common in psychiatric genetics. However, knowledge of data before analysis may introduce cognitive bias, thus distorting conclusions. Several solutions may limit bias (41): 1) a preregistration (Preregistration and Registered Reports); 2) sampling (e.g., hold out samples for exploratory research); 3) blinded analyses (42) or scrambling data (41); 4) multiverse analyses (testing whether findings remain consistent across multiple paradigms, e.g., frequentist and Bayesian analyses) (43); or 5) coordinated analyses (testing models across separate large-scale cohorts) (44).

Clear Phenotype Specification

Heterogeneous psychiatric phenotypes are frequently defined under a single diagnosis (45). Granular phenotypes (symptoms, brain structures, endophenotypes), which are difficult to obtain retrospectively, may be subject to stricter data sharing restrictions or may be difficult to harmonize across studies. Detailed publication of all phenotypic information, where appropriate, may partially address concerns about heterogeneity.

Another issue is phenotyping quality in genomic datasets that are linked to electronic health records. Electronic health records may reflect diagnostic difficulties (e.g., rare phenotypes), stigma, or race- or gender-specific bias in admission, diagnosis, and treatment. These biases may be ameliorated through 1) replication of phenotypic and genetic associations; 2) accounting for order, length, and time between episodes and treatment to avoid conflating unrelated episodes into a diagnosis; and 3) using known disorder biology to verify diagnoses, e.g., using polygenic scores or genetic correlations to disentangle whether diagnoses match their predicted underlying biology (46).

Studies should report what details were requested, whether questionnaires were applied across the cohort (e.g., whether trauma histories were obtained from posttraumatic stress disorder controls or only cases), who determined the phenotypes, and potential inaccuracies. It is also crucial to state whether results were compared across samples with different phenotype operationalizations. For instance, a GWAS of age of onset in psychiatric and nonpsychiatric disorders yielded different significant hits according to the age of onset definitions that were applied (47). Finally, data collection protocols should emphasize the maximum level of detail so that analysts can eventually understand sources of variability in the data (48).

Sample Size Justifications

Sample size justifications should include power calculations or clarification of why power calculations are absent (a whole population study; resource constraints; heuristics based on literature or norms in the field; or no reason to specify power, e.g., no clear inference goal) (49).

Ameliorating Eurocentric Bias

In 2017, 88% of GWAS findings came from European ancestry samples. Indigenous samples represented only 0.02% of GWAS findings (40). Genetic association statistics are not generalizable across ancestries (50). Eurocentric bias is also present in pharmacogenetics. While 30% of studies use European ancestry samples, ancestry reporting is frequently incomplete or vague. This results in biased assessments of drug targets, which are potentially not relevant across populations (51).

To ameliorate Eurocentric bias, researchers should analyze data from minoritized populations. Data from different ancestries should be analyzed separately. Researchers should acknowledge underpowered samples (52) or use well-powered international biobank data or summary statistics from biobank networks (8). Researchers should also implement pipelines for multiancestral data analysis (53, 54, 55, 56).

Preregistration and Registered Reports

Preregistration specifies hypotheses (or research questions for hypothesis-free designs), study design, and analysis plans before data collection or secondary data analyses begin (57). Registered reports (RRs) are peer-reviewed, preregistered methods and data collection protocols (58) that result in publication regardless of study outcome. We encourage journals to publish RRs, null results, and nonreplications. Guidance on preregistrations/RRs, including in consortia, is detailed elsewhere (57,59, 60, 61, 62, 63).

Despite increasing guidance, not all aspects of preregistration have been resolved. Preregistrations require substantial effort, but they can be completed using research plans required for proposals to ethics boards (64) or consortia for data access (59). Another issue is the cost of open RR publications, which is paid early in the process. This may prevent researchers from choosing RRs. We call on journals to provide clear RR pricing guidance and on funders to provide incentives (e.g., https://www.cancerresearchuk.org/funding-for-researchers/how-we-deliver-research/positive-research-culture/registered-reports).

Data Access and Data Sharing

Open annotated data and metadata are the gold standards for replicability. However, reasonable privacy concerns prevent making much genetic data fully open. The European Commission Directorate-General for Research and Innovation has called for data “as open as possible and as closed as necessary”: “open” where possible for reuse, but “closed” to protect privacy (65). Studies of public perception of genomic data sharing in 22 countries demonstrated that attitudes vary on increasing public trust in the process (66) and willingness to share data with for-profit researchers and medical doctors (67) in return for results (68). Thus, we argue that participant safety and wishes must be prioritized and should not be assumed to be universal.

Nonetheless, respect for study participants is at odds with full data access. The onus falls on researchers to optimize approaches for making data accessible without risking identification of participants or to develop methods whereby summary statistics (which aggregate cohort-level genetic information) are sufficient to perform analyses. For multiancestral projects, ancestry-specific summary statistics are also necessary (69).

Within Consortia

The “secondary analysis proposal” model gives access to deidentified individual-level genotype and phenotype data to researchers with an approved analysis plan [implemented, for example, by the PGC (70)]. Where external users cannot easily access data, sites may employ a confederate model whereby analytical procedures and scripts are shared or site analysts run analyses and share summary data.

Within Institutions

Data sharing between teams may accelerate collaboration, reduce time spent on quality control (Data Quality Control and Analysis), and enable researchers to detect errors prior to publication. Institutions should incentivize such data sharing. A blueprint is the Quality-Ethics-Open Science-Translation Center (Berlin Institute of Health, Germany), which funds institution-wide data sharing (71).

A novel approach to data sharing is adversarial collaborations. Adversarial collaborations have research teams jointly access the data but test opposing hypotheses. Teams also reproduce analyses of their opponents and edit each other’s descriptions of theory to prevent misrepresentation of opposing views (72).

Summary-Level Data

Alternatively, researchers may prioritize analyzing open summary statistics, including associated statistics such as linkage disequilibrium matrices. Gold standard approaches include, e.g., GCTA-COJO (genome-wide complex trait analysis-conditional and joint analysis) (73) for conditional analysis of GWAS summary statistics; FINEMAP (74) and PAINTOR (75) for fine-mapping loci; COLOC (76, 77, 78) and summary data-based Mendelian randomization (79) for expression quantitative trait loci annotation of GWAS summary statistics; and S-PrediXcan (80) for transcriptomic imputation. However, we note that these approaches perform best with linkage disequilibrium matrices of the original data.

Another issue may be limited access to summary-level data from commercial entities (17). We believe that journals could enforce data sharing, e.g., by accepting a publication only after its authors have provided complete summary statistics.

Indigenous Data Sovereignty

Indigenous data sovereignty (IDS) describes the right of Indigenous peoples to control access to data and samples that are collected on their lands and from their communities (81,82). Although IDS precludes broad data access, the benefits to open science are nevertheless myriad: inclusive research development, innovation, and improved citizen engagement (because Indigenous communities fear that fully open data could result in misinterpretation of eventual results and increased stigma) (81,82). Increased IDS will enhance accountability through transparency of data and sample handling and improve equity when Indigenous communities benefit through publication, research, or commercialization. Increased Indigenous participation would increase diversity and inclusion in genetics (81). These values, together with degrees of improved transparency and equity, are also open science values.

IDS guidance is summarized in the CARE Principles for Indigenous Data Governance (collective benefit, authority to control, responsibility and ethics) (83). CARE principles reflect tribal expectations of research conduct: tribal consent and input are followed throughout, e.g., when and how samples are returned to honor participants’ spiritual practices; research benefits reflect tribal priorities; Indigenous individuals are employed in research projects and compensated; research materials and data belong to tribes who govern access, review research progress, provide institutional review boards, and grant secondary uses of data and research materials; findings are returned to communities; researchers respect Indigenous cultural and spiritual values; researchers and Indigenous communities are equal collaborators (84).

Data Quality Control and Analysis

Quality Control

Genetic data quality remains inconsistent (17). We suggest consistently following and describing completed quality control and imputation protocols and explaining any deviations from protocols in manuscripts and open datasets (which frequently lack information on the quality control protocol that was applied).

Open Source

Open source tools are commonly used and potentially improve replicability and reproducibility [e.g., PLINK (22,85)]. However, while software used for analysis is often shared, software and scripts for preprocessing data are not, which limits reproducibility (86).

Detailed Record of Analytical Steps and Open Code

While preregistrations (Preregistration and RRs) ideally provide specific research plans, these plans can expand (e.g., following reviewer suggestions). Researchers should keep detailed, transparent analysis records, e.g., in open online laboratory notebooks (87). Code should be openly deposited with version control and a codebook in which all variables are summarized (88). Workflows should be automated (files are not edited manually but result from how the script executes), with labeled input and output files. Relationships between data, data preprocessing, software, and analysis outcomes should be explained in a compendium article (89), e.g., an interactive website Jupyter notebook, or R Markdown files, which are convertible into books, articles, or websites (90).

Bioinformatics Tools

Development and maintenance of bioinformatics tools provides research tools and hands-on software development training, which most biologists (or psychiatry researchers) are not trained in. However, tool development is unscalable without continuous funding for maintenance, reproducibility, and training. Tool development is also not credited with coveted senior authorship, and software is not consistently cited (91). We echo the call from the bioinformatics community for consistent funding and counting software development toward career progression. To that end, researchers could make their bioinformatic work citable with persistent handles (e.g., generated through Zenodo, https://zenodo.org/) and consistently attribute authorship of software, data visualization tools, etc.

Reporting

Language

Detailed reporting supports transparency, replicability, and reproducibility but may also be in tension with open science values. Using language that participants prefer (to describe their experience, communities, etc.) improves inclusivity and reproduces information shared by participants. However, using unstandardized descriptors contradicts facilitating reproducibility through shared terminology. Consequently, we suggest that researchers provide descriptions from participants, operationalizations (e.g., diagnostic criteria), or, if these are unavailable, inclusion/exclusion criteria (e.g., self-diagnosis, disorder for which formal diagnostic criteria are lacking, symptom types, frequency, intensity, etc.). Below, we also briefly outline discussions about language choices for commonly reported data as a starting point for researchers.

Ancestry, Race, and Ethnicity

The use of race as interchangeable with ancestry and ethnicity is decreasing (92) but has not stopped. The terms have different meanings. Race is a sociopolitical concept whereby group identification is based on often stereotypical or politically influenced defining physical characteristics, e.g., skin pigmentation (54,93). Ethnicity, on the other hand, is a category that is based on group belonging through shared language and traditions. Ethnic groups may share genetic factors because of similar ancestral origins or be self-identified based on shared culture (54,94). Genetic ancestry refers to populations from which an individual has descended, which is reflected in DNA inherited from recent biological ancestors (54). Biogeographical ancestry labeling (African, Asian, and European) has increased (92). However, it has been criticized for being reminiscent of historical racial taxonomies (95), not capturing heterogeneity within local subpopulations (96), and being inconsistent with global genetic variation, which is continuous, not discrete continental (93). In addition, individuals who are not monoracial are frequently grouped into monolithic, imprecise, or othering categories, e.g., multicultural, admixed, bi/multiracial, and other (97). These descriptors are unacceptable.

At a minimum, researchers should follow antiracist genetics publication recommendations (94), which require clearly stated definitions and reasons for addressing race and ancestry. Race impacts health care delivery, disorder etiologies, and health outcomes. Accounting for race/ethnicity in modeling is necessary, but explicitly as a marker of inequality, never as a proxy for genetic ancestry (54). Ideally, researchers should work in interdisciplinary teams with affected communities (93) and consider outlining their own identities and motivations in manuscripts to build trust with affected communities (97).

Sex and Gender

Sex and gender are not synonymous, essentialist constructs. Gender encompasses sociopolitically constructed roles, behaviors, and identities (98). Sex is a set of physical and physiological variables (genitalia, gametes, karyotypes, gene expression, hormones) (98) that do not necessarily or fully determine sex (99). Common issues with reporting gender/sex differences include inconsistent or conflated use of sex and gender; sex or sample sizes per sex not reported; statistical evidence for claimed gender differences not provided; and no stated criteria for determining or ascertaining sex/gender (100,101).

Diagnosis and Patient Identity

Two general ways of describing patients are person-first language (e.g., a person with autism) and identity-first language (e.g., an autistic person). Person-first language prioritizes an individual, while communities, e.g., the English-speaking autism community, argue that identity-first descriptions emphasize that diagnoses can be proudly integral to one’s identity (102). Participant preference should be respected; where unknown, researchers should be sensitive to historical agendas, priorities, and different experiences within the community (103). We also suggest that researchers transparently report how descriptions were determined.

Reporting Guidelines

Health care studies insufficiently report statistical methods, missing values, or reporting guidelines followed (104). Multiple reporting standards exist for common genetic designs (105, 106, 107, 108). Furthermore, researchers should ensure that their results are not overinterpreted. Researchers should state why the evidence may be causal and clearly communicate any uncertainty (109, 110, 111), including in press releases.

Figures

Good figures clearly communicate approach, results, and key messages, thus improving transparency and reproducibility. To make figures accessible, researchers should address font size, readability, and color selection for individuals with visual impairments or color blindness. Accessibility and inclusivity also extend to imagery, mindful of lived experience. For example, eating disorder studies should avoid potentially triggering visuals, e.g., scales or distorted body images in mirrors. Graphics for case-control groups should not depict a singular gender presentation/race/body type as a sample case. Differences between cases and controls could be labeled with neutral colors (not green and red, implying good and bad for controls and cases; these colors are also unreadable to individuals with color-vision deficiencies). We provide free resources for creating accessible figures in Table S1.

Crucially, researchers should be aware of how data visualizations may be misinterpreted through a racist lens and co-opted by far-right extremists to further white supremacy, especially given that much of the genetic research is Eurocentric (Study Design). Researchers have to anticipate how their visualizations, out of context and without labels, could be misconstrued. A preventive solution would be to standardize plot presentation in the field (112).

Preprints

Preprints, nonpeer reviewed papers, are now integral to fast and free research sharing (113). Preprints also streamline submission to journals. In bioRxiv, 139 journals relevant to psychiatric genetics, psychiatry, and bioinformatics endorsed direct journal transfer. However, the preprint explosion during the COVID-19 pandemic highlighted limitations of rapid sharing, which we suggest that researchers be mindful of. Rapidly generated preprints, some of which may be methodologically flawed or fraudulent, may end up being integrated into unreliable meta-analyses or flawed health policy. Furthermore, flawed studies may lead to wasteful or harmful follow-up research even after retraction of the original preprints (114).

Citizen Science and Participant Inclusion

Knowledge coproduction (between patient experts and researchers) may be difficult due to pressures of timescales and funding (115). More extremely, coproduction could leave patients feeling tokenized or invalidated when their experience is deemed wrong against the views of psychiatrists or Eurocentric models of knowledge (116). Specific approaches may foster inclusive collaborations (115,116) including the following: 1) resources dedicated to supporting relationships with patient experts during and between projects; 2) open events where patients and researchers explore a specific key issue to identify creative solutions; 3) published evidence of the impact of coproduced research, supporting continuous collaborations; and 4) careful consultations between researchers and patient experts. Researchers should prepare to address boundaries to participation, conflicting views, and difficult journeys through health services (e.g., due to racism or coercion) (116,117). Patient experts should not be expected to share sensitive community knowledge. It should be clear how researchers will use knowledge that is obtained from patient experts (118).

In citizen science, nonacademic members of the public are active research collaborators, not passive data donors. Collaborations should reflect citizen science values: autonomy, fun, respect, altruism, inclusivity, openness, reciprocity, and solidarity (119,120). Researchers should plan how to navigate divergent collaborator priorities, e.g., when and how to release results to comply with institutional or regulatory requirements (121). In terms of compensation, citizen scientists wish for accessible data, clearly communicated findings, and acknowledgments in publications (122). Credit or intellectual property compensation, to be legally binding, needs to be detailed in contracts or institutional policy (121).

Research Distribution

Researchers should consider how to communicate findings in such a way as to uphold open science values. While psychiatric geneticists support returning results to patients, most do not deem their knowledge of the process to be adequate (123) [although see recent international guidelines (124, 125, 126)]. In addition, scientists incorrectly assume that publication in peer-reviewed journals makes results accessible (121). Scientific publications may be publicly unavailable or written in a language that is difficult for nonspecialists to understand. To ensure access, American genomic citizen science projects provide links to open access publications on project websites (127).

Writing manuscripts in a nontechnical manner would be a good general practice. However, we acknowledge that it can be challenging. Extra (technical) details of analyses are required for replicability/reproducibility, but they decrease the overall accessibility of manuscripts. Several solutions could be put in place; for example, manuscripts could be accompanied by nontechnical press releases sent to participants in automated e-mails or letters. Researchers already explain their work or summarize talks (128) using blogs or social media posts (129). Accessible materials explaining research to nonacademic readers could be provided, e.g., on manuscript companion websites and websites of research societies (e.g., https://ispg.net/resources/educational-presentations, https://pgc.unc.edu/for-the-public/basic-genetics/), laboratories, or patient advocacy groups.

An alternative to written communication is live events. Accessibility practices include live captions/transcriptions, inclusive language, and translations, including sign language; affordable online conferences and ensuring that scholars who require visas can attend meetings; even division of time among speakers; and implementing a code of conduct (130). Alternatively, nonreal-time web conferences allow attendees to watch presentations, while remaining friendly toward time zones and caretaking commitments. Nonlive conferences also do not require a stable Internet connection (131).

Finally, because psychiatric genetics requires multidisciplinary technical training, open teaching materials would support learning about the newest techniques. Platforms such as Open Science Framework (https://osf.io/) or Zenodo permit free content uploads (slides, posters, and recordings of talks, which should include captions, transcripts, and translations). These teaching materials can have citable digital object identifiers generated. Textbooks can also be easily uploaded online from R Markdown files (Data Quality Control and Analysis).

Conclusions

Methodological rigor and careful collection of open data and their transparent, easy-to-follow presentation are key to improving the reproducibility, replicability, and accessibility of psychiatric genetics studies. However, psychiatric genetics remaining at the forefront of open science will depend on the ongoing support of and advocacy for collaborations and credit for junior, minority, technical, and nonprofessional researchers. Continuous funding is needed for research with non-European ancestry samples in a manner which respects the rights and privacy of underrepresented groups, following the increasingly embraced call to make data “as open as possible, as closed as necessary.” Funding is also required for the continuous software development that is necessary to analyze these complex data. Finally, consistent funding and institutional change of research norms would support researchers by providing them with extra time needed to run reproducible and replicable studies, providing clear research explanations for the public, creating open teaching materials, and maintaining ongoing collaborations with patient experts.

Acknowledgments and Disclosures

APK was funded in part by the National Institute for Health and Care Research Maudsley Biomedical Research Centre at South London and Maudsley National Health Service Foundation Trust and King’s College London, and the Klarman Family Foundation. JSJ was funded by National Institute of Mental Health (Grant No. R01MH124839) and the Klarman Family Foundation. LMH was funded by National Institute of Mental Health (Grant Nos. R01MH118278 and R01MH124839), National Institute of Environmental Health Sciences (Grant No. R01ES033630), and the Klarman Family Foundation. The views expressed are those of the authors and not necessarily those of the funders.

We thank Carina Seah, Rebecca Signer, and Kayla Townsley for their helpful feedback.

APK and LMH conceptualized the study; APK, JSJ, and LMH wrote the original draft; APK, JSJ, and LMH reviewed and edited; APK created visualizations; APK administered the project; JSJ provided resources; LMH supervised the project.

The authors report no biomedical financial interests or potential conflicts of interest.

Footnotes

Supplementary material cited in this article is available online at https://doi.org/10.1016/j.bpsgos.2023.08.007.

Supplementary Material

Supplementary Data
mmc1.pdf (231.3KB, pdf)

References

  • 1.Munafò M.R., Nosek B.A., Bishop D.V.M., Button K.S., Chambers C.D., Percie du Sert N., et al. A manifesto for reproducible science. Nat Hum Behav. 2017;1 doi: 10.1038/s41562-016-0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Leonelli S., Spichtinger D., Prainsack B. Sticks and carrots: Encouraging open science at its source. Geo. 2015;2:12–16. doi: 10.1002/geo2.2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Robinson D. Open Source Alliance, What is open? | OSAOS [The Open Source Alliance for Open Scholarship] handbook. 2018. https://osaos.codeforscience.org/what-is-open/ Available at:
  • 4.Murphy M.C., Mejia A.F., Mejia J., Yan X., Cheryan S., Dasgupta N., et al. Open science, communal culture, and women’s participation in the movement to improve science. Proc Natl Acad Sci USA. 2020;117:24154–24164. doi: 10.1073/pnas.1921320117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Sullivan P.F., Agrawal A., Bulik C.M., Andreassen O.A., Børglum A.D., Breen G., et al. Psychiatric genomics: An update and an agenda. Am J Psychiatry. 2018;175:15–27. doi: 10.1176/appi.ajp.2017.17030283. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Uffelmann E., Huang Q.Q., Munung N.S., de Vries J., Okada Y., Martin A.R., et al. Genome-wide association studies. Nat Rev Methods Primers. 2021;1:59. [Google Scholar]
  • 7.GTEx Consortium The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science. 2020;369:1318–1330. doi: 10.1126/science.aaz1776. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Zhou W., Kanai M., Wu K.-H.H., Rasheed H., Tsuo K., Hirbo J.B., et al. Global Biobank Meta-analysis Initiative: Powering genetic discovery across human disease. Cell Genomics. 2022;2:100192. doi: 10.1016/j.xgen.2022.100192. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Abdill R.J., Blekhman R. Tracking the popularity and outcomes of all bioRxiv preprints. eLife. 2019;8 doi: 10.7554/eLife.45133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Johnson E.C., Border R., Melroy-Greif W.E., de Leeuw C.A., Ehringer M.A., Keller M.C. No evidence that schizophrenia candidate genes are more associated with schizophrenia than noncandidate genes. Biol Psychiatry. 2017;82:702–708. doi: 10.1016/j.biopsych.2017.06.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Border R., Johnson E.C., Evans L.M., Smolen A., Berley N., Sullivan P.F., Keller M.C. No support for historical candidate gene or candidate gene-by-interaction hypotheses for major depression across multiple large samples. Am J Psychiatry. 2019;176:376–387. doi: 10.1176/appi.ajp.2018.18070881. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Sullivan P.F. Spurious genetic associations. Biol Psychiatry. 2007;61:1121–1126. doi: 10.1016/j.biopsych.2006.11.010. [DOI] [PubMed] [Google Scholar]
  • 13.Duncan L.E., Keller M.C. A critical review of the first 10 years of candidate gene-by-environment interaction research in psychiatry. Am J Psychiatry. 2011;168:1041–1049. doi: 10.1176/appi.ajp.2011.11020191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Claussnitzer M., Cho J.H., Collins R., Cox N.J., Dermitzakis E.T., Hurles M.E., et al. A brief history of human disease genetics. Nature. 2020;577:179–189. doi: 10.1038/s41586-019-1879-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Colhoun H.M., McKeigue P.M., Davey Smith G.D. Problems of reporting genetic associations with complex outcomes. Lancet. 2003;361:865–872. doi: 10.1016/s0140-6736(03)12715-8. [DOI] [PubMed] [Google Scholar]
  • 16.Grabe H.J., Van der Auwera S. In: Personalized Psychiatry. Bernhard T.B., editor. Academic Press; London: 2020. Gene-environment interaction in psychiatry; pp. 363–373. [Google Scholar]
  • 17.Burt C., Munafò M. Has GWAS lost its status as a paragon of open science? PLoS Biol. 2021;19 doi: 10.1371/journal.pbio.3001242. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Bell V. Open science in mental health research. Lancet Psychiatry. 2017;4:525–526. doi: 10.1016/S2215-0366(17)30244-4. [DOI] [PubMed] [Google Scholar]
  • 19.Burke N.L., Frank G.K.W., Hilbert A., Hildebrandt T., Klump K.L., Thomas J.J., et al. Open science practices for eating disorders research. Int J Eat Disord. 2021;54:1719–1729. doi: 10.1002/eat.23607. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Scheibein F., Donnelly W., Wells J.S. Assessing open science and citizen science in addictions and substance use research: A scoping review. Int J Drug Policy. 2022;100 doi: 10.1016/j.drugpo.2021.103505. [DOI] [PubMed] [Google Scholar]
  • 21.Marigorta U.M., Rodríguez J.A., Gibson G., Navarro A. Replicability and prediction: Lessons and challenges from GWAS. Trends Genet. 2018;34:504–517. doi: 10.1016/j.tig.2018.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Lin X. Learning lessons on reproducibility and replicability in large scale genome-wide association studies. Harv Data Sci Rev. 2020;2 doi: 10.1162/99608f92.33703976. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Carter C.S., Bearden C.E., Bullmore E.T., Geschwind D.H., Glahn D.C., Gur R.E., et al. Enhancing the informativeness and replicability of imaging genomics studies. Biol Psychiatry. 2017;82:157–164. doi: 10.1016/j.biopsych.2016.08.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Hübel C., Leppä V., Breen G., Bulik C.M. Rigor and reproducibility in genetic research on eating disorders. Int J Eat Disord. 2018;51:593–607. doi: 10.1002/eat.22896. [DOI] [PubMed] [Google Scholar]
  • 25.Pratte M.M., Audette-Chapdelaine S., Auger A.M., Wilhelmy C., Brodeur M. Researchers’ experiences with patient engagement in health research: A scoping review and thematic synthesis. Res Involv Engagem. 2023;9:22. doi: 10.1186/s40900-023-00431-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Garrison N.A., Hudson M., Ballantyne L.L., Garba I., Martinez A., Taualii M., et al. Genomic research through an Indigenous lens: Understanding the expectations. Annu Rev Genomics Hum Genet. 2019;20:495–517. doi: 10.1146/annurev-genom-083118-015434. [DOI] [PubMed] [Google Scholar]
  • 27.Brand A., Allen L., Altman M., Hlava M., Scott J. Beyond authorship: Attribution, contribution, collaboration, and credit. Learn Pub. 2015;28:151–155. [Google Scholar]
  • 28.Ross M.B., Glennon B.M., Murciano-Goroff R., Berkes E.G., Weinberg B.A., Lane J.I. Women are credited less in science than men. Nature. 2022;608:135–145. doi: 10.1038/s41586-022-04966-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Liboiron M., Ammendolia J., Winsor K., Zahara A., Bradshaw H., Melvin J., et al. Equity in author order: A feminist laboratory’s approach. Catalyst. 2017;3:1–17. [Google Scholar]
  • 30.Heijstra T.M., Steinthorsdóttir F.S., Einarsdóttir T. Academic career making and the double-edged role of academic housework. Gend Educ. 2017;29:764–780. [Google Scholar]
  • 31.Limas J.C., Corcoran L.C., Baker A.N., Cartaya A.E., Ayres Z.J. The impact of research culture on mental health & diversity in STEM. Chemistry. 2022;28 doi: 10.1002/chem.202102957. [DOI] [PubMed] [Google Scholar]
  • 32.Veldink J., Al-Chalabi A. Project MinE Publication Policy. 2017. https://www.projectmine.com/wp-content/uploads/2016/06/ProjectMinE_publicationpolicy_v2.pdf Available at:
  • 33.Corvin A. Schizophrenia Working Group of the PGC [Psychiatric Genomics Consortium] revision of authorship. 2015. https://www.med.unc.edu/pgc/wp-content/uploads/sites/959/2019/01/SCZ_authorship.docx Available at:
  • 34.PTSD Working Group of the PGC PTSD [post-traumatic stress disorder] Working Group of the PGC [Psychiatric Genomics Consortium] draft authorship. 2015. https://pgc-ptsd.com/wp-content/uploads/2017/06/Authorship-Guidelines-PGC-PTSD.pdf Available at:
  • 35.Larivière V., Desrochers N., Macaluso B., Mongeon P., Paul-Hus A., Sugimoto C.R. Contributorship and division of labor in knowledge production. Soc Stud Sci. 2016;46:417–435. doi: 10.1177/0306312716650046. [DOI] [PubMed] [Google Scholar]
  • 36.Dung S.K., López A., Barragan E.L., Reyes R.J., Thu R., Castellanos E., et al. Illuminating women’s hidden contribution to historical theoretical population genetics. Genetics. 2019;211:363–366. doi: 10.1534/genetics.118.301277. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Lount R.B., Pettit N.C. Shared first authorship should be declared on academic CVs. Nat Hum Behav. 2023;7 doi: 10.1038/s41562-023-01588-8. 659-659. [DOI] [PubMed] [Google Scholar]
  • 38.Lapidow A., Scudder P. Shared first authorship. J Med Libr Assoc. 2019;107:618–620. doi: 10.5195/jmla.2019.700. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Rechavi O., Tomancak P. Who did what: Changing how science papers are written to detail author contributions. Nat Rev Mol Cell Biol. 2023;24:519–520. doi: 10.1038/s41580-023-00587-x. [DOI] [PubMed] [Google Scholar]
  • 40.Mills M.C., Rahal C. A scientometric review of genome-wide association studies. Commun Biol. 2019;2:9. doi: 10.1038/s42003-018-0261-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Baldwin J.R., Pingault J.B., Schoeler T., Sallis H.M., Munafò M.R. Protecting against researcher bias in secondary data analysis: Challenges and potential solutions. Eur J Epidemiol. 2022;37:1–10. doi: 10.1007/s10654-021-00839-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.MacCoun R.J., Perlmutter S. In: Psychological Science Under Scrutiny. Lilienfeld S.O., Waldman I.D., editors. John Wiley & Sons, Inc.; Hoboken, NJ: 2017. Blind analysis as a correction for confirmatory bias in physics and in psychology; pp. 295–322. [Google Scholar]
  • 43.Steegen S., Tuerlinckx F., Gelman A., Vanpaemel W. Increasing transparency through a multiverse analysis. Perspect Psychol Sci. 2016;11:702–712. doi: 10.1177/1745691616658637. [DOI] [PubMed] [Google Scholar]
  • 44.Hofer S.M., Piccinin A.M. Integrative data analysis through coordination of measurement and analysis protocol across independent longitudinal studies. Psychol Methods. 2009;14:150–164. doi: 10.1037/a0015566. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Cai N., Choi K.W., Fried E.I. Reviewing the genetics of heterogeneity in depression: Operationalizations, manifestations and etiologies. Hum Mol Genet. 2020;29:R10–R18. doi: 10.1093/hmg/ddaa115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Dueñas H.R., Seah C., Johnson J.S., Huckins L.M. Implicit bias of encoded variables: Frameworks for addressing structured bias in EHR–GWAS data. Hum Mol Genet. 2020;29:R33–41. doi: 10.1093/hmg/ddaa192. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Feng Y.-C.A., Ge T., Cordioli M., FinnGen GannaA., Smoller J.W., Neale B.M. Findings and insights from the genetic investigation of age of first reported occurrence for complex disorders in the UK Biobank and FinnGen. medRxiv. 2020 doi: 10.1101/2020.11.20.20234302. [DOI] [Google Scholar]
  • 48.Lambert C.G., Black L.J. Learning from our GWAS mistakes: From experimental design to scientific method. Biostatistics. 2012;13:195–203. doi: 10.1093/biostatistics/kxr055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Lakens D. Sample size justification. Collabra Psychol. 2022;8 [Google Scholar]
  • 50.Ding Y., Hou K., Xu Z., Pimplaskar A., Petter E., Boulier K., et al. Polygenic scoring accuracy varies across the genetic ancestry continuum. Nature. 2023;618:774–781. doi: 10.1038/s41586-023-06079-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Popejoy A.B. Diversity in precision medicine and pharmacogenetics: Methodological and conceptual considerations for broadening participation. Pharmgenomics Pers Med. 2019;12:257–271. doi: 10.2147/PGPM.S179742. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Ben-Eghan C., Sun R., Hleap J.S., Diaz-Papkovich A., Munter H.M., Grant A.V., et al. Don’t ignore genetic data from minority populations. Nature. 2020;585:184–186. doi: 10.1038/d41586-020-02547-3. [DOI] [PubMed] [Google Scholar]
  • 53.Liu D., Meyer D., Fennessy B., Feng C., Cheng E., Johnson J.S., et al. Rare schizophrenia risk variant burden is conserved in diverse human populations. medRxiv. 2022 doi: 10.1101/2022.01.03.22268662. [DOI] [Google Scholar]
  • 54.Peterson R.E., Kuchenbaecker K., Walters R.K., Chen C.Y., Popejoy A.B., Periyasamy S., et al. Genome-wide association studies in ancestrally diverse populations: Opportunities, methods, pitfalls, and recommendations. Cell. 2019;179:589–603. doi: 10.1016/j.cell.2019.08.051. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Highland H.M., Wojcik G.L., Graff M., Nishimura K.K., Hodonsky C.J., Baldassari A.R., et al. Predicted gene expression in ancestrally diverse populations leads to discovery of susceptibility loci for lifestyle and cardiometabolic traits. Am J Hum Genet. 2022;109:669–679. doi: 10.1016/j.ajhg.2022.02.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Bhattacharya A., Hirbo J.B., Zhou D., Zhou W., Zheng J., Kanai M., et al. Best practices for multi-ancestry, meta-analytic transcriptome-wide association studies: Lessons from the Global Biobank Meta-analysis Initiative. Cell Genomics. 2022;2 doi: 10.1016/j.xgen.2022.100180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Nosek B.A., Ebersole C.R., DeHaven A.C., Mellor D.T. The preregistration revolution. Proc Natl Acad Sci USA. 2018;115:2600–2606. doi: 10.1073/pnas.1708274114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Chambers C.D., Tzavella L. The past, present and future of Registered Reports. Nat Hum Behav. 2022;6:29–42. doi: 10.1038/s41562-021-01193-7. [DOI] [PubMed] [Google Scholar]
  • 59.Zugman A., Harrewijn A., Cardinale E.M., Zwiebel H., Freitag G.F., Werwath K.E., et al. Mega-analysis methods in ENIGMA: The experience of the generalized anxiety disorder working group. Hum Brain Mapp. 2022;43:255–277. doi: 10.1002/hbm.25096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Haroz S. Comparison of preregistration platforms. MetaArXiv. 2022 doi: 10.31222/osf.io/zry2u. [DOI] [Google Scholar]
  • 61.Kiyonaga A., Scimeca J.M. Practical considerations for navigating Registered Reports. Trends Neurosci. 2019;42:568–572. doi: 10.1016/j.tins.2019.07.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Bakker M., Veldkamp C.L.S., van Assen M.A.L.M., Crompvoets E.A.V., Ong H.H., Nosek B.A., et al. Ensuring the quality and specificity of preregistrations. PLS Biol. 2020;18 doi: 10.1371/journal.pbio.3000937. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Van Den Akker O.R., Weston S., Campbell L., Chopik B., Damian R., Davis-Kean P., et al. Preregistration of secondary data analysis: A template and tutorial. Meta-Psychol. 2021;5 [Google Scholar]
  • 64.Evans T.R., Branney P., Clements A., Hatton E. Improving evidence-based practice through preregistration of applied research: Barriers and recommendations. Account Res. 2023;30:88–108. doi: 10.1080/08989621.2021.1969233. [DOI] [PubMed] [Google Scholar]
  • 65.Landi A., Thompson M., Giannuzzi V., Bonifazi F., Labastida I., da Silva Santos L.O.B., Roos M. The “A” of FAIR – As open as possible, as closed as necessary. Data Intellegence. 2020;2:47–55. [Google Scholar]
  • 66.Milne R., Morley K.I., Almarri M.A., Anwer S., Atutornu J., Baranova E.E., et al. Demonstrating trustworthiness when collecting and sharing genomic data: Public views across 22 countries. Genome Med. 2021;13:92. doi: 10.1186/s13073-021-00903-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Middleton A., Milne R., Almarri M.A., Anwer S., Atutornu J., Baranova E.E., et al. Global public perceptions of genomic data sharing: What shapes the willingness to donate DNA and health data? Am J Hum Genet. 2020;107:743–752. doi: 10.1016/j.ajhg.2020.08.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Milne R., Morley K.I., Almarri M.A., Atutornu J., Baranova E.E., Bevan P., et al. Return of genomic results does not motivate intent to participate in research for all: Perspectives across 22 countries. Genet Med. 2022;24:1120–1129. doi: 10.1016/j.gim.2022.01.002. [DOI] [PubMed] [Google Scholar]
  • 69.Martin A.R., Kanai M., Kamatani Y., Okada Y., Neale B.M., Daly M.J. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat Genet. 2019;51:584–591. doi: 10.1038/s41588-019-0379-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Davis L. The PGC data access portal and genomic privacy: Data sharing procedures to satisfy all communities. Eur Neuropsychopharmacol. 2019;29:S714. [Google Scholar]
  • 71.Strech D., Weissgerber T., Dirnagl U., QUEST Group Improving the trustworthiness, usefulness, and ethics of biomedical research through an innovative and comprehensive institutional initiative. PLoS Biol. 2020;18 doi: 10.1371/journal.pbio.3000576. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Martschenko D.O., Trejo S. Ethical, anticipatory genomics research on human behavior means celebrating disagreement. HGG Adv. 2022;3 doi: 10.1016/j.xhgg.2021.100080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Yang J., Ferreira T., Morris A.P., Medland S.E., Madden P.A.F., Heath A.C., et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat Genet. 2012;44:369–375. doi: 10.1038/ng.2213. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Benner C., Spencer C.C., Havulinna A.S., Salomaa V., Ripatti S., Pirinen M. FINEMAP: Efficient variable selection using summary data from genome-wide association studies. Bioinformatics. 2016;32:1493–1501. doi: 10.1093/bioinformatics/btw018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Kichaev G., Yang W.Y., Lindstrom S., Hormozdiari F., Eskin E., Price A.L., et al. Integrating functional data to prioritize causal variants in statistical fine-mapping studies. PLoS Genet. 2014;10 doi: 10.1371/journal.pgen.1004722. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Wallace C. Eliciting priors and relaxing the single causal variant assumption in colocalisation analyses. PLoS Genet. 2020;16 doi: 10.1371/journal.pgen.1008720. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Wallace C. A more accurate method for colocalisation analysis allowing for multiple causal variants. PLoS Genet. 2021;17 doi: 10.1371/journal.pgen.1009440. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Giambartolomei C., Vukcevic D., Schadt E.E., Franke L., Hingorani A.D., Wallace C., Plagnol V. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 2014;10 doi: 10.1371/journal.pgen.1004383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Zhu Z., Zhang F., Hu H., Bakshi A., Robinson M.R., Powell J.E., et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat Genet. 2016;48:481–487. doi: 10.1038/ng.3538. [DOI] [PubMed] [Google Scholar]
  • 80.Barbeira A.N., Dickinson S.P., Bonazzola R., Zheng J., Wheeler H.E., Torres J.M., et al. Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat Commun. 2018;9:1825. doi: 10.1038/s41467-018-03621-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Hudson M., Garrison N.A., Sterling R., Caron N.R., Fox K., Yracheta J., et al. Rights, interests and expectations: Indigenous perspectives on unrestricted access to genomic data. Nat Rev Genet. 2020;21:377–384. doi: 10.1038/s41576-020-0228-x. [DOI] [PubMed] [Google Scholar]
  • 82.Rainie S.C., Kukutai T., Walter M., Figueroa-Rodríguez O.L., Walker J., Axelsson P. In: The State of Open Data: Histories and Horizons. Davies T., Walker S., Rubinstein M., Perini F., editors. African Minds and International Development Research Centre; Cape Town and Ottawa: 2019. Indigenous data sovereignty; pp. 300–319. [Google Scholar]
  • 83.Carroll S.R., Garba I., Figueroa-Rodríguez O.L., Holbrook J., Lovett R., Materechera S., et al. The CARE principles for Indigenous data governance. Data Sci J. 2020;19:43. [Google Scholar]
  • 84.Carroll S.R., Garba I., Plevel R., Small-Rodriguez D., Hiratsuka V.Y., Hudson M., Garrison N.A. Using Indigenous standards to implement the CARE principles: Setting expectations through tribal research codes. Front Genet. 2022;13 doi: 10.3389/fgene.2022.823309. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Chang C.C., Chow C.C., Tellier L.C., Vattikuti S., Purcell S.M., Lee J.J. Second-generation PLINK: Rising to the challenge of larger and richer datasets. GigaScience. 2015;4:7. doi: 10.1186/s13742-015-0047-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Peng R.D., Dominici F., Zeger S.L. Reproducible epidemiologic research. Am J Epidemiol. 2006;163:783–789. doi: 10.1093/aje/kwj093. [DOI] [PubMed] [Google Scholar]
  • 87.Schapira M., Open Lab Notebook Consortium. Harding R.J. Open laboratory notebooks: Good for science, good for society, good for scientists. F1000Res. 2019;8:87. doi: 10.12688/f1000research.17710.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Arslan R.C. How to automatically document data with the codebook package to facilitate data reuse. Adv Methods Pract Psychol Sci. 2019;2:169–187. [Google Scholar]
  • 89.Peng R.D. Reproducible research in computational science. Science. 2011;334:1226–1227. doi: 10.1126/science.1213847. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Xie Y., Allaire J.J., Grolemund G. Chapman & Hall/CRC; Boca Raton, FL: 2019. R Markdown: The Definitive Guide. [Google Scholar]
  • 91.Prins P., de Ligt J., Tarasov A., Jansen R.C., Cuppen E., Bourne P.E. Toward effective software solutions for big biology. Nat Biotechnol. 2015;33:686–687. doi: 10.1038/nbt.3240. [DOI] [PubMed] [Google Scholar]
  • 92.Byeon Y.J.J., Islamaj R., Yeganova L., Wilbur W.J., Lu Z., Brody L.C., Bonham V.L. Evolving use of ancestry, ethnicity, and race in genetics research-A survey spanning seven decades. Am J Hum Genet. 2021;108:2215–2223. doi: 10.1016/j.ajhg.2021.10.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Lewis A.C.F., Molina S.J., Appelbaum P.S., Dauda B., Di Rienzo A., Fuentes A., et al. Getting genetic ancestry right for science and society. Science. 2022;376:250–252. doi: 10.1126/science.abm7530. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Brothers K.B., Bennett R.L., Cho M.K. Taking an antiracist posture in scientific publications in human genetics and genomics. Genet Med. 2021;23:1004–1007. doi: 10.1038/s41436-021-01109-w. [DOI] [PubMed] [Google Scholar]
  • 95.Rajagopalan R., Fujimura J.H. In: Genetics and the Unsettled Past. Wailoo K., Nelson A., Lee C., editors. Rutgers University Press; New Brunswick, NJ: 2012. Making history via DNA, making DNA from history: Deconstructing the race-disease connection in admixture mapping; pp. 143–163. [Google Scholar]
  • 96.Weiss K.M., Long J.C. Non-Darwinian estimation: My ancestors, my genes’ ancestors. Genome Res. 2009;19:703–710. doi: 10.1101/gr.076539.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Martschenko D.O., Wand H., Young J.L., Wojcik G.L. Including multiracial individuals is crucial for race, ethnicity and ancestry frameworks in genetics and genomics. Nat Genet. 2023;55:895–900. doi: 10.1038/s41588-023-01394-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Heidari S., Babor T.F., De Castro P., Tort S., Curno M. Sex and gender equity in research: Rationale for the SAGER guidelines and recommended use. Res Integr Peer Rev. 2016;1:2. doi: 10.1186/s41073-016-0007-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Miyagi M., Guthman E.M., Sun S.E.D. Transgender rights rely on inclusive language. Science. 2021;374:1568–1569. doi: 10.1126/science.abn3759. [DOI] [PubMed] [Google Scholar]
  • 100.Garcia-Sifuentes Y., Maney D.L. Reporting and misreporting of sex differences in the biological sciences. eLife. 2021;10 doi: 10.7554/eLife.70817. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Rechlin R.K., Splinter T.F.L., Hodges T.E., Albert A.Y., Galea L.A.M. An analysis of neuroscience and psychiatry papers published from 2009 and 2019 outlines opportunities for increasing discovery of sex differences. Nat Commun. 2022;13:2137. doi: 10.1038/s41467-022-29903-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Monk R., Whitehouse A.J.O., Waddington H. The use of language in autism research. Trends Neurosci. 2022;45:791–793. doi: 10.1016/j.tins.2022.08.009. [DOI] [PubMed] [Google Scholar]
  • 103.Vivanti G. Ask the Editor: What is the most appropriate way to talk about individuals with a diagnosis of autism? J Autism Dev Disord. 2020;50:691–693. doi: 10.1007/s10803-019-04280-x. [DOI] [PubMed] [Google Scholar]
  • 104.Held U., Steigmiller K., Hediger M., Cammann V.L., Garaiman A., Halvachizadeh S., et al. The incremental value of the contribution of a biostatistician to the reporting quality in health research-A retrospective, single center, observational cohort study. PLoS One. 2022;17 doi: 10.1371/journal.pone.0264819. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Wand H., Lambert S.A., Tamburro C., Iacocca M.A., O’Sullivan J.W., Sillari C., et al. Improving reporting standards for polygenic scores in risk prediction studies. Nature. 2021;591:211–219. doi: 10.1038/s41586-021-03243-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Skrivankova V.W., Richmond R.C., Woolf B.A.R., Yarmolinsky J., Davies N.M., Swanson S.A., et al. Strengthening the reporting of observational studies in epidemiology using Mendelian randomization: The STROBE-MR statement. JAMA. 2021;326:1614–1621. doi: 10.1001/jama.2021.18236. [DOI] [PubMed] [Google Scholar]
  • 107.Little J., Higgins J.P.T., Ioannidis J.P.A., Moher D., Gagnon F., von Elm E., et al. STrengthening the REporting of Genetic Association Studies (STREGA): An extension of the STROBE statement. PLoS Med. 2009;6:e22. doi: 10.1371/journal.pmed.1000022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Little J., Higgins J., editors. The HuGENetTM HuGE Review Handbook, Version 1.0. 2006. http://www.medicine.uottawa.ca/public-health-genomics/web/assets/documents/HuGE_Review_Handbook_V1_0.pdf Available at: [Google Scholar]
  • 109.Haber N.A., Wieten S.E., Rohrer J.M., Arah O.A., Tennant P.W.G., Stuart E.A., et al. Causal and associational language in observational health research: A systematic evaluation. Am J Epidemiol. 2022;191:2084–2097. doi: 10.1093/aje/kwac137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.Ioannidis J.P., Boffetta P., Little J., O’Brien T.R., Uitterlinden A.G., Vineis P., et al. Assessment of cumulative evidence on genetic associations: Interim guidelines. Int J Epidemiol. 2008;37:120–132. doi: 10.1093/ije/dym159. [DOI] [PubMed] [Google Scholar]
  • 111.van der Bles A.M., van der Linden S., Freeman A.L.J., Mitchell J., Galvao A.B., Zaval L., Spiegelhalter D.J. Communicating uncertainty about facts, numbers and science. R Soc Open Sci. 2019;6 doi: 10.1098/rsos.181870. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Carlson J., Henn B.M., Al-Hindi D.R., Ramachandran S. Counter the weaponization of genetics research by extremists. Nature. 2022;610:444–447. doi: 10.1038/d41586-022-03252-z. [DOI] [PubMed] [Google Scholar]
  • 113.Kirkham J.J., Penfold N.C., Murphy F., Boutron I., Ioannidis J.P., Polka J., Moher D. Systematic examination of preprint platforms for use in the medical and biomedical sciences setting. BMJ, (Open) 2020;10 doi: 10.1136/bmjopen-2020-041849. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Watson C. Rise of the preprint: How rapid data sharing during COVID-19 has changed science forever. Nat Med. 2022;28:2–5. doi: 10.1038/s41591-021-01654-6. [DOI] [PubMed] [Google Scholar]
  • 115.Staniszewska S., Hickey G., Coutts P., Thurman B., Coldham T. Co-production: A kind revolution. Res Involv Engagem. 2022;8:4. doi: 10.1186/s40900-022-00340-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Rose D., Kalathil J. Power, privilege and knowledge: The untenable promise of co-production in mental “health.”. Front Sociol. 2019;4:57. doi: 10.3389/fsoc.2019.00057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Blakey H. University of Bradford, Department of Peace Studies, International Centre for Participation Studies. ICPS Working Paper 2; 2005. Participation . . . why bother?: The views of Black and minority ethnic mental health service users on participation in the NHS in Bradford.http://hdl.handle.net/10454/3798 Available at: [Google Scholar]
  • 118.Montgomery L., Hartley J., Neylon C., Gillies M., Gray E., Herrmann-Pillath C., et al. Open Knowledge Institutions: Reinventing Universities. The MIT Press; Cambridge, MA: 2021. Diversity; pp. 47–65. [Google Scholar]
  • 119.Eitzel M.V., Cappadonna J.L., Santos-Lang C., Duerr R.E., Virapongse A., West S.E., et al. Citizen science terminology matters: Exploring key terms. CSTP. 2017;2:1. [Google Scholar]
  • 120.Guerrini C.J., Trejo M., Canfield I., McGuire A.L. Core values of genomic citizen science: Results from a qualitative interview study. BioSocieties. 2022;17:203–228. doi: 10.1057/s41292-020-00208-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121.Guerrini C.J., Contreras J.L. Credit for and control of research outputs in genomic citizen science. Annu Rev Genomics Hum Genet. 2020;21:465–489. doi: 10.1146/annurev-genom-083117-021812. [DOI] [PubMed] [Google Scholar]
  • 122.de Vries M., Land-Zandstra A., Smeets I. Citizen scientists’ preferences for communication of scientific output: A literature review. CSTM. 2019;4:2. [Google Scholar]
  • 123.Lázaro-Muñoz G., Torgerson L., Pereira S. Return of results in a global survey of psychiatric genetics researchers: Practices, attitudes, and knowledge. Genet Med. 2021;23:298–305. doi: 10.1038/s41436-020-00986-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 124.Matimba A., Ali S., Littler K., Madden E., Marshall P., McCurdy S., et al. Guideline for feedback of individual genetic research findings for genomics research in Africa. BMJ Glob Health. 2022;7 doi: 10.1136/bmjgh-2021-007184. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125.Aizawa Y., Nagami F., Ohashi N., Kato K. A proposal on the first Japanese practical guidance for the return of individual genomic results in research settings. J Hum Genet. 2020;65:251–261. doi: 10.1038/s10038-019-0697-y. [DOI] [PubMed] [Google Scholar]
  • 126.Lewis A.C.F., Knoppers B.M., Green R.C. An international policy on returning genomic research results. Genome Med. 2021;13:115. doi: 10.1186/s13073-021-00928-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 127.Guerrini C.J., Lewellyn M., Majumder M.A., Trejo M., Canfield I., McGuire A.L. Donors, authors, and owners: How is genomic citizen science addressing interests in research outputs? BMC Med Ethics. 2019;20:84. doi: 10.1186/s12910-019-0419-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 128.Ekins S., Perlstein E.O. Ten simple rules of live tweeting at scientific conferences. PLoS Comput Biol. 2014;10 doi: 10.1371/journal.pcbi.1003789. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 129.Cheplygina V., Hermans F., Albers C., Bielczyk N., Smeets I. Ten simple rules for getting started on Twitter as a scientist. PLoS Comput Biol. 2020;16 doi: 10.1371/journal.pcbi.1007513. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 130.Whitaker K.J., Guest O. #bropenscience is broken science. Psychologist. 2020;33:34–37. [Google Scholar]
  • 131.Arnal A., Epifanio I., Gregori P., Martínez V. Ten simple rules for organizing a non-real-time web conference. PLoS Comput Biol. 2020;16 doi: 10.1371/journal.pcbi.1007667. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data
mmc1.pdf (231.3KB, pdf)

Articles from Biological Psychiatry Global Open Science are provided here courtesy of Elsevier

RESOURCES