Abstract
The gender gap is a well-known problem in academia and, despite its gradual narrowing, recent estimations indicate that it will persist for decades. Short-term descriptive studies suggest that this gap may have actually worsened during the months of confinement following the start of the COVID-19 pandemic in 2020. In this work, we evaluate the impact of the COVID-19 lockdown on female and male academics’ research productivity using preprint drop-off data. We examine a total of 307,902 unique research articles deposited in 5 major preprint repositories during the period between January and May each year from 2017 to 2020. We find that the proportion of female authors in online repositories steadily increased over time; however, the trend reversed during the confinement and gender parity worsened in two respects. First, the proportion of male authors in preprints increased significantly during lockdown. Second, the proportion of male authors in COVID-19-related articles was significantly higher than that of women. Overall, our results imply that the gender gap in academia suffered an approximately 1-year setback during the strict lockdown months of 2020, and COVID-related research areas suffered an additional 1.5-year setback.
Introduction
The under-representation of women in scientific publications is well established in the literature. Despite their growing presence in all research areas, women continue to publish less than men [1–3], even in fields where they are not a minority [4]. In addition, women are much less likely to be included as first or last authors in an article [3, 5]. According to recent estimates by Holman and his colleagues [6], a significant gender gap will persist for decades, especially in the areas of computer science, physics, and mathematics. As a result, women are less likely than men to be granted tenure or promoted [1, 7, 8].
How has the gender gap in scientific production changed during the global lockdown? And how might these changes affect the career paths of men and women in the short and medium term? Here, we build on recent research to argue that the move towards gender parity could slow down as a result of COVID-19. On the one hand, surveys examining time allocation and research production during the pandemic suggest that research productivity has decreased more for women, particularly those with young children [9–11]. On the other hand, research funds are being redirected to support coronavirus-related studies, even at the cost of other less cutting-edge topics. Those with the ability to initiate COVID-related research projects will be more likely to benefit from these new lines of funding. If the majority of scholars conducting such novel research are men, then they will have gained a further advantage in the coming years. All in all, lower levels of scientific production during confinement could be detrimental to women—particularly those in early career stages [12]—who may see diminished promotion possibilities or even risk losing their jobs.
Assessing the magnitude and scope of the new gap in scientific production is, therefore, crucial to designing and implementing effective actions that will prevent a backslide in academic gender equity. With this aim, we model the evolution of the gender gap in preprint submissions from January 2017 to May 2020 to measure the impact of COVID-19 on male and female scientific productivity. Specifically, we examine a total of 307,902 unique research articles deposited in 5 major repositories: arXiv, medRxiv, bioRxiv, PsyArXiv, and SocArXiv. A preprint is a full draft article that is shared publicly before it has been peer reviewed. Preprints offer strong benefits, such as the possibility of receiving feedback and increased visibility, which often results in a higher number of citations. In addition, during the COVID-19 pandemic, they have enabled researchers to share data and essential findings at unprecedented speeds. We aim to quantify the effect of confinement on the likelihood of men and women to publish a preprint and, more specifically, to publish COVID-related research.
Our research contributes to prior studies in several ways. First, unlike previous research focusing on a very limited period [13–15], we examine preprint submission trends from 2017 to 2020. By expanding the observation window, we are able to discern how much of the observed change in 2020 corresponds to the effect of lockdown and social distancing and how much is due to gender differences in the circulation of scientific knowledge over time [16–18]. Second, unlike previous studies, we provide a systematic analysis of academic fields. Some scholars have examined very broad areas of knowledge, such as mathematics, physics, or economics [13–15], while others have focused on journals on a very specific topic, such as medicine [19, 20] or biological sciences [21]. Our study seeks to fill this gap by covering a total of 10 academic fields and 250 sub-fields. Thus, we provide a comprehensive view of the pandemic’s impact on scientific productivity. Also, as we will see, this level of granularity is crucial to avoid incurring Simpson’s paradox, where group trends disappear or reverse when data is aggregated. Third, our model discriminates between COVID-19-related research and general research. While the coronavirus has brought many challenges to academic research, it has also created opportunities for research moving forward. Therefore, it is key to look at who is taking advantage of these new opportunities.
Finally, we pay detailed attention to authorship order. Given the growing tendency across scientific disciplines to write multi-author papers, the sequence of names is becoming a major topic in recruitment processes, promotion, and tenure [3, 22, 23]. Thus, we not only examine whether gendered patterns of authorship vary after lockdown among all authors but also among solo authors, first authors, and last authors. It is common practice across many areas that the first author contributes most to the work and receives the most credit. Therefore, given the asymmetric share of domestic responsibilities women have assumed during confinement [24–26], we expect a decrease in females listed as first authors and as solo authors, the two positions requiring the most intensive research work. This decline might be particularly noticeable in areas of knowledge where women have recently joined, since young female academics are more likely to have kids and experience the caregiver burden. As for the other authorship positions, the pre-lockdown expectations are not so clear. In some disciplines, the last author position is reserved for the senior author (first-last-author-emphasis norm, or FLAE), while in other fields the author sequence reflects their relative contributions to the manuscript (sequence-determines-credit approach, or SDC) [23]. Consequently, variations in gender composition after lockdown cannot be easily anticipated.
Data and methods
The complete dataset is publicly available on Zenodo [27], where the software repository containing the replication scripts is linked as supplementary materials.
Data collection and integration
Using their APIs, we collected public data from the top 5 preprint servers in terms of submissions with availability since January 2017 (namely, arXiv, bioRxiv, medRxiv, PsyArXiv, and SocArXiv). We used R [28], and, specifically, the following packages: aRxiv [29], medrxivr [30] and osfr [31]. We downloaded all submission records from January to May for the years 2017, 2018, 2019, and 2020. The arXiv API returns the date of the last update for preprints with more than one submission. For the other repositories, the date is defined as the first posting date. Specifically, we collected information on full author names, order of appearance, publishing date, categories, subcategories, article title, abstract, and keywords. We restricted the analysis to the March-May period, when the lockdown was considerably uniform among countries. When contagions dropped during June and July, internal and external border restrictions were relaxed. However, the restrictions were lifted unevenly and the confinement situation became more heterogeneous across countries and continents, potentially skewing the data. In addition, focusing on the short-term period allows us to capture gender differences when researchers are put under time pressure and must carry out their work in challenging circumstances.
Preprint categorization
Different preprint servers require different approaches to preprint categorization. On the one hand, the arXiv repository consists of 8 categories (such as Computer Science, Mathematics, or Physics) and a reduced set of subcategories for large subject areas within those main categories. bioRxiv and medRxiv follow a similar approach for the Biology and Health Sciences categories, respectively. On the other hand, PsyArXiv and SocArXiv (Psychology and Social Sciences) allow the authors to freely tag submissions with a (potentially unlimited) number of areas, sub-areas, and even more specific fields of study. As a result, compared to the rest of the repositories, PsyArXiv and SocArXiv contain a large number of subcategories for a relatively small number of preprints, as shown in S1 Table.
Thus, to better balance our categorization, we pre-processed PsyArXiv and SocArXiv data to consider just the preprint’s main subcategory. To identify this main subcategory, we first sorted all the unique tags in descending order based on the number of papers using them. The main subcategory for each preprint was then defined as the first appearance in the previous list, and the rest of the tags were removed. We also discarded those subcategories with fewer than 100 preprints, and manually recoded some SocArXiv subcategories that were still too specific (see S2 Table). Finally, arXiv categories “Quantitative Biology” and “Quantitative Finance” were recoded and merged into Biology and Economics, respectively, and the Psychology subcategory contained in SocArXiv was merged into the Psychology category.
The summary after this pre-processing is shown in Table 1. While public repositories have gained popularity in all fields of knowledge, the table reveals that significant differences among disciplines persist. Physics, Computer Science, and Mathematics are the three areas with the highest number of submissions. At the other extreme are Economics, Social Science, and Psychology. These figures are consistent with previous research showing that journals from STEM disciplines have clearer policies regarding preprinting than journals from the Social Sciences and Humanities, which could affect authors’ decisions to submit their work to open-access repositories [32].
Table 1. Number of subcategories and preprints per category after pre-processing.
Category | # subcategories | # preprints |
---|---|---|
Biology | 34 | 44454 |
Computer Science | 40 | 79653 |
Economics | 12 | 2869 |
Elec. Eng. Systems Science | 4 | 10686 |
Health Sciences | 52 | 6100 |
Mathematics | 32 | 78187 |
Physics | 51 | 116983 |
Psychology | 14 | 4120 |
Social Sciences | 5 | 1938 |
Statistics | 6 | 22040 |
Total: | 250 | 367030 |
Next, a data quality assessment was conducted to detect and remove possible inconsistencies. First, we processed full names to remove stop words, places, and institutions, so that, for example, an author’s institution did not register as an additional author. Second, we removed articles with inconsistent dates. Finally, because in some repositories supplementary materials appeared as an additional posting, we removed them, as well as preprints that were marked as withdrawn from the repository.
Inferring gender from authors’ given names
We used the genderize.io database to assign gender to authors based on their first names. This is one of the most effective gender prediction tools [33] and has been widely used in the literature ([6], among others). One of the most important advantages of using name-to-gender inference services is that, compared to standard approaches for name-to-gender inference based on administrative data (census data, administration records, or country-specific birth lists), they allow for a robust prediction for names from countries all over the world (see [34] for an evaluation of different web services). Our choice, Genderize, is a database of name–gender associations assembled from all over the web (> 114M given names for approximately 80 countries as of January 2021), and thus is a good option for analyses on data outside of a national context. Gender data was collected via Genderize’s API using the genderizeR R package [35] for a total of 1,235,037 unique authors.
In addition to predicting gender for a given name, Genderize returns additional information to quantify the precision of such predictions, namely, count and probability. The count shows how many instances in the database associate a given first name with the predicted gender, and the probability corresponds to the proportion, or frequency, of such associations. Unfortunately, gender cannot be predicted when the authors’ given names were written as initials or were absent from the Genderize database, and these instances were reported as missing cases. The Genderize API reported a total of 28% of missing cases. Additionally, following Holman and his colleagues [6], we only consider gender identification with a probability higher or equal to 0.95 and a frequency of at least 10 appearances in the Genderize database. This simple procedure preserved 80% of cases. After filtering out missing cases, the proportion of preprints included in the analyses were equivalent for all the years (see Table 2), indicating that this procedure did not introduce any under- or over-representation bias.
Table 2. Preprints considered per year and model.
Model | 2017 | 2018 | 2019 | 2020 | ||||
---|---|---|---|---|---|---|---|---|
p | N | p | N | p | N | p | N | |
all authors | 0.76 | 39843 | 0.78 | 49623 | 0.80 | 60939 | 0.79 | 91030 |
first author | 0.66 | 13941 | 0.65 | 19002 | 0.65 | 24857 | 0.64 | 37993 |
last author | 0.74 | 15379 | 0.73 | 21252 | 0.73 | 27765 | 0.73 | 43014 |
single author | 0.78 | 6605 | 0.77 | 7010 | 0.77 | 7328 | 0.76 | 9965 |
Measuring the effect of lockdown
We are interested in estimating the gender gap in preprint submissions and measuring how much of this gap can be attributed to the global lockdown. Previous research examining gender differences in publications has mostly used generalized linear models (GLMs) to estimate the gender proportion and its rate of change ([6, 36], among others). However, this approach takes the averages of the individual-level variables, discarding valuable within-group information that may reveal opposing trends. A potential alternative would be to disaggregate and introduce categories and subcategories as fixed effects in the GLM design, but this would violate the assumption of independence of the observations, thus biasing the results. To account for this drawback, we employ a hierarchical (or multilevel) GLM, which explicitly models the nested nature of this data.
More concretely, we define a fractional hierarchical GLM model that captures the proportion of males as a function of time and measures the effect of the lockdown and type of research (directly related to COVID-19 versus not directly related), with a random intercept per category and subcategory. We consider the quasi-binomial family to describe the error distribution (to account for overdispersion) with the logit link function:
(1) |
where i, j, and k index observations, subcategories, and categories, respectively; the terms ϵi, γj, and ηk are normal errors at the individual and cluster levels; the response variable pmale is the proportion of males; ‘year’ is a continuous variable that takes a value of 0 at the start of our time window (i.e., 2017 → 0, 2018 → 1 and so on); ‘lockdown’ is a binary factor that is equal to 1 during the lockdown period, from March to May 2020; ‘COVID paper’ is a binary factor that is equal to 1 for preprints directly related to COVID-19, defined as those preprints containing “coronavirus”, “sars-cov-2,” or “covid-19” in their title (restricted to 2020 and with case-insensitive matching); and where we consider a random intercept that varies across categories and subcategories inside categories.
Within this framework, we consider four distinct models. Namely, we estimate the monthly proportion of males for (1) all authors, (2) first authors, (3) last authors, and (4) single authors. In all cases, pmale is computed as the total number of males over the total number of males and females identified per month, excluding missing values. In the case of all authors, preprints with missing gender rates greater than 25% are not considered, and subcategories with fewer than 30 authors per month are dropped too. In the case of first and last authors, preprints with an alphabetically-ordered list of authors are discarded, as alphabetical sequence is frequently used to acknowledge similar contributions or to avoid disharmony among collaborating groups [23]. In the case of single authors, only preprints with one author are considered.
Results and discussion
Gender trends in preprints submission over time
Global numbers in Fig 1 show that the trend of preprint submissions has accelerated notably over the previous three years, and especially during lockdown. This effect is particularly pronounced in fields where COVID-related production is more likely—such as biology, health sciences (vaccines, epidemiology, etc.), and mathematics (epidemiological models)—but is also clearly evident in computer science, economics, engineering, physics, and psychology. Also, the time trend in the social sciences and economics is less constant than in the other areas. This is largely due to the lower number of submissions registered in these fields. As we have seen in Table 1, both social sciences and economics rank at the bottom in terms of the number of preprints received, with N = 1938 and N = 2869, respectively, far behind other areas such as physics (N = 116983), computer science (N = 79653), and mathematics (N = 78187). In spite of these variations, results suggest that research in all areas has been very prolific during 2020, particularly during the lockdown months.
Next, Fig 2 displays the male proportion by category. In general terms, we do observe a slowly declining global trend. This is mainly driven by categories that are already more feminized than the average, such as biology, health sciences, and psychology. As for the rest of the categories, the gender gap has remained rather stable during the period considered, in consonance with previous findings [6]. The social sciences are the only exception to the general trend, showing a slight upswing in the proportion of males. As discussed above, this oddity might be related to a less frequent and irregular use of online servers in this particular field of study.
Consistent with previous studies [14, 15], Fig 2 reveals a slowing down of the feminization process during the pandemic. This slowdown can be separately observed in biology and psychology, and the gender gap has even started to grow in health sciences and economics. However, it is difficult to conclude with certainty whether such an effect exists and, even if it does, whether it can be ultimately attributable to the lockdown period, as other authors have suggested from similar descriptive analyses. To better account for these trends, we next explore this issue in a hierarchical modelling framework.
Explaining the gender gap
We estimate the monthly proportion of males for (1) all authors, (2) first authors, (3) last authors, and (4) single authors, as described in the Methods section. Our model separates the temporal trend from the effect of the lockdown, controlling for COVID-related work, and uses random intercepts to take into account the hierarchical nature of the data, which is nested in categories and subcategories. Table 3 shows the coefficient estimates and main summary statistics for all the models considered.
Table 3. Regression analysis results.
Dependent variable: | |||||
---|---|---|---|---|---|
Proportion of males | |||||
all authors | all authors | first author | last author | single author | |
fractional GLM | fractional GLMM | ||||
(0) | (1) | (2) | (3) | (4) | |
year | −0.104*** | −0.049*** | −0.044*** | −0.059*** | −0.029 |
(0.003) | (0.003) | (0.009) | (0.009) | (0.018) | |
lockdown | 0.072*** | 0.031*** | 0.035* | 0.008 | 0.136*** |
(0.007) | (0.007) | (0.020) | (0.021) | (0.052) | |
COVID paper | −0.614*** | 0.076*** | 0.399*** | 0.129** | 0.715** |
(0.019) | (0.022) | (0.064) | (0.065) | (0.298) | |
(Intercept) | 1.826*** | 1.595*** | 1.520*** | 1.819*** | 2.480*** |
(0.006) | (0.199) | (0.195) | (0.173) | (0.169) | |
Observations | 3,368 | 3,368 | 3,996 | 4,027 | 3,517 |
N (subcategory) | 192 | 201 | 201 | 200 | |
N (category) | 10 | 10 | 10 | 10 | |
sd(subcategory) | 0.29 | 0.32 | 0.30 | 0.45 | |
sd(category) | 0.62 | 0.60 | 0.53 | 0.47 | |
Log Likelihood | −39,496.310 | −11,327.900 | −7,182.127 | −6,955.634 | −3,788.911 |
Akaike Inf. Crit. | 79,000.620 | 22,667.800 | 14,376.250 | 13,923.270 | 7,589.823 |
Bayesian Inf. Crit. | 22,704.540 | 14,414.010 | 13,961.070 | 7,626.815 | |
Pseudo-R2 | 0.04 | 0.83 | 0.24 | 0.18 | 0.11 |
Note:
*p<0.1;
**p<0.05;
***p<0.01
The use of a GLMM model is justified by comparing model (1) with model (0), which is a GLM for the same data but does not account for this hierarchical structure. There are three reasons that lead us to opt for the GLMM model. First, the GLMM achieves a much better fit and predictive power (83% of variance explained). Second, the model intercept correctly captures the overall average proportion of males at the beginning of our time window (∼83% of males as of January 2017, as can be seen in Fig 2). Finally, once we include categories and subcategories as random intercepts, we observe that the sign of the ‘COVIDpaper’ coefficient changes. This result reveals that disaggregation is necessary to avoid incurring a Simpson’s paradox. Moreover, these random intercepts also correctly capture well-known differences among categories and subcategories. For example, we observe that STEM categories have a wider gender gap than the average, while non-STEM ones are much more feminized (see S1 Fig). Similarly, subcategories such as “high-energy” and “quantum physics” are more masculinized than the average for all physics, while astrophysics-related research as well as “bio-medical physics” are more feminized (see S2 Fig).
To facilitate model comparison, Fig 3 displays the relative effect size with 95% confidence intervals (CI) for the three fixed effects in (1–4). Results can be summarized in three key points. First, findings confirm a slow but significative decreasing trend in the overall proportion of male authors (OR 0.95, 95% CI 0.95–0.96); this holds for first (OR 0.96, 95% CI 0.94–0.97) as well as last authors (OR 0.94, 95% CI 0.93–0.96). The reduced number of single authors, however, does not provide enough statistical power to measure such a small effect (should it exist), but the point estimate is consistent with the estimates for the rest of the models (OR 0.97, 95% CI 0.94–1.01).
Second, we find that for all authors, there is a measurable lockdown effect that is slightly smaller than the yearly effect but with the opposite sign (OR 1.03, 95% CI 1.02–1.05): the lockdown has partially reversed the yearly decrease in the proportion of male authors that would be expected in 2020 given the trend from previous years. A very similar effect for first authors could exist, although the variability is higher and thus there is not enough statistical power to reach a stronger conclusion (OR 1.04, 95% CI 1.00–1.08). Regarding single authors, a statistically significant and potentially much stronger effect can be observed (OR 1.15, 95% CI 1.03–1.27). This can possibly be explained by the greater effort and time required to produce single-authored papers, which could negatively affect women, especially those with children [24–26]. Finally, no effect is found for last authors (OR 1.01, 95% CI 0.97–1.05). This may be due to the fact that this co-authorship position has different meanings across disciplines. While in disciplines following the first-last-author-emphasis norm, last positions tend to be reserved for senior researchers with consolidated careers (e.g., senior female authors), the last position in the sequence-determines-credit approach is more likely to correspond to less-contributing authors (e.g., young women with child-rearing responsibilities). It seems reasonable to think that the latter may have had more trouble juggling child care and research than the former. These factors might result in a lower average value, and otherwise larger variability, for last author effects.
Third, we find an additional masculinization effect in COVID-related preprints that is equal or larger than the yearly feminization effect for all the models considered (all authors: OR 1.08, 95% CI 1.03–1.13; first author: OR 1.49, 95% CI 1.31–1.69; last author: OR 1.14, 95% CI 1.00–1.29; single author: OR 2.04, 95% CI 1.14–3.67). In other words, overall, female authors’ production has been penalized during lockdown compared to their male peers, but especially in those disciplines where increased productivity is directly linked to COVID-19 research. While the COVID-19 pandemic has resulted in unprecedented research opportunities worldwide, women have not benefited as much as men have. This is particularly noticeable among single authors and first authors, the two most time-consuming positions. As for the last authorship position, the relative effect size is lower but still significant.
Conclusion
In this paper, we examine how the global lockdown has affected the gender productivity gap in academia. More specifically, we model the evolution of the gender gap in preprint submissions between 2017 an 2020. Our findings show that the progress towards gender parity that has been observed over the past few years partially reversed during the lockdown period. The pandemic confinements penalized women in two ways. Not only were they less likely to complete research during the pandemic, but they were also less likely to produce COVID-related research than men, despite the increasing research opportunities that COVID-19 provided in many fields. Overall, results indicate that the gender gap in academia suffered an approximately 1-year setback during the strict lockdown months of 2020, and COVID-related research areas (which incidentally have a better male-female balance) suffered an additional 1.5-year setback.
The results of our research are relevant from an empirical and substantive point of view. From the empirical perspective, our analysis indicates that dissagregated data at the sub-field level is key to untangling within-group trends that are otherwise concealed in global averages due to the profound gender gap difference that still exist across disciplines and sub-disciplines. In this regard, generalized linear mixed models are a natural choice to model the hierarchical structure of such data. From the substantive perspective, results show that COVID-19 lockdowns exacerbated gender inequalities such that their effects will be felt in the years to come.
Current differences in productivity levels might result in higher rates of gender inequality in the next few years. Negative effects in the short and medium term might be twofold. On the one hand, we expect a reallocation of research money at the expense of research areas funded prior to the pandemic, which can lead to an unequal distribution of resources. In addition, lower productivity levels will result in fewer citations, fewer research grants, and lower likelihood of promotion among women. While our work only examines the impact of COVID-19 lockdowns in academia, this prognosis might be valid for all women in high-skilled occupations where promotion tracks and human capital accumulation are crucial during early career years, to the point that early productivity declines might lead to job loss [37, 38]. Therefore, the implementation of gender equity actions is necessary in order to ensure that the COVID-19-related penalty does not translate into inequality in future recruitment and promotion processes.
Supporting information
Data Availability
The complete dataset is archived on Zenodo (https://doi.org/10.5281/zenodo.5142676). R code used to analyse the data is archived at https://github.com/CONCIERGE-CM-UC3M/COVID19-gender-gap.
Funding Statement
This work has been supported by the Madrid Government (Comunidad de Madrid) under the Multiannual Agreement with UC3M in the line of "Fostering Young Doctors Research" (CONCIERGE-CM-UC3M), and in the context of the V PRICIT (Regional Programme of Research and Technological Innovation. M.T. and I.U. also acknowledge support from the Spanish Ministry of Science and Innovation through the research grant RTI2018-098182-A-I00.
References
- 1. Huang J, Gates AJ, Sinatra R, Barabási AL. Historical comparison of gender inequality in scientific careers across countries and disciplines. Proceedings of the National Academy of Sciences. 2020;117(9):4609–4616. doi: 10.1073/pnas.1914221117 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Larivière V, Ni C, Gingras Y, Cronin B, Sugimoto CR. Bibliometrics: Global gender disparities in science. Nature News. 2013;504(7479):211. doi: 10.1038/504211a [DOI] [PubMed] [Google Scholar]
- 3. West JD, Jacquet J, King MM, Correll SJ, Bergstrom CT. The Role of Gender in Scholarly Authorship. PLOS ONE. 2013;8(7):e66212. doi: 10.1371/journal.pone.0066212 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Teele DL, Thelen K. Gender in the Journals: Publication Patterns in Political Science. PS: Political Science & Politics. 2017;50(2):433–447. doi: 10.1017/s1049096516002985 [DOI] [Google Scholar]
- 5. Bendels MHK, Müller R, Brueggmann D, Groneberg DA. Gender disparities in high-quality research revealed by Nature Index journals. PLOS ONE. 2018;13(1):e0189136. doi: 10.1371/journal.pone.0189136 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Holman L, Stuart-Fox D, Hauser CE. The gender gap in science: How long until women are equally represented? PLOS Biology. 2018;16(4):e2004956. doi: 10.1371/journal.pbio.2004956 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Catalyst. Quick Take: Women in Academia; 2020. Available from: https://www.catalyst.org/research/women-in-academia/.
- 8. Hechtman LA, Moore NP, Schulkey CE, Miklos AC, Calcagno AM, Aragon R, et al. NIH funding longevity by gender. Proceedings of the National Academy of Sciences. 2018;115(31):7943–7948. doi: 10.1073/pnas.1800615115 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Myers KR, Tham WY, Yin Y, Cohodes N, Thursby JG, Thursby MC, et al. Unequal effects of the COVID-19 pandemic on scientists. Nature human behaviour. 2020;4(9):880–883. doi: 10.1038/s41562-020-0921-y [DOI] [PubMed] [Google Scholar]
- 10. Barber BM, Jiang W, Morse A, Puri M, Tookes H, Werner IM. What Explains Differences in Finance Research Productivity During the Pandemic? The Journal of Finance. 2021;. doi: 10.1111/jofi.13028 [DOI] [Google Scholar]
- 11. Staniscuaski F, Kmetzsch L, Soletti RC, Reichert F, Zandonà E, Ludwig ZM, et al. Gender, race and parenthood impact academic productivity during the COVID-19 pandemic: from survey to action. Frontiers in psychology. 2021;12. doi: 10.3389/fpsyg.2021.663252 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Harrop C, Bal V, Carpenter K, Halladay A. A lost generation?The impact of the COVID-19 pandemic on early career ASD researchers. Autism Research. 2021;. doi: 10.1002/aur.2503 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Cooke SJ, Cramp RL, Madliger CL, Bergman JN, Reeve C, Rummer JL, et al. Conservation physiology and the COVID-19 pandemic. Conservation Physiology. 2021;9(1):coaa139. doi: 10.1093/conphys/coaa139 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Viglione G. Are women publishing less during the pandemic? Here’s what the data say. Nature. 2020;581:365–366. doi: 10.1038/d41586-020-01294-9 [DOI] [PubMed] [Google Scholar]
- 15. Matthews D. Pandemic lockdown holding back female academics, data show. Times Higher Education. 2020;. [Google Scholar]
- 16. Zhu Y. Who support open access publishing? Gender, discipline, seniority and other factors associated with academics’ OA practice. Scientometrics. 2017;111(2):557–579. doi: 10.1007/s11192-017-2316-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. J T, A K, J M, C R, E MH, C P, et al. Gender differences and bias in open source: pull request acceptance of women versus men. PeerJ Computer Science 3:e111. 2017. doi: 10.7717/peerjcs.111 [DOI] [Google Scholar]
- 18. Ruggieri R, Pecoraro F, Luzi D. An intersectional approach to analyse gender productivity and open access: a bibliometric analysis of the Italian National Research Council. Scientometrics. 2021;126(2):1647–1673. doi: 10.1007/s11192-020-03802-0 [DOI] [Google Scholar]
- 19. Andersen JP, Nielsen MW, Simone NL, Lewiss RE, Jagsi R. Meta-Research: COVID-19 medical papers have fewer women first authors than expected. elife. 2020;9:e58807. doi: 10.7554/eLife.58807 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Gayet-Ageron A, Messaoud KB, Richards M, Schroter S. Female authorship of covid-19 research in manuscripts submitted to 11 biomedical journals: cross sectional study. BMJ. 2021;375. doi: 10.1136/bmj.n2288 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Ribarovska AK, Hutchinson MR, Pittman QJ, Pariante C, Spencer SJ. Gender inequality in publishing during the COVID-19 pandemic. Brain, behavior, and immunity. 2021;91:1. doi: 10.1016/j.bbi.2020.11.022 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Macaluso B, Larivière V, Sugimoto T, Sugimoto CR. Is science built on the shoulders of women? A study of gender differences in contributorship. Academic Medicine. 2016;91(8):1136–1142. doi: 10.1097/ACM.0000000000001261 [DOI] [PubMed] [Google Scholar]
- 23. Tscharntke T, Hochberg ME, Rand TA, Resh VH, Krauss J. Author Sequence and Credit for Contributions in Multiauthored Publications. PLOS Biology. 2007;5(1):1–2. doi: 10.1371/journal.pbio.0050018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Ruppanner L, Tan X, Scarborough W, Landivar LC, Collins C. Shifting Inequalities? Parents’ Sleep, Anxiety, and Calm during the COVID-19 Pandemic in Australia and the United States. Men and Masculinities. 2021;24(1):181–188. doi: 10.1177/1097184X21990737 [DOI] [Google Scholar]
- 25. Collins C, Landivar LC, Ruppanner L, Scarborough WJ. COVID-19 and the Gender Gap in Work Hours. Gender Work Organ. 2020;n/a(n/a). doi: 10.1111/gwao.12506 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Landivar LC, Ruppanner L, Scarborough WJ, Collins C. Early Signs Indicate That COVID-19 Is Exacerbating Gender Inequality in the Labor Force. Socius. 2020;6:2378023120947997. doi: 10.1177/2378023120947997 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Ucar I, Torre M, Fernández AE. CONCIERGE-CM-UC3M/COVID19-gender-gap; 2021. Available from: 10.5281/zenodo.5142676. [DOI]
- 28.R Core Team. R: A Language and Environment for Statistical Computing; 2019. Available from: https://www.R-project.org/.
- 29.Ram K, Broman K. aRxiv: Interface to the arXiv API; 2019.
- 30.McGuinness LA, Schmidt L. medrxivr: Accessing medRxiv data in R; 2020. Available from: https://github.com/mcguinlu/medrxivr.
- 31. Wolen AR, Hartgerink CHJ, Hafen R, Richards BG, Soderberg CK, York TP. osfr: An R Interface to the Open Science Framework. Journal of Open Source Software. 2020;5(46):2071. doi: 10.21105/joss.02071 [DOI] [Google Scholar]
- 32. Klebel T, Reichmann S, Polka J, McDowell G, Penfold N, Hindle S, et al. Peer review and preprint policies are unclear at most major journals. PLOS ONE. 2020;15(10):1–19. doi: 10.1371/journal.pone.0239518 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Wais K. Gender Prediction Methods Based on First Names with genderizeR. R J. 2016;8(1):17. doi: 10.32614/RJ-2016-002 [DOI] [Google Scholar]
- 34. Santamaría L, Mihaljević H. Comparison and benchmark of name-to-gender inference services. PeerJ Computer Science. 2018;4:e156. doi: 10.7717/peerj-cs.156 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Wais K. Gender Prediction Methods Based on First Names with genderizeR. The R Journal. 2006;8(1):17–37. doi: 10.32614/RJ-2016-002 [DOI] [Google Scholar]
- 36. van den Besselaar PAA, Sandstrom U. Vicious circles of gender bias, lower positions and lower impact: gender differences in scholarly productivity and impact. PLoS ONE. 2017;12(8):e0183301. doi: 10.1371/journal.pone.0183301 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Antecol H, Bedard K, Stearns J. Equal but Inequitable: Who Benefits from Gender-Neutral Tenure Clock Stopping Policies? American Economic Review. 2018;108(9):2420–41. [Google Scholar]
- 38. Rosen S. Contracts and the Market for Executives. National Bureau of Economic Research; 1990. 3542. Available from: http://www.nber.org/papers/w3542. [Google Scholar]