Abstract
A number of studies suggest that scientific papers with women in leading-author positions attract fewer citations than those with men in leading-author positions. We report the results of a matched case-control study of 1,269,542 papers in selected areas of medicine published between 2008 and 2014. We find that papers with female authors are, on average, cited between 6.5 and 12.6% less than papers with male authors. However, the standardized mean differences are very small, and the percentage overlaps between the distributions for male and female authors are extensive. Adjusting for self-citations, number of authors, international collaboration and journal prestige, we find near-identical per-paper citation impact for women and men in first and last author positions, with self-citations and journal prestige accounting for most of the small average differences. Our study demonstrates the importance of focusing greater attention on within-group variability and between-group overlap of distributions when interpreting and reporting results of gender-based comparisons of citation impact.
Research organism: None
Introduction
Over the past four decades, the share of female graduates in medicine has increased from less than 10% to more than 50% in OECD countries, and recent statistics suggest near-parity in the representation of women and men as authors in medical research in Australia, Brazil, Chile, Europe and North America (Huggett, 2017; OECD, 2019). However, gender inequalities persist in the upper echelons of academic medicine. As of 2013, women constituted just 21% of full professors in the United States and just 23% in Europe, with the proportion of female department chairs and deans also remaining low (European Commission, 2016; Lautenberger et al., 2014).
These gender imbalances likely reflect myriad obstacles to women's career progress, including chilly and sometimes hostile work climates (Carr et al., 2003; Jenner et al., 2019; Pololi et al., 2013), bias in recruitment and selection practices (Van den Brink, 2011), societal cultures that still expect a strongly gendered division of domestic labor (Jolly et al., 2014), an underrepresentation of women in last-author positions (González-Álvarez and Cervera-Crespo, 2019; Jagsi et al., 2006; Lerchenmueller and Sorenson, 2018), and disparities in research funding (Jagsi et al., 2009; Sege et al., 2015). Given that citation indicators are increasingly being used to inform tenure, hiring and funding decisions in many areas of the medical sciences, possible gender differences in citation impact have the potential to contribute to the perpetuation of these inequalities.
A survey of the literature revealed 22 papers on gender and citations in the medical sciences published between 2006 and 2016 (see Supplementary file 1). The study designs, impact measures and statistics used in these papers are too heterogeneous for meta-analytical comparisons, and this literature is also characterized by notable variations in results depending on specialty, country, study design and type of citation indicator (h-index, citations per paper, cumulative citations, m-quotient and journal impact factor). Some studies report an average citation advantage in favor of men (see e.g. Larivière et al., 2011; Nielsen, 2016); others do not observe any notable gender difference (see e.g. Mirnezami et al., 2016; Pagel and Hudetz, 2011). Existing articles are in most cases based on convenience samples and limit their focus to single specialties or sub-specialties (16 out of 22), and the literature is characterized by a North American bias, with only five studies focusing on countries outside the US and Canada. Furthermore, only six of the papers report direct comparisons of the average number of citations per paper for male and female authors (Housri et al., 2008; Larivière et al., 2011; Mirnezami et al., 2016; Nielsen, 2016; Pagel and Hudetz, 2015; Pagel and Hudetz, 2011).
Researchers have also studied gender and citation distributions in fields other than medicine, and again these studies are characterized by ambiguous results that vary by geographical focus, time period and discipline. Some report average differences in favor of male authors (Aksnes et al., 2011; Caplar et al., 2017; Eagly and Miller, 2016; Maliniak et al., 2013), some report smaller average differences in favor of female authors (Borrego et al., 2010; Long, 1992; van Arensbergen et al., 2012), and some report no discernable gender difference (Nielsen, 2017; Slyder et al., 2011; Symonds et al., 2006).
A recurring limitation in the literature is the lack of attention paid to within-group variability and between-group overlap in citation distributions. Many studies rely on simple, mean-based comparisons to derive generic conclusions about gender differences (or similarities) in per-paper citation impact. This practice can be misleading for at least two reasons. First, the reported gender differences are generally small, and will, given the expected distribution for citation data, imply a great deal of overlap for women and men (we return to this below). Second, the lack of attention to co-varying factors influencing a paper’s likelihood of being cited (e.g. institutional affiliation, country-affiliation, self-citations and number of authors) will inevitably limit what can be learned from bivariate comparisons of this sort.
Here we report the results of a comprehensive, global analysis of possible gender variations in the per-paper citation impact of medical researchers. We analyzed 1,269,542 papers on disease-specific medical research published between 2008 and 2014. To reduce confounding and ensure balanced case-control groups, three matching covariates (institutional prestige, geographic location and medical specialty) were used to generate three datasets: sample 1 had female first authors as the case and male first authors as the control (n=1,018,665); sample 2 had female last authors as the case and male last authors as the control (n=653,233); and in sample 3, pairs of female first and last authors constituted the case group and all other author combinations were included in the control group (n=368,374). The outcome variable was field-normalized citations per paper, and regression analyzes were used to explore the influence of additional co-varying factors (self-citations, number of authors, international collaboration and journal prestige) on differences in per-paper citation impact (see Methods). Given the large sample size, global scope, and matched design, our study is less vulnerable to biases resulting from sample-specific variance, confounders and selection than previous studies in the medical field.
Results
Table 1 specifies the gender composition of the unmatched sample (n=1,269,542) by main specialty, institutional prestige and geographic location. Male researchers dominate all five main specialty groupings. Female last authors are underrepresented, in comparison to their representation in the global population, in all five groupings, but most notably in Surgical/Procedural specialties. The proportion of female first and last authors is highest in Latin America and lowest in South East Asia. Note here that numerous countries located in Eastern Asia have been excluded from the analysis due to unreliable gender disambiguation based on first-name and country information (see Methods for more details). In Western Europe and North America the proportions of female first and last authors lie close to the Global averages.
Table 1. Women’s share of authorships overall, across five main specialties, institutional prestige, and geocultural area.
Overall | f_w | f_first | f_last | f_both |
---|---|---|---|---|
0.35 | 0.40 | 0.26 | 0.15 | |
Main specialty | f_w | f_first | f_last | f_both |
Basic science | 0.39 | 0.46 | 0.30 | 0.18 |
Hospital based | 0.37 | 0.43 | 0.28 | 0.16 |
Medical | 0.33 | 0.38 | 0.24 | 0.13 |
Pediatric | 0.46 | 0.52 | 0.37 | 0.24 |
Surgical/procedural | 0.29 | 0.32 | 0.21 | 0.11 |
Institutional prestige | f_w | f_first | f_last | f_both |
Top-100 University | 0.36 | 0.42 | 0.27 | 0.16 |
Other university | 0.35 | 0.39 | 0.25 | 0.14 |
Geographic location | f_w | f_first | f_last | f_both |
Arab countries | 0.33 | 0.34 | 0.27 | 0.16 |
Commonwealth of Independent States | 0.40 | 0.45 | 0.30 | 0.17 |
East Asia | 0.19 | 0.19 | 0.09 | 0.04 |
Latin America | 0.46 | 0.52 | 0.39 | 0.25 |
North America | 0.36 | 0.40 | 0.27 | 0.15 |
Oceania | 0.40 | 0.48 | 0.31 | 0.20 |
South and Central Europe | 0.40 | 0.44 | 0.31 | 0.18 |
Sub-Saharan Africa | 0.36 | 0.39 | 0.31 | 0.20 |
South-West Asia | 0.29 | 0.31 | 0.24 | 0.10 |
Western Europe | 0.35 | 0.42 | 0.24 | 0.14 |
Citation impact per paper is measured by field-normalized citation scores with a four-year fixed citation window (NCS). Using NCS as the outcome variable strengthens our matched design by adjusting for sub-specialty variations in citation practices. NCS is a continuous outcome variable. It is calculated by dividing the citations accrued by a paper within the first four years after publication with the expected citation score of other papers published in the same year and field (Waltman et al., 2012). Fields are delineated using the same article-level classification system as the Leiden Ranking. This classification system consists of 4,047 micro-level clusters of publications and offers one of the best, current approaches to item-oriented field normalization (Waltman and van Eck, 2012). This item-oriented field normalization procedure allows for comparison of papers published in different sub-fields, with different publication dates.
Figure 1 displays the density distributions of log-transformed citation scores for the matched sets of papers with female first authors and male first authors (Sample 1), female last authors and male last authors (Sample 2), and female first and last authors and other gender combinations of first and last authors (Sample 3). For all distributions, the absolute uncertainty of the mean is between 0.001 and 0.005. On average, papers with female first authors are cited 8.7% less than papers with male first authors (Sample 1. Female first authors: n = 509,330; x̄ = 1.16; σ = 1.83; x̃ =0.73. Male first authors: n = 509.335; x̄ = 1.27; σ = 2.00; x̃ =0.76); however, the overlap between the two distributions is extensive (Cohen's d = −.06; Weitzman's Δ = 95.4%; Weitzman, 1970). Papers with female last authors are cited 6.5% less than papers with male last authors (Sample 2. Female last authors: n = 326,611; x̄ = 1.16; σ = 1.93; x̃ =0.72. Male first authors: n = 326,622; x̄ = 1.24; σ = 1.93; x̃ =0.76); again, the overlap between the two distributions is extensive (Cohen's d = −.04; Weitzman's Δ = 95.6%). Finally, papers in which both the first and last authors are female are cited 12.6% less than papers with other gender combinations (Sample 3. Female first and last authors: n = 184,183; x̄ = 1.11; σ = 1.75; x̃ =0.71. Other combinations: n = 184,191; x̄ = 1.27; σ= 2.28; x̃ =0.76); again, the overlap between the two distributions is extensive (Cohen's d = −.08; Weitzman's Δ = 93.1%).
To obtain a closer approximation of the observed gender variation on the right side of the distribution curves (Figure 1), we calculated the percentage share of case papers among the top 5% and top 10% most cited papers in each sample. Note here that the case and control-papers, given our matching approach, each comprise 50% of all papers in Samples 1, 2, and 3. Our calculations showed that papers with female first authors comprise 43.6% of the top 5% most cited and 45.5% of the top 10% most cited in Sample 1. Papers with female last authors comprise 45.8% of the top 5% most cited and 46.8% of the top 10% most cited in Sample 2. Finally, papers in which both the first and last authors are female comprise 41.1% of the top 5% most cited and 43.5% of the top 10% most cited in Sample 3.
While these results indicate large within-group variation and very small between-group differences, they are consistent with previous reports of a slight average citation advantage for papers by male lead authors. We decided to use regression analyses to explore the underlying variations that may drive this average difference.
We ran Tweedie regressions to estimate the residual effect of author gender after adjusting for self-citations, numbers of authors per paper, field-normalized journal impact (MNCS journal) and international collaboration. MNCS journal is calculated as the mean-normalized citation score (NCS) of all papers published in a journal in a given year; in this case the same year as the observed paper. MNCS journal is essentially a measure of journal prestige that takes into account that most journals publish papers in more than one field, and that different fields have different citation characteristics. (See Methods for a discussion of the relationship between NCS and MNCS).
Exponentiated beta coefficients and 95% confidence intervals for the three models with NCS as outcome are displayed in Figure 2. The exponentiated beta coefficients should be interpreted as, ceteris paribus, the predicted relative change in the outcome resulting from a one unit increase in the predictor (Coates et al., 2018). For instance, an exponentiated coefficient of 0.95 for the case variable in Sample 1 (male first author = 0, female first author = 1) would indicate that female first authors, on average, receive 0.95 times the citations accrued by comparable papers with male first authors, hence on average 5% fewer citations.
The numeric input variables have been rescaled by dividing by two standard deviations (Gelman, 2008). We did this to allow the numeric inputs (i.e. MNCS journal, N authors, self-citations) to be interpreted on the same scale as the binary case variables (i.e. F_first, F_last, F_both). The standardized coefficients should be interpreted as two-standard deviation changes on a logit scale, from a low value to a high value (Gelman, 2008). Figure 2—source data 1 summarizes the exponentiated values for both the direct and standardized coefficients.
The models indicate very small residual effects of author gender on citation impact per paper. The exponentiated coefficient for the case variable, F_first in Sample 1 (F_first: female first author=1, male first authors=0) is 0.98 (CI=0.98–0.98). The exponentiated coefficient for the case variable in Sample 2 (F_last: female last author=1, male last author=0) is 0.99 (CI=0.98–0.99), and the exponentiated coefficient for the case variable in Sample 3 (F_last: female first and last authors=1, all other author constellations=0) is 0.96 (CI=0.96–0.97). MNCS journal and self-citations are the strongest predictors in the models although their effect sizes can be considered small. The two remaining covariates (N authors and International collaboration) both have exponentiated coefficients close to one, indicating small effects.
Figure 3 displays the estimated marginal means (EM-means) and 95% CIs for the case variables in Samples 1, 2 and 3. The EM-means are used to report the predicted, average citation score per paper for each group, adjusting for self-citations, number of authors, MNCS journal and international collaboration. In accordance with Figure 1, we observe only very small differences between the EM-means for the case and control groups in the three samples (Figure 3). Robustness checks were carried out to examine the sensitivity of the regression results to alternative model and sample specifications (for specifications see Methods). All of these analyses yielded qualitatively similar results. All of the models indicated very small residual effects of author gender on citation impact per paper (see Figure 2—source datas 2–4).
To examine which of the covariates that vary the most by lead-author gender, we ran three logistic regressions with F_first as outcome in Sample 1, F_last as outcome in Sample 2, and F_both as outcome in Sample 3. The results are presented in Figure 4. Again, the regression inputs have been rescaled by dividing by two standard deviations to allow for meaningful comparisons of the binary and numeric variables. Figure 4—source data 1 summarizes both the direct and the standardized coefficients. Self-citations have the largest standardized coefficients in all three samples, followed by N authors in Samples 1 and 2, and MNCS journal in Sample 3. We observe no discernable effect of international collaboration on the outcome variable in any of the samples. The results indicate that papers with high self-citation rates and high MNCS journal scores are less likely to be led by female authors than male authors in all samples. Notice that the effect sizes are small. Further, papers with a high value of N authors are slightly more likely to have a female first author than a male first author, and slightly less likely to have a female last author than a male last author. As indicated in Figure 2, N authors has exponentiated coefficients extremely close to 1.0 in the Tweedie regressions with NCS as outcome. Hence, we restrict our focus to self-citations and MNCS journal in the remaining part of the analysis.
Descriptive analysis indicates larger average gender differences in self-citation rates compared to MNCS journal scores in Samples 1, 2 and 3 (Table 2), but the standardized mean differences for both variables are very small, and the percentage overlaps are extensive.
Table 2. Means, standard deviations, medians, Cohen’s d, and Weitzman’s ∆ for case-control comparisons of self-citations and MNCS journal in Samples 1, 2 and 3.
case (σ) | control (σ) | case | control | d | ∆ | ||
---|---|---|---|---|---|---|---|
Sample 1 | Self-citations | 1.91 (3.18) | 2.16 (3.93) | 1 | 1 | -0.07 | |
MNCS journal | 1.16 (.90) | 1.21 (1.04) | .99 | 1.00 | -0.05 | 96.4% | |
Sample 2 | Self-citations | 1.84 (3.22) | 2.08 (3.77) | 1 | 1 | -0.07 | |
MNCS journal | 1.14 (.98) | 1.20 (.99) | .98 | 1.00 | -0.06 | 95.6% | |
Sample 3 | Self-citations | 1.74 (2.84) | 2.13 (3.91) | 1 | 1 | -0.11 | |
MNCS journal | 1.12 (.97) | 1.20 (1.02) | .97 | 1.0 | -0.08 | 93.4% |
To obtain a closer approximation of the extent to which self-citations may contribute to explain the observed gender variation on the right side of the distribution curves in Figure 1, we plotted the average proportion of per-paper self-citations (number of per-paper self-citations/raw per-paper citation scores) in 5% intervals from the quantile of least cited papers to the quantile of top cited papers in Samples 1, 2 and 3. As displayed in the upper panel of Figure 5, the average proportion of self-citations for papers in the top 5% bin is ~15% in all three samples. This implies that at least part of the gender variation observed on the right side of the curves in Figure 1 may be attributable to average gender differences in self-citation rates per paper. It should also be noted that our citation indicators are calculated with a four-year window, which may contribute to explain the relatively large proportion of self-citations in the samples.
Finally, to examine the observed gender variation in MNCS journal scores in closer detail, we plotted the average proportion of case papers (i.e. papers with female first authors in Sample 1, papers with female last authors in Sample 2, and papers with female first and last authors in Sample 3) in 5% intervals – from the papers with the lowest to the highest MNCS journal scores. Note again that the case papers and the control papers each comprise 50% of all papers in Samples 1, 2, and 3. As displayed in the upper panel in Figure 6, the relative proportion of case papers is slightly overrepresented on the left side and in the middle part of the x-axis and slightly underrepresented on the right side. The relative proportion of case papers drops below 50% at the 80th percentile. The most notable drop occurs at the 90th–95th percentile. This indicates that most of the observed, average gender differences in MNCS journal scores are due to an underrepresentation of female-led papers in the most high impact publication outlets. Of the papers published in the top 5% outlets with the highest MNCS journal scores, papers with female first authors and female last authors comprise 45% and 44%, respectively, while papers with female first and last authors comprise 41%.
Discussion
Decision-makers in academic medicine increasingly use citation-based metrics to evaluate scholarly merits and allocate individual opportunities and rewards. Examining data to discern systematic gender differences in medical researchers’ citation impact is therefore more important than ever. However, existing evidence from the medical sciences falls short of providing tangible guidance for policy on this issue. In this study, we carried out a controlled, large-scale, global gender comparison of lead authors’ average citation impact per paper in disease-specific medical research.
Descriptive analysis indicated average gender differences of 6.5 to 12.6% in the three matched samples. However, the standardized mean differences were very small, and the percentage overlaps between male and female distributions were extensive.
In regressions adjusted for international collaboration, self-citations, number of authors, and field-normalized journal impact, the average citation impact per paper was close to identical for women and men, irrespective of first and last author combinations. Additional analyses indicated that self-citations and field-normalized journal impact accounted for most of the average differences observed in bivariate gender comparisons.
Compared to previous studies, these findings align with what is known about “decline effects" (Ioannidis, 2005). As sample sizes become larger and study designs more advanced, effect sizes tend to decline towards minuscule effects.
Here, it is relevant to highlight that the gender effect attributable to self-citations likely reflects a generation effect, with more senior male authors having more publications to self-cite. For instance a recent analysis of 1.5 million life-science papers demonstrates that observed gender differences in self-citation rates are leveled out when adjusting for each author’s prior publication output (Mishra et al., 2018; see also King et al., 2017).
Given this latent generation effect, one could argue that it is somewhat surprising that we end up observing very small residual effects of author gender on per-paper citation impact. If the male authors in our sample, on average, are older and more established researchers, one would expect them to have a slight citation advantage in a cross-sectional analysis like ours.
Note also that women may still receive less credit for their citations due to gender-based double standards in evaluative judgments (Botelho and Abraham, 2017; Moss-Racusin et al., 2012). For instance, a sophisticated analysis of scientists receiving early-career grants from the National Institutes of Health in the US (1985–2009) shows that women compared to men gain fewer returns on citations in terms of transition time from a postdoc grant to an R01 grant. Adjusting for a large number of factors, women on average spent one year longer transitioning to an R01 grant than men with the same number of citations (Lerchenmueller and Sorenson, 2018).
The observed differences in the relative proportion of female- and male-led papers published in the journals with the highest field-normalized impact echo the findings of previous research in the medical sciences (Jagsi et al., 2006; Lerchenmueller and Sorenson, 2018). Part of this difference may be due to the generation effect described above. Other possible explanations may be that female lead authors are less likely to submit their research to journals with high impact factors (Berg, 2017; Nature Neuroscience, 2018), or that they have slightly lower success rates in peer review, e.g. due to gender bias or topic bias in journals with high impact factors. In the future, we hope to see closer examinations of the mechanisms driving this gender gap. This issue is critical given the strong emphasis on publishing in journals with high impact factors in evaluations of the performance of individual researchers (McKiernan et al., 2019).
Building diverse and inclusive research organizations requires careful attention to the mechanisms perpetuating gender inequalities in the higher academic ranks. The results reported here demonstrate that gender differences in per-paper citation impact are a negligible factor in this stratification process. Instead, universities and research leaders should develop strategies for effectively counteracting the broader societal and institutional barriers to the scientific advancement of women, including chilly and hostile work climates (Carr et al., 2003; Pololi et al., 2013), gender bias in recruitment and selection practices (Van den Brink, 2011) and work-family conflicts in the early stages of the academic career (Jolly et al., 2014).
Counteracting such barriers may be critical to improving the rigor and precision of medical research. For instance, an association between the representation of women as last authors (in combination with male first authors) and adequate statistical power in clinical trials was reported recently (Otte et al., 2018). Another recent study suggests a robust positive correlation between the attention to gender- and sex-related variations in medical research and the involvement of women as first and last authors (Nielsen et al., 2017).
Our study has some limitations. First, the analysis excluded articles with first or last authors from 18 countries, due to unreliable gender determination (see Methods). Despite a relatively small reduction in sample size (7.2% of the total population), this exclusion implies that countries located in East Asia and Sub-Saharan Africa, are underrepresented in the analysis, and the results should be interpreted accordingly.
Second, this study limited its focus to average gender differences in citation impact per paper. This focus precludes us from drawing any conclusions on potential long-term differences in male and female researchers’ cumulative citation impact. If men, for instance, have higher average publication rates (e.g. due to shorter career breaks, more collaborative research articles, more funding and more people in their labs), this would imply that their average cumulative citation impact would be higher as well. Hence, while differences in per-paper citations appear to be a negligible factor in the perpetuation of gender inequalities in academic medicine, disparities in publication rates may still play a role – especially at the early-career stages and especially in evaluation systems where publication rates have a strong influence on decisions about tenure, hiring and funding. Future research could use author-disambiguation algorithms to compare the cumulative citation impact and publication rates of large samples of individual researchers over time, adjusting for institutional affiliation, country affiliation, research area, scientific age and other relevant factors.
Third, per-paper citation impact represents a very specific proxy of scholarly impact. In practice, research evaluators at funding organizations and universities may use citation indicators in other ways, e.g. by restricting their focus to the five most influential publications of an applicant. Indeed, having a few highly cited papers may in some evaluative contexts do more for a researcher’s career progression than having a higher than average per-paper citation impact.
Fourth, the relatively small gender differences observed in the descriptive analysis should be interpreted with caution. Such results are sensitive to generic noise in the data (Schneider, 2013), and inherent uncertainties associated with statistical inferences based on non-random samples of “found data” (Freedman et al., 2003).
In conclusion, our results demonstrate that adjusting for co-varying factors, men and women in first and last author positions are cited at similar rates. The analysis presented here raises concerns that at least parts of the gender differences reported in prior research may be distorted by methodological limitations and imprecision in how the results are interpreted. We acknowledge the critical importance of recognizing even the small drawbacks that can add up over time and become cumulative disadvantages for women in science (Caplan, 1993; Cole and Singer, 1991). However, our study demonstrates the importance of focusing greater attention on within-group variability and between-group overlap of distributions when interpreting and reporting results.
Methods
Figure 7 displays the data-selection process. Peer-reviewed articles published between 2008 and 2014 were collected in PubMed Medline. To target core medical research and enable exact matching based on primary medical specialty, we needed information on the disease-specific Medical Subject Headings assigned to each paper. Hence, the initial sample was limited to records indexed with the broad MeSH descriptor “Diseases Category" (n=2,336,805). Eligible PubMed records were matched to article metadata in Web of Science (WoS) (citation data, author first names and affiliations), using Publication identifiers (PMID, DOI) and a fuzzy matching of reference data (source, volume, pagination, etc.). The matching percentage by journal is given in Figure 7—figure supplement 1. All papers lacking full first-name information for one or more authors were excluded from the sample (n=362,453; 15.5% of the population sample). The name-to-gender assignment algorithm, Gender API (Gender API, 2016), was used to determine the gender of all authors per paper for the remaining sample. This algorithm estimates a given author's likelihood of being a man or a woman based on first name and country affiliation. The accuracy of the algorithm has previously been validated in a random subsample (N=500 authors) drawn from the same dataset (Nielsen et al., 2017), and was recently evaluated as the best-performing service, in a bench-mark of five name-to-gender assignment algorithms (Santamaría and Mihaljević, 2018). Gender API provided valid name-to-gender estimates for 1,434,715 papers (61.4% of the population sample) (for further specification on Gender API, See Figure 7—figure supplement 3). A sensitivity analysis indicated unreliable Gender API estimates for authors from 18 countries, located in Eastern Asia and Sub-Saharan Africa. All documents with first and last authors from these countries were excluded (see Figure 7—figure supplement 2). This reduced the sample by 7.2% (n=165,173), resulting in a final sample of 1,269,542 papers (54.3% of the population sample).
In the analysis of citation impact per paper, exact matching covariates were included for institutional prestige, geographical region and medical specialty. All three factors are known to influence citation impact (Judge, 2016; Stremersch et al., 2007; van Eck et al., 2013). In addition, research shows that the participation of women in medical research varies considerably across geographical regions, top and lower-tier research institutions and medical specialties (Lautenberger et al., 2014; Nielsen et al., 2017; Weeden et al., 2017). Matching of institutional prestige was based on a binary variable specifying whether a paper includes authors affiliated with a top-100 university according to the Leiden Ranking [www.leidenranking.com]. The matching of geographical region was based on ten variables specifying the location of the first and last author. The matching of medical specialties was based on 124 specialties identified using the HeTOP MeSH specialty-classification algorithm (Darmoni et al., 2006) (for specifications on country groupings and specialty-disambiguation, see Figure 7—source datas 1–3). We used replacement sampling, resulting in case and control groups of equal sizes.
Five covariates were included in the regression models. Journal prestige is arguably the strongest single predictor of a paper's citation impact (Judge, 2016). Prior work suggest that women are less likely than men to publish in journals with high impact factors (see, for example, González-Álvarez and Cervera-Crespo, 2019; Lerchenmüller et al., 2018). To adjust for this factor, we computed the mean NCS-score per journal (MNCS journal). This indicator is advantageous compared to the journal impact factor, most notably because it corrects for subfield-specific citation characteristics.
International collaboration is another recognized predictor of citation impact (Smith et al., 2014). Again, extant research suggests that women, on average, co-author fewer papers with international colleagues compared to men (see e.g. Abramo et al., 2013; Larivière et al., 2013). A binary variable adjusts for this factor (collaboration between authors from different countries = 1).
Finally, we included two count variables that adjust for the number of authors per paper and the number of per-paper self-citations within the first four years after publication. Extant research demonstrates that per-paper citation impact is positively correlated with author-group size, and that women, on average, have fewer self-citations and fewer collaborators per paper (see e.g. Araújo et al., 2017; King et al., 2017).
A Tweedie distribution was used to estimate the relationship between author gender and NCS (Funk et al., 2010; Jørgensen, 1987). The continuous outcome variable, NCS, is highly right-skewed with a probability mass at zero. Tweedie distributions are a class of mixed compound Poisson-gamma distributions with a discrete mass at zero. This makes them useful for modeling continuous outcome variables with a mixture of zeros and positive values. Tweedie distributions belong to the exponential family of generalized linear models (GLM). The mean and variance for the Tweedie random variable are and , respectively, where is the dispersion parameter and p is the parameter controlling the variance of the distribution. Tweedie distributions take variance-power values p in the range >1 and < 2. We estimated three basic GLM-models using link power=0 corresponding to a log-link function and variance power of p=1.65, p=1.72 and p=1.6 for the three models, F_first, F_last and F_both, respectively. Variance power was derived empirically through iterative algorithms seeking an optimal fit. The dispersion parameter was used to test for goodness of fit and examine possible overdispersion. Robustness checks were carried out to examine the sensitivity of the results to alternative model and sample specifications (see Figure 2—source datas 2–4). First, we ran negative binomial regressions with raw per-paper citation scores (with a four-year citation window) (CS) as the outcome variable in Samples 1, 2 and 3. Next, we ran Tweedie regressions with NCS as outcome variable based on the full, un-matched data set. Finally, we ran Tweedie regressions with dummy variables for different levels of MNCS journal (low, medium, and high). This allowed us to examine whether adjusting for journal prestige at different thresholds influenced the case coefficients in Samples 1, 2, and 3. The dummy variables were created based on percentile ranks of MNCS journal. The percentile thresholds were ≥ 95% for the high-category variable, ≥ 50% < 95% for the medium-category variable and < 50% for the low category variable.
Logistic regression was used to estimate the relationship between the four covariates (self-citations, N authors, MNCS journal and international collaboration) and the case variable in each sample.
The predictors and covariates in all the regression models had Variance Inflation Factors below two, indicating very low levels of multicollinearity.
The statistical analyses were conducted in R version 3.4.3. For the matching procedure, we used the R “Matching" package (Sekhon, 2011), for the Tweedie regressions we used the “tweedie" and “statmod" packages (Dunn, 2017; Dunn and Smyth, 2008; Dunn and Smyth, 2005). Finally, we used the “emmeans’ package to calculate the estimated marginal means for the case variables in the Tweedie regressions (Lenth, 2019).
Information for the calculation of bibliometric indices (CS, NCS, JS, MNCS journal, self-citations and international collaboration) were obtained from the Centre for Science and Technology Studies (CWTS), Leiden University. CWTS hosts a curated, quality-added version of the Web of Science which enables the calculation of field-normalized citation indicators, which is not immediately possible in the standard version available online. Calculation methods are standard operations, as described in Waltman et al. (2012). For clarity, we briefly explain the NCS and MNCS journal indicators here. The purpose of using field-normalized citation indicators is to account for very large differences in citation activity and density across fields, stemming from differences in the referencing behavior and norms for various fields. The operation makes comparison between fields possible, as the score expresses impact relative to the field a given paper is published in, rather than an absolute impact, which may have different meanings across fields. To normalize citation scores, the raw citation count (CS) is divided by the mean citation scores of equivalent papers from the same field. These are papers published in the same year and field, and in this case, with citations counted in the same number of years. This gives us the NCS. Fields are here delimited by an algorithmic approach developed by Waltman and van Eck (2012), where papers are assigned to clusters based on their citing, citation and topical commonalities. These clusters thus define small fields with common referencing cultures, increasing internal consistency when calculating field normalizations. The MNCS journal is simply the mean NCS of all papers published in a given journal in a given year. Like the Journal Impact Factor, the MNCS journal changes from one year to another, and the MNCS journal for a paper is then calculated for the year the paper was published.
Weitzman’s measure, or Δ, is well-defined for density functions. Let f(x) and g(x) be two probability density functions, then:
However, for empirical distributions the solution is not as well-defined. We used the “overlapping” R-package (Pastore, 2018), which divides two empirical density distributions into intervals and calculates the cumulative sum (integral) of minimum values per interval. As both distributions by definition have a cumulative sum of 1, the result is in the range 0 to 1, where 1 implies identical distributions and 0 the complete absence of any overlap. Estimating the overlap empirically heavily depends on the number of bins the distribution is divided into. We tested various bin ranges for our samples and found estimates stabilized around 5,000 bins and upward, and thus used 10,000 bins for the analysis.
Biographies
Jens Peter Andersen is in the Danish Centre for Studies on Research and Research Policy, Aarhus University, Aarhus, Denmark
Jesper Wiborg Schneider is in the Danish Centre for Studies on Research and Research Policy, Aarhus University, Aarhus, Denmark
Reshma Jagsi is in the Department of Radiation Oncology and the Center for Bioethics and Social Sciences in Medicine, University of Michigan, Ann Arbor, Michigan, United States
Mathias Wullum Nielsen is in the Danish Centre for Studies on Research and Research Policy, Aarhus University, Aarhus, Denmark
Funding Statement
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Author contributions
Conceptualization, Resources, Data curation, Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing—original draft, Project administration, Writing—review and editing.
Conceptualization, Resources, Formal analysis, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing—original draft, Writing—review and editing.
Conceptualization, Data curation, Formal analysis, Validation, Writing—original draft, Writing—review and editing.
Conceptualization, Resources, Formal analysis, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing—original draft, Project administration, Writing—review and editing.
Competing interests
No competing interests declared.
RJ: Stock options in Equity Quotient; advisory role and personal fees from Amgen; and consulting for Vizient.
Contributor Information
Jens Peter Andersen, Email: jpa@ps.au.dk.
Peter Rodgers, eLife, United Kingdom.
Peter Rodgers, eLife, United Kingdom.
Funding Information
This paper was supported by the following grants:
Uddannelses- og Forskningsministeriet 6183-00001B to Jesper Wiborg Schneider.
Aarhus University Research Foundation AUFF-2018-7-5 to Mathias Wullum Nielsen.
Additional files
Data availability
All final data and analytical scripts are available on GitHub: https://github.com/ipoga/gendcit (copy archived at https://github.com/elifesciences-publications/gendcit).
References
- Abramo G, D’Angelo CA, Murgia G. Gender differences in research collaboration. Journal of Informetrics. 2013;7:811–822. doi: 10.1016/j.joi.2013.07.002. [DOI] [Google Scholar]
- Aksnes DW, Rorstad K, Piro F, Sivertsen G. Are female researchers less cited? A large-scale study of Norwegian scientists. Journal of the American Society for Information Science and Technology. 2011;62:628–636. doi: 10.1002/asi.21486. [DOI] [Google Scholar]
- Araújo EB, Araújo NAM, Moreira AA, Herrmann HJ, Andrade JS. Gender differences in scientific collaborations: Women are more egalitarian than men. PLOS ONE. 2017;12:e0176791. doi: 10.1371/journal.pone.0176791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Berg J. Looking inward at gender issues. Science. 2017;355:329. doi: 10.1126/science.aam8109. [DOI] [PubMed] [Google Scholar]
- Borrego Á, Barrios M, Villarroya A, Ollé C. Scientific output and impact of postdoctoral scientists: A gender perspective. Scientometrics. 2010;83:93–101. doi: 10.1007/s11192-009-0025-y. [DOI] [Google Scholar]
- Botelho TL, Abraham M. Pursuing quality: How search costs and uncertainty magnify gender-based double standards in a multistage evaluation process. Administrative Science Quarterly. 2017;62:698–730. doi: 10.1177/0001839217694358. [DOI] [Google Scholar]
- Caplan PJ. Lifting a Ton of Feathers: A Woman’s Guide for Survivng in the Academic World. University of Toronto Press; 1993. [Google Scholar]
- Caplar N, Tacchella S, Birrer S. Quantitative evaluation of gender bias in astronomical publications from citation counts. Nature Astronomy. 2017;1:0141. doi: 10.1038/s41550-017-0141. [DOI] [Google Scholar]
- Carr PL, Szalacha L, Barnett R, Caswell C, Inui T. A "ton of feathers": Gender discrimination in academic medical careers and how to manage it. Journal of Women's Health. 2003;12:1009–1018. doi: 10.1089/154099903322643938. [DOI] [PubMed] [Google Scholar]
- Coates JM, Gullo MJ, Feeney GFX, Young RM, Connor JP. A randomized trial of personalized cognitive-behavior therapy for alcohol use disorder in a public health clinic. Frontiers in Psychiatry. 2018;9:1–12. doi: 10.3389/fpsyt.2018.00297. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cole JR, Singer B. A theory of limited differences. In: Zuckerman H, Cole JR, Bruer J, editors. The Outer Circle: Women in the Scientific Community. New York: Norton; 1991. pp. 277–310. [Google Scholar]
- Darmoni SJ, Névéol A, Renard JM, Gehanno JF, Soualmia LF, Dahamna B, Thirion B. A MEDLINE categorization algorithm. BMC Medical Informatics and Decision Making. 2006;6:7. doi: 10.1186/1472-6947-6-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dunn PK. Tweedie: Evaluation of Tweedie exponential family models. 2017 https://cran.r-project.org/web/packages/tweedie
- Dunn PK, Smyth GK. Series evaluation of Tweedie exponential dispersion model densities. Statistics and Computing. 2005;15:267–280. doi: 10.1007/s11222-005-4070-y. [DOI] [Google Scholar]
- Dunn PK, Smyth GK. Evaluation of Tweedie exponential dispersion model densities by Fourier inversion. Statistics and Computing. 2008;18:73–86. doi: 10.1007/s11222-007-9039-6. [DOI] [Google Scholar]
- Eagly AH, Miller DI. Scientific eminence. Perspectives on Psychological Science. 2016;11:899–904. doi: 10.1177/1745691616663918. [DOI] [PubMed] [Google Scholar]
- European Commission . She Figures, 2015: Gender in Research and Innovation, Statistics and Indicators. European Commission; 2016. https://ec.europa.eu/research/swafs/pdf/pub_gender_equality/she_figures_2015-final.pdf [Google Scholar]
- Freedman DA, Berk RA, Freedman DA. Statistical assumptions as empirical commitments. In: Blomberg TG, Cohen S, editors. Law, Punishment, and Social Control: Essays in Honor of Sheldon Messinger. New York, NY: Walter de Gruyter; 2003. pp. 235–265. [Google Scholar]
- Funk LM, Weiser TG, Berry WR, Lipsitz SR, Merry AF, Enright AC, Wilson IH, Dziekan G, Gawande AA. Global operating theatre distribution and pulse oximetry supply: An estimation from reported data. The Lancet. 2010;376:1055–1061. doi: 10.1016/S0140-6736(10)60392-3. [DOI] [PubMed] [Google Scholar]
- Gelman A. Scaling regression inputs by dividing by two standard deviations. Statistics in Medicine. 2008;27:2865–2873. doi: 10.1002/sim.3107. [DOI] [PubMed] [Google Scholar]
- Gender API Gender API - Determines the gender of a first name. 2016 https://gender-api.com
- González-Álvarez J, Cervera-Crespo T. Contemporary psychology and women: A gender analysis of the scientific production. International Journal of Psychology. 2019;54:135–143. doi: 10.1002/ijop.12433. [DOI] [PubMed] [Google Scholar]
- Housri N, Cheung MC, Koniaris LG, Zimmers TA. Scientific impact of women in academic surgery. Journal of Surgical Research. 2008;148:13–16. doi: 10.1016/j.jss.2008.02.015. [DOI] [PubMed] [Google Scholar]
- Huggett S. Gender in the Global Research Landscape. Elsevier; 2017. https://www.elsevier.com/__data/assets/pdf_file/0008/265661/ElsevierGenderReport_final_for-web.pdf [Google Scholar]
- Ioannidis JP. Contradicted and initially stronger effects in highly cited clinical research. JAMA. 2005;294:218–228. doi: 10.1001/jama.294.2.218. [DOI] [PubMed] [Google Scholar]
- Jagsi R, Guancial EA, Worobey CC, Henault LE, Chang Y, Starr R, Tarbell NJ, Hylek EM. The "gender gap" in authorship of academic medical literature: A 35-year perspective. New England Journal of Medicine. 2006;355:281–287. doi: 10.1056/NEJMsa053910. [DOI] [PubMed] [Google Scholar]
- Jagsi R, Motomura AR, Griffith KA, Rangarajan S, Ubel PA. Sex differences in attainment of independent funding by career development awardees. Annals of Internal Medicine. 2009;151:804–811. doi: 10.7326/0003-4819-151-11-200912010-00009. [DOI] [PubMed] [Google Scholar]
- Jenner S, Djermester P, Prügl J, Kurmeyer C, Oertelt-Prigione S. Prevalence of sexual harassment in academic medicine. JAMA Internal Medicine. 2019;179:108–111. doi: 10.1001/jamainternmed.2018.4859. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jolly S, Griffith KA, DeCastro R, Stewart A, Ubel P, Jagsi R. Gender differences in time spent on parenting and domestic responsibilities by high-achieving young physician-researchers. Annals of Internal Medicine. 2014;160:344-353–353. doi: 10.7326/M13-0974. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jørgensen B. Exponential dispersion models. Journal of the Royal Statistical Society: Series B. 1987;49:127–145. [Google Scholar]
- Judge TA. What causes a management article to be cited ? Acad Manag J. 2016;50:491–506. [Google Scholar]
- King MM, Bergstrom CT, Correll SJ, Jacquet J, West JD. Men set their own cites high: Gender and self-citation across fields and over time. Socius: Sociological Research for a Dynamic World. 2017;3 doi: 10.1177/2378023117738903. [DOI] [Google Scholar]
- Larivière V, Vignola-Gagné E, Villeneuve C, Gélinas P, Gingras Y. Sex differences in research funding, productivity and impact: An analysis of Québec university professors. Scientometrics. 2011;87:483–498. doi: 10.1007/s11192-011-0369-y. [DOI] [Google Scholar]
- Larivière V, Ni C, Gingras Y, Cronin B, Sugimoto CR. Bibliometrics: Global gender disparities in science. Nature. 2013;504:211–213. doi: 10.1038/504211a. [DOI] [PubMed] [Google Scholar]
- Lautenberger DM, Dandar VM, Raezer CL. The State of Women in Academic Medicine: The Pipeline and Pathways to Leadership 2013-2014. Association of American Medical Colleges; 2014. [Google Scholar]
- Lenth R. emmeans: Estimated marginal means, aka least-squares means. 2019 https://cran.r-project.org/web/packages/emmeans
- Lerchenmueller MJ, Sorenson O. The gender gap in early career transitions in the life sciences. Research Policy. 2018;47:1007–1017. doi: 10.1016/j.respol.2018.02.009. [DOI] [Google Scholar]
- Lerchenmüller C, Lerchenmueller MJ, Sorenson O. Long-term analysis of sex differences in prestigious authorships in cardiovascular research supported by the National Institutes of Health. Circulation. 2018;137:880–882. doi: 10.1161/CIRCULATIONAHA.117.032325. [DOI] [PubMed] [Google Scholar]
- Long JS. Measures of sex differences in scientific productivity. Social Forces. 1992;71:159–178. doi: 10.1093/sf/71.1.159. [DOI] [Google Scholar]
- Maliniak D, Powers R, Walter BF. The gender citation gap in international relations. International Organization. 2013;67:889–922. doi: 10.1017/S0020818313000209. [DOI] [Google Scholar]
- McKiernan EC, Schimanski LA, Muñoz Nieves C, Matthias L, Niles MT, Pablo Alperin J. Use of the journal impact factor in academic review, promotion, and tenure evaluations. PeerJ Preprints. 2019;7:e27638v2. doi: 10.7287/peerj.preprints.27638v2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mirnezami SR, Beaudry C, Larivière V. What determines researchers’ scientific impact? A case study of Quebec researchers. Science and Public Policy. 2016;43:262–274. doi: 10.1093/scipol/scv038. [DOI] [Google Scholar]
- Mishra S, Fegley BD, Diesner J, Torvik VI. Self-citation is the hallmark of productive authors, of any gender. PLOS ONE. 2018;13:e0195773. doi: 10.1371/journal.pone.0195773. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moss-Racusin CA, Dovidio JF, Brescoll VL, Graham MJ, Handelsman J. Science faculty's subtle gender biases favor male students. PNAS. 2012;109:16474–16479. doi: 10.1073/pnas.1211286109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nature Neuroscience Promoting diversity in neuroscience. Nature Neuroscience. 2018;21:1. doi: 10.1038/s41593-017-0052-6. [DOI] [PubMed] [Google Scholar]
- Nielsen MW. Gender inequality and research performance: Moving beyond individual-meritocratic explanations of academic advancement. Studies in Higher Education. 2016;41:2044–2060. doi: 10.1080/03075079.2015.1007945. [DOI] [Google Scholar]
- Nielsen MW. Gender and citation impact in management research. Journal of Informetrics. 2017;11:1213–1228. doi: 10.1016/j.joi.2017.09.005. [DOI] [Google Scholar]
- Nielsen MW, Andersen JP, Schiebinger L, Schneider JW. One and a half million medical papers reveal a link between author gender and attention to gender and sex analysis. Nature Human Behaviour. 2017;1:791–796. doi: 10.1038/s41562-017-0235-x. [DOI] [PubMed] [Google Scholar]
- OECD OECD statistics. 2019 https://stats.oecd.org/
- Otte WM, Tijdink JK, Weerheim PL, Lamberink HJ, Vinkers CH. Adequate statistical power in clinical trials is associated with the combination of a male first author and a female last author. eLife. 2018;7:e34412. doi: 10.7554/eLife.34412. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pagel PS, Hudetz JA. An analysis of scholarly productivity in United States academic anaesthesiologists by citation bibliometrics. Anaesthesia. 2011;66:873–878. doi: 10.1111/j.1365-2044.2011.06860.x. [DOI] [PubMed] [Google Scholar]
- Pagel PS, Hudetz JA. Scholarly productivity and National Institutes of Health funding of Foundation for Anesthesia Education and Research grant recipients: Insights from a bibliometric analysis. Anesthesiology. 2015;123:683–691. doi: 10.1097/ALN.0000000000000737. [DOI] [PubMed] [Google Scholar]
- Pastore M. Overlapping: Estimation of overlapping in empirical distributions. 2018 https://cran.r-project.org/web/packages/overlapping
- Pololi LH, Civian JT, Brennan RT, Dottolo AL, Krupat E. Experiencing the culture of academic medicine: Gender matters, a national study. Journal of General Internal Medicine. 2013;28:201–207. doi: 10.1007/s11606-012-2207-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Santamaría L, Mihaljević H. Comparison and benchmark of name-to-gender inference services. PeerJ Computer Science. 2018;4:e156. doi: 10.7717/peerj-cs.156. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schneider JW. Caveats for using statistical significance tests in research assessments. Journal of Informetrics. 2013;7:50–62. doi: 10.1016/j.joi.2012.08.005. [DOI] [Google Scholar]
- Sege R, Nykiel-Bub L, Selk S. Sex differences in institutional support for junior biomedical researchers. JAMA. 2015;314:1175–1177. doi: 10.1001/jama.2015.8517. [DOI] [PubMed] [Google Scholar]
- Sekhon JS. Multivariate and propensity score matching software with automated balance optimization: The matching package for R. Journal of Statistical Software. 2011;42:1–52. [Google Scholar]
- Slyder JB, Stein BR, Sams BS, Walker DM, Jacob Beale B, Feldhaus JJ, Copenheaver CA. Citation pattern and lifespan: A comparison of discipline, institution, and individual. Scientometrics. 2011;89:955–966. doi: 10.1007/s11192-011-0467-x. [DOI] [Google Scholar]
- Smith MJ, Weinberger C, Bruna EM, Allesina S. The scientific impact of nations: Journal placement and citation performance. PLOS ONE. 2014;9:e109195. doi: 10.1371/journal.pone.0109195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stremersch S, Verniers I, Verhoef PC. The quest for citations: Drivers of article impact. Journal of Marketing. 2007;71:171–193. doi: 10.1509/jmkg.71.3.171. [DOI] [Google Scholar]
- Symonds MR, Gemmell NJ, Braisher TL, Gorringe KL, Elgar MA. Gender differences in publication output: Towards an unbiased metric of research performance. PLOS ONE. 2006;1:1–5. doi: 10.1371/journal.pone.0000127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Arensbergen P, van der Weijden I, van den Besselaar P. Gender differences in scientific productivity: A persisting phenomenon? Scientometrics. 2012;93:857–868. doi: 10.1007/s11192-012-0712-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Van den Brink M. Scouting for talent: Appointment practices of women professors in academic medicine. Social Science & Medicine. 2011;72:2033–2040. doi: 10.1016/j.socscimed.2011.04.016. [DOI] [PubMed] [Google Scholar]
- van Eck NJ, Waltman L, van Raan AFJ, Klautz RJM, Peul WC. Citation analysis may severely underestimate the impact of clinical research as compared to basic research. PLOS ONE. 2013;8:e62395. doi: 10.1371/journal.pone.0062395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Waltman L, Calero-Medina C, Kosten J, Noyons ECM, Tijssen RJW, van Eck NJ, van Leeuwen TN, van Raan AFJ, Visser MS, Wouters P. The Leiden ranking 2011/2012: Data collection, indicators, and interpretation. Journal of the American Society for Information Science and Technology. 2012;63:2419–2432. doi: 10.1002/asi.22708. [DOI] [Google Scholar]
- Waltman L, van Eck NJ. A new methodology for constructing a publication-level classification system of science. Journal of the American Society for Information Science and Technology. 2012;63:2378–2392. doi: 10.1002/asi.22748. [DOI] [Google Scholar]
- Weeden K, Thébaud S, Gelbgiser D. Degrees of difference: Gender segregation of US doctorates by field and program prestige. Sociological Science. 2017;4:123–150. doi: 10.15195/v4.a6. [DOI] [Google Scholar]
- Weitzman M. Measures of Overlap of Income Distributions of White and Negro Families in the United States. Washington, DC: US Bureau of the Census; 1970. [Google Scholar]