Skip to main content
Lippincott Open Access logoLink to Lippincott Open Access
. 2024 May 10;280(6):945–953. doi: 10.1097/SLA.0000000000006339

Comparison of Postoperative Outcomes Among Patients Treated by Male Versus Female Surgeons

A Systematic Review and Meta-analysis

Natsumi Saka *,, Norio Yamamoto †,, Jun Watanabe †,§,, Christopher Wallis ¶,#,**, Angela Jerath ††, Hidehiro Someko †,‡‡, Minoru Hayashi †,§§, Kyosuke Kamijo †,∥∥, Takashi Ariie †,¶¶, Toshiki Kuno ##, Hirotaka Kato ***, Hodan Mohamud , Ashton Chang ††, Raj Satkunasivam †††,‡‡‡,§§§, Yusuke Tsugawa ∥∥∥,¶¶¶,
PMCID: PMC11542977  PMID: 38726676

Abstract

Objective:

To compare clinical outcomes of patients treated by female surgeons versus those treated by male surgeons.

Background:

It remains unclear as to whether surgical performance and outcomes differ between female and male surgeons.

Methods:

We conducted a meta-analysis to compare patients’ clinical outcomes—including patients’ postoperative mortality, readmission, and complication rates—between female versus male surgeons. MEDLINE, Embase, CENTRAL, ICTRP, and ClinicalTrials.gov were searched from inception to September 8, 2022. The update search was conducted on July 19, 2023. We used random-effects models to synthesize data and GRADE to evaluate the certainty.

Results:

A total of 15 retrospective cohort studies provided data on 5,448,121 participants. We found that patients treated by female surgeons experienced a lower postoperative mortality compared with patients treated by male surgeons [8 studies; adjusted odds ratio (aOR), 0.93; 95% CI, 0.88–0.97; I 2=27%; moderate certainty of the evidence]. We found a similar pattern for both elective and nonelective (emergent or urgent) surgeries, although the difference was larger for elective surgeries (test for subgroup difference P=0.003). We found no evidence that female and male surgeons differed for patient readmission (3 studies; aOR, 1.20; 95% CI, 0.83–1.74; I 2=92%; very low certainty of the evidence) or complication rates (8 studies; aOR, 0.94; 95% CI, 0.88–1.01; I 2=38%; very low certainty of the evidence).

Conclusion:

This systematic review and meta-analysis suggests that patients treated by female surgeons have a lower mortality compared with those treated by male surgeons.

Key Words: gender disparities, female physician, mortality, surgeon gender, surgeon sex, survival


Although the number of female doctors is increasing over time, women are still underrepresented in surgery compared with their male counterparts. In 2019, women comprised only 22% of general surgeons in the United States,1,2 18% of surgeons in the United Kingdom,3 6% of surgeons in Japan,4 and 16% of surgeons in 8 general surgical societies in North America, Europe, and Oceania.5 These rates were even lower in surgical subspecialties, including thoracic and orthopedic surgery.1,2 These numbers are notably lower than the proportions of female physicians in fields such as medicine and pediatrics, where female physicians may outnumber male counterparts in some countries.6,7

The success of a surgical procedure can be attributed to several factors, including the quality of preoperative planning, the surgeon’s technical expertize and skill sets, and the ability to manage complications that may arise during the operation and in the postoperative period. These factors are determined by the surgeon’s clinical knowledge, communication skills with other team members, and the ability to make sound clinical judgments.8 Research on gender differences in clinical practice patterns suggests that female and male surgeons may treat patients differently. For example, female surgeons treating breast cancer patients have been found to be more likely to offer guideline-concordant therapies,9 recommend adjuvant radiotherapy following breast-conservation surgery (BCS),10 perform contralateral prophylactic mastectomy,11 and are less likely perform lymph node dissection along with BCS.12 Considering the existing structural barriers that hinder women from pursuing a career in surgery,13 the current pool of female surgeons might be a selected group displaying greater expertize, determination, and attentiveness than their male counterparts. These differences could potentially influence the outcomes after surgery. However, existing studies that have compared patient outcomes between female and male surgeons have produced mixed findings2,1416 with some studies showing better patient outcomes for female surgeons2 and others showing no difference between the 2 groups of surgeons. However, to our knowledge, no study to date using quantitative meta-analysis exists that provides a summative comparison of postoperative outcomes between female and male surgeons across a large number of surgical procedures.

Given that female surgeons are compensated less and less likely to be promoted than male surgeons,1724 a better understanding of the performance of female surgeons has important clinical and policy implications. In this context, we performed a systematic review and meta-analysis of available evidence comparing patients’ postoperative outcomes (mortality, readmission, and complication rates) between female and male surgeons.

METHODS

We followed a registered protocol (https://osf.io/pgh78/)25 and reported this systematic review in accordance with the Reporting Checklist for Meta-analyses of Observational Studies (MOOSE)26 and Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidance.27 The amendments from the registered protocol are available in Methods section in the Supplemental Digital Content 1 (http://links.lww.com/SLA/F109).

Data Source

We searched Medline, Embase, the Cochrane Central Register of Controlled Trials (CENTRAL) from database inception to September 8, 2022 and updated the search on July 19, 2023. We also searched the World Health Organization International Clinical Trials Platform Search Portal (ICTRP) and ClinicalTrials.gov for unpublished and ongoing trials. The details of the full search strategy, which were constructed by investigators and a research librarian, are available in Methods section in the Supplemental Digital Content 1 (http://links.lww.com/SLA/F109). We investigated the reference lists of eligible studies and articles citing eligible studies that reported the surgical outcome of female surgeons using Citationchaser.28 We also examined the reference lists of the studies that were included in the previously published meta-analysis of surgical outcomes based on the surgeons’ factors.29,30 We also evaluated the major surgical guidelines (Methods section in the Supplemental Digital Content 1, http://links.lww.com/SLA/F109).

Study Eligibility

We included studies that have stratified their results by the sex or gender of the surgeon and evaluated health outcomes (ie, mortality, readmission, and complication rates) of patients who underwent surgeries for our eligibility criteria. We included both longitudinal/cohort studies and case-control studies, as well as controlled before and after studies, cross-sectional studies without the restriction of language, country, or publication year. We included observational studies which was conducted using the data from randomized controlled trials (RCTs) for other objective. We included the studies that report the outcome of surgical trainees (eg, surgical residents), not only surgical attending physicians. We excluded case reports, case series, studies that use surveys to gather data from surgeons or patients. We did not include RCTs that were not conducted to assess the effect of surgeon gender, even if they contain some information on surgeon gender, due to limited information available to assess their generalizability. We also excluded studies that were published as conference abstracts due to difficulty in assessing their quality, but conducted the sensitivity analysis including those studies to assess the robustness of our analysis. We selected studies with the largest cohorts or the most detailed information for analyses if multiple studies used the same or overlapping data.

Study participants were defined as patients undergoing any surgeries, including major and minor surgery,31 outpatient surgery, both emergent and elective surgery, and revision surgery. We defined exposure as surgical treatment performed by female versus male surgeons. The definition of surgeon gender was decided by the original authors by either biological sex or socially constructed gender.

Two independent reviewers (N.S. and one of the others: H.S., M.H., K.K., or T.A.) screened titles and abstracts using Rayyan,32 followed by the assessment of eligibility based on the full texts. Disagreements between the 2 reviewers were resolved by discussion, and if that failed, a third reviewer acted as an arbiter (N.Y. or J.W.).

Data Extraction and Quality Assessment

Two reviewers (N.S. and one of the others: H.S., M.H., K.K., or T.A.) conducted independent data extraction of the included studies using a standardized data collection form. Data collection included study characteristics (study design, study setting, study location, sample size, language of the published study, age of participants, enrollment period, type of surgery, follow-up period, exposure, outcomes, the definition of complications, adjusted confounders) and outcomes. The outcomes of interest were patients’ postoperative mortality, patients’ readmission and complication rates. We defined the postoperative period as 30 days after the surgery, a definition widely used in previous studies.2,14,33,34 When the data on 30-day follow-up was unavailable, we used the outcome variables’ follow-up period closest to 30 days after the surgery. For the effect measure of the outcomes, we extracted both unadjusted and adjusted odds ratios (ORs), and if there were several analyses conducted in the study, we used the data on the most extensively adjusted ORs. For the studies which reported the adjusted result from matched cohort and entire cohort, we used the adjusted effect estimate from entire cohort to reflect the result from larger sample. We contacted the original authors and requested unpublished data whenever necessary.

Risk of Bias and Certainty of the Evidence Assessment

Two reviewers (N.S. and one of the others: H.S., M.H., K.K., or T.A.) independently evaluated the risk of bias using the Risk Of Bias In Non-randomised Studies–of Interventions (ROBINS-I) tool.35 We a priori defined the following possible confounders for the evaluation of study confounding—surgeons’ age, specialty, and surgical volume; and patients’ sex/gender, age, and comorbidities—based on the previous studies.2,30,33,3638 The overall risk of bias was determined based on the ratings of the ROBINS-I domains. Traffic light plots were generated on each outcome using a robvis tool.39

We evaluated the certainty of evidence based on the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) approach,40 which divided evidence as either very low, low, moderate, or high. We rated the certainty of the observational evidence as high in the beginning and downgraded based on the risk of bias, imprecision, inconsistency, indirectness, and publication bias and rated up for size and trend of the effect, using the ROBINS-I tool.41 A summary of findings (SoF) table was created to describe the evidence and absolute effect for each outcome. The risk in control groups in the SoF table was calculated from the median of event rates in the control group across studies, based on the recommendation in the Cochrane Handbook.42

Data Synthesis

We conducted the meta-analysis of the crude and adjusted ORs using a random-effects model. When both crude and adjusted ORs were reported in the original articles, most adjusted ORs were used for the synthesis. If ORs are not available and if the risk ratios (RRs) are reported in original studies, we used a method reported by Zhang and Yu43 to transform RRs into ORs. We calculated the ORs with 95% CIs for the following binary outcome variables: patient mortality, complications, and readmission rates using Review Manager software (RevMan 5.4). We evaluated the statistical heterogeneity by visual inspection of the Forest plot and calculating the I 2 statistic, with 0% to 40%: might not be important; 30% to 60%: may represent moderate heterogeneity; 50% to 90%: may represent substantial heterogeneity; and 75% to 100%: considerable heterogeneity.42 All significance testing was considered statistically significant if P-value was <0.05.

Subgroup Analysis

To examine whether our findings varied by factors related to procedures, we conducted subgroup analyses by: types of surgery (for procedures with sufficiently large sample sizes; types of surgeries based on the specialty, or major surgery versus minor surgery based on the definition from the study by Powell et al31), the location where the study was conducted (North America vs Europe vs Asia Pacific), and elective versus nonelective (ie, emergent or urgent) surgeries.

Sensitivity Analysis

We conducted a series of sensitivity analyses to evaluate the robustness of our findings. First, to test whether our findings were sensitive to the adjustment for potential confounders in individual studies, we reanalyzed the data after restricting it to studies that reported adjusted ORs. Second, as the duration of the follow-up period may affect our findings, we reanalyzed the data restricting it to studies that reported 30-day postoperative outcomes. Third, to examine whether our results were sensitive by the selection of studies in our analyses, we reanalyzed the data including the data from studies published as conference abstracts. Fourth, we reanalyzed the data after excluding studies in which surgical trainees conducted the operations or the qualification of the operators was unclear. Finally, for the outcome of complication, we excluded studies in which only reoperation was defined as a complication. Publication bias using funnel plots and the Egger test were not assessed since there were <10 included studies for each outcome.42

RESULTS

Study Selection, Characteristics, and Quality Assessment

Figure 1 describes the flow of review. We identified 3722 records after removing duplicates. After screening the titles and citation of the research, 123 records were subsequently reviewed. A list of the studies that were excluded and the reason for their exclusion can be found in the in Supplemental Digital Content 1, Table 1 (http://links.lww.com/SLA/F109). Twenty-one studies were included in the qualitative synthesis.2,1416,4460 Finally, 15 studies with 5,448,121 participants were eligible for data extraction and quantitative synthesis.2,1416,4448,50,5456,59,60 Supplemental Digital Content 1, Table 2 (http://links.lww.com/SLA/F109) summarizes the characteristics of the studies included in the quantitative synthesis and Supplemental Digital Content 1, Table 3 (http://links.lww.com/SLA/F109) summarizes the further details of study characteristics included in the qualitative synthesis. Among the studies included in the qualitative synthesis, 6 studies were excluded from the quantitative analysis because 2 of them reported only the composite outcome of mortality and readmission rate,51,53 1 study did not report the quantitative result,52 and 3 studies were conference abstracts.49,57,58 Among the studies that were included only in the qualitative synthesis, all studies reported the equivalent outcomes between female surgeons and male surgeons. The included studies for the quantitative synthes were published from 2000 to 2023. All studies were observational studies. All studies were published in English. A total of 15 studies were included, exhibiting a diverse geographic composition, with a predominance from North America and Asia, followed by contributions from European countries. While 3 studies included surgical procedures across a broad spectrum of specialties,2,14,54 others focused on specific surgical fields. The definition of postoperative complication varied among the studies, with 2 studies using reoperation as the criterion.47,48 The majority of studies used population-based or registry-based data, while the remaining studies employed hospital-based data. The time frames for measuring outcomes were evenly divided among 30-day, intrahospital, and outcomes exceeding 30 days. A small number of studies (n=4) included surgical trainees as surgeons, while other studies either exclusively involved attending surgeons or did not specify the qualifications of the surgeons.

FIGURE 1.

FIGURE 1

Literature search flow diagram.

The overall risk of bias evaluated by the Risk Of Bias In Non-randomised Studies—of Interventions (ROBINS-I) tool was moderate in 8 studies,2,14,16,45,48,54,55,60 and serious in 7 studies.15,44,46,47,50,56,59

Overall risk of bias for each study was rated as serious when at least 1 of 7 domains in ROBINS-I tool (Confounding, Selection bias, Bias in classification of interventions, Bias due to deviations from intended interventions, Bias due to missing data, Bias in measurement of outcomes, and Bias in selection of the reported result) were rated as serious, and rated as moderate if one domain rated as moderate. For example, whether the studies prespecified potential confounders and adjusted for them was defined as one of the domains of the assessment of the risk of bias. Studies which did not include prespecified potential confounders in their analysis plan were rated as serious or moderate risk of bias in confounding, based on the number of confounders they used. The risk of bias on each outcome is described in Supplemental Digital Content 1, Figures 1 to 3 (http://links.lww.com/SLA/F109).

Synthesized Results

Table 1 summarizes the findings using the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) approach.

TABLE 1.

Summary of Findings Table on Postoperative Outcomes

Anticipated absolute effects
Outcomes Risk with male surgeons* Risk with female surgeons
(95% CI)
Relative effect (95% CI) No. participants (studies) Certainty of the evidence
Mortality 10 per 1000 9 per 1000
(9–10)
OR 0.93 (0.88–0.97) 5,390,762
(8 studies)
Moderate§
Readmission 66 per 1000 78 per 1000
(55 –109)
OR 1.20 (0.83–1.74)
1,179,107
(3 studies)
Very low
Complication 94 per 1000 89 per 1000
(84–95)
OR 0.94 (0.88–1.01)
1,306,128
(8 studies)
Very Low
*

Median event rate of the included studies.

Risk in the exposed group and its 95% CI was based on the assumed risk in the nonexposed group and the relative effect of the exposure (and its 95% CI).

GRADE Working Group grades of evidence.

§

Two of 8 studies were rated as having a serious risk of bias due to confounding factors. Consequently, the certainty of the evidence was downgraded by one level because of this serious risk of bias.

Two of 3 studies of the studies were rated as serious risk of bias due to confounding factors. The result from each study were inconsistent and the CI was wide. Consequently, the certainty of the evidence was downgraded by 3 levels by serious risk of bias, inconsistency, and imprecision.

Four of 8 studies were rated as serious risk of bias due to confounding factors. The result from each study were inconsistent and the CI was wide. Consequently, the certainty of the evidence was downgraded by 3 levels by serious risk of bias, inconsistency, and imprecision.

OR indicates odds ratio.

Eight studies with 5,390,762 participants reported postoperative mortality.2,1416,44,45,54,60 Figure 2 shows the forest plot of the synthesized result. Due to the risk in confounding, 2 of the 8 studies were assessed as having a serious risk of bias. Consequently, this risk led to the downgrading of the evidence certainty by one level. Moderate certainty of the evidence supported that postoperative mortality is lower among patients treated by female surgeons compared with patients treated by male surgeons [adjusted odds ratio (aOR), 0.93; 95% CI, 0.88–0.97; I 2=27%].

FIGURE 2.

FIGURE 2

Pooled estimates in postoperative mortality. Effect size was determined using the random-effects model weighted by the inverse of the variance estimate. Squares represent effect size, with marker size reflecting the statistical weight of the study, obtained using random-effects meta-analysis; diamond represents the overall odds ratios and 95% CI.

Subgroup analyses performed to assess postoperative mortality showed significant differences among surgical specialties (test for subgroup differences, P=0.007) with a significantly lower mortality rate seen for patients treated by female general surgeons (aOR, 0.90; 95% CI, 0.87–0.93; I 2=19%), but no differences found in other specialties (Supplemental Digital Content 1, Fig. 4, http://links.lww.com/SLA/F109). Although most of the large cohort studies were from North American region, there was no subgroup differences stratified by the location of the studies (test for subgroup differences P=0.53) (Supplemental Digital Content 1, Fig. 5, http://links.lww.com/SLA/F109). We found significant differences in the effect of surgeons’ gender on mortality both in elective and nonelective (emergent or urgent) procedures. The difference in patient mortality between female and male surgeons was larger for elective surgeries compared with nonelective surgeries (test for subgroup differences, P=0.003) (Supplemental Digital Content 1, Fig. 6, http://links.lww.com/SLA/F109).

Three studies with 1,179,107 participants reported postoperative readmission rates (Fig. 3A).2,15,46 In an assessment of 3 studies, 2 were identified as exhibiting a serious risk of bias attributed to confounding variables. Moreover, the evidence derived from these studies demonstrated a lack of consistency, accompanied by wide CIs. Consequently, the certainty of the evidence was downgraded by 3 levels due to serious risks of bias, heterogeneity, and imprecision. Very low certainty of the evidence supports that the rates of postoperative readmission were not different between female and male surgeons (aOR, 1.20; 95% CI, 0.83–1.74; I 2=92%). Eight studies with 1,360,128 participants reported rates of postoperative complications (Fig. 3B).2,47,48,50,55,56,59,60 In an assessment of 8 studies, 4 were identified as exhibiting a serious risk of bias attributed to confounding variables. Moreover, the evidence derived from these studies demonstrated a lack of consistency, accompanied by wide CIs. Consequently, the certainty of the evidence was downgraded by 3 levels due to serious risks of bias, heterogeneity, and imprecision. Very low certainty of the evidence supported that the rate of postoperative complications was not different between female and male surgeons (aOR, 0.94; 95% CI, 0.88–1.01; I 2=38%).

FIGURE 3.

FIGURE 3

Pooled estimates in postoperative readmission rate and complication rate. A, Postoperative readmission rate. B, Complication rate. Effect size was determined using the random-effects model weighted by the inverse of the variance estimate. Squares represent effect size, with marker size reflecting the statistical weight of the study, obtained using random-effects meta-analysis; diamonds represent the overall odds ratios and 95% CI.

Subgroup analyses of readmission rate showed no significant differences stratified by major and minor surgery and by the elective versus nonelective (emergent or urgent) surgery (Supplemental Digital Content 1, Figs. 7 and 8, http://links.lww.com/SLA/F109), but showed the significant subgroup effect by the location of studies (test for subgroup differences P<0.00001), due to the effect of one study (Supplemental Digital Content 1, Fig. 9, http://links.lww.com/SLA/F109).46 When assessing the effect of surgeon gender on complications, the results did not vary by type of the surgery, location where the studies were conducted, and elective versus nonelective (emergent or urgent) surgery (Supplemental Digital Content 1, Figs. 10–12, http://links.lww.com/SLA/F109).

Sensitivity Analysis

For mortality and readmission, our findings were qualitatively unaffected by restricting to studies that reported adjusted outcomes; restricting to studies using 30-day postoperative outcomes, including studies with conference abstract,49 and excluding studies in which surgical trainees conducted the operations or the qualification of the operators remained undisclosed (Supplemental Digital Content 1, Figs. 13–20, http://links.lww.com/SLA/F109). For complication, the result showed the lower complication rate for female surgeons when we restricted our analysis to studies with 30-day follow-up period, and also when we restricted our analysis to studies which included only attending surgeons (Supplemental Digital Content 1, Figs. 21 and 22, http://links.lww.com/SLA/F109). The sensitivity analysis also showed better patient outcomes for female surgeon when we excluded studies in which only reoperation was defined as complication (Supplemental Digital Content 1, Fig. 23, http://links.lww.com/SLA/F109). Other sensitivity analysis on complication showed the similar result as main analysis (Supplemental Digital Content 1, Figs. 24 and 25, http://links.lww.com/SLA/F109).

DISCUSSION

We conducted a systematic review and meta-analysis, which encompassed ∼5.5 million patients, and found a lower mortality among patients treated by female surgeons compared with male surgeons with a moderate certainty of evidence. When we stratified by electiveness of surgeries, a lower patient mortality for female surgeons was observed for both elective and nonelective surgeries. We found no evidence that patient readmission and complication rates differed between female and male surgeons, although the certainty of evidence was very low. Taken together, these findings suggest that the performance of female surgeons is equivalent to, or better than, that of male surgeons with respect to patients’ clinical outcomes. Despite substantial efforts to improve gender diversity in the field of surgery, prior studies have shown that female surgeons are compensated less,18,61 and are less likely to be promoted to the rank of full professor,18,19 compared with their male peers. Our findings, which exemplify the equivalent or slightly superior performance of female surgeons, highlight the need for more equitable opportunities for female surgeons.

Possible Explanation for the Findings

There are several potential mechanisms that could explain why female surgeons exhibited a lower patient mortality than male surgeons. First, it is possible that female surgeons may be more compliant with clinical guidelines,9,10 deliver more patient-centered care, and exhibit superior communication skills compared with male surgeons. Although evidence regarding differential practice patterns between female and male surgeons is limited, one study found that patients treated by female surgeons were more likely to receive evidence-based care (defined as adjuvant radiotherapy after BCS).10 Existing evidence showing that female primary care physicians are more likely to provide guideline-concordant care and communicate better also support this hypothesis.6264 Research indicates that for medical care, patients of physicians who adhere to guidelines generally experience better outcomes than patients of physicians less likely to adhere to guidelines.65 Research has also shown that hospitals that adhere to guidelines more exhibit better outcomes among patients undergoing surgery.66,67 Although it remains unknown as to whether surgeons who adhere to guidelines show better outcomes, it is possible that differences in guideline concordance between female and male surgeons may explain our findings.

Second, research in economics and psychology suggests that women are generally more risk-averse than men,68,69 which also may influence decision-making processes in surgical care that affect patient outcomes. While no study clarified the details of practicing pattern among female surgeons for preoperative and postoperative care, female surgeons may be more risk-averse and deliberately manage surgical risks, including addressing preexisting comorbidities before surgery and maintaining vigilant postoperative observation. Third, female surgeons may be more willing to collaborate with other team members, such as nurses, than male surgeons, which may avert scenarios that could otherwise result in the “failure to rescue.”70 Fourth, female surgeons may have better surgical knowledge than male surgeons, as reported in a relatively small study of medical students showing that female students completed tests of basic surgical skills more efficiently and had higher scores on tests of theoretical surgical knowledge compared with male students.71 Finally, it is also possible that the observed difference in patient mortality may not solely be attributed to the surgeons’ gender, but also to the health care system to which they are affiliated. An extensive body of research on medical mistakes suggests that systems, rather than individual practitioners, tend to offer stronger explanations for undesirable outcomes in health care.72,73 It could be the case that female surgeons often operate in environments that adhere closely to guidelines, and offer a better surgical skill training, which could potentially result in improved surgical results.

Previous studies have found that female surgeons face several obstacles in choosing their careers and acquiring surgical proficiency, including an imbalance in the distribution of surgical cases assigned to female versus male surgeons during their training.13,52 For example, a study from the United Kingdom reported that female medical students are advised not to choose surgical careers and are subjected to sexual discrimination.13 As another example, Foley et al74 reported differences based on gender in experience with robotic surgery among male and female trainees in colorectal surgery training programs, with females having fewer opportunities to operate the consoles and complete procedures. Notably, the same study reported that male supervisors offered fewer opportunities for console participation to female residents compared with male residents, whereas female supervisors provided an equal number of opportunities to use the console to both female and male trainees. These obstacles might set a higher entry standard for women aiming to join the surgical profession compared with their male counterparts. As a result, the existing population of female surgeons may exhibit a relatively higher level of expertize, determination, and meticulousness, which consequently contributes to the superior outcomes observed among female surgeons in our study. Given the recent surge in the number of female surgeons and possible change in their characteristics, it is plausible that there will be a shift in our outcomes in the future.

There are several reasons as to why we did not observe difference in readmission and complication rates between female and male surgeons. First, compared with the mortality which included 5 million patients, the sample size for the analyses of readmission and complication rates was smaller, potentially limiting the statistical power to detect small differences. Second, the certainty of the evidence was “very low” for the analyses of readmission and complication rates; the potential bias may explain null findings for these outcomes. Last, patient mortality and readmission/complication rates may be reflecting different aspects of surgeons’ practice patterns. For example, a lower patient mortality rate for female surgeons could be attributed not only to their superior surgical skills but also to their overall better management of surgical patients, such as improved preoperative and postoperative care. However, this difference may be less noticeable concerning complication and readmission rates.

Comparison With Other Studies

To our knowledge, this is the first meta-analysis that compared the surgical outcomes of patients treated by female versus male surgeons. Our findings were largely consistent with existing large, observational studies. For example, previous literature analyzed the nationally representative sample of older Americans who underwent 1 of 20 surgical procedures (n=892,187) and found that female surgeons have a lower patient mortality than male surgeons for elective surgeries, but no difference was observed for nonelective surgeries.14 In addition, Wallis et al2 studied patients who underwent 1 of 12 surgical procedures in Ontario, Canada (n=1,159,687), and reported similar findings.

Limitations

This study has limitations. First, as is the case with any observational studies, unmeasured confounding may exist. In particular, while a significant proportion of the data was derived from studies that adjusted for potential confounders—such as surgeon age and surgical volume; and patient age, sex/gender, and comorbidities—the presence of unmeasured and residual confounders, such as surgical complexity or disease stage, remains possible. However, it is important to note that unmeasured confounding is unlikely to explain our findings for the analyses of emergent surgeries. This is because, in the context of emergency surgeries, surgeons are unlikely to select their patients, and patients are also unlikely to select their surgeons. Therefore, patients are plausibly “quasi-randomized” to surgeons based on the timing of when they required a surgical procedure. Our findings that female surgeons exhibit lower patient mortality rates not only for elective surgeries but also for emergency surgeries (Supplemental Digital Content 1, Fig. 6, http://links.lww.com/SLA/F109) indicate that these findings could not be explained by unmeasured confounding. Second, the certainty of evidence for readmissions and complication was very low; therefore, the results for readmission and complication rates should be interpreted with caution. Third, most of the studies included in our analysis relied on self-reported data to identify the gender of patients and surgeons. This may lead to misclassification of this variable, especially for patients and surgeons who are transgender. Nevertheless, given that the estimated prevalence of transgender and gender nonconforming individuals ranges from 0.1% to 2% in the general population,75 a misclassification is unlikely to substantially impact the overall outcome. Finally, a significant proportion of the synthesized results were derived from samples in North America. Other areas included in our meta-analysis were European countries, Taiwan, and Japan. The proportion of female surgeons, surgical training, and how surgical care is provided may vary substantially among the countries, and therefore, the findings from our meta-analysis might not be generalizable to health systems in countries that were not included in our analysis.

CONCLUSIONS

Our meta-analysis on 15 studies found that patients treated by female surgeons had a small but statistically significantly lower postoperative mortality and similar readmission and complication rates, compared with those treated by male surgeons. These findings support ongoing efforts to recruit and retain women in surgery, as well as to better understand the differences in the processes of care that contribute to these outcomes.

Supplementary Material

sla-280-945-s001.docx (6.6MB, docx)

ACKNOWLEDGMENT

The authors thank Dr Per Jolbäck for sharing the unpublished data of their studies.

Footnotes

Concept and design: N.S., N.Y., J.W., H.S., M.H., K.K., T.A., and Y.T. Acquisition, analysis, or interpretation of data: N.S., N.Y., J.W., H.S., M.H., K.K., T.A., C.W., A.J., T.K., H.K., and Y.T. Drafting of the manuscript: N.S., N.Y., J.W., T.K., and Y.T. Statistical analysis: N.S. and H.K. Supervision: Y.T. Critical revision of the manuscript for important intellectual content: all authors.

This study was supported by the National Institute of Health (NIH)/National Institute on Minority Health and Health Disparities (R01 MD013913; PI, Y.T.), Gregory Annenberg Weingarten GRoW @Annenberg (PI, Y.T.) and JSPS KAKENHI Grant Number JP23K15698 (PI, N.S.). Y.T. was supported by NIH/National Institute on Aging (R01AG068633 and R01AG082991) for other work not related to this study.

The authors report no conflicts of interest.

Supplemental Digital Content is available for this article. Direct URL citations are provided in the HTML and PDF versions of this article on the journal's website, www.annalsofsurgery.com.

Contributor Information

Natsumi Saka, Email: natsumi613@gmail.com.

Norio Yamamoto, Email: lovescaffe@yahoo.co.jp.

Jun Watanabe, Email: m06105jw@jichi.ac.jp.

Christopher Wallis, Email: wallis.cjd@gmail.com.

Angela Jerath, Email: angela.jerath@sunnybrook.ca.

Hidehiro Someko, Email: hidehirosomeko@gmail.com.

Minoru Hayashi, Email: fukuiben17@yahoo.co.jp.

Kyosuke Kamijo, Email: m10025kk@jichi.ac.jp.

Takashi Ariie, Email: tks.ar1212@gmail.com.

Toshiki Kuno, Email: kuno-toshiki@hotmail.co.jp.

Hirotaka Kato, Email: kato.hir.lh@yokohama-cu.ac.jp.

Hodan Mohamud, Email: hodan.mohamud@mail.utoronto.ca.

Ashton Chang, Email: ashton.chang@mail.utoronto.ca.

Raj Satkunasivam, Email: raj.satkunasivam@gmail.com.

Yusuke Tsugawa, Email: ytsugawa@mednet.ucla.edu.

REFERENCES

  • 1. AAMC Physician Specialty Data Report; 2019. Accessed Jun 20, 2023. https://www.aamc.org/data-reports/workforce/interactive-data/active-physicians-sex-and-specialty-2019
  • 2. Wallis CJ, Ravi B, Coburn N, et al. Comparison of postoperative outcomes among patients treated by male and female surgeons: a population based matched cohort study. Brit Med J. 2017;359:j4366. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Newman TH, Parry MG, Zakeri R, et al. Gender diversity in UK surgical specialties: a national observational study. BMJ Open. 2022;12:e055516. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Statistics of Physicians, Dentists, and Pharmacists; 2018. Accessed Jun 20, 2023. https://www.mhlw.go.jp/toukei/saikin/hw/ishi/18/dl/kekka-1.pdf
  • 5. Wu B, Bhulani N, Jalal S, et al. Gender disparity in leadership positions of general surgical societies in North America, Europe, and Oceania. Cureus. 2019;11:e6285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Spector ND, Asante PA, Marcelin JR, et al. Women in pediatrics: progress, barriers, and opportunities for equity, diversity, and inclusion. Pediatrics. 2019;144:e20192149. [DOI] [PubMed] [Google Scholar]
  • 7. OECD Library. Health at a Glance 2021: OECD Indicators; 2021. Accessed Jun 20, 2023. https://www.oecd-ilibrary.org/sites/aa9168f1-en/index.html?itemId=/content/component/aa9168f1-en
  • 8. Thomas WE. Teaching and assessing surgical competence. Ann R Coll Surg Engl. 2006;88:429–432. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Silliman RA, Demissie S, Troyan SL. The care of older women with early-stage breast cancer: what is the role of surgeon gender? Med Care. 1999;37:1057–1067. [DOI] [PubMed] [Google Scholar]
  • 10. Hershman DL, Buono D, McBride RB, et al. Surgeon characteristics and receipt of adjuvant radiotherapy in women with breast cancer. J Natl Cancer Inst. 2008;100:199–206. [DOI] [PubMed] [Google Scholar]
  • 11. Arrington AK, Jarosek SL, Virnig BA, et al. Patient and surgeon characteristics associated with increased use of contralateral prophylactic mastectomy in patients with breast cancer. Ann Surg Oncol. 2009;16:2697–2704. [DOI] [PubMed] [Google Scholar]
  • 12. Gilligan MA, Neuner J, Sparapani R, et al. Surgeon characteristics and variations in treatment for early-stage breast cancer. Arch Surg. 2007;142:17–22. [DOI] [PubMed] [Google Scholar]
  • 13. Kerr H-L, Armstrong LA, Cade JE. Barriers to becoming a female surgeon and the influence of female surgical role models. Postgrad Med J. 2016;92:576–580. [DOI] [PubMed] [Google Scholar]
  • 14. Tsugawa Y, Jena AB, Orav EJ, et al. Age and sex of surgeons and mortality of older surgical patients: observational study. Brit Med J. 2018;361:k1343. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Jolbäck P, Rogmark C, Bedeschi Rego De Mattos C, et al. The influence of surgeon sex on adverse events following primary total hip arthroplasty: a register-based study of 11,993 procedures and 200 surgeons in Swedish public hospitals. J Bone Joint Surg Am. 2022;104:1327–1333. [DOI] [PubMed] [Google Scholar]
  • 16. Okoshi K, Endo H, Nomura S, et al. Comparison of short term surgical outcomes of male and female gastrointestinal surgeons in Japan: retrospective cohort study. Brit Med J. 2022;378:e070568. [DOI] [PubMed] [Google Scholar]
  • 17. Association of American Medical Colleges Center for Workforce Studies . 2014 Physician Specialty Data Book; 2014.
  • 18. Jena AB, Olenski AR, Blumenthal DM. Sex differences in physician salary in US public medical schools. JAMA Intern Med. 2016;176:1294–1304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Jena AB, Khullar D, Ho O, et al. Sex differences in academic rank in US medical schools in 2014. JAMA. 2015;314:1149–1158. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Ly DP, Seabury SA, Jena AB. Differences in incomes of physicians in the United States by race and sex: observational study. Brit Med J. 2016;353:i292. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Capek L, Edwards DE, Mackinnon SE. Plastic surgeons: a gender comparison. Plast Reconstr Surg. 1997;99:289–299. [DOI] [PubMed] [Google Scholar]
  • 22. Dresler CM, Padgett DL, Mackinnon SE, et al. Experiences of women in cardiothoracic surgery: a gender comparison. Arch Surg. 1996;131:1128–1134. [DOI] [PubMed] [Google Scholar]
  • 23. Spencer ES, Deal AM, Pruthi NR, et al. Gender differences in compensation, job satisfaction and other practice patterns in urology. J Urol. 2016;195:450–455. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Peckham C. Medscape General Surgeon Compensation Report; 2016. Accessed Jun 20, 2023. http://www.medscape.com/features/slideshow/compensation/2016/generalsurgery
  • 25. Saka N, Norio Y, Hidehiro S, et al. Association between surgeons’ sex and postoperative outcomes: protocol of a systematic review and meta-analysis of observational studies; 2022. 10.17605/OSF.IO/PGH78 [DOI] [Google Scholar]
  • 26. Brooke BS, Schwartz TA, Pawlik TM. MOOSE Reporting Guidelines for Meta-analyses of Observational Studies. JAMA Surg. 2021;156:787–788. [DOI] [PubMed] [Google Scholar]
  • 27. Page MJ, McKenzie JE, Bossuyt PM, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. Brit Med J. 2021;372:n71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Haddaway NR, Grainger MJ, Gray CT. Citationchaser: a tool for transparent and efficient forward and backward citation chasing in systematic searching. Res Synth Methods. 2022;13:533–545. [DOI] [PubMed] [Google Scholar]
  • 29. Hoshijima H, Wajima Z, Nagasaka H, et al. Association of hospital and surgeon volume with mortality following major surgical procedures: meta-analysis of meta-analyses of observational studies. Medicine. 2019;98:e17712. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Jung Y, Kim K, Choi ST, et al. Association between surgeon age and postoperative complications/mortality: a systematic review and meta-analysis of cohort studies. Sci Rep. 2022;12:11251. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Powell JS, Apte S, Chambost H, et al. Long-acting recombinant factor IX Fc fusion protein (rFIXFc) for perioperative management of subjects with haemophilia B in the phase 3 B-LONG study. Br J Haematol. 2015;168:124–134. [DOI] [PubMed] [Google Scholar]
  • 32. Ouzzani M, Hammady H, Fedorowicz Z, et al. Rayyan—a web and mobile app for systematic reviews. Syst Rev. 2016;5:10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Birkmeyer JD, Stukel TA, Siewers AE, et al. Surgeon volume and operative mortality in the United States. N Engl J Med. 2003;349:2117–2127. [DOI] [PubMed] [Google Scholar]
  • 34. Waljee JF, Greenfield LJ, Dimick JB, et al. Surgeon age and operative mortality in the United States. Ann Surg. 2006;244:353. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Sterne JA, Hernán MA, Reeves BC, et al. ROBINS-I: a tool for assessing risk of bias in non-randomised studies of interventions. Brit Med J. 2016;355:i4919. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Roche JJ, Wenn RT, Sahota O, et al. Effect of comorbidities and postoperative complications on mortality after hip fracture in elderly people: prospective observational cohort study. Brit Med J. 2005;331:1374. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Hogue CW, Barzilai B, Pieper KS, et al. Sex differences in neurological outcomes and mortality after cardiac surgery. Circulation. 2001;103:2133–2137. [DOI] [PubMed] [Google Scholar]
  • 38. Polanczyk CA, Marcantonio E, Goldman L, et al. Impact of age on perioperative complications and length of stay in patients undergoing noncardiac surgery. Ann Intern Med. 2001;134:637–643. [DOI] [PubMed] [Google Scholar]
  • 39. McGuinness LA, Higgins JPT. Risk-of-bias VISualization (robvis): an R package and Shiny web app for visualizing risk-of-bias assessments. Res Synth Methods. 2021;12:55–61. [DOI] [PubMed] [Google Scholar]
  • 40. Guyatt G, Oxman AD, Akl EA, et al. GRADE guidelines: 1. Introduction-GRADE evidence profiles and summary of findings tables. J Clin Epidemiol. 2011;64:383–394. [DOI] [PubMed] [Google Scholar]
  • 41. Schünemann HJ, Cuello C, Akl EA, et al. GRADE guidelines: 18. How ROBINS-I and other tools to assess risk of bias in nonrandomized studies should be used to rate the certainty of a body of evidence. J Clin Epidemiol. 2019;111:105–114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Higgins JPT, Thomas J, Chandler J, et al. Cochrane Handbook for Systematic Reviews of Interventions, version 63 (updated February 2022). Cochrane; 2022. [Google Scholar]
  • 43. Zhang J, Yu KF. What’s the relative risk? A method of correcting the odds ratio in cohort studies of common outcomes. JAMA. 1998;280:1690–1691. [DOI] [PubMed] [Google Scholar]
  • 44. O’Neill L, Lanska DJ, Hartz AJ. Surgeon characteristics associated with mortality and morbidity following carotid endarterectomy. Neurology. 2000;55:773–781. [DOI] [PubMed] [Google Scholar]
  • 45. Chai CY, Chen CH, Lin HW, et al. Association of increasing surgeon age with decreasing in-hospital mortality after coronary artery bypass graft surgery. World J Surg. 2009;34:3–9. [DOI] [PubMed] [Google Scholar]
  • 46. Ho J-D, Kuo NW, Tsai CY, et al. Surgeon age and operative outcomes for primary rhegmatogenous retinal detachment: a 3-year nationwide population-based study. Eye. 2009;24:290–296. [DOI] [PubMed] [Google Scholar]
  • 47. Wu M-P, Wu C-J, Weng S-F. The choice of reoperation after primary surgeries for uterine prolapse: a nationwide study. Gynecol Minim Invasive Ther. 2015;4:120–125. [Google Scholar]
  • 48. Lin DL, Wu CS, Tang CH, et al. The safety and risk factors of revision adenoidectomy in children and adolescents: a nationwide retrospective population-based cohort study. Auris Nasus Larynx. 2018;45:1191–1198. [DOI] [PubMed] [Google Scholar]
  • 49. Vora H, Amersi F, Chung A, et al. The effect of surgeon gender on breast cancer surgical practice patterns and outcomes. Ann Surg Oncol. 2019;26:S95–S96. [Google Scholar]
  • 50. Flodin J, Juthberg R, Edman G. Patient-reported outcome and healing biomarkers in patients treated by female versus male surgeons—a cohort study on achilles tendon ruptures. Muscle Ligaments Tendons J. 2019;9:531. [Google Scholar]
  • 51. Chapman TR, Zmistowski B, Votta K, et al. Patient Complications after Total Joint Arthroplasty: Does Surgeon Gender Matter? J Am Acad Orthop Surg. 2020;28:937–944. [DOI] [PubMed] [Google Scholar]
  • 52. Gill HK, Niederer RL, Danesh-Meyer HV. Gender differences in surgical case volume among ophthalmology trainees. Clin Exp Ophthalmol. 2021;49:664–671. [DOI] [PubMed] [Google Scholar]
  • 53. Guiab K, Evans T, Brigode W, et al. Complications after inpatient laparoscopic cholecystectomy: effect of surgeon experience, procedure volume, and other surgeon-based characteristics. Am Surg. 2022;88:1798–1804. [DOI] [PubMed] [Google Scholar]
  • 54. Wallis CJD, Jerath A, Satkunasivam R, et al. Association between patient-surgeon sex concordance and post-operative mortality in the United States. Brit Med J. 2023;383:e075484. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Bouchghoul H, Deneux-Tharaux C, Georget A, et al. Association between surgeon gender and maternal morbidity after cesarean delivery. JAMA Surg. 2023;158:273–281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Mazilescu LI, Bernheim I, Treckmann J, et al. Donor, Recipient and Surgeon Sex and Sex-Concordance and their Impact on Liver Transplant Outcome. J Pers Med. 2023;13:281 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Engdahl J, Bech Larsen S, Öberg A, et al. Effect of surgeon gender on postoperative morbidity, mortality and long-term survival following colon cancer resections. Colorectal Dis. 2022;24:112. [Google Scholar]
  • 58. Indart C, Galhotra S, Hu C, et al. Surgical Outcomes for Laparoscopic Hysterectomy Based on Surgeon Gender. J Minim Invasive Gynecol. 2022;29:S9. [Google Scholar]
  • 59. Kobylianskii A, Murji A, Matelski JJ, et al. Surgeon gender and performance outcomes for hysterectomies: retrospective cohort study. J Minim Invasive Gynecol. 2023;30:108–114. [DOI] [PubMed] [Google Scholar]
  • 60. Blohm M, Sandblom G, Enochsson L, et al. Differences in Cholecystectomy Outcomes and Operating Time Between Male and Female Surgeons in Sweden. JAMA Surg. 2023;158:1168–1175. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Okoshi K, Nomura K, Taka F, et al. Suturing the gender gap: Income, marriage, and parenthood among Japanese Surgeons. Surgery. 2016;159:1249–1259. [DOI] [PubMed] [Google Scholar]
  • 62. Lurie N, Slater J, McGovern P, et al. Preventive care for women. Does the sex of the physician matter? N Engl J Med. 1993;329:478–482. [DOI] [PubMed] [Google Scholar]
  • 63. Bertakis KD, Helms LJ, Callahan EJ, et al. The influence of gender on physician practice style. Med Care. 1995;33:407–416. [DOI] [PubMed] [Google Scholar]
  • 64. Berthold HK, Gouni-Berthold I, Bestehorn KP, et al. Physician gender is associated with the quality of type 2 diabetes care. J Intern Med. 2008;264:340–350. [DOI] [PubMed] [Google Scholar]
  • 65. Komajda M, Schöpe J, Wagenpfeil S, et al. Physicians’ guideline adherence is associated with long-term heart failure mortality in outpatients with heart failure with reduced ejection fraction: the QUALIFY international registry. Eur J Heart Fail. 2019;21:921–929. [DOI] [PubMed] [Google Scholar]
  • 66. Emond YEJJM, Calsbeek H, Peters YAS, et al. Increased adherence to perioperative safety guidelines associated with improved patient safety outcomes: a stepped-wedge, cluster-randomised multicentre trial. Br J Anaesth. 2022;128:562–573. [DOI] [PubMed] [Google Scholar]
  • 67. Kaslow SR, Ma Z, Hani L, et al. Adherence to guidelines at the patient- and hospital-levels is associated with improved overall survival in patients with gastric cancer. J Surg Oncol. 2022;126:479–489. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Eckel CC, Grossman PJ. Men, women and risk aversion: experimental evidence. Handb Exp Econ Results. 2008;1:1061–1073. [Google Scholar]
  • 69. Mittal MV. A study of psychological reasons for gender differences in preferences for risk and investment decision making. IUP J Behav Finan. 2011;8:45–60. [Google Scholar]
  • 70. Wakeam E, Hyder JA. Raising the bar for failure to rescue: critical appraisal of current measurement and strategies to catalyze improvement. JAMA Surg. 2015;150:1023–1024. [DOI] [PubMed] [Google Scholar]
  • 71. Lou Z, Yan F-H, Zhao Z-Q, et al. The sex difference in basic surgical skills learning: a comparative study. J Surg Educ. 2016;73:902–905. [DOI] [PubMed] [Google Scholar]
  • 72. Vincent C, Moorthy K, Sarker SK, et al. Systems approaches to surgical quality and safety: from concept to measurement. Ann Surg. 2004;239:475–482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73. Birkmeyer JD. Progress and challenges in improving surgical outcomes. Br J Surg. 2012;99:1467–1469. [DOI] [PubMed] [Google Scholar]
  • 74. Foley KE, Izquierdo KM, von Muchow MG, et al. Colon and rectal surgery robotic training programs: an evaluation of gender disparities. Dis Colon Rectum. 2020;63:974–979. [DOI] [PubMed] [Google Scholar]
  • 75. Goodman M, Adams N, Corneil T, et al. Size and distribution of transgender and gender nonconforming populations: a narrative review. Endocrinol Metab Clin North Am. 2019;48:303–321. [DOI] [PubMed] [Google Scholar]

Articles from Annals of Surgery are provided here courtesy of Wolters Kluwer Health

RESOURCES