Abstract
Objective To examine the effect of surgeon sex on postoperative outcomes of patients undergoing common surgical procedures.
Design Population based, retrospective, matched cohort study from 2007 to 2015.
Setting Population based cohort of all patients treated in Ontario, Canada.
Participants Patients undergoing one of 25 surgical procedures performed by a female surgeon were matched by patient age, patient sex, comorbidity, surgeon volume, surgeon age, and hospital to patients undergoing the same operation by a male surgeon.
Interventions Sex of treating surgeon.
Main outcome measure The primary outcome was a composite of death, readmission, and complications. We compared outcomes between groups using generalised estimating equations.
Results 104 630 patients were treated by 3314 surgeons, 774 female and 2540 male. Before matching, patients treated by female doctors were more likely to be female and younger but had similar comorbidity, income, rurality, and year of surgery. After matching, the groups were comparable. Fewer patients treated by female surgeons died, were readmitted to hospital, or had complications within 30 days (5810 of 52 315, 11.1%, 95% confidence interval 10.9% to 11.4%) than those treated by male surgeons (6046 of 52 315, 11.6%, 11.3% to 11.8%; adjusted odds ratio 0.96, 0.92 to 0.99, P=0.02). Patients treated by female surgeons were less likely to die within 30 days (adjusted odds ratio 0.88; 0.79 to 0.99, P=0.04), but there was no significant difference in readmissions or complications. Stratified analyses by patient, physician, and hospital characteristics did not significant modify the effect of surgeon sex on outcome. A retrospective analysis showed no difference in outcomes by surgeon sex in patients who had emergency surgery, where patients do not usually choose their surgeon.
Conclusions After accounting for patient, surgeon, and hospital characteristics, patients treated by female surgeons had a small but statistically significant decrease in 30 day mortality and similar surgical outcomes (length of stay, complications, and readmission), compared with those treated by male surgeons. These findings support the need for further examination of the surgical outcomes and mechanisms related to physicians and the underlying processes and patterns of care to improve mortality, complications, and readmissions for all patients.
What is already known on this topic
Female and male physicians differ in their practice of medicine in ways that might substantially affect patient outcomes. Outcomes after surgery depend on the technical and cognitive skills of treating physicians, so findings from medical specialties might not apply to surgical specialties.
What this study adds
This large, population based, matched cohort analysis found small differences in surgical outcomes between patients treated by female and male surgeons, with the former having a small but statistically significant decreased risk of short term postoperative death.
Introduction
Successful surgical practice has four core components: knowledge, communication skills, judgment, and technical proficiency.1 The acquisition and maintenance of technical skills distinguishes surgeons from many other doctors, and these skills are directly associated with short term postoperative outcomes.2 Women and men practise medicine differently,3 4 5 6 7 although little research exists on the differences in learning styles, acquisition of skills, or outcomes for female and male surgeons.8 Tsugawa et al found that beneficiaries of US Medicare that were treated by female general internists in hospital had lower rates of 30 day mortality and readmission than those treated by male internists.9 Suggested mechanisms for this difference include female doctors being more likely to use a patient centred approach and to follow evidence based guidelines. Surgery, however, has a major technical component, so there is less reason to expect a difference in outcomes between female and male surgeons.
Surgical disciplines are disproportionately male despite increasing numbers of female medical students.3 4 5 6 7 Gender equity in the surgical profession, including disparities in compensation and promotion, is a growing concern.10 11 Assessing outcomes for female and male surgeons is important for combatting implicit bias and gender schemas that might perpetuate current inequalities.10 We sought to determine whether postoperative outcomes for patients having one of 25 surgical procedures (emergent and elective) differ between male and female operating surgeons using a large, population based cohort.
Methods
Overview
We conducted a population based, retrospective cohort study of patients having surgical procedures in Ontario, Canada, between 1 January 2007 and 31 December 2015 to assess the hypothesis that sex of the operating surgeon would significantly affect 30 day postoperative outcomes. The study sample included patients who had one of 25 surgical interventions, selected by multidisciplinary discussion with consultants from all surgical specialties. We selected surgical procedures using the following criteria: inclusion of all surgical subspecialties with female surgeons12 and either frequently performed in Ontario13 or having an increased likelihood of complications. We selected coronary artery bypass grafting, femoral-popliteal bypass, abdominal aortic aneurysm repair, appendectomy, cholecystectomy, gastric bypass, colon resection, liver resection, hysterectomy, anterior or posterior spinal decompression, anterior or posterior spinal arthrodesis, craniotomy for brain tumour, total knee replacement, total hip replacement, open repair of femoral neck or shaft fracture, total thyroidectomy, neck dissection, lung resection, radical cystectomy, radical prostatectomy, transurethral resection of prostate, carpal tunnel release, and breast reduction.
Despite using administrative data sources, we performed a sample size calculation to ensure that the proposed analysis was feasible. Based on previous estimates of the study primary outcome (composite event rate of death, readmission, and postoperative complications; 17%)14and effect size (4%, or odds ratio 0.96),9 we calculated a necessary sample size of 94 270 (see appendix). We designed and conducted this study according to STROBE (strengthening the reporting of observational studies in epidemiology) guidelines15 and the RECORD (reporting of studies conducted using observational routinely collected health data) statement.11 Sunnybrook Health Sciences Centre Research Ethics Board approved this study (project identification number 375-2016).
Data sources
We linked the following datasets using encrypted patient identifiers: the Ontario Health Insurance Plan (OHIP) database, which tracks claims paid for physician billings, laboratories, and out-of-province providers16; the Canadian Institute for Health Information Discharge Abstract Database (CIHI-DAD), which contains records for hospital admissions17; the CIHI National Ambulatory Care Reporting System, which contains records for emergency department visits; the Registered Persons Database for demographic information18; and the Corporate Provider Database for surgeon data.
Cohort derivation
We identified all patients who had one of the 25 index procedures in the study period (n=1 534 592). We excluded patients treated by physicians whose primary declared specialty was non-surgical (n=8753) and patients under 18 (n=29 158). We included only the first one of the 25 procedures for each patient, thus excluding 282 399 procedures in patients already in the cohort and ensuring that each patient was included only once. We excluded patients for whom the date of death preceded the date of surgery (n=332). We think these cases represent coding errors as the majority died several years before the date of surgery. A sensitivity analysis including the 10 patients who died one day before surgery did not alter the outcome of the primary analysis (see appendix). To capture data on hospital factors, we excluded patients for whom the treating institution could not be identified (n=54 263). After these exclusions, the study sample comprised 1 159 687 patients who had surgery between 2007 and 2015.
Outcomes
Our primary outcome was a composite of death, complications, or readmission (to any hospital in the province of Ontario) in the 30 days after surgery, as previously described.16 We included deaths that occurred in the initial postoperative period in hospital. We think this outcome best captures the overall burden of short term postoperative complications. Our secondary outcomes were the individual components of the primary outcome (death, complications, and readmission within 30 days of surgery), and length of stay in hospital. We used a definition of surgical complications that has previously been used and represents major morbidity.16 These outcomes were ascertained from health administrative data using a combination of procedural and diagnostic codes that are uniformly collected for all hospitals and patients in Ontario.14 19
Covariates
We collected patient age at surgery, geographic location (local health integration networks20), sex, socioeconomic status (based on geographic location), rurality, and general comorbidity according to the Johns Hopkins aggregate disease group score (ADG)21 from linked administrative databases. The ADG score discriminates better than the Charlson score.22 We identified data on the treating surgeons including age, sex, years in practice, specialty, and surgical volume. Surgeon sex refers to biological sex, as reported by physicians at the time of registering for licenses to practise medicine in the province of Ontario. Patients’ biological sex is captured in the Registered Persons Database, which we used for demographic information. We were, therefore, unable to identify transgendered patients or surgeons. To determine surgical volume, for each patient and for the specific procedure they had, we identified the number of identical procedures their operating surgeon performed in the previous year. We defined this variable categorically into quarters for each procedure performed, among the whole cohort. We collected hospital institution identifiers to account for variability between institutions. We defined a surgical procedure as emergent or elective using the CIHI-DAD database admission variables, which denote urgent, emergent, or admission from the emergency department before surgery. We considered all same day surgery procedures to be elective.
Matching
Surgeon age, years in practice, operative volumes, surgical specialty, and patient demographics differed between participants treated by female and male surgeons, so we conducted a matched analysis. Variables such as surgeon age could potentially confound the relationship between surgeon sex and outcomes, as male surgeons were older on average. Far fewer patients were treated by female surgeons, so we identified these first and matching them to patients treated by male surgeons. We matched patients 1:1 using a hard match comprising procedural fee code (a unique identifier for each procedure), surgeon volume (separated into quarters based on the number of index procedures performed by each surgeon in the year before index), surgeon age (±3 years), hospital identifier, patient age (±5 years), patient sex, and patient comorbidity using ADG score (categorical). Surgeon years in practice was collinear with surgeon age, so we selected only age for matching. We selected all covariates for matching based on a priori identification without variable selection.
Statistical analysis
In all analyses we examined surgeon sex as an exposure potentially associated with postoperative outcomes. Descriptive statistics were used to compare patients treated by male and female physicians, as well as to compare the characteristics of the male and female surgeons. Given the large sample size, traditional statistical measures were likely to show statistically significant differences where no clinically important differences exist. So we compared groups using standardised differences—the difference in the mean of a variable between two groups divided by an estimate of the standard deviation of that variable among both groups.23 We defined a clinically important standardised difference as greater than 0.10.23
Matched data are inherently correlated, so we used multivariable generalised estimating equations (GEE) with a logit link to estimate the association between surgeon sex and the outcomes. We used Poisson regression to examine the association between surgeon sex and hospital length of stay.
We conducted subgroup analyses to assess for an interaction between patient (age, sex, and comorbidity), surgeon (specialty, age, years in practice, and annual volume), and hospital characteristics and to assess the association between surgeon sex and outcomes. To minimise the effect of patients selecting their physician, we performed a retrospective stratified analysis of the matched cohort according to whether the surgery was elective or emergent.
Matching considerably reduced our sample size, which might adversely affect the applicability of our findings. Thus, we examined the entire cohort (n=1 159 687) using regression with multivariable GEE models with a logit link, rather than matching, to account for differences in patient, surgeon, and hospital factors between patients treated by female and male surgeons. The GEE model accounted for correlation in outcomes based on surgeon, hospital, and procedural fee code.
Statistical significance was set at P<0.05 based on a two tailed comparison. We used SAS Enterprise Guide 6.1 (SAS Institute Inc., Cary, NC, USA) for all analyses.
Patient involvement
No patients were involved in setting the research question or the outcome measures, nor were they involved in developing plans for implementation of the study. No patients were asked to advise on interpretation or writing up of results. There are no plans to disseminate the results of the research to study participants or the relevant patient community. Individual patient consent was not sought given the use of anonymised administrative data.
Results
We identified 1 159 687 eligible patients who had one of the 25 index procedures in the study period. Surgery was performed by 3314 surgeons, 774 (23.4%) of whom were female, and 2540 (76.6%) were male (table 1). Of the 1 159 687 patients, 144 119 (12.4%) were treated by female surgeons, and 1 015 568 (87.6%) were treated by male surgeons. Patients treated by female physicians were more likely to be female and younger (table 1). We found no difference in year of surgery, comorbidity, income, rurality, or region of residence between groups. Female surgeons were younger and had fewer years in practice. Surgeon volume (based on the number of identical surgical procedures that surgeon had performed in the preceding year) was lower for female surgeons. Female surgeons performed proportionally more operations than men in general surgery, obstetrics and gynaecology, and plastic surgery. Before matching, patients treated by female surgeons had statistically significantly lower rates of the composite endpoint (15 731 of 144 119, 10.9%, 10.8% to 11.1%) than those treated by male surgeons (122 468 of 1 015 568, 12.1%, 12.0% to 12.1%; unadjusted odds ratio 0.94, 0.91 to 0.97, P<0.001).
Table 1.
Before matching | After matching | ||||||
---|---|---|---|---|---|---|---|
Treated by female surgeon | Treated by male surgeon | St diff | Treated by female surgeon | Treated by male surgeon | St diff | ||
Number of patients | 144 119 | 1 015 568 | NA | 52 315 | 52 315 | NA | |
Surgeon characteristics | |||||||
Age, years: | |||||||
Mean (SD) | 44.2 (8.1) | 49.3 (9.6) | 0.57 | 44.2 (7.6) | 44.3 (7.6) | 0.02 | |
Median (IQR) | 43 (38-50) | 49 (41-57) | 0.55 | 43 (38-50) | 43 (38-50) | 0.02 | |
Years in practice: | |||||||
Mean (SD) | 11.6 (8.0) | 15.4 (8.4) | 0.46 | 11.7 (7.8) | 11.9 (7.9) | 0.03 | |
Median (IQR) | 10 (5-19) | 17 (8-22) | 0.45 | 10 (5-18) | 10 (5-19) | 0.02 | |
Surgical volume among all patients, n (%): | |||||||
First quarter (lowest) | 52 730 (36.6) | 232 356 (22.9) | 0.22 | 19 230 (36.8) | 19 230 (36.8) | 0 | |
Second quarter | 43 284 (30.0) | 256 528 (25.3) | 15 096 (28.9) | 15 096 (28.9) | |||
Third quarter | 29 203 (20.3) | 257 512 (25.4) | 11 515 (22.0) | 11 515 (22.0) | |||
Fourth quarter (highest) | 18.902 (13.1) | 269 172 (26.5) | 6474 (12.4) | 6474 (12.4) | |||
Surgical specialty: | |||||||
Cardiothoracic surgery | 5272 (3.7) | 63 494 (6.3) | 0.36 | 4023 (7.7) | 4039 (7.7) | 0.01 | |
General surgery | 57 814 (40.1) | 326 911 (32.2) | 25 711 (49.1) | 25 759 (49.2) | |||
Neurosurgery | 2370 (1.6) | 44 264 (4.4) | 764 (1.5) | 675 (1.3) | |||
Obstetrics/Gynaecology | 48 301 (33.5) | 86 641 (8.5) | 12 839 (24.5) | 12 608 (24.1) | |||
Orthopaedic surgery | 11 400 (7.9) | 319 634 (31.5) | 5407 (10.3) | 5515 (10.5) | |||
Otolaryngology | 2879 (2.0) | 23 529 (2.3) | 664 (1.3) | 729 (1.4) | |||
Plastic surgery | 14 123 (9.8) | 52 719 (5.2) | 2141 (4.1) | 2140 (4.1) | |||
Thoracic surgery | 1078 (0.7) | 11 075 (1.1) | 371-375* | 416-420* | |||
Urology | 862 (0.6) | 83 456 (8.2) | 390 (0.7) | 418 (0.8) | |||
Vascular surgery | 20 (0) | 3845 (0.4) | <6* | 12-16* | |||
Patient characteristics | |||||||
Age, years: | |||||||
Mean (SD) | 52.7 (16.5) | 59.7 (17.2) | 0.42 | 52.1 (17.0) | 52.1 (17.0) | 0 | |
Median (IQR) | 51 (41-65) | 61 (48-73) | 0.44 | 51 (40-65) | 51 (40-65) | 0 | |
Sex, n (%): | |||||||
Female | 111 705 (77.5) | 584 042 (57.5) | 0.44 | 38 149 (72.9) | 38 149 (72.9) | 0 | |
Male | 32 414 (22.5) | 431 526 (42.5) | 14 166 (27.1) | 14 166 (27.1) | |||
Comorbidity, n (%): | |||||||
ADG 0-5 | 39 761 (27.6) | 270 369 (26.6) | 0.02 | 15 136 (28.9) | 15 136 (28.9) | 0 | |
ADG 6-7 | 35 984 (25.0) | 243 640 (24.0) | 12 506 (23.9) | 12 506 (23.9) | |||
ADG 8-10 | 43 520 (30.2) | 303 605 (29.9) | 15 962 (30.5) | 15 962 (30.5) | |||
ADG ≥11 | 24 854 (17.2) | 197 954 (19.5) | 8711 (16.7) | 8711 (16.7) | |||
Rurality, n (%) | |||||||
Urban | 125 309 (89.9) | 858 825 (84.6) | 0.07 | 46 412 (88.7) | 46 408 (88.7) | 0 | |
Rural | 18 680 (13.0) | 155 884 (15.3) | 5860 (11.2) | 5861 (11.2) | |||
Missing | 130 (0.1) | 859 (0.01) | 43 (0.1) | 46 (0.1) | |||
Income, n (%): | |||||||
First fifth (lowest) | 26 184 (18.2) | 190 796 (18.8) | 0.01 | 9872 (18.9) | 9942 (19.0) | 0.01 | |
Second fifth | 28 094 (19.5) | 202 670 (20.0) | 10 374 (19.8) | 10 309 (19.8) | |||
Third fifth | 28 762 (20.0) | 202 843 (20.0) | 10 396 (19.9) | 10 515 (20.0) | |||
Fourth fifth | 30 833 (21.4) | 211 167 (20.8) | 11 109 (21.2) | 10 876 (21.0) | |||
Fifth fifth (highest) | 29 698 (20.6) | 203 627 (20.1) | 10 358 (19.8) | 10 420 (19.9) | |||
Missing | 548 (0.4) | 4465 (0.4) | 206 (0.4) | 253 (0.5) | |||
Hospital characteristics | |||||||
Practice setting, n (%): | |||||||
Academic | 54 801 (38.0) | 344 027 (33.9) | 0.09 | 20 878 (39.9) | 20 878 (39.9) | 0 | |
Community | 89 318 (62.0) | 671 541 (66.1) | 31 437 (60.1) | 31 437 (60.1) | |||
Year of surgery, n (%) | |||||||
2007 | 14 481 (10.0) | 128 186 (12.6) | 0.04 | 5230 (10.0) | 6906 (13.2) | 0.05 | |
2008 | 15 224 (10.6) | 121 318 (11.9) | 5346 (10.2) | 6292 (12.0) | |||
2009 | 15 432 (10.7) | 117 545 (11.6) | 5579 (10.7) | 58,75 (11.2) | |||
2010 | 15 363 (10.7) | 112 699 (11.1) | 5467 (10.5) | 5690 (10.9) | |||
2011 | 15 265 (10.6) | 111 693 (11.0) | 5382 (10.3) | 5624 (10.8) | |||
2012 | 15 759 (10.9) | 108 527 (10.7) | 5550 (10.6) | 5210 (10.0) | |||
2013 | 16 688 (11.6) | 108 251 (10.7) | 6122 (11.7) | 5694 (10.9) | |||
2014 | 17 356 (12.0) | 104 967 (10.3) | 6564 (12.5) | 5502 (10.5) | |||
2015 | 18 551 (12.9) | 102 382 (10.1) | 7075 (13.5) | 5522 (10.6) |
ADG=aggregate disease group; IQR=interquartile range; SD=standard deviation; St diff=standardized difference.
*Because of ICES (Institute for Clinical Evaluative Sciences) privacy regulations, data fields with fewer than six events are not shown, along with cells that would allow for derivation of these small cells.
After matching, the two groups were well balanced with respect to patient age, sex, comorbidity, rurality, and income; surgeon age, years in practice, annual volume of the index procedure, and specialty; and hospital type. More patients treated by female surgeons had their operation in later calendar years, although the difference was not clinically important (standardized difference <0.10).
Differences in surgical specialty, procedural volume, and age of female surgeons meant that patients who were successfully matched to male surgeons (and, thus, included in the primary analytical cohort) were treated by younger surgeons with less experience and lower annual volumes than those who were excluded (supplementary table 2). Included patients were more likely to have had general surgery or an obstetrics and gynaecology related procedure and less likely to have had neurosurgery, orthopaedic surgery, or a urology related procedure. Included patients were younger and more likely to be female; comorbidity did not differ (supplementary table 2).
In the matched cohort, the primary outcome of death, readmission, or complications occurred in 5810 of 52 315 (11.1%, 10.9% to 11.4%) patients treated by female surgeons and in 6046 of 52 315 (11.6%, 11.3% to 11.8%) patients treated by male surgeons (table 2), translating to an absolute difference of 0.43% (number needed to treat to prevent one event=230).
Table 2.
Outcome | Patients treated by female surgeon (n=52 315) | Patients treated by male surgeon (n=52 315) | Absolute difference | Adjusted odds ratio (95%CI; P value) |
---|---|---|---|---|
Primary outcome (death, readmission, or complication within 30 days) | 5819 (11.1, 10.9 to 11.4) | 6046 (11.6, 11.3 to 11.8) | 0.43% | 0.96 (0.92 to 0.99; 0.02) |
Death within 30 days | 480 (0.9, 0.8 to 1.0) | 543 (1.0, 1.0 to 1.1) | 0.12% | 0.88 (0.78 to 0.99; 0.04) |
Readmission within 30 days | 2433 (4.7, 4.5 to 4.8) | 2518 (4.8, 4.6 to 5.0) | 0.16% | 0.96 (0.91 to 1.02; 0.20) |
Complication within 30 days | 3543 (6.8, 6.6 to 7.0) | 3674 (7.0, 6.8 to 7.2) | 0.25% | 0.96 (0.92 to 1.01; 0.10) |
Hospital length of stay, median (IQR) | 2 (0 to 4) | 2 (0 to 4) | 0 | 0.97 (0.94 to 0.99; 0.01)* |
*Adjusted relative rate rather than adjusted odds ratio.
In GEE models accounting for the correlation between matched individuals, patients treated by female surgeons were significantly less likely to experience the composite outcome (adjusted odds ratio 0.96, 0.92 to 0.99, P=0.02). Among the secondary outcomes, patients treated by female surgeons had a significantly lower likelihood of death within 30 days of surgery (adjusted odds ratio 0.88, 0.79 to 0.99, P=0.04) and comparable likelihood of readmission to hospital (0.96, 0.91 to 1.02, P=0.20) or complications (0.96, 0.92 to 1.01, P=0.10). Median length of stay was two days (interquartile range 0-4 days), regardless of the sex of the treating physician. As the mean duration of hospital admission differed among patients treated by female surgeons and those treated by male surgeons (mean 3.9 days (SD 10.0) and mean 4.0 days (SD 11.1), respectively), this was statistically significant in adjusted models (table 2).
We performed a retrospective sensitivity analysis of all outcomes in the matched cohort, stratifying by elective or emergent surgery. Patients who had emergent procedures were less likely to be female and more likely to have these procedures performed by younger surgeons with lower surgical volumes who had practiced for a shorter period of time (supplementary table 3). Specialties wherein emergent procedures are common (such as general surgery) were proportionately more represented than those where it is uncommon (such as obstetrics and gynaecology and plastic surgery), as expected. Emergent procedures were not more or less likely to be performed by a female surgeon. Using GEE models accounting for the correlation between matched individuals, patients treated by female surgeons were significantly less likely to experience the composite outcome when the procedure was elective (adjusted odds ratio 0.94, 0.89 to 0.98, P=0.007) but not emergent (adjusted odds ratio 1.01, 0.96 to 1.08, P=0.636); Pinteraction=0.048). We did not find any evidence of statistically significant effect modification by the urgency of the procedure for secondary outcomes, except length of stay (supplementary table 4).
We found no significant evidence of effect modification when analyses were stratified according to surgical specialty, surgeon age, surgeon years in practice, annual surgical volume of the index procedure, hospital status (academic or community), patient sex, patient age, or patient comorbidity (fig 1).
Although the effect estimates remained relatively consistent, many comparisons were no longer statistically significant in these subgroup analyses; for example, differences based on surgical specialty were not statistically significant (P=0.17). Given the dramatic effect of surgeon sex on outcomes of plastic surgical procedures (fig 1; GEE regression model adjusted odds ratio 0.24, 0.17 to 0.36, supplementary table 5), we repeated our primary analysis excluding plastic surgery. The direction and magnitude of the adjusted relative effects were unchanged (adjusted odds ratio of composite outcome 0.963, 0.928 to 0.999, P=0.045; adjusted odds ratio of mortality 0.886, 0.787 to 0.998, P=0.0465).
Matching considerably reduced our sample size, which might adversely affect the extrapolation of these findings to our entire patient cohort. In regression based models to account for confounding, patients treated by female surgeons had a comparably lower likelihood of the composite primary outcome (adjusted odds ratio 0.96, 0.93 to 0.99, P=0.006); the other findings, including mortality, were also comparable to the matched analysis (supplementary table 5). Notably, the year of surgery was not significantly associated with primary outcome in the regression analysis.
To explore the degree to which case-mix variation and surgical volume affected the study conclusions, we examined outcomes in regression models with sequentially greater numbers of covariates. We clustered observations based on the procedural fee code for each model to compare patients undergoing the same operation. Comparing patients treated by female surgeons to those treated by male surgeons, in unadjusted models, the crude odds ratio of the composite endpoint was 0.94 (0.91 to 0.97, P=0.0004). It remained 0.94 (0.90 to 0.97, P=0.0002) after accounting for surgeon age. Finally, we accounted for case-mix and all variables except for surgical volume, and the difference between female and male surgeons became non-significant (adjusted odds ratio 0.97, 0.94 to 1.00, P=0.08).
Discussion
Principal findings
In this population based cohort, we found small but significant differences in postoperative complications (mortality, complications, and readmission) between patients treated by male and female surgeons, using an α criterion of 0.05. Patients treated by female surgeons had lower rates of 30 day mortality. We matched patients for age, sex, and general comorbidity that had the same surgical procedure by a female or male surgeon of the same age with comparable annual surgical volume in the same hospital. Male surgeons in this cohort had higher average procedural volumes, which mitigates some of the effect of surgeon sex. Outcomes adjusted for case-mix (without adjustment for surgical volume) were comparable for patients treated by female and male surgeons. Retrospective analyses showed that the effect of surgeon sex was restricted to patients who had elective operations; outcomes did not differ among patients who had emergent operations. This finding must be interpreted within the limitations of retrospective analyses.24
A large proportion of patients were excluded because of the inability to find a suitable match. This was most prominent among patients who had procedures in specialties dominated by men. But regression models accounting for baseline differences with adjustment rather than matching gave comparable results for the same outcomes of interest. This indicates that the matched analytic approach might provide results that can be generalised.
We found an association between improved 30 day postoperative outcomes and female surgeons, but whether these differences would persist for long term survival is unclear. We focused on short term outcomes, as others have previously done,16 17 because technical factors related to the surgeon are more likely to contribute to complications in the immediate postoperative period rather than to long term surgical outcomes.8 20
We don’t know the mechanism that underlies better outcomes for patients treated by female surgeons, although it might be related to delivery of care that is more congruent with guidelines, more patient centred, and involves superior communication.1 2 3 4 5 We found that improved postoperative outcomes for patients treated by female surgeons were restricted to patients who had elective operations, which might reflect better patient selection for surgery or residual confounding. Other possible explanations include a willingness to collaborate by female surgeons, which might avert scenarios that could otherwise result in the “failure to rescue” phenomenon.25 In a relatively small cohort of medical students (n=342), Lou et al found that female students completed tests of basic surgical skills more efficiently and had significantly higher scores on tests of theoretical surgical knowledge than men.26 Further study is required to understand how differences in medical practice, clinical acumen, technical skills,27 or risk taking behaviour28 could underlie the sex based differences in mortality that we found, with a view to improving outcomes for patients treated by physicians of either sex.
The confidence intervals of the effect estimate approach might represent residual confounding or be due to chance alone. Our retrospective analyses of emergent operations, where patients are less likely able to select or be biased towards a surgeon of a particular sex, partly support the possibility of confounding. We found that the benefit in composite outcome for patients treated by female surgeons was restricted to elective procedures. By contrast, patients treated by female surgeons did not have a lower rate of 30 day mortality for either elective or emergent procedures, although this subanalysis had lower power than our main analysis. Regardless, outcomes for patients treated by female surgeons were never worse than for those treated by male surgeons.
The Canadian healthcare system is entirely publicly funded, which must be considered when interpreting these results. Patients can freely choose their operating surgeon, so factors such as lower marketing at patients might lower the likelihood of patients “shopping around” for surgeons compared with other healthcare systems.29 30 31 Whether factors such as this affect the likelihood of selection bias whereby “favourable” patients preferentially seek female surgeons in Canada is unknown.
Women undertaking a surgical career experience barriers in training and in practice.21 22 Previous work has shown that the “surgical personality,” surgical culture, and sex based discrimination, as well as lifestyle factors and workload, are deterrents for women considering a career in surgery.23 24 Difficulty identifying mentors, owing to the lack of women in leadership positions in surgery,26 32 might exacerbate this problem. These barriers might create a higher standard for women to gain entrance into the surgical workforce than men, resulting in the selection of a cohort of women that are proportionately more skilled, motivated, and harder working.
Strengths and limitations of this study
We were able to comprehensively capture patients undergoing the selected surgical procedures because of single payer healthcare in Ontario. Unlike Tsugawa et al, who were limited to Medicare beneficiaries aged 65 years and older, we identified all adult patients who had the procedures of interest. This is particularly important because patients treated by female physicians were younger than those treated by male surgeons. Moreover, the nature of the healthcare system enabled us to capture readmissions and complications occurring anywhere in the province, including at hospitals other than where the index surgery was performed.
The observational nature of our study carries some limitations. Primarily, we are unable to account for unmeasured (and unmeasurable) residual confounding. Differences in socioeconomics and comorbidity, which were not measured by our data sources, might have contributed to our findings. We used hard matching to directly compare outcomes between patients treated by female and male surgeons, but within group heterogeneity from the categorical definitions of surgical volume and comorbidity might have caused residual confounding. Finally, we did not have information on the severity of the diseases for which surgery was performed (for example, cancer stage) or case complexity, though procedures were matched directly.
Comparison with other studies
The association between sex of treating physician and patient mortality has now been assessed in three settings—after surgery, among older patients admitted to an internal medicine service,6 and in outpatient care.27 Jerant et al found no association between the treating physician’s sex and mortality in an outpatient setting,27 but their study was conducted in a smaller cohort (roughly 20 000) of relatively healthy patients. By contrast, Tsugawa et al found small, but meaningful, differences in mortality based on sex of the treating physician.
Conclusions and implications
Using a comprehensive sample of patients undergoing a broad range of surgeries in all relevant specialties in Ontario, Canada, we found small differences in postoperative outcomes between patients treated by female and male surgeons, with those treated by female surgeons having a statistically significantly lower rate of 30 day mortality.
Although the relative difference in mortality is quite modest (4%), it has potentially significant clinical implications. Tsugawa et al found a 4% relative risk reduction in mortality for patients treated by female internists, compared with those treated by male internists.6 In our primary matched analysis, we found a similar 4% relative lower likelihood of the composite outcome (adjusted odds ratio 0.96) and a 12% lower likelihood of 30 day mortality (adjusted odds ratio 0.88) among patients treated by female surgeons.
That physician sex might have a greater effect on mortality rates in patients having surgery than those treated by internists is not surprising. Outcomes after surgery are probably more dependent on individual physician actions than those admitted to general internal medicine departments, which, proportionally, might depend more on patient characteristics. Moreover, much of the care provided by an inpatient internal medicine service is performed by a team.33 Tsugawa et al identified the primary treating physician as the individual who billed the largest proportion of services, which, on average, represented half of each patient’s billings.6 By contrast, although residents, fellows, hospitalists, and nurses participate in the operative and postoperative care of patients having surgery, the primary surgeon has a direct effect on patient outcomes. This has been found when assessing the effect of surgeon volume,34 specialisation,35 and skill.2 Surgeons are not completely independent of teams, so the differences we found could not only reflect differences in surgeon sex but also how they use the resources available to them (such as other consultants or intensive care units). Because we compared surgeons at the same hospital, who shared the same resources, mortality differences are less likely to be related to system or team based factors. The buffering effect of surgical teams should result in a bias towards the null. So the effect we observed might be an underestimate of the true effect.
These results do not support the preferential selection of a surgeon of either sex in clinical practice. Rather they indicate the need for further study in independent cohorts using different research methods. In addition, they support the examination of surgical outcomes and mechanisms related to physicians and the underlying processes and patterns of care to improve mortality, complications, and readmissions for all patients. Our findings have important implications for supporting sex equality and diversity in a traditionally male dominated profession.36
Web extra.
Extra material supplied by authors
We thank Refik Saskin and Lisa Ellison for their methodological and analytical support.
Contributors and sources: CJDW and RS conceived the study and were responsible for the design and development of the data analysis. CJDW, BR, ASD and RS were responsible for data assembly, collection, and analysis. CJDW, BR, NC, RKN, ASD, and RS were responsible for data interpretation. CJDW wrote the first draft. All authors provided significant revisions for important intellectual content and approved the final version. CJDW and RS affirm that the manuscript is an honest, accurate, and transparent account of the study being reported; that no important aspects of the study have been omitted; and that any discrepancies from the study as planned have been explained.
Funding: This study received no support from any organisation. CJDW is supported by the Canadian Institute of Health Research Banting and Best Doctoral Award. NGC is supported by the Sherif and MaryLou Hanna Chair in Surgical Oncology. RKN is supported by the Ajmera Family Chair in Urologic Oncology.
Competing interests: All authors have completed the ICMJE uniform disclosure form at www.icmje.org/coi_disclosure.pdf and declare: no support from any organisation for the submitted work; no financial relationships with any organisations that might have an interest in the submitted work in the previous three years; no other relationships or activities that could appear to have influenced the submitted work.
Data sharing: Statistical code is available from the RS (raj.satkunasivam@gmail.com). Given the administrative nature of the data, patients did not give informed consent for data sharing but all data are fully anonymised, and risk of identification is low.
Disclaimer: This study made use of de-identified data from the ICES Data Repository, which is managed by the Institute for Clinical Evaluative Sciences with support from its funders and partners: Canada’s Strategy for Patient-Oriented Research (SPOR), the Ontario SPOR Support Unit, the Canadian Institutes of Health Research and the Government of Ontario. The opinions, results and conclusions reported are those of the authors. No endorsement by ICES of any of its funders or partners is intended or should be inferred.
References
- 1.Thomas WE. Teaching and assessing surgical competence. Ann R Coll Surg Engl 2006;88:429-32. 10.1308/003588406X116927 pmid:17002841. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Birkmeyer JD, Finks JF, O’Reilly A, et al. Michigan Bariatric Surgery Collaborative. Surgical skill and complication rates after bariatric surgery. N Engl J Med 2013;369:1434-42. 10.1056/NEJMsa1300625 pmid:24106936. [DOI] [PubMed] [Google Scholar]
- 3.Frank E, Dresner Y, Shani M, Vinker S. The association between physicians’ and patients’ preventive health practices. CMAJ 2013;185:649-53. 10.1503/cmaj.121028 pmid:23569163. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Lurie N, Slater J, McGovern P, Ekstrum J, Quam L, Margolis K. Preventive care for women. Does the sex of the physician matter?N Engl J Med 1993;329:478-82. 10.1056/NEJM199308123290707 pmid:8332153. [DOI] [PubMed] [Google Scholar]
- 5.Bertakis KD, Helms LJ, Callahan EJ, Azari R, Robbins JA. The influence of gender on physician practice style. Med Care 1995;33:407-16. 10.1097/00005650-199504000-00007 pmid:7731281. [DOI] [PubMed] [Google Scholar]
- 6.Roter DL, Hall JA, Aoki Y. Physician gender effects in medical communication: a meta-analytic review. JAMA 2002;288:756-64. 10.1001/jama.288.6.756 pmid:12169083. [DOI] [PubMed] [Google Scholar]
- 7.Berthold HK, Gouni-Berthold I, Bestehorn KP, Böhm M, Krone W. Physician gender is associated with the quality of type 2 diabetes care. J Intern Med 2008;264:340-50. 10.1111/j.1365-2796.2008.01967.x pmid:18397244. [DOI] [PubMed] [Google Scholar]
- 8.Ali A, Subhi Y, Ringsted C, Konge L. Gender differences in the acquisition of surgical skills: a systematic review. Surg Endosc 2015;29:3065-73. 10.1007/s00464-015-4092-2 pmid:25631116. [DOI] [PubMed] [Google Scholar]
- 9.Tsugawa Y, Jena AB, Figueroa JF, Orav EJ, Blumenthal DM, Jha AK. Comparison of hospital mortality and readmission rates for Medicare patients treated by male vs female physicians. JAMA Intern Med 2017;177:206-13. 10.1001/jamainternmed.2016.7875 pmid:27992617. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lillemoe KD. Surgical mentorship: a great tradition, but can we do better for the next generation?Ann Surg 2017;266:401-10. 10.1097/SLA.0000000000002374 pmid:28795992. [DOI] [PubMed] [Google Scholar]
- 11.Greenberg C. Sticky floors and glass ceilings: Association for Academic Surgery 2017 presidential address. J Surg Res 2017; [forthcoming]. [DOI] [PubMed] [Google Scholar]
- 12.Canadian Medical Association. Number and percent distribution of physicians by specialty and sex, Canada 2017 2017 https://www.cma.ca/Assets/assets-library/document/en/advocacy/06-spec-sex.pdf.
- 13.Feinberg AE, Porter J, Saskin R, Rangrej J, Urbach DR. Regional variation in the use of surgery in Ontario. CMAJ Open 2015;3:E310-6.. 10.9778/cmajo.20150014 pmid:26442230. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Govindarajan A, Urbach DR, Kumar M, et al. Outcomes of Daytime Procedures Performed by Attending Surgeons after Night Work. N Engl J Med 2015;373:845-53. 10.1056/NEJMsa1415994 pmid:26308685. [DOI] [PubMed] [Google Scholar]
- 15.von Elm E, Altman DG, Egger M, Pocock SJ, Gøtzsche PC, Vandenbroucke JP. STROBE Initiative. The strengthening the reporting of observational studies in epidemiology (STROBE) statement: guidelines for reporting observational studies. Ann Intern Med 2007;147:573-7. 10.7326/0003-4819-147-8-200710160-00010 pmid:17938396. [DOI] [PubMed] [Google Scholar]
- 16.Williams JI, Young W. A summary of studies on the quality of health care administrative databases in Canada. In: Goel V, Williams J, Anderson G, et al, eds. Patterns of Health Care in Ontario, Canada: The ICES Practice Atlas.Canadian Medical Association, 1996: 339-45. [Google Scholar]
- 17.Juurlink DN, Preyra C, Croxford R, et al. Canadian Institute for Health Information Discharge Abstract Database: A validation study.Institute for Clinical Evaluation Sciences, 2006. [Google Scholar]
- 18.Iron K, Zagorski BM, Sykora K, et al. Living and Dying in Ontario: An Opportunity for Improved Health Information. Toronto, ON: ICES Investigative Report, 2008. [Google Scholar]
- 19.Urbach DR, Govindarajan A, Saskin R, Wilton AS, Baxter NN. Introduction of surgical safety checklists in Ontario, Canada. N Engl J Med 2014;370:1029-38. 10.1056/NEJMsa1308261 pmid:24620866. [DOI] [PubMed] [Google Scholar]
- 20.Network LHI. Ontario’s Local Health Integration Networks Toronto. Queen's Printer for Ontario, 2014, Available from http://www.lhins.on.ca/. [Google Scholar]
- 21.Center HSRD. The Johns Hopkins ACG Case-Mix System Reference Manual Version 7.0.The Johns Hopkins University Bloomberg School of Public Health, 2005. [Google Scholar]
- 22.Austin PC, van Walraven C, Wodchis WP, Newman A, Anderson GM. Using the Johns Hopkins Aggregated Diagnosis Groups (ADGs) to predict mortality in a general adult population cohort in Ontario, Canada. Med Care 2011;49:932-9. 10.1097/MLR.0b013e318215d5e2 pmid:21478773. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Austin PC. Using the standardized difference to compare the prevalence of a binary variable between two groups in observational research. Commun Stat Simul Comput 2009;38:1228-34 10.1080/03610910902859574. [DOI] [Google Scholar]
- 24.Elliott HL. Post hoc analysis: use and dangers in perspective. J Hypertens Suppl 1996;14:S21-4, discussion S24-5. 10.1097/00004872-199609002-00006 pmid:8934374. [DOI] [PubMed] [Google Scholar]
- 25.Wakeam E, Hyder JA. Raising the bar for failure to rescue: critical appraisal of current measurement and strategies to catalyze improvement. JAMA Surg 2015;150:1023-4. 10.1001/jamasurg.2015.1933 pmid:26331677. [DOI] [PubMed] [Google Scholar]
- 26.Lou Z, Yan FH, Zhao ZQ, et al. The sex difference in basic surgical skills learning: a comparative study. J Surg Educ 2016;73:902-5. 10.1016/j.jsurg.2016.04.002 pmid:27184180. [DOI] [PubMed] [Google Scholar]
- 27.Grantcharov TP, Bardram L, Funch-Jensen P, Rosenberg J. Impact of hand dominance, gender, and experience with computer games on performance in virtual reality laparoscopy. Surg Endosc 2003;17:1082-5. 10.1007/s00464-002-9176-0 pmid:12728373. [DOI] [PubMed] [Google Scholar]
- 28.Byrnes JP, Miller DC, Schafter WD. Gender differences in risk taking: a meta-analysis. Psychol Bull 1999;125:367-83 10.1037/0033-2909.125.3.367. [DOI] [Google Scholar]
- 29.Dixon PR, Grant RC, Urbach DR. The impact of marketing language on patient preference for robot-assisted surgery. Surg Innov 2015;22:15-9. 10.1177/1553350614537562 pmid:24902683. [DOI] [PubMed] [Google Scholar]
- 30.Wirth MP, Hakenberg OW. Surgery and marketing: comparing different methods of radical prostatectomy. Eur Urol 2009;55:1031-3. 10.1016/j.eururo.2009.02.013 pmid:19232815. [DOI] [PubMed] [Google Scholar]
- 31.Schiavone MB, Kuo EC, Naumann RW, et al. The commercialization of robotic surgery: unsubstantiated marketing of gynecologic surgery by hospitals. Am J Obstet Gynecol 2012;207:174.e1-7. 10.1016/j.ajog.2012.06.050 pmid:22835493. [DOI] [PubMed] [Google Scholar]
- 32.Scally CP, Varban OA, Carlin AM, Birkmeyer JD, Dimick JB. Michigan Bariatric Surgery Collaborative. Video ratings of surgical skill and late outcomes of bariatric surgery. JAMA Surg 2016;151:e160428 10.1001/jamasurg.2016.0428 pmid:27074114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Coyne J. Replace male doctors with female ones and save at least 32 000 lives each year? PLoS Blogs. 2017. http://blogs.plos.org/mindthebrain/2017/02/21/replace-male-doctors-with-female-ones-and-save-at-least-32000-lives-each-year/
- 34.Birkmeyer JD, Stukel TA, Siewers AE, Goodney PP, Wennberg DE, Lucas FL. Surgeon volume and operative mortality in the United States. N Engl J Med 2003;349:2117-27. 10.1056/NEJMsa035205 pmid:14645640. [DOI] [PubMed] [Google Scholar]
- 35.Sahni NR, Dalton M, Cutler DM, Birkmeyer JD, Chandra A. Surgeon specialization and operative mortality in United States: retrospective analysis. BMJ 2016;354:i3571 10.1136/bmj.i3571 pmid:27444190. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Dageforde LA, Kibbe M, Jackson GP. Recruiting women to vascular surgery and other surgical specialties. J Vasc Surg 2013;57:262-7. 10.1016/j.jvs.2012.07.029 pmid:23141685. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.