This study attempts to determine the association between measured candidate factors at the time of an Irish selection and assessment outcomes in surgical training.
Key Points
Question
Is there an association between measured candidate factors at selection and future performance in surgical training?
Findings
A cohort study was conducted including 303 successful applicants to the Royal College of Surgeons in Ireland Core Surgical Training program. Performance at the time of selection, measured across technical and nontechnical domains in a multimodal fashion, was significantly associated with future performance in the workplace and in simulation-based assessments.
Meaning
In this study, a multimodal selection framework could be used to recruit surgical trainees in a standardized and transparent manner, conforming to a predictive validity paradigm.
Abstract
Importance
Selection processes for surgical training should aim to identify candidates who will become competent independent practitioners and should aspire to high standards of reliability and validity.
Objective
To determine the association between measured candidate factors at the time of an Irish selection and assessment outcomes in surgical training, examined via rate of progression to Higher Specialist Training (HST), attrition rates, and performance as assessed through a multimodal framework of workplace-based and simulation-based assessments.
Design, Setting, and Participants
This retrospective observational cohort study included data from all successful applicants to the Royal College of Surgeons in Ireland (RCSI) national Core Surgical Training (CST) program. Participants included all trainees recruited to dedicated postgraduate surgical training from 2016 to 2020. These data were analyzed from July 11, 2016, through July 10, 2022.
Exposures
Selection decisions were based on a composite score that was derived from technical aptitude assessments, undergraduate academic performance, and a 4-station multiple mini-interview.
Main outcomes and measures
Assessment data, attrition rates, and rates of progression to HST were recorded for each trainee. CST performance was assessed using workplace-based and simulation-based technical and nontechnical skill assessments. Potential associations between selection and assessment measures were explored using Pearson correlation, logistic regression, and multiple linear-regression analyses.
Results
Data were available for 303 trainees. Composite scores were positively associated with progression to HST (odds ratio [OR], 1.09; 95% CI, 1.05-1.13). There was a weak positive correlation, ranging from 0.23 to 0.34, between scores and performance across all CST assessments. Multivariable linear regression analysis showed technical aptitude scores at application were associated with future operative performance assessment scores, both in the workplace (β = 0.31; 95% CI, 0.14-0.48) and simulated environments (β = 0.57; 95% CI, 0.33-0.81). There was evidence that the interpersonal skills interview station was associated with future performance in simulated communication skill assessments (β = 0.55; 95% CI, 0.22-0.87).
Conclusions and Relevance
In this study, performance at the time of Irish national selection, measured across technical and nontechnical domains in a multimodal fashion, was associated with future performance in the workplace and in simulated environments. Future studies will be required to explore the consequential validity of selection, including potential unintended effects of selection and ranking on candidate performance.
Introduction
Obtaining a postgraduate surgical training position is a competitive process worldwide.1,2 Selection processes vary widely.3 Common criteria informing selection include letters of recommendation,4,5,6 prior academic performance,7 results in national licensing examinations1 and professional examinations,8 student performance evaluations,9 research output,4,5 prior experience,10 personal statements,11 and performance at interview.5,9,10,12 Selection can be considered as a high-stakes assessment process and should conform to high standards of validity.13 In particular, processes should follow a predictive paradigm, aiming to identify candidates who will become competent medical professionals.14 Industry evaluations of selection processes further describe return on investment across 3 domains; administrative efficiency, the future performance of selected candidates, and subsequent attrition rates.15 Screening processes are costly1 and attrition rates from general surgery programs can be as high as 30%.16 Deficits in medical knowledge can lead to delayed progression17 and shortfalls in interpersonal skills or professionalism may be even more challenging to rectify.18 Of further consideration for surgical specialties is technical skill or aptitude.19 A multifactorial approach to trainee selection could theoretically select for high-performing residents.
While a number of studies have examined factors associated with success at an undergraduate level,20,21 the validity of selection metrics in postgraduate training are less well explored.13,22 Predicting the future performance of surgical trainees is challenging.13 A 2013 meta-analysis of selection strategies across all medical and surgical specialties showed minimal or no correlation between interviews, reference letters, or deans’ letters and future resident performance.23 Performance in the US Medical Licensing Examination correlates weakly to moderately with future in-training examination performance, with an observed weaker correlation between US Medical Licensing Examination scores and performance in surgical specialties.24 While performance in the Membership of the Royal College of Surgeons examination in the United Kingdom and Ireland is associated positively with likelihood of selection onto surgical training programs, further predictive validity evidence is lacking.25 Data-driven approaches to trainee selection are therefore increasingly being explored.15,26,27,28,29
The objective of this study was to determine the potential association between measured candidate factors at the time of a multimodal national selection process for surgical trainees in Ireland and future success in early surgical training, examined via rate of progression to higher specialist training, performance as assessed through workplace-based and simulation-based assessments, and attrition rates.
Methods
This study is reported according to the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guidelines30 and was approved by the ethics committee of the University of Medicine and Health Sciences at the Royal College of Surgeons in Ireland. Informed consent for the collection and processing of training data were obtained from trainees at the time of recruitment.
Setting and Participants
Selection into Core Surgical Training (CST) (the first 2 years of dedicated, undifferentiated postgraduate surgical training) is a centralized process conducted by the Royal College of Surgeons in Ireland (RCSI). Candidates undergo psychomotor aptitude testing (Table 1) and a 4-station multiple mini-interview evaluating clinical judgment, interpersonal skills, professional development, and suitability for specialist training. Scores from these structured interview stations and aptitude assessments are combined with a score awarded for the candidate’s undergraduate medical degree centile to produce a total score (Table 1).
Table 1. Analyzed Selection Metrics and Performance Outcomes.
| Domain | Description | Method/measures |
|---|---|---|
| Selection domain (relative scoring %) | ||
| Centile | Centile score awarded in a candidate’s undergraduate medical degree | Prescored, scaled to 15-point measure |
| Surgical aptitude (15) | Psychomotor skills, visuospatial ability, perception | 3 Equally weighted validated assessments of baseline aptitude, PicSOr, visuospatial aptitude, grooved pegboard |
| Clinical judgment (15) | 2 Structured sequential clinical scenarios | Structured interview |
| Interpersonal skills (15) | Communication, teamwork and leadership; crisis management, negotiation, and conflict resolution | Structured interview |
| Professional development (15) | Clinical research projects, attendance at meetings and courses, audit projects, and teaching activities | Structured interview |
| Suitability for specialist training (25) | References, motivation and drive; knowledge of specialty, time and stress management, work ethic, and professionalism | Structured interview |
| Assessment domain | ||
| Attrition | Withdrawal from the CST program | Binary outcome measure |
| Progression | Successful progression to HST | Binary outcome measure |
| Operative performance | SSAOP assessment scores | Mean scores awarded across all submitted SSAOP assessments over 2 y of training, scored using global (not procedure specific) checklists on a scale from 0-150 |
| Simulated operative performance | OSS assessment scores | Mean scores awarded across 8 technical skill OSCE stations conducted in annual sessions (4 stations at each assessment), scored using predefined procedure or task-specific checklists |
| Nontechnical skill performance | Communication skill assessment scores | Mean scores awarded across 8 nontechnical skill OSCE stations conducted in annual sessions, scored using predefined scenario-specific checklists and the Calgary Cambridge assessment tool31 |
| Clinical performance | SCA assessment scores | Mean scores awarded across all submitted SCA assessments, scored using generic (not diagnosis-specific) checklists from 0-100 |
| Operative experience | eLogbook scores | Scores calculated algorithmically, incorporating total procedural numbers, case complexity, and level of autonomy. Mean standardized scores were calculated, awarded across 2 rotations in year 1 (scored 0-25) and 1 rotation in year 2 (scored 0-50). |
| Trainee assessment reports | Trainer evaluations of performance in the workplace | Mean standardized scores awarded across 3 rotations over 2 y, scored from 0-50 across 2 rotations in year 1 and 0-100 in year 2 |
Abbreviations: CST, Core Surgical Training; eLogbook, electronic logbook; HST, Higher Specialist Training; OSCE, Objective Structured Clinical Examination; OSS, operative surgical skill; PicSOr, Pictorial Surface Orientation testing; SCA, structured clinical assessment; SSAOP, Supervised Structured Assessment of Operative Performance.
Successful candidates undergo 2 years of CST. Trainees complete 2 rotations of different specialties in their first year of training, followed by a further year in their intended specialty, usually in a different training location. Performance during this time is assessed through workplace-based assessments of operative and clinical performance, simulated assessments of technical and nontechnical skills, trainer evaluation reports, and operative experience—recorded and stored using an electronic logbook. On completion of CST, candidates are eligible to apply for a Higher Specialist Training (HST) position in their surgical specialty of interest. Selection at this stage is again a competitive process32,33 and is informed by the following: (1) CST Trainer Assessment Reports, (2) electronic logbook scores, (3) simulation-based communication skill and objective surgical skill assessments, (4) performance in part B of the Intercollegiate Membership of the Royal College of Surgeons examination, and (5) performance in a further standardized multi-station interview. Information on the progression pathway is available.34
A retrospective analysis of prospectively collected application data from successful applicants to RCSI’s CST program from 2016 through 2020 inclusive was performed. Assessment data, attrition rates, and progression to HST were recorded until July 2022 inclusive.
Variables, Data Sources, and Measurement
Potential explanatory variables explored are outlined in Table 1. Psychomotor aptitude testing was conducted using 3 validated assessments: Pictorial Surface Orientation testing,35,36,37,38,39,40,41,42 digitized visuospatial assessments,43,44 and dexterity testing using a grooved pegboard.45 Pictorial Surface Orientation testing requires candidates to orientate a spinning arrowhead so that it rests perpendicular to a displayed cube face; scores are calculated based on speed and accuracy with Pearson product moment correlation coefficients used to compare the orientation chosen by the candidate with the theoretically correct orientation.35 Four visuospatial aptitude tests are used at selection, derived from a Kit of Factor Referenced Cognitive Tests developed by Ekstrom et al44: surface development, map planning, card rotation, and cube comparison.43,44 Manual dexterity is assessed using a grooved pegboard, requiring candidates to orientate pegs until they slot into keyhole-shaped openings; scores are awarded for speed with deductions accrued for dropped pegs.46
Outcome variables are summarized in Table 1. Workplace-based assessments of operative performance are scored using generic checklists (supervised structured assessment of operative skill). Workplace-based assessments of clinical performance—for example, history and physical examination in the outpatient setting—are also conducted (structured clinical assessments). Global assessments of overall performance in a training post are completed at 6-monthly intervals (trainee assessment reports). A 4-station simulated skills assessment is conducted at the end of each year of training (operative surgical skill assessment). Communication skills are also assessed at these time points across 4 structured stations. Due to the COVID-19 pandemic, trainees recruited in 2019 and 2020 (n = 124) using predefined scenario-specific checklists and the Calgary Cambridge assessment tool31 had disruptions to their annual assessments and undertook 1 sitting of the operative surgical skill and communication skill assessments only. Example assessment forms are available in the eAppendix in Supplement 1. Scoring of electronic logbooks as a proxy measure of operative experience has been previously described,47 but briefly is calculated based on the number of procedures performed, the degree of trainee involvement, and the complexity of the logged procedures.
Bias and Sample Size
Potential explanatory variables were assessed prior to outcome variables. Assessors of simulation-based assessments and workplace-based performance were blinded as to the scores achieved by trainees at selection. Workplace-based assessors are registered trainers with the Royal College of Surgeons in Ireland, registered on the relevant specialist division of the Irish Medical Council, registered on a professional competence (continuous professional development) program, and practicing at a consultant (attending) surgeon level. Assessors for simulation-based assessments undergo rater training with standard setting conducted prior to each assessment session. Based on a minimum of 193 participants, this study has adequate power to detect a weak correlation (0.2 or more) between independent variables and measured outcome variables.48
Quantitative Variables and Statistical Analysis
Total scores were summarized using means and SDs. Potential associations between explanatory variables and outcome measures were explored using Pearson correlation coefficients, logistic, and linear regression analyses.49 Multicollinearity was assessed using variance inflation factor. Results are presented as β coefficients or odds ratios (ORs), 95% CIs, and P values. Variance in assessment scores across surgical specialties and training locations was explored using analysis of variance. A correlation coefficient of 0 to 0.19 was regarded as very weak, 0.20 to 0.39 as weak, 0.40 to 0.59 as moderate, 0.60 to 0.79 as strong, and 0.8 to 1 as very strong.50 All statistical analysis was performed using SPSS version 23 (SPSS).
Results
Selection data from 303 successful applicants to RCSI’s CST program from 2016 to 2020 were analyzed. Mean scores achieved during the recruitment process are outlined in Table 2.
Table 2. Mean Scores Awarded to Candidates (N = 303) Successful in Obtaining a Core Surgical Training Post.
| Measure | Mean (SD) [range] |
|---|---|
| Selection metric | |
| Surgical aptitudea (0-15) | 9.75 (2.48) [3.00-14.67] |
| Perception (PicSOr) (0-5) | 3.39 (1.73) [0.00-5.00] |
| Visuospatial aptitude (0-5) | 2.31 (1.07) [0.00-4.82] |
| Manual dexterity (pegboard) (0-5) | 4.04 (0.94) [0.00-5.00] |
| Centile (0-15) | 9.67 (3.98) [0.15-15.00] |
| Clinical judgment (0-15) | 12.09 (1.73) [5.75-15.00] |
| Interpersonal skills (0-15) | 11.49 (1.63) [5.25-15.00] |
| Professional development (0-15) | 11.09 (1.91) [4.50-15.00] |
| Suitability for specialty training (0-25) | 19.65 (2.27) [11.90-25.00] |
| Total score (0-100) | 73.43 (6.79) [56.93-89.54] |
| Assessment method | |
| Operative experience (eLogbook) (0-25) | 18.62 (4.25) [2.88-25.00] |
| Workplace-based operative performance assessments (SSAOP) (0-50) | 37.68 (3.64) [21.40-44.50] |
| Workplace-based clinical assessments (SCA) (0-30) | 26.32 (2.53) [15.00-30.00] |
| Standardized global trainer evaluations (TAR) | 45.70 (3.49) [34.75-61.65] |
| Simulation-based assessments of technical skill (OSS) (0-50) | 38.71 (5.24) [17.55-47.36] |
| Simulation-based assessments of Communication skills (0-50) | 38.89 (4.37) [23.80-48.15] |
Abbreviations: eLogbook, electronic logbook; PicSOr, Pictorial Surface Orientation testing; OSS, Operative Surgical Skill; SCA, Structured Clinical Assessment; SSAOP, Supervised Structured Assessment of Operative Performance; TAR, Trainee Assessment Report.
Surgical aptitude scores are the sum of perception, visuospatial aptitude, and manual dexterity assessment scores. The total score is the sum of surgical aptitude, centile, clinical judgment, interpersonal skills, professional development, and suitability for specialist training scores.
Relationships With Other Variables
Attrition and Progression
Of 303 trainees, 276 completed the CST program (91.09%). Of the 276 trainees who completed CST, 215 subsequently applied to HST (77.89%) and 172 progressed to HST (62.32%). Of these, 121 progressed to HST directly from CST (70.34%), while a further 51 candidates did not progress directly from CST, but did achieve an HST place in subsequent years. The total progression rate from CST to HST was 56.76%. The withdrawal rate was 29.04%. Univariable logistic regression showed no evidence of an association between total score at application and completion of the CST program (OR, 1.02; 95% CI, 0.96-1.08; P = .48) (Table 3). There was evidence of an association with progression to HST (OR, 1.09; 95% CI, 1.05-1.13; P < .001) and the same for direct progression from CST to HST on first attempt (OR, 1.09; 95% CI, 1.05-1.13; P < .001) (Table 3). When the intended specialty of HST application was included in a multiple logistic regression model, significant associations were again observed between total score and progression to HST (OR, 1.13; 95% CI, 1.06-1.20; P < .001) and direct HST progression on first attempt (OR, 1.10; 95% CI, 1.04-1.15; P < .001).
Table 3. Multiple Logistic Regression Analyses Demonstrating the Association Between Application Centile, Psychomotor, and Structured Interview Scores and Outcome Measures.
| Measure | OR (95% CI) | P value |
|---|---|---|
| Completion of CST | ||
| Centile | 0.93 (0.84-1.04) | .22 |
| Surgical aptitude | 1.07 (0.91-1.25) | .43 |
| Clinical judgment | 1.21 (0.95-1.53) | .12 |
| Interpersonal skills | 0.90 (0.69-1.18) | .45 |
| Professional development | 1.06 (0.85-1.32) | .61 |
| Suitabilitya | 1.22 (1.02-1.45) | .03 |
| Progression to HST | ||
| Centile | 1.05 (0.98-1.12) | .16 |
| Surgical aptitude | 0.96 (0.87-1.07) | .49 |
| Clinical judgmenta | 1.48 (1.26-1.74) | <.001 |
| Interpersonal skills | 1.06 (0.89-1.25) | .52 |
| Professional developmenta | 1.15 (1.01-1.32) | .04 |
| Suitabilitya | 1.13 (1.01-1.27) | .04 |
| Direct progression to HST b | ||
| Centile | 1.05 (0.98-1.12) | .12 |
| Surgical aptitudea | 1.19 (1.07-1.32) | <.001 |
| Clinical judgmenta | 1.39 (1.18-1.64) | <.001 |
| Interpersonal skills | 1.18 (0.99-1.36) | .07 |
| Professional development | 1.06 (0.92-1.21) | .43 |
| Suitabilitya | 1.01 (0.89-1.13) | <.001 |
Abbreviations: CST, core surgical training; HST, Higher Specialist Training; OR, odds ratio.
Statistically significant associations.
Direct progression to HST indicates the likelihood of progression to higher specialist training on first application from core surgical training.
On multiple logistic regression analyses, scores in the suitability for specialist training\interview were the only factor at application positively associated with CST completion (Table 3), while scores in clinical judgment, professional development, and suitability for specialist training interviews were positively associated with HST progression. Scores in surgical aptitude (baseline psychomotor assessments), along with scores in 4 of the 5 interview stations, were positively associated direct progression to HST (Table 3).
Associations With In-Training Assessment Scores
Mean scores awarded in CST assessments are outlined in Table 2. Total score at application was significantly and positively correlated with performance in all future CST assessments (r = 0.22-0.34) (eTable 1 in Supplement 1). Correlations between scores awarded at application and scores in future performance assessments are further outlined in eTable 1 in Supplement 1.
Multivariable linear regression again demonstrated a number of significant associations between assessed factors at application and future performance (Table 4). When examining the standardized coefficients, surgical aptitude was the largest contributor of the independent variables to scores in both workplace-based and simulation-based assessments of operative skill. Scores awarded in interpersonal skill interview stations were the largest contributor to performance in subsequent communication skill assessments. There was observable and significant variance in assessment scores across training locations and surgical specialties (eTables 2 and 3 in Supplement 1).
Table 4. Measured Candidate Factors at Selection and Associations With Future Performance on Multivariable Linear-Regression Analysis.
| Selection domain | Core surgical training assessment | |||||
|---|---|---|---|---|---|---|
| Mean standardized eLogbook | Mean SSAOP | Mean SCA | Mean standardized TAR | Mean OSS | Mean communication skills | |
| Centile | ||||||
| β (95% CI) | 0.19 (0.07-0.31) | 0.14 (0.04-0.25) | 0.09 (0.02-0.17) | 0.12 (0.03-0.22) | 0.17 (0.02-0.32) | 0.17 (0.05-0.30) |
| P value | .002 | .01 | .01 | .01 | .02 | .01 |
| Surgical aptitude | ||||||
| β (95% CI) | 0.22 (0.02-0.41) | 0.31 (0.14-0.48) | 0.40 (0.25-0.56) | 0.57 (0.33-0.81) | 0.57 (0.33-0.81) | 0.26 (0.06-0.46) |
| P value | .03 | <.001 | .03 | <.001 | <.001 | .01 |
| Clinical judgment | ||||||
| β (95% CI) | 0.22 (−0.06 to 0.51) | −0.26 (−0.51 to 0.02) | −0.14 (−0.31 to 0.04) | −0.22 (−0.45 to 0.01) | 0.24 (−0.11 to 0.59) | −0.38 (−0.68 to −0.09) |
| P value | .12 | .04 | .13 | .06 | .18 | .01 |
| Interpersonal skills | ||||||
| β (95% CI) | 0.62 (0.31-0.93) | 0.34 (0.07-0.61) | 0.21 (0.02-0.39) | 0.54 (0.29-0.79) | 0.60 (0.22-0.99) | 0.55 (0.22-0.87) |
| P value | <.001 | .01 | .03 | <.001 | .002 | .001 |
| Professional development | ||||||
| β (95% CI) | 0.12 (−0.14 to 0.38) | 0.18 (−0.05 to 0.40) | 0.18 (0.03-0.34) | 0.22 (0.01-0.42) | 0.07 (−0.25 to 0.39) | −0.06 (−0.33 to 0.21) |
| P value | .36 | .12 | .02 | .04 | .67 | .69 |
| Suitability | ||||||
| β (95% CI) | 0.18 (−0.04 to 0.39) | 0.14 (−0.05 to 0.33) | 0.06 (−0.07 to 0.19) | 0.14 (−0.04 to 0.31) | 0.28 (0.00-0.55) | 0.37 (0.14-0.60) |
| P value | 0.11 | .16 | .38 | .12 | .05 | .002 |
Abbreviations: eLogbook, electronic logbook; SCA, Structured Clinical Assessment; SSAOP, Supervised Structured Assessment of Operative Performance; OSS, Objective Surgical Skills; TAR, Trainee Assessment Report.
Discussion
Performance in a multifaceted selection process is not associated with completion of CST, but is positively associated with subsequent progression to HST in the trainee’s chosen subspecialty. Scores awarded at selection are significantly associated with the future performance across a range of workplace-based and simulation-based assessments. Surgical aptitude, as measured using psychomotor assessments, was the largest contributor of measured factors to future operative performance assessment scores. Candidate performance in the interpersonal skills station of a multistation interview was the most strongly associated selection variable with performance in future simulated communication skill assessments.
Candidate performance at the time of selection was not associated with withdrawal from the CST program. A low attrition rate may act as a useful measure of the efficacy of a selection process.3 However, prior work from our institution suggests that trainees do not leave surgical training due to self-perceived skill-program mismatch, but rather due to negative perceptions regarding program delivery, training atmosphere, lifestyle considerations, peer influence, and concerns regarding career progression.51 Therefore, it is appropriate to view attrition as influenced by both candidate and program factors. It is also important to consider consequential effects of selection on future performance when using progression as a measure of success. In Ireland, rank achieved at selection informs the location and specialties that trainees rotate through; a higher ranking leads to greater choice. Candidates who perform less well at selection are less likely to undertake their chosen rotations. This may lead to demotivation with effects on performance, and ultimately, a lower likelihood of progression. The candidate experience of selection, ranking, and the perceived impact on future training should be explored.
A number of significant associations are noted in this study between measured attributes at the time of selection and performance in various workplace-based and simulation-based assessments. Individual correlations between selection and future performance measures are weak. The CST selection process sits within a complex system of recruitment, training, assessment, and progression. Experiences of training can vary greatly for trainees. At our institution, we endeavor to measure operative volume and autonomy through surgical logbooks47 and have previously demonstrated the association of baseline technical aptitude assessments with operative performance when controlling for experience.29 It would be naïve to assume, however, that operative volume is the sole measure of a positive training experience. The effects of individual trainers,52 the institutional training environment,53 engagement with deliberate practice,54 and innumerable further variables contribute to trainee performance. The inherent variability in trainee experience postselection may explain in part why studies to date have struggled to relate selection metrics with performance outcomes.23
Variance in assessment scores was observable across training locations and specialties (eTable 1 in Supplement 1). There are several factors that could account for observed differences, including inter-rater variability, differences in operative exposure or autonomy granting, or differences in case mix across hospital sites. Contributing factors to assessment score variance can be explored statistically.55 Variance across training experiences could subsequently contribute to the likelihood of progression to HST. It is reasonable to anticipate that candidates who have undertaken certain posts during their training may be more likely to obtain a place in HST; whether this relates to the experiences gained in certain posts leading to better assessment outcomes or the perception of said posts by interviewers at the time of HST recruitment should be studied. Trainees rotate across specialties and locations during CST. As trainees rank their own rotation preferences, it is likely that trainees who perform well at selection seek out positive training experiences. Candidate self-selection could account for a significant proportion of observed variance in assessment outcomes and likelihood of progression. The experience of trainees as they progress through the process of selection and training is worthy of further qualitative study, and can be informed by social validity principles.56 The relative contributions of trainee aptitude and training experiences to assessment outcomes and progression are challenging, but can and should be explored by future studies.
The association between measured factors and future performance is just one source of validity evidence for the selection program. The content validity (ie, the extent to which measured factors accurately reflect the construct of interest)57 and response process validity (ie, the fit between the construct of interest and the process of selection as it is actually engaged in) can be further analyzed both quantitatively and qualitatively. For example, in this study, the suitability for specialty training mini-interview station scores demonstrated a relatively wide range and SD. Scores in this station correlate with likelihood of progression. The response process validity of this station could be evaluated by qualitatively assessing rater’s thoughts and actions during candidate interview to determine how raters come to decisions on ranking.58 This would be important to ensure that stations representing as broad a construct as suitability do not represent a source of bias, and subsequently, contribute to a lack of candidate diversity.
Beyond validity, there are further considerations regarding the utility of RCSI’s selection program across reliability, educational impact, acceptability, feasibility and cost-effectiveness domains.59 Further research will be required to analyze the additional potential benefit of assessing other skills or attributes at the time of selection, through situational judgment testing,60 assessment of grit,61 or a more nuanced quantification of academic record. Lastly, differential attainment across demographic groups62 should be explored to ensure that the selection process meets high standards of diversity, equity, and inclusion. While RCSI publishes annual reports on the gender, country of origin, and further demographic data on CST appointees,63 the impact of such attributes on candidate selection and progression was not explored by this study.
Approaches to the selection of surgical trainees vary across jurisdictions,3 ranging from highly standardized and centralized processes, to completely decentralized local institutional recruitment.3,10 Technical skill or aptitude assessments are rarely used to inform recruitment. Some studies have explored the use of simulated surgical task assessments to stratify candidates.64,65,66 Lund et al67 report that simulation-based assessment scores were associated with higher future milestone assessment scores (β = 0.45; P = .03). Assessment of performance in surgical tasks or procedures, however, may risk selecting candidates based on experience rather than aptitude. Using psychomotor tests of aptitude, far removed from the representation of real-world tasks, may represent a more valid method for assessing a candidate’s technical learning ability. Such aptitude tests are associated with the rate of skill acquisition.19,68,69 The findings of this study support the use of these inexpensive tests to inform selection decisions.
Limitations
Performance data are available only for candidates who were successful in obtaining a CST number at the Royal College of Surgeons in Ireland. Selected candidates are those who possess the same sought-after characteristics, and common method variance may therefore account for some of the correlation observed between selection and in-training assessments.13 The reliability of the findings above relies on the accuracy and validity of trainer- and trainee-submitted data. A broad range of knowledge, skill, and attitude domains likely contribute to the complex concept of future competence; it is difficult, if not impossible, to account for all of these known and unknown contributors at the time of selection.
Conclusions
In this study, performance at the time of selection for surgical training, measured across technical and nontechnical domains, was associated with future performance. Performance at the time of selection is further positively associated with progression to higher specialist training in the candidate’s preferred surgical specialty. Of measured factors at selection, surgical aptitude was the most strongly associated with future operative performance, while the interpersonal skills station of a multi-station interview was the most strongly associated with communication skill assessment scores. Future studies will explore the consequential validity of selection, including potential unintended downstream effects of selection and ranking on candidate performance.
eAppendix. Assessment tools
eTable 1. Correlation between applicant factors evaluated at the time of selection, and future performance in assessments throughout Core Surgical Training, presented as Pearson correlation coefficients (r) and P-value
eTable 2. Analysis of Variance (ANOVA) demonstrating variance in assessment scores across training locations
eTable 3. Analysis of Variance (ANOVA) demonstrating variance in assessment scores across sub-specialties
Data sharing statement
References
- 1.Gardner AK, Smink DS, Scott BG, Korndorffer JR Jr, Harrington D, Ritter EM. How much are we spending on resident selection? J Surg Educ. 2018;75(6):e85-e90. doi: 10.1016/j.jsurg.2018.10.001 [DOI] [PubMed] [Google Scholar]
- 2.Solomou G, Murphy S, Bandyopadhyay S, et al. Neurosurgery specialty training in the UK: what you need to know to be shortlisted for an interview. Ann Med Surg (Lond). 2020;57:287-290. doi: 10.1016/j.amsu.2020.07.047 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Hagelsteen K, Pedersen H, Bergenfelz A, Mathieu C. Different approaches to selection of surgical trainees in the European Union. BMC Med Educ. 2021;21(1):363. doi: 10.1186/s12909-021-02779-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.LaGrasso JR, Kennedy DA, Hoehn JG, Ashruf S, Przybyla AM. Selection criteria for the integrated model of plastic surgery residency. Plast Reconstr Surg. 2008;121(3):121e-125e. doi: 10.1097/01.prs.0000299456.96822.1b [DOI] [PubMed] [Google Scholar]
- 5.Makdisi G, Takeuchi T, Rodriguez J, Rucinski J, Wise L. How we select our residents–a survey of selection criteria in general surgery residents. J Surg Educ. 2011;68(1):67-72. doi: 10.1016/j.jsurg.2010.10.003 [DOI] [PubMed] [Google Scholar]
- 6.Marwan Y, Waly F, Algarni N, Addar A, Saran N, Snell L. The role of letters of recommendation in the selection process of surgical residents in Canada: a national survey of program directors. J Surg Educ. 2017;74(4):762-767. doi: 10.1016/j.jsurg.2017.01.006 [DOI] [PubMed] [Google Scholar]
- 7.Filiberto AC, Cooper LA, Loftus TJ, Samant SS, Sarosi GA Jr, Tan SA. Objective predictors of intern performance. BMC Med Educ. 2021;21(1):77. doi: 10.1186/s12909-021-02487-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Lammy S. Core surgical training applications: making the cut. Bulletin of The Royal College of Surgeons of England. 2014;95:48-49. doi: 10.1308/003588414X1381402167691 [DOI] [Google Scholar]
- 9.Janis JE, Hatef DA. Resident selection protocols in plastic surgery: a national survey of plastic surgery program directors. Plast Reconstr Surg. 2008;122(6):1929-1939. doi: 10.1097/PRS.0b013e31818d20ae [DOI] [PubMed] [Google Scholar]
- 10.Selber JC, Tong W, Koshy J, Ibrahim A, Liu J, Butler C. Correlation between trainee candidate selection criteria and subsequent performance. J Am Coll Surg. 2014;219(5):951-957. doi: 10.1016/j.jamcollsurg.2014.07.942 [DOI] [PubMed] [Google Scholar]
- 11.Gardner AK, Steffes CP, Nepomnayshy D, et al. Selection bias: examining the feasibility, utility, and participant receptivity to incorporating simulation into the general surgery residency selection process. Am J Surg. 2017;213(6):1171-1177. doi: 10.1016/j.amjsurg.2016.09.029 [DOI] [PubMed] [Google Scholar]
- 12.Patel NM, Khajuria A, Khajuria A. Utility of a webinar to educate trainees on UK core surgical training (CST) selection—a cross sectional study and future implications amidst the COVID-19 pandemic. Ann Med Surg (Lond). 2020;59:35-40. doi: 10.1016/j.amsu.2020.08.054 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Roberts C, Khanna P, Rigby L, et al. Utility of selection methods for specialist medical training: a BEME (best evidence medical education) systematic review: BEME guide no. 45. Med Teach. 2018;40(1):3-19. doi: 10.1080/0142159X.2017.1367375 [DOI] [PubMed] [Google Scholar]
- 14.Patterson F, Ferguson E, Zibarras L. Selection into medical education and training. In: Understanding Medical Education. Swanwick T, Forrest K, O'Brien BC, eds. The Association for the Study of Medical Education; 2018:375-388. [Google Scholar]
- 15.Gardner AK, Grantcharov T, Dunkin BJ. The science of selection: using best practices from industry to improve success in surgery training. J Surg Educ. 2018;75(2):278-285. doi: 10.1016/j.jsurg.2017.07.010 [DOI] [PubMed] [Google Scholar]
- 16.Longo WE, Seashore J, Duffy A, Udelsman R. Attrition of categoric general surgery residents: results of a 20-year audit. Am J Surg. 2009;197(6):774-778. doi: 10.1016/j.amjsurg.2008.06.038 [DOI] [PubMed] [Google Scholar]
- 17.Yaghoubian A, Galante J, Kaji A, et al. General surgery resident remediation and attrition: a multi-institutional study. Arch Surg. 2012;147(9):829-833. doi: 10.1001/archsurg.2012.1676 [DOI] [PubMed] [Google Scholar]
- 18.Torbeck L, Canal DF. Remediation practices for surgery residents. Am J Surg. 2009;197(3):397-402. doi: 10.1016/j.amjsurg.2008.10.008 [DOI] [PubMed] [Google Scholar]
- 19.Gangemi A, Shi K, Durgam S, et al. A call to evaluate manual dexterity of prospective surgical trainees. J Surg Res. 2022;279:518-525. doi: 10.1016/j.jss.2022.06.025 [DOI] [PubMed] [Google Scholar]
- 20.Eva KW, Reiter HI, Trinh K, Wasi P, Rosenfeld J, Norman GR. Predictive validity of the multiple mini-interview for selecting medical trainees. Med Educ. 2009;43(8):767-775. doi: 10.1111/j.1365-2923.2009.03407.x [DOI] [PubMed] [Google Scholar]
- 21.Patterson F, Knight A, Dowell J, Nicholson S, Cousans F, Cleland J. How effective are selection methods in medical education? a systematic review. Med Educ. 2016;50(1):36-60. doi: 10.1111/medu.12817 [DOI] [PubMed] [Google Scholar]
- 22.Tolan AM, Kaji AH, Quach C, Hines OJ, de Virgilio C. The electronic residency application service application can predict accreditation council for graduate medical education competency-based surgical resident performance. J Surg Educ. 2010;67(6):444-448. doi: 10.1016/j.jsurg.2010.05.002 [DOI] [PubMed] [Google Scholar]
- 23.Kenny S, McInnes M, Singh V. Associations between residency selection strategies and doctor performance: a meta-analysis. Med Educ. 2013;47(8):790-800. doi: 10.1111/medu.12234 [DOI] [PubMed] [Google Scholar]
- 24.Panda N, Bahdila D, Abdullah A, Ghosh AJ, Lee SY, Feldman WB. Association between USMLE step 1 scores and in-training examination performance: a meta-analysis. Acad Med. 2021;96(12):1742-1754. doi: 10.1097/ACM.0000000000004227 [DOI] [PubMed] [Google Scholar]
- 25.Scrimgeour DSG, Cleland J, Lee AJ, et al. Impact of performance in a mandatory postgraduate surgical examination on selection into specialty training. BJS Open. 2017;1(3):67-74. doi: 10.1002/bjs5.7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Louridas M, Szasz P, de Montbrun S, Harris KA, Grantcharov TP. Can we predict technical aptitude? Ann Surg. 2016;263(4):673-691. doi: 10.1097/SLA.0000000000001283 [DOI] [PubMed] [Google Scholar]
- 27.Schaverien MV. Selection for surgical training: an evidence-based review. J Surg Educ. 2016;73(4):721-729. doi: 10.1016/j.jsurg.2016.02.007 [DOI] [PubMed] [Google Scholar]
- 28.Louridas M, Szasz P, Montbrun S, Harris KA, Grantcharov TP. Optimizing the selection of general surgery residents: a national consensus. J Surg Educ. 2017;74(1):100-107. doi: 10.1016/j.jsurg.2016.06.015 [DOI] [PubMed] [Google Scholar]
- 29.Toale C, Morris M, Ryan DM, et al. Baseline assessments of psychomotor aptitude are associated with the future operative performance of surgical trainees a prospective observational cohort study. Ann Surg. 278(1):148-152. doi: 10.1097/SLA.0000000000005565 [DOI] [PubMed] [Google Scholar]
- 30.von Elm E, Altman DG, Egger M, Pocock SJ, Gøtzsche PC, Vandenbroucke JP; STROBE Initiative . The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies. Lancet. 2007;370(9596):1453-1457. doi: 10.1016/S0140-6736(07)61602-X [DOI] [PubMed] [Google Scholar]
- 31.Iversen ED, Wolderslund MO, Kofoed PE, et al. Codebook for rating clinical communication skills based on the Calgary-Cambridge Guide. BMC Med Educ. 2020;20(1):140. doi: 10.1186/s12909-020-02050-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Gallagher AG, Neary P, Gillen P, et al. Novel method for assessment and selection of trainees for higher surgical training in general surgery. ANZ J Surg. 2008;78(4):282-290. doi: 10.1111/j.1445-2197.2008.04439.x [DOI] [PubMed] [Google Scholar]
- 33.Carroll SM, Kennedy AM, Traynor O, Gallagher AG. Objective assessment of surgical performance and its impact on a national selection programme of candidates for higher surgical training in plastic surgery. J Plast Reconstr Aesthet Surg. 2009;62(12):1543-1549. doi: 10.1016/j.bjps.2008.06.054 [DOI] [PubMed] [Google Scholar]
- 34.Royal College of Surgeons in Ireland . General surgery: overview. Accessed February 5, 2024. https://www.rcsi.com/surgery/training/surgery/general-surgery/overview
- 35.Henn P, Gallagher AG, Nugent E, et al. A computerised test of perceptual ability for learning endoscopic and laparoscopic surgery and other image guided procedures: Score norms for PicSOr. Am J Surg. 2017;214(5):969-973. doi: 10.1016/j.amjsurg.2017.01.025 [DOI] [PubMed] [Google Scholar]
- 36.Gallagher AG, Cowie R, Crothers I, Jordan-Black JA, Satava RM. PicSOr: an objective test of perceptual skill that predicts laparoscopic technical skill in three initial studies of laparoscopic performance. Surg Endosc. 2003;17(9):1468-1471. doi: 10.1007/s00464-002-8569-4 [DOI] [PubMed] [Google Scholar]
- 37.Cowie R. Measurement and modelling of perceived slant in surfaces represented by freely viewed line drawings. Perception. 1998;27(5):505-540. doi: 10.1068/p270505 [DOI] [PubMed] [Google Scholar]
- 38.Kolozsvari NO, Andalib A, Kaneva P, et al. Sex is not everything: the role of gender in early performance of a fundamental laparoscopic skill. Surg Endosc. 2011;25(4):1037-1042. doi: 10.1007/s00464-010-1311-8 [DOI] [PubMed] [Google Scholar]
- 39.Haluck RS, Gallagher AG, Satava RM, Webster R, Bass TL, Miller CA. Reliability and validity of Endotower, a virtual reality trainer for angled endoscope navigation. Stud Health Technol Inform. 2002;85:179-184. [PubMed] [Google Scholar]
- 40.Ritter EM, McClusky DA III, Gallagher AG, Enochsson L, Smith CD. Perceptual, visuospatial, and psychomotor abilities correlate with duration of training required on a virtual-reality flexible endoscopy simulator. Am J Surg. 2006;192(3):379-384. doi: 10.1016/j.amjsurg.2006.03.003 [DOI] [PubMed] [Google Scholar]
- 41.Westman B, Ritter EM, Kjellin A, et al. Visuospatial abilities correlate with performance of senior endoscopy specialist in simulated colonoscopy. J Gastrointest Surg. 2006;10(4):593-599. doi: 10.1016/j.gassur.2005.08.014 [DOI] [PubMed] [Google Scholar]
- 42.Arora H, Uribe J, Ralph W, et al. Assessment of construct validity of the endoscopic sinus surgery simulator. Arch Otolaryngol Head Neck Surg. 2005;131(3):217-221. doi: 10.1001/archotol.131.3.217 [DOI] [PubMed] [Google Scholar]
- 43.Carroll JB. Human cognitive abilities: a survey of factor-analytic studies. Cambridge University Press; 1993. doi: 10.1017/CBO9780511571312 [DOI] [Google Scholar]
- 44.Ekstrom RB, Harman HH. Manual for kit of factor-referenced cognitive tests, 1976. In: Ekstrom RB, Harman HH, eds. Educational Testing Service; 1976. [Google Scholar]
- 45.Dikmen SS, Heaton RK, Grant I, Temkin NR. Test-retest reliability and practice effects of expanded Halstead-Reitan Neuropsychological Test Battery. J Int Neuropsychol Soc. 1999;5(4):346-356. doi: 10.1017/S1355617799544056 [DOI] [PubMed] [Google Scholar]
- 46.Toale C, Morris M, Ryan DM, et al. Baseline assessments of psychomotor aptitude are associated with the future operative performance of surgical trainees: a prospective observational cohort study. Ann Surg. 2023;278(1):148-152. doi: 10.1097/SLA.0000000000005565 [DOI] [PubMed] [Google Scholar]
- 47.Harrington CM, Kavanagh DO, Ryan D, et al. Objective scoring of an electronic surgical logbook: analysis of impact and observations within a surgical training body. Am J Surg. 2017;214(5):962-968. doi: 10.1016/j.amjsurg.2017.07.028 [DOI] [PubMed] [Google Scholar]
- 48.Bujang MA, Baharum N. Sample size guideline for correlation analysis. World Journal of Social Science Research. 2016;3:37. doi: 10.22158/wjssr.v3n1p37 [DOI] [Google Scholar]
- 49.Myers RH. Classical and modern regression with applications. Duxbury Press; 1990. [Google Scholar]
- 50.Cohen J. Statistical power analysis for the behavioral sciences. Academic press; 2013. doi: 10.4324/9780203771587 [DOI] [Google Scholar]
- 51.Nally DM, Kavanagh D, Traynor O, Bennett D. Attrition from a national surgical training program: a qualitative analysis. Ann Surg. 2022;275(4):621-628. doi: 10.1097/SLA.0000000000004852 [DOI] [PubMed] [Google Scholar]
- 52.Sutton PA, Beamish AJ, Rashid S, Elsey E, Mohan HM, O’Regan D; Association of Surgeons in Training . Attributes of excellent surgical trainers: an analysis of outstanding trainers. Int J Surg. 2018;52:371-375. doi: 10.1016/j.ijsu.2017.10.007 [DOI] [PubMed] [Google Scholar]
- 53.Roff S, McAleer S. What is educational climate? Med Teach. 2001;23(4):333-334. doi: 10.1080/01421590120063312 [DOI] [PubMed] [Google Scholar]
- 54.Hashimoto DA, Sirimanna P, Gomez ED, et al. Deliberate practice enhances quality of laparoscopic surgical performance in a randomized controlled trial: from arrested development to expert performance. Surg Endosc. 2015;29(11):3154-3162. doi: 10.1007/s00464-014-4042-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Toale C, Morris M, O’Keeffe D, et al. Assessing operative competence in core surgical training: a reliability analysis. Am J Surg. 2023;226(5):588-595. doi: 10.1016/j.amjsurg.2023.06.020 [DOI] [PubMed] [Google Scholar]
- 56.Burgess A, Roberts C, Clark T, Mossman K. The social validity of a national assessment centre for selection into general practice training. BMC Med Educ. 2014;14(1):261. doi: 10.1186/s12909-014-0261-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Worrell FC, Roberson CC. 2014 Standards for educational and psychological testing: implications for ethnic minority youth. American Psychological Association; 2016. [Google Scholar]
- 58.Cook DA, Hatala R. Validation of educational assessments: a primer for simulation and beyond. Adv Simul (Lond). 2016;1:31. doi: 10.1186/s41077-016-0033-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Van Der Vleuten CPM. The assessment of professional competence: developments, research and practical implications. Adv Health Sci Educ Theory Pract. 1996;1(1):41-67. doi: 10.1007/BF00596229 [DOI] [PubMed] [Google Scholar]
- 60.Gardner AK, Dunkin BJ. Evaluation of validity evidence for personality, emotional intelligence, and situational judgment tests to identify successful residents. JAMA Surg. 2018;153(5):409-416. doi: 10.1001/jamasurg.2017.5013 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Abd Elwahab SM, Cosgrave J, Doherty E, Lowery A. Grit personality trait of doctors and medical students in level 4 acute hospital. Surgeon. 2022;20(4):216-224. doi: 10.1016/j.surge.2021.04.010 [DOI] [PubMed] [Google Scholar]
- 62.Ellis R, Brennan PA, Lee AJ, Scrimgeour DS, Cleland J. Differential attainment at MRCS according to gender, ethnicity, age and socioeconomic factors: a retrospective cohort study. J R Soc Med. 2022;115(7):257-272. doi: 10.1177/01410768221079018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Royal College of Surgeons in Ireland . Core surgical training intake report 2020-2023. Accessed February 5, 2024. https://www.rcsi.com/surgery/-/media/feature/media/download-document/surgery/training/core-surgical-training-intake-data-2020-2023.pdf
- 64.Moglia A, Morelli L, Ferrari V, Ferrari M, Mosca F, Cuschieri A. Distribution of innate psychomotor skills recognized as important for surgical specialization in unconditioned medical undergraduates. Surg Endosc. 2018;32(10):4087-4095. doi: 10.1007/s00464-018-6146-8 [DOI] [PubMed] [Google Scholar]
- 65.Moglia A, Ferrari V, Morelli L, et al. Distribution of innate ability for surgery amongst medical students assessed by an advanced virtual reality surgical simulator. Surg Endosc. 2014;28(6):1830-1837. doi: 10.1007/s00464-013-3393-6 [DOI] [PubMed] [Google Scholar]
- 66.Alvand A, Auplish S, Khan T, Gill HS, Rees JL. Identifying orthopaedic surgeons of the future: the inability of some medical students to achieve competence in basic arthroscopic tasks despite training: a randomised study. J Bone Joint Surg Br. 2011;93(12):1586-1591. doi: 10.1302/0301-620X.93B12.27946 [DOI] [PubMed] [Google Scholar]
- 67.Lund S, D’Angelo JD, Baloul M, Yeh VJ, Stulak J, Rivera M. Simulation as soothsayer: simulated surgical skills MMIs during residency interviews are associated with first year residency performance. J Surg Educ. 2022;79(6):e235-e241. doi: 10.1016/j.jsurg.2022.06.002 [DOI] [PubMed] [Google Scholar]
- 68.Buckley CE, Kavanagh DO, Nugent E, Ryan D, Traynor OJ, Neary PC. The impact of aptitude on the learning curve for laparoscopic suturing. Am J Surg. 2014;207(2):263-270. doi: 10.1016/j.amjsurg.2013.08.037 [DOI] [PubMed] [Google Scholar]
- 69.Groenier M, Groenier KH, Miedema HA, Broeders IA. Perceptual speed and psychomotor ability predict laparoscopic skill acquisition on a simulator. J Surg Educ. 2015;72(6):1224-1232. doi: 10.1016/j.jsurg.2015.07.006 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
eAppendix. Assessment tools
eTable 1. Correlation between applicant factors evaluated at the time of selection, and future performance in assessments throughout Core Surgical Training, presented as Pearson correlation coefficients (r) and P-value
eTable 2. Analysis of Variance (ANOVA) demonstrating variance in assessment scores across training locations
eTable 3. Analysis of Variance (ANOVA) demonstrating variance in assessment scores across sub-specialties
Data sharing statement
