Reliability, validity, and feasibility of the Zwisch scale for the assessment of intraoperative performance.

PURPOSE The existing methods for evaluating resident operative performance interrupt the workflow of the attending physician, are resource intensive, and are often completed well after the end of the procedure in question. These limitations lead to low faculty compliance and potential significant recall bias. In this study, we deployed a smartphone-based system, the Procedural Autonomy and Supervisions System, to facilitate assessment of resident performance according to the Zwisch scale with minimal workflow disruption. We aimed to demonstrate that this is a reliable, valid, and feasible method of measuring resident operative autonomy. METHODS Before implementation, general surgery residents and faculty underwent frame-of-reference training to the Zwisch scale. Immediately after any operation in which a resident participated, the system automatically sent a text message prompting the attending physician to rate the resident's level of operative autonomy according to the 4-level Zwisch scale. Of these procedures, 8 were videotaped and independently rated by 2 additional surgeons. The Zwisch ratings of the 3 raters were compared using an intraclass correlation coefficient. Videotaped procedures were also scored using 2 alternative operating room (OR) performance assessment instruments (Operative Performance Rating System and Ottawa Surgical Competency OR Evaluation), against which the item correlations were calculated. RESULTS Between December 2012 and June 2013, 27 faculty used the smartphone system to complete 1490 operative performance assessments on 31 residents. During this period, faculty completed evaluations for 92% of all operations performed with general surgery residents. The Zwisch scores were shown to correlate with postgraduate year (PGY) levels based on sequential pairwise chi-squared tests: PGY 1 vs PGY 2 (χ(2) = 106.9, df = 3, p < 0.001); PGY 2 vs PGY 3 (χ(2) = 22.2, df = 3, p < 0.001); and PGY 3 vs PGY 4 (χ(2) = 56.4, df = 3, p < 0.001). Comparison of PGY 4 to PGY 5 scores were not significantly different (χ(2) = 4.5, df = 3, p = 0.21). For the 8 operations reviewed for interrater reliability, the intraclass correlation coefficient was 0.90 (95% CI: 0.72-0.98, p < 0.01). Correlation of Procedural Autonomy and Supervisions System ratings with both Operative Performance Rating System items (each r > 0.90, all p's < 0.01) and Ottawa Surgical Competency OR Evaluation items (each r > 0.86, all p's < 0.01) was high. CONCLUSIONS The Zwisch scale can be used to make reliable and valid measurements of faculty guidance and resident autonomy. Our data also suggest that Zwisch ratings may be used to infer resident operative performance. Deployed on an automated smartphone-based system, it can be used to feasibly record evaluations for most operations performed by residents. This information can be used to council individual residents, modify programmatic curricula, and potentially inform national training guidelines.

[1]  Daniel B. Jones,et al.  General Surgery Residency Inadequately Prepares Trainees for Fellowship: Results of a Survey of Fellowship Program Directors , 2013, Annals of surgery.

[2]  Joseph B Zwischenberger,et al.  A theory-based model for teaching and assessing residents in the operating room. , 2013, Journal of surgical education.

[3]  R. Bell,et al.  Attitudes, training experiences, and professional expectations of US general surgery residents: a national survey. , 2009, JAMA.

[4]  Brian C. George,et al.  Duration of faculty training needed to ensure reliable or performance ratings. , 2013, Journal of surgical education.

[5]  F. Lewis,et al.  General surgery residency training issues. , 2013, Advances in surgery.

[6]  S. Wilson,et al.  Threats to the surgical residency in the academic medical center. , 2001, Archives of surgery.

[7]  Timothy J. Wood,et al.  The Ottawa Surgical Competency Operating Room Evaluation (O-SCORE): A Tool to Assess Surgical Competence , 2012, Academic medicine : journal of the Association of American Medical Colleges.

[8]  Reed G. Williams,et al.  Feasibility, reliability and validity of an operative performance rating system for evaluating surgery residents. , 2005, Surgery.

[9]  C. W. Cummings,et al.  Competency in surgical residency training: defining and raising the bar. , 2007, Academic medicine : journal of the Association of American Medical Colleges.

[10]  Stuart R. Lipsitz,et al.  Patterns of Technical Error Among Surgical Malpractice Claims: An Analysis of Strategies to Prevent Injury to Surgical Patients , 2007, Annals of surgery.