Crowd-Sourced Assessment of Technical Skills: Differentiating Animate Surgical Skill Through the Wisdom of Crowds.

BACKGROUND Objective quantification of surgical skill is imperative as we enter a healthcare environment of quality improvement and performance-based reimbursement. The gold standard tools are infrequently used due to time-intensiveness, cost inefficiency, and lack of standard practices. We hypothesized that valid performance scores of surgical skill can be obtained through crowdsourcing. METHODS Twelve surgeons of varying robotic surgical experience performed live porcine robot-assisted urinary bladder closures. Blinded video-recorded performances were scored by expert surgeon graders and by Amazon's Mechanical Turk crowdsourcing crowd workers using the Global Evaluative Assessment of Robotic Skills tool assessing five technical skills domains. Seven expert graders and 50 unique Mechanical Turkers (each paid $0.75/survey) evaluated each video. Global assessment scores were analyzed for correlation and agreement. RESULTS Six hundred Mechanical Turkers completed the surveys in less than 5 hours, while seven surgeon graders took 14 days. The duration of video clips ranged from 2 to 11 minutes. The correlation coefficient between the Turkers' and expert graders' scores was 0.95 and Cronbach's Alpha was 0.93. Inter-rater reliability among the surgeon graders was 0.89. CONCLUSION Crowdsourcing surgical skills assessment yielded rapid inexpensive agreement with global performance scores given by expert surgeon graders. The crowdsourcing method may provide surgical educators and medical institutions with a boundless number of procedural skills assessors to efficiently quantify technical skills for use in trainee advancement and hospital quality improvement.

[1]  A. Darzi,et al.  Assessing operative skill , 1999, BMJ.

[2]  A. Darzi,et al.  Objective assessment of technical skills in surgery , 2003, BMJ : British Medical Journal.

[3]  Marlene R. Miller,et al.  Excess length of stay, charges, and mortality attributable to medical injuries during hospitalization. , 2003, JAMA.

[4]  L. Cronbach,et al.  My Current Thoughts on Coefficient Alpha and Successor Procedures , 2004 .

[5]  Catherine Yoon,et al.  Analysis of surgical errors in closed malpractice claims at 4 liability insurers. , 2006, Surgery.

[6]  Ara Darzi,et al.  The surgical efficiency score: a feasible, reliable, and valid method of skills assessment. , 2006, American journal of surgery.

[7]  E. Verdaasdonk,et al.  Objective assessment of technical surgical skills , 2010, The British journal of surgery.

[8]  David Baker,et al.  Algorithm discovery by protein folding game players , 2011, Proceedings of the National Academy of Sciences.

[9]  A. Goh,et al.  Global evaluative assessment of robotic skills: validation of a clinical assessment tool to measure robotic surgical skills. , 2012, The Journal of urology.

[10]  Timothy M. Kowalewski,et al.  Content and construct validation of a robotic surgery curriculum using an electromagnetic instrument tracker. , 2012, Journal of Urology.

[11]  J. Birkmeyer,et al.  Surgical skill and complication rates after bariatric surgery. , 2013, The New England journal of medicine.

[12]  Blake Hannaford,et al.  Future of Robotic Surgery , 2013, Cancer journal.

[13]  Timothy M. Kowalewski,et al.  Crowd-Sourced Assessment of Technical Skills: a novel method to evaluate surgical performance. , 2014, The Journal of surgical research.

[14]  Timothy M. Kowalewski,et al.  Crowd-sourced assessment of technical skills: an adjunct to urology resident surgical simulation training. , 2014, Journal of endourology.