Gaze, Posture and Gesture Recognition to Minimize Focus Shifts for Intelligent Operating Rooms in a Collaborative Support System

This paper describes the design of intelligent, collaborative operating rooms based on highly intuitive, natural and multimodal interaction. Intelligent operating rooms minimize surgeon’s focus shifts by minimizing both the focus spatial offset (distance moved by surgeon’s head or gaze to the new target) and the movement spatial offset (distance surgeon covers physically). These spatio-temporal measures have an impact on the surgeon’s performance in the operating room. I describe how machine vision techniques are used to extract spatio-temporal measures and to interact with the system, and how computer graphics techniques can be used to display visual medical information effectively and rapidly. Design considerations are discussed and examples showing the feasibility of the different approaches are presented.

[1]  Masatsugu Kidode,et al.  Displaying a Moving Image By Multiple Steerable Projectors , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Maribeth Gandy Coleman,et al.  The Gesture Pendant: A Self-illuminating, Wearable, Infrared Computer Vision System for Home Automation Control and Medical Monitoring , 2000, Digest of Papers. Fourth International Symposium on Wearable Computers.

[3]  Sébastien Roy,et al.  Multi-projectors for arbitrary surfaces without explicit calibration nor reconstruction , 2003, Fourth International Conference on 3-D Digital Imaging and Modeling, 2003. 3DIM 2003. Proceedings..

[4]  Yoshiaki Shirai,et al.  Intelligent wheelchair remotely controlled by interactive gestures , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[5]  James M. Rehg,et al.  Shadow Elimination and Blinding Light Suppression for Interactive Projected Displays , 2007, IEEE Trans. Vis. Comput. Graph..

[6]  Fumio Miyazaki,et al.  FAce MOUSe: A novel human-machine interface for controlling the position of a laparoscope , 2003, IEEE Trans. Robotics Autom..

[7]  H Poizner,et al.  Virtual reality-based post-stroke hand rehabilitation. , 2002, Studies in health technology and informatics.

[8]  Kazuhiko Yamamoto,et al.  Robust Face Detection and Japanese Sign Language Hand Posture Recognition for Human-Computer Interaction in an “ Intelligent ” Room † , 2002 .

[9]  Pierre Wellner The DigitalDesk calculator: tangible manipulation on a desk top display , 1991, UIST '91.

[10]  Mark Ashdown,et al.  Steerable Projector Calibration , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[11]  Aditi Majumder,et al.  Registration Techniques for Using Imperfect and Par tially Calibrated Devices in Planar Multi-Projector Displays , 2007, IEEE Transactions on Visualization and Computer Graphics.

[12]  C Baur,et al.  A non-contact mouse for surgeon-computer interaction. , 2004, Technology and health care : official journal of the European Society for Engineering and Medicine.

[13]  Antonio Torralba,et al.  Object Detection and Localization Using Local and Global Features , 2006, Toward Category-Level Object Recognition.

[14]  Paul Lukowicz,et al.  WearIT@work: Toward Real-World Industrial Wearable Computing , 2007, IEEE Pervasive Computing.

[15]  William Newman,et al.  A desk supporting computer-based interaction with paper documents , 1992, CHI.

[16]  R.A. Brooks,et al.  The Intelligent Room project , 1997, Proceedings Second International Conference on Cognitive Technology Humanizing the Information Age.

[17]  Mohan M. Trivedi,et al.  Hierarchical audio-visual cue integration framework for activity analysis in intelligent meeting rooms , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[18]  Greg Welch,et al.  A Distributed Cooperative Framework for Continuous Multi-Projector Pose Estimation , 2009, 2009 IEEE Virtual Reality Conference.

[19]  Juan Pablo Wachs,et al.  Recognizing Human Postures and Poses in Monocular Still Images , 2009, IPCV.

[20]  Animesh Kumar,et al.  Towards an intelligent hospital environment: OR of the future. , 2005, Studies in health technology and informatics.

[21]  Alex Pentland,et al.  Staying Alive: A Virtual Reality Visualization Tool for Cancer Patients , 1996 .

[22]  R. Satava Disruptive visions: The operating room of the future , 2003 .

[23]  J. Duysens,et al.  Distraction Affects the Performance of Obstacle Avoidance During Walking , 2003, Journal of motor behavior.

[24]  Luc Van Gool,et al.  Automatic Interactive Calibration of Multi-Projector-Camera Systems , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[25]  Robert L Coleman,et al.  Acquiring laparoscopic skill proficiency: does orientation matter? , 2004, American journal of obstetrics and gynecology.

[26]  Jan Kleindienst,et al.  Integrated Development of Context-Aware Applications in Smart Spaces , 2008, IEEE Pervasive Computing.

[27]  Animesh Kumar,et al.  Towards an intelligent hospital environment , 2005 .

[28]  Binyu Zang,et al.  A High Resolution Video Display System by Seamlessly Tiling Multiple Projectors , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[29]  Steven D Schwaitzberg,et al.  Effects of cognitive distraction on performance of laparoscopic surgical tasks. , 2006, Journal of laparoendoscopic & advanced surgical techniques. Part A.

[30]  Michael W. Eysenck,et al.  Distraction and cognitive performance. , 1989 .

[31]  Mohan M. Trivedi,et al.  Video arrays for real-time tracking of person, head, and face in an intelligent room , 2003, Machine Vision and Applications.

[32]  Gary Bradski,et al.  Computer Vision Face Tracking For Use in a Perceptual User Interface , 1998 .

[33]  Ruigang Yang,et al.  Automatic and Continuous Projector Display Surface Estimation Using Everyday Imagery , 2001, WSCG.

[34]  Gérard G. Medioni,et al.  Electronic pan-tilt-zoom: a solution for intelligent room systems , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[35]  Michael H. Coen,et al.  Design Principles for Intelligent Environments , 1998, AAAI/IAAI.