Speech Recognition for Endoscopic Automatic Positioning System

A novel system for minimally invasive surgery is presented in this paper. The system utilized an Endoscopic Automatic Positioner (EAP) controlled by Speech Recognition Engine to implement the clamping and dynamically positioning of the laparoscope. The motion instructions of the EAP are transformed from voice commands of specific doctor recognized by an improved algorithm named Normalized Average- Dynamic Time Warping (NA-DTW). An embedded platform based on ARM is designed to run the NA-DTW on Windows CE operating system. 1250 groups of experiments from 10 individual speakers demonstrate the performance of DTW. Compared with traditional algorithms, the enhanced algorithm improves the recognition rate from 96.6% to 99.76% and shortens the time of calculation by 51%. The results demonstrate the enhanced algorithm being effective and can satisfy the real time requirement in embedded system.

[1]  Luo Fei Improved Spectral Subtraction Method Based on Spectral Entropy Noise Estimation , 2009 .

[2]  Yoshikazu Miyanaga,et al.  An improved dynamic time warping algorithm employing nonlinear median filtering , 2011, 2011 11th International Symposium on Communications & Information Technologies (ISCIT).

[3]  Björn W. Schuller,et al.  Speech control in surgery: A field analysis and strategies , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[4]  E. Paulus,et al.  Speech Signal Processing , 1997, The Electrical Engineering Handbook - Six Volume Set.

[5]  Aaron E. Rosenberg,et al.  Performance tradeoffs in dynamic time warping algorithms for isolated word recognition , 1980 .

[6]  Wang Hua-qing Improved algorithm of Mel-Frequence Cepstral Coefficients in characteristics extraction based on voice signal , 2008 .

[7]  Waleed H. Abdulla,et al.  Cross-words reference template for DTW-based speech recognition systems , 2003, TENCON 2003. Conference on Convergent Technologies for Asia-Pacific Region.

[8]  Quantized Dynamic Time Warping (DTW) algorithm , 2010, 2010 8th International Conference on Communications.