Principal Component Pyramids using Image Blurring for Nonlinearity Reduction in Hand Shape Recognition

The thesis presents four algorithms using a multistage hierarchical strategy for hand shape recognition. The proposed multistage hierarchy analyzes new patterns by projecting them into the different levels of a data pyramid, which consists of different principal component spaces. Image blurring is used to reduce the nonlinearity in manifolds generated by a set of example images. Flattening the space helps in classifying different hand shapes more accurately. Four algorithms using different pattern recognition techniques are proposed. The first algorithm is based on using perpendicular distance to measure the distance between new patterns and the nearest manifold. The second algorithm is based on using supervised multidimensional grids. The third algorithm uses unsupervised multidimensional grids to cluster the space into cells of similar objects. The fourth algorithm is based on training a set of simple architecture multi-layer neural networks at the different levels of the pyramid to map new patterns to the closest class. The proposed algorithms are categorized as example-based approaches where a large set of computer generated images are used to densely sample the space. Experimental results are presented to examine the accuracy and performance of the proposed algorithms. The effect of image blurring on reducing the nonlinearity in manifolds is examined. The results are compared with the exhaustive search scenario. The experimental results show that the proposed algorithms are applicable for real time applications with high accuracy measures. They can achieve frame rates of more than 10 frames per second and accuracies of up to 98% on test data.

[1]  Kanad K. Biswas,et al.  Real Time Hand Tracking and Gesture Recognition , 2009, IPCV.

[2]  Nicolas D. Georganas,et al.  Real-Time Hand Gesture Detection and Recognition Using Bag-of-Features and Support Vector Machine Techniques , 2011, IEEE Transactions on Instrumentation and Measurement.

[3]  Robert Pless,et al.  A Survey of Manifold Learning for Images , 2009, IPSJ Trans. Comput. Vis. Appl..

[4]  Andrew P. Witkin,et al.  Scale-space filtering: A new approach to multi-scale description , 1984, ICASSP.

[5]  Rama Chellappa,et al.  Machine Recognition of Human Activities: A Survey , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Ying Wu,et al.  Capturing natural hand articulation , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[7]  Bernhard Schölkopf,et al.  Kernel Principal Component Analysis , 1997, ICANN.

[8]  Kourosh Khoshelham,et al.  Accuracy analysis of kinect depth data , 2012 .

[9]  P. Roth,et al.  SURVEY OF APPEARANCE-BASED METHODS FOR OBJECT RECOGNITION , 2008 .

[10]  Lijuan Cao,et al.  A comparison of PCA, KPCA and ICA for dimensionality reduction in support vector machine , 2003, Neurocomputing.

[11]  Adrian Hilton,et al.  A survey of advances in vision-based human motion capture and analysis , 2006, Comput. Vis. Image Underst..

[12]  Ying Wu,et al.  Vision-Based Gesture Recognition: A Review , 1999, Gesture Workshop.

[13]  Nathan Intrator,et al.  Blurred face recognition via a hybrid network architecture , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[14]  Wu-Chih Hu,et al.  Gabor filter-based hand-pose angle estimation for hand gesture recognition under varying illumination , 2011, Expert Syst. Appl..

[15]  E. S. Gedraite,et al.  Investigation on the effect of a Gaussian Blur in image filtering and segmentation , 2011, Proceedings ELMAR-2011.

[16]  Sanjeev Sofat,et al.  Vision Based Hand Gesture Recognition , 2009 .

[17]  Samir I. Shaheen,et al.  Sign language recognition using a combination of new vision based features , 2011, Pattern Recognit. Lett..

[18]  Othman O. Khalifa,et al.  Comparison of supervised and unsupervised learning classifiers for human posture recognition , 2010, International Conference on Computer and Communication Engineering (ICCCE'10).

[19]  Jr. Joseph J. LaViola,et al.  A Survey of Hand Posture and Gesture Recognition Techniques and Technology , 1999 .

[20]  Pierre-Antoine Absil,et al.  Principal Manifolds for Data Visualization and Dimension Reduction , 2007 .

[21]  Mario Campos,et al.  Inspection of bottles crates in the beer industry through computer vision , 2010, IECON 2010 - 36th Annual Conference on IEEE Industrial Electronics Society.

[22]  Toshiaki Ejima,et al.  Real-Time Hand Tracking and Gesture Recognition System , 2005 .

[23]  Nooritawati Md. Tahir,et al.  Analysis of PCA based feature vectors for SVM posture classification , 2010, 2010 6th International Colloquium on Signal Processing & its Applications.

[24]  M. Arif Wani Introducing Subspace Grids to Recognise Patterns in Multidimensinal Data , 2012, 2012 11th International Conference on Machine Learning and Applications.

[25]  Erich Schikuta,et al.  Grid-clustering: an efficient hierarchical clustering method for very large data sets , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[26]  S. Amirhassan Monadjemi,et al.  Rapid hand posture recognition using Adaptive Histogram Template of Skin and hand edge contour , 2010, 2010 6th Iranian Conference on Machine Vision and Image Processing.

[27]  Kongqiao Wang,et al.  Real-time hand posture analysis based on neural network , 2010, IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS.

[28]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[29]  S. Oniga,et al.  Hand Postures Recognition System Using Artificial Neural Networks Implemented in FPGA , 2007, 2007 30th International Spring Seminar on Electronics Technology (ISSE).

[30]  Nie Shengdong,et al.  Automatic Liver Segmentation Method Based on a Gaussian Blurring Technique For CT Images , 2008, 2008 2nd International Conference on Bioinformatics and Biomedical Engineering.

[31]  Stan Sclaroff,et al.  Estimating 3D hand pose from a cluttered image , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[32]  Edward H. Adelson,et al.  PYRAMID METHODS IN IMAGE PROCESSING. , 1984 .

[33]  S. Naidoo,et al.  Vision-Based Static Hand Gesture Recognition Using Support Vector Machines , 2002 .

[34]  Chieh-Chih Wang,et al.  Hand posture recognition using adaboost with SIFT for human robot interaction , 2007 .

[35]  Jovan Popović,et al.  Real-time hand-tracking with a color glove , 2009, SIGGRAPH 2009.

[36]  Mircea Nicolescu,et al.  Vision-based hand pose estimation: A review , 2007, Comput. Vis. Image Underst..

[37]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[38]  Weimin Huang,et al.  Multimodal Sleeping Posture Classification , 2010, 2010 20th International Conference on Pattern Recognition.

[39]  Andrei Zinovyev,et al.  Principal Manifolds for Data Visualization and Dimension Reduction , 2007 .

[40]  Ali Ghodsi,et al.  Dimensionality Reduction A Short Tutorial , 2006 .

[41]  Walid Al-Atabany,et al.  Performance of Optical Flow tracking approaches for cardiac motion analysis , 2014, 2nd Middle East Conference on Biomedical Engineering.

[42]  Ying Wah Teh,et al.  A study of density-grid based clustering algorithms on data streams , 2011, 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD).

[43]  Piyush Kumar,et al.  Hand Data Glove: A Wearable Real-Time Device for Human- Computer Interaction , 2012 .

[44]  Trevor Darrell,et al.  Fast pose estimation with parameter-sensitive hashing , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[45]  S. Mitra,et al.  Gesture Recognition: A Survey , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[46]  Robert C. Bolles,et al.  Parametric Correspondence and Chamfer Matching: Two New Techniques for Image Matching , 1977, IJCAI.

[47]  Ioannis Pitas,et al.  Single camera pointing gesture recognition using spatial features and support vector machines , 2007, 2007 15th European Signal Processing Conference.

[48]  Matthias Scholz,et al.  Nonlinear Principal Component Analysis: Neural Network Models and Applications , 2008 .

[49]  Fabio Solari,et al.  A man-machine communication system based on the visual analysis of dynamic gestures , 2005, IEEE International Conference on Image Processing 2005.

[50]  U. Kruger,et al.  Developments and Applications of Nonlinear Principal Component Analysis – a Review , 2008 .

[51]  Jing-Wein Wang,et al.  Genetic eigenhand selection for handshape classification based on compact hand extraction , 2013, Eng. Appl. Artif. Intell..

[52]  Alistair Sutherland,et al.  A Multistage Hierarchical Algorithm for Hand Shape Recognition , 2009, 2009 13th International Machine Vision and Image Processing Conference.

[53]  George Awad,et al.  Real Time Hand Gesture Recognition Including Hand Segmentation and Tracking , 2006, ISVC.

[54]  Rafiqul Zaman Khan,et al.  Survey on Gesture Recognition for Hand Image Postures , 2012, Comput. Inf. Sci..

[55]  Thomas Coogan Dynamic gesture recognition using transformation invariant hand shape recognition , 2007 .

[56]  Thomas B. Moeslund,et al.  A Survey of Computer Vision-Based Human Motion Capture , 2001, Comput. Vis. Image Underst..

[57]  Rini Akmeliawati,et al.  Vision-based hand posture detection and recognition for Sign Language — A study , 2011, 2011 4th International Conference on Mechatronics (ICOM).

[58]  Alistair Sutherland,et al.  Nonlinearity Reduction of Manifolds using Gaussian Blur for Handshape Recognition based on Multi-Dimensional Grids , 2013, ICPRAM.

[59]  Kok Kiong Tan,et al.  Autonomous Reverse Parking System Based on Robust Path Generation and Improved Sliding Mode Control , 2015, IEEE Transactions on Intelligent Transportation Systems.

[60]  Steven Skiena,et al.  Implementing discrete mathematics - combinatorics and graph theory with Mathematica , 1990 .

[61]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[62]  Jiawei Han,et al.  CLARANS: A Method for Clustering Objects for Spatial Data Mining , 2002, IEEE Trans. Knowl. Data Eng..

[63]  Kouichi Murakami,et al.  Gesture recognition using recurrent neural networks , 1991, CHI.

[64]  Yasue Mitsukura,et al.  Classification of hand postures based on 3D vision model for human-robot interaction , 2010, 19th International Symposium in Robot and Human Interactive Communication.

[65]  Mrs. A. R. Patil,et al.  On Vision Based Hand Gesture Recognition Approach Using Support Vector Machines . , .

[66]  G RadhaH,et al.  DESIGN AND DEVELOPMENT OF AN ASSISTIVE DEVICE FOR SPEECH AND HEARING IMPAIRED , 2014 .

[67]  Demetri Terzopoulos,et al.  Deformable models in medical image analysis: a survey , 1996, Medical Image Anal..

[68]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[69]  Junjie Guo,et al.  The hand shape recognition of Human Computer Interaction with Artificial Neural Network , 2009, 2009 IEEE International Conference on Virtual Environments, Human-Computer Interfaces and Measurements Systems.

[70]  Vassilis Athitsos,et al.  Nearest neighbor search methods for handshape recognition , 2008, PETRA '08.