Preprocessing-free surface material classification using convolutional neural networks pretrained by sparse Autoencoder

Acceleration signals captured during the interaction of a rigid tool with an object surface carry relevant information for surface material classification. Existing methods mostly rely on carefully designed perception-related features or features adapted from audio processing motivated by the observed similarity between acceleration signals and audio signals. In contrast, our proposed method automatically learns features from RAW acceleration data without preprocessing. The approach is based on Convolutional Neural Networks (CNN) trained and tested on RAW data. For better performance and faster convergence of the CNN, we use the weights of a trained sparse Autoencoder (AE) to initialize the weights of the first convolution layers of the CNN. This strategy is named CNN pretrained by sparse AE (ACNN). Our classification results on a publically available Haptic Texture Database demonstrate that the proposed algorithm performs favorably against existing methods.

[1]  Masakazu Matsugu,et al.  Subject independent facial expression recognition with robust face detection using a convolutional neural network , 2003, Neural Networks.

[2]  Léon Bottou,et al.  Stochastic Gradient Descent Tricks , 2012, Neural Networks: Tricks of the Trade.

[3]  Geoffrey E. Hinton,et al.  Binary coding of speech spectrograms using a deep auto-encoder , 2010, INTERSPEECH.

[4]  Jivko Sinapov,et al.  Vibrotactile Recognition and Categorization of Surfaces by a Humanoid Robot , 2011, IEEE Transactions on Robotics.

[5]  S Kullback,et al.  LETTER TO THE EDITOR: THE KULLBACK-LEIBLER DISTANCE , 1987 .

[6]  Gerald E. Loeb,et al.  Bayesian Exploration for Intelligent Identification of Textures , 2012, Front. Neurorobot..

[7]  Joseph M. Romano,et al.  Methods for robotic tool-mediated haptic surface recognition , 2014, 2014 IEEE Haptics Symposium (HAPTICS).

[8]  Honglak Lee,et al.  Unsupervised feature learning for audio classification using convolutional deep belief networks , 2009, NIPS.

[9]  Joseph M. Romano,et al.  Dimensional Reduction of High-Frequency Accelerations for Haptic Rendering , 2010, EuroHaptics.

[10]  Eckehard G. Steinbach,et al.  A haptic texture database for tool-mediated texture recognition and classification , 2014, 2014 IEEE International Symposium on Haptic, Audio and Visual Environments and Games (HAVE) Proceedings.

[11]  Harris Drucker,et al.  Learning algorithms for classification: A comparison on handwritten digit recognition , 1995 .

[12]  Eckehard G. Steinbach,et al.  Surface classification using acceleration signals recorded during human freehand movement , 2015, 2015 IEEE World Haptics Conference (WHC).

[13]  Joseph M. Romano,et al.  Creating Realistic Virtual Textures from Contact Acceleration Data , 2012, IEEE Transactions on Haptics.

[14]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[15]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[16]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[17]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.