Convolutional neural networks with multi-valued neurons

Convolutional neural network (CNN) has been successfully used in many fields including image recognition. CNN is composed of input, convolution, pooling, hidden and output layers, and the weights and biases between layers except the ones between convolution and pooling layers are acquired by learning. In comparison to the conventional neural networks, the learning cost of CNN is higher, and the learning time is longer especially when hidden layer(s) are added. Recently, complex- and quaternion-valued neural networks have drawn much attention. In complex-valued neural networks, inputs, weights, biases and outputs are complex numbers, and in quaternion-valued neural networks, these parameters are quaternions. It has been shown that both methods exhibit excellent accuracy in various applications such as classification and function approximation problems with less computational burden. In this study, we propose CNNs with complex- and quaternion-valued neurons where complex and quaternion numbers are used between pooling, hidden and output layers. We here show that CNNs with complex- and quaternion-valued neurons have higher learning ability in handwritten digit image classification with the MNIST data than the real-valued counterpart.

[1]  Yoshua Bengio,et al.  Convolutional networks for images, speech, and time series , 1998 .

[2]  Igor N. Aizenberg,et al.  Complex-Valued Neural Networks with Multi-Valued Neurons , 2011, Studies in Computational Intelligence.

[3]  Yann LeCun,et al.  The mnist database of handwritten digits , 2005 .

[4]  Kazuyuki Murase,et al.  Single-layered complex-valued neural network for real-valued classification problems , 2009, Neurocomputing.

[5]  Takayuki Okatani,et al.  On Deep Learning , 2015, Journal of the Robotics Society of Japan.

[6]  Hiromi Kusamichi,et al.  A New Scheme for Color Night Vision by Quaternion Neural Network , 2004 .

[7]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[8]  Jason Weston,et al.  A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[9]  Tohru Nitta Complex-Valued Neural Networks , 2009, Encyclopedia of Artificial Intelligence.

[10]  Akira Hirose,et al.  Complex-Valued Neural Networks , 2006, Studies in Computational Intelligence.

[11]  Kazuyuki Murase,et al.  Quaternion neuro-fuzzy learning algorithm for generation of fuzzy rules , 2016, Neurocomputing.

[12]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[13]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[14]  Kunihiko Fukushima,et al.  Neocognitron: A new algorithm for pattern recognition tolerant of deformations and shifts in position , 1982, Pattern Recognit..

[15]  Akira Hirose,et al.  Complex-Valued Neural Networks: Advances and Applications , 2013 .