Operational neural networks

Feed-forward, fully connected artificial neural networks or the so-called multi-layer perceptrons are well-known universal approximators. However, their learning performance varies significantly depending on the function or the solution space that they attempt to approximate. This is mainly because of their homogenous configuration based solely on the linear neuron model. Therefore, while they learn very well those problems with a monotonous, relatively simple and linearly separable solution space, they may entirely fail to do so when the solution space is highly nonlinear and complex. Sharing the same linear neuron model with two additional constraints (local connections and weight sharing), this is also true for the conventional convolutional neural networks (CNNs) and it is, therefore, not surprising that in many challenging problems only the deep CNNs with a massive complexity and depth can achieve the required diversity and the learning performance. In order to address this drawback and also to accomplish a more generalized model over the convolutional neurons, this study proposes a novel network model, called operational neural networks (ONNs), which can be heterogeneous and encapsulate neurons with any set of operators to boost diversity and to learn highly complex and multi-modal functions or spaces with minimal network complexity and training data. Finally, the training method to back-propagate the error through the operational layers of ONNs is formulated. Experimental results over highly challenging problems demonstrate the superior learning capabilities of ONNs even with few neurons and hidden layers.

[1]  Sotirios A. Tsaftaris,et al.  Medical Image Computing and Computer Assisted Intervention , 2017 .

[2]  Vijay Vasudevan,et al.  Learning Transferable Architectures for Scalable Image Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[3]  P. Somogyi,et al.  Neuronal Diversity and Temporal Dynamics: The Unity of Hippocampal Circuit Operations , 2008, Science.

[4]  Alexandros Iosifidis,et al.  Knowledge Transfer for Face Verification Using Heterogeneous Generalized Operational Perceptrons , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[5]  Jian Sun,et al.  BM3D-Net: A Convolutional Neural Network for Transform-Domain Collaborative Filtering , 2018, IEEE Signal Processing Letters.

[6]  Paul Nurse,et al.  Cell Division Intersects with Cell Geometry , 2010, Cell.

[7]  Nicolaos B. Karayiannis,et al.  On the construction and training of reformulated radial basis function neural networks , 2003, IEEE Trans. Neural Networks.

[8]  Z. Nusser Variability in the subcellular distribution of ion channels increases neuronal diversity , 2009, Trends in Neurosciences.

[9]  Gang Hua,et al.  Labeled Faces in the Wild: A Survey , 2016 .

[10]  Moncef Gabbouj,et al.  Multidimensional Particle Swarm Optimization for Machine Learning and Pattern Recognition , 2013, Adaptation, learning, and optimization.

[11]  Jessica A. Cardin,et al.  Neocortical Interneurons: From Diversity, Strength , 2010, Cell.

[12]  Moncef Gabbouj,et al.  A Generic and Robust System for Automated Patient-Specific Classification of ECG Signals , 2009, IEEE Transactions on Biomedical Engineering.

[13]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[14]  Alexandros Iosifidis,et al.  Progressive Operational Perceptron with Memory , 2018, ArXiv.

[15]  Moncef Gabbouj,et al.  Face segmentation in thumbnail images by data-adaptive convolutional segmentation networks , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[16]  Jianfei Cai,et al.  Beyond pixels: A comprehensive survey from bottom-up to semantic image segmentation and cosegmentation , 2015, J. Vis. Commun. Image Represent..

[17]  Stamatios Lefkimmiatis,et al.  Non-local Color Image Denoising with Convolutional Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Alexandros Iosifidis,et al.  Heterogeneous Multilayer Generalized Operational Perceptron , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[19]  Karsten Berns,et al.  Kernel Multilayer Perceptron , 2011, 2011 24th SIBGRAPI Conference on Graphics, Patterns and Images.

[20]  Yi Li,et al.  Fully Convolutional Instance-Aware Semantic Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  P. Werbos,et al.  Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[22]  R. Petralia Diversity in the Neuronal Machine—Order and Variability in Interneuronal Microcircuits, I. Soltesz. Oxford University Press (2006), 238 pp. , 2007 .

[23]  W. Pitts,et al.  A Logical Calculus of the Ideas Immanent in Nervous Activity (1943) , 2021, Ideas That Created the Future.

[24]  S. Kiranyaz,et al.  A Generic and Robust System for Automated Patient-Specific Classification of Electrocardiogram Signals , 2008 .

[25]  Andrzej Cichocki,et al.  Neural networks for optimization and signal processing , 1993 .

[26]  Quoc V. Le,et al.  Efficient Neural Architecture Search via Parameter Sharing , 2018, ICML.

[27]  拓海 杉山,et al.  “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[28]  R. Masland Neuronal diversity in the retina , 2001, Current Opinion in Neurobiology.

[29]  Moncef Gabbouj,et al.  Evolutionary artificial neural networks by multi-dimensional particle swarm optimization , 2009, Neural Networks.

[30]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[31]  Mário Marques Fernandes,et al.  ADVANCES IN FACE DETECTION AND FACIAL IMAGE ANALYSIS , 2018 .

[32]  Zongben Xu,et al.  Universal Approximation of Extreme Learning Machine With Adaptive Growth of Hidden Nodes , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[33]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[34]  Camille Couprie,et al.  Semantic Segmentation using Adversarial Networks , 2016, NIPS 2016.

[35]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Ying Li,et al.  A new algorithm of selecting the radial basis function networks center , 2002, Proceedings. International Conference on Machine Learning and Cybernetics.

[37]  Alexandros Iosifidis,et al.  Progressive Operational Perceptrons , 2017, Neurocomputing.

[38]  Nor Ashidi Mat Isa,et al.  Clustered-Hybrid Multilayer Perceptron network for pattern recognition application , 2011, Appl. Soft Comput..

[39]  Erik Learned-Miller,et al.  FDDB: A benchmark for face detection in unconstrained settings , 2010 .

[40]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[41]  Thomas Hofmann,et al.  Greedy Layer-Wise Training of Deep Networks , 2007 .

[42]  Alexandros Iosifidis,et al.  Generalized model of biological neural networks: Progressive operational perceptrons , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).

[43]  Mandar Kulkarni,et al.  Layer-wise training of deep networks using kernel similarity , 2017, ArXiv.

[44]  H. Sebastian Seung,et al.  Natural Image Denoising with Convolutional Networks , 2008, NIPS.

[45]  E. Marder,et al.  Variability, compensation and homeostasis in neuron and network function , 2006, Nature Reviews Neuroscience.

[46]  M. Y. Mashor Hybrid multilayered perceptron networks , 2000, Int. J. Syst. Sci..

[47]  T.,et al.  Training Feedforward Networks with the Marquardt Algorithm , 2004 .

[48]  Serkan Kiranyaz,et al.  Multi-dimensional Particle Swarm Optimization , 2014 .

[49]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[50]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  Wei Yu,et al.  On learning optimized reaction diffusion processes for effective image restoration , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[52]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.