An oblique elliptical basis function network approach for supervised learning applications

Abstract We propose a neural network architecture based on the oblique elliptical basis function for supervised learning problems. In classification, a category can be a disconnected or non-convex region involving several overlapping or disjoint sub-regions of the feature space. Other existing supervised learning methods may have the restriction that only allows decision regions to be convex. Our proposed method overcomes this restriction by employing a rotational self-constructing clustering algorithm to decompose the feature space into a collection of sub-regions which can then be combined to make up individual categories. An unseen instance is classified to a certain category if its similarity to the category exceeds a threshold. The whole framework fits in a five-layer network consisting of input, component-similarity, cluster-similarity, aggregation, and output layers. A similar idea also applies to solving regression problems. A parameter learning algorithm based on least squares estimation is used to derive the weights of the underlying network. Our approach can offer some advantages in practicality. Through the incorporation of rotation, data can be clustered more appropriately than by standard elliptical basis functions. Also, our approach is applicable to single-label classification, multi-label classification, as well as regression problems. A number of experiments are conducted to show the effectiveness of the proposed approach.

[1]  Hongming Zhou,et al.  Extreme Learning Machine for Regression and Multiclass Classification , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[2]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[3]  Enrico Zio,et al.  Two Machine Learning Approaches for Short-Term Wind Speed Time-Series Prediction , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[4]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[5]  Henry Leung,et al.  Prediction of noisy chaotic time series using an optimal radial basis function neural network , 2001, IEEE Trans. Neural Networks.

[6]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[7]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[8]  Mu-Yen Chen,et al.  Online fuzzy time series analysis based on entropy discretization and a Fast Fourier Transform , 2014, Appl. Soft Comput..

[9]  Gene H. Golub,et al.  Matrix computations , 1983 .

[10]  Grigorios Tsoumakas,et al.  Multi-label classification of music by emotion , 2011 .

[11]  H. Vincent Poor,et al.  Machine Learning Methods for Attack Detection in the Smart Grid , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[12]  Shyi-Ming Chen,et al.  TAIEX Forecasting Using Fuzzy Time Series and Automatically Generated Weights of Multiple Factors , 2012, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[13]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[14]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[15]  Bor-Shing Lin,et al.  Higher-Order-Statistics-Based Radial Basis Function Networks for Signal Enhancement , 2007, IEEE Transactions on Neural Networks.

[16]  Shie-Jue Lee,et al.  A neuro-fuzzy system modeling with self-constructing rule generationand hybrid SVD-based learning , 2003, IEEE Trans. Fuzzy Syst..

[17]  Saso Dzeroski,et al.  An extensive experimental comparison of methods for multi-label learning , 2012, Pattern Recognit..

[18]  Man-Wai Mak,et al.  Elliptical basis function networks and radial basis function networks for speaker verification: a comparative study , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).

[19]  Johannes Fürnkranz,et al.  Large-Scale Multi-label Text Classification - Revisiting Neural Networks , 2013, ECML/PKDD.

[20]  Aristidis Likas,et al.  Shared kernel models for class conditional density estimation , 2001, IEEE Trans. Neural Networks.

[21]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[22]  Zhi-Hua Zhou,et al.  Multilabel Neural Networks with Applications to Functional Genomics and Text Categorization , 2006, IEEE Transactions on Knowledge and Data Engineering.

[23]  Yoav Freund,et al.  Large Margin Classification Using the Perceptron Algorithm , 1998, COLT.

[24]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machines , 2002 .

[25]  Shie-Jue Lee,et al.  Multilabel Text Categorization Based on Fuzzy Relevance Clustering , 2014, IEEE Transactions on Fuzzy Systems.

[26]  Yoram Singer,et al.  BoosTexter: A Boosting-based System for Text Categorization , 2000, Machine Learning.

[27]  Zuhair Bandar,et al.  Sentence similarity based on semantic nets and corpus statistics , 2006, IEEE Transactions on Knowledge and Data Engineering.

[28]  Grigorios Tsoumakas,et al.  Random K-labelsets for Multilabel Classification , 2022 .

[29]  Bowen Zhou,et al.  Improved Neural Network-based Multi-label Classification with Better Initialization Leveraging Label Co-occurrence , 2016, NAACL.

[30]  Suphakant Phimoltares,et al.  A Very Fast Neural Learning for Classification Using Only New Incoming Datum , 2010, IEEE Transactions on Neural Networks.

[31]  David W. Aha,et al.  Special Issue on Lazy Learning , 1997 .

[32]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[33]  Grigorios Tsoumakas,et al.  MULAN: A Java Library for Multi-Label Learning , 2011, J. Mach. Learn. Res..

[34]  Shou-De Lin,et al.  Generalized k-Labelsets Ensemble for Multi-Label and Cost-Sensitive Classification , 2014, IEEE Transactions on Knowledge and Data Engineering.

[35]  Chris T. Kiranoudis,et al.  Radial Basis Function Neural Networks Classification for the Recognition of Idiopathic Pulmonary Fibrosis in Microscopic Images , 2008, IEEE Transactions on Information Technology in Biomedicine.

[36]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[37]  Sun-Yuan Kung,et al.  Estimation of elliptical basis function parameters by the EM algorithm with application to speaker verification , 2000, IEEE Trans. Neural Networks Learn. Syst..

[38]  Guang-Bin Huang,et al.  Extreme learning machine: a new learning scheme of feedforward neural networks , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[39]  Dimitrios Zissis,et al.  A cloud based architecture capable of perceiving and predicting multiple vessel behaviour , 2015, Appl. Soft Comput..

[40]  Mário A. T. Figueiredo,et al.  Hybrid generative/discriminative training of radial basis function networks , 2006, ESANN.

[41]  Jason Weston,et al.  A kernel method for multi-labelled classification , 2001, NIPS.

[42]  Shyi-Ming Chen,et al.  TAIEX Forecasting Based on Fuzzy Time Series and Fuzzy Variation Groups , 2011, IEEE Transactions on Fuzzy Systems.

[43]  Min-Ling Zhang,et al.  A Review on Multi-Label Learning Algorithms , 2014, IEEE Transactions on Knowledge and Data Engineering.

[44]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[45]  A. Kusiak,et al.  Short-Term Prediction of Wind Farm Power: A Data Mining Approach , 2009, IEEE Transactions on Energy Conversion.

[46]  Min-Ling Zhang,et al.  Ml-rbf: RBF Neural Networks for Multi-Label Learning , 2009, Neural Processing Letters.