Efficient web data classification techniques using semi-supervise learning algorithm

In Organization the data is very important that increase the volume of information that is available on the web and that leads to the design of efficient and accurate web data classification systems. In this paper, we define a framework to improve the performance of a base classifier, by clustering the unlabeled data with labeled data using clustering algorithm (training of samples) labeling of clusters (majority voting for each Hyperspheres) and final generated classified data. We have used construction of BNN based semi-supervised classifier while training and testing of the classifier is performed. We have studied and customized a supervised classification algorithm to form out semi-supervised classification that leads to design a multiclass semi-supervised classifier using geometrical expansion. The experimental result shows provision for the classifier designer followed by training and testing medical disease dataset using pre-decided samples. Our classification model consists of training phase that covers two process clustering and labeling to perform classification task of medical data and the binary neural network is trained. In this we used two techniques normalization and quantization for pre-processing the datasets. Pre-processing impart various outcomes after applying the classification model like number of hypersphere, confusing samples that cannot be learned, training time and label of hypersphere. Comparison has been done for implementation and design of Binary Neural Network Classifier Algorithm with the other existing traditional algorithms. Our classifier evaluates performance in terms of generalization, number of hidden neuron and accuracy etc. The BNN-CA construct three-layered binary neural network (BNN) and can solve any semi-labeled multi-class problem.

[1]  Di Wang,et al.  A constructive unsupervised learning algorithm for clustering binary patterns , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[2]  James T. Kwok,et al.  Constructive algorithms for structure learning in feedforward neural networks for regression problems , 1997, IEEE Trans. Neural Networks.

[3]  Lawrence O. Hall,et al.  Text classification with enhanced semi-supervised fuzzy clustering , 2002, 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE'02. Proceedings (Cat. No.02CH37291).

[4]  Fei Wang,et al.  Semi-Supervised Classification with Universum , 2008, SDM.

[5]  Sung-Kwon Park,et al.  The geometrical learning of binary neural networks , 1995, IEEE Trans. Neural Networks.

[6]  Daewon Lee,et al.  Equilibrium-Based Support Vector Machine for Semisupervised Classification , 2007, IEEE Transactions on Neural Networks.

[7]  Lawrence Carin,et al.  Semi-Supervised Classification , 2004, Encyclopedia of Database Systems.

[8]  Di Wang,et al.  Fast constructive-covering approach for neural networks , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[9]  John R. Smith,et al.  Semantic Labeling of Multimedia Content Clusters , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[10]  Carey L. Williamson,et al.  Categories and Subject Descriptors: C.4 [Computer Systems Organization]Performance of Systems , 2022 .

[11]  I C G Campbell,et al.  Constructive learning techniques for designing neural network systems , 1998 .

[12]  Di Wang,et al.  A fast modified constructive-covering algorithm for binary multi-layer neural networks , 2006, Neurocomputing.

[13]  Michael K. Ng,et al.  Input Validation for Semi-supervised Clustering , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).