A Novel Classification Method From the Perspective of Fuzzy Social Networks Based on Physical and Implicit Style Features of Data

Many practical scenarios have demanded that we should classify unlabeled data more accurately based on both physical features (e.g., color, distance, or similarity) and implicit style features of data. As most extant classification algorithms classify unlabeled data based only on their physical features, they become weak in achieving expected classification results for many scenarios. To work around this drawback in this paper, a novel classification method (FuCM) from the perspective of fuzzy social network based on both physical and implicit style features of data is proposed. Based on the proposed fuzzy social network and its dynamics about fuzzy influences of nodes, FuCM comprises two stages. In its training stage, after the fuzzy social network has been built, it learns the topological structure, reflecting physical features and implicit style features of data by carrying out fuzzy influence dynamics in the built network. In its prediction stage, both physical and implicit style features of data are effectively integrated to yield the double structure efficiency characterized by fuzzy influences of nodes. FuCM classifies unlabeled data according to the strongest connection measure based on the proposed double structure efficiency. FuCM does not assume that both data distribution and the classification by physical features or by both physical and implicit style features of data must be known in advance. Thus, it is a novel unified classification framework in this sense. In contrast to all the nine comparative methods, FuCM experimentally demonstrates its comparable classification performance on most synthetic, UCI and KEEL datasets, which can be well classified based only on physical features of data. Furthermore, it displays distinctive superiority on five case studies where satisfactory classification certainly depends on both physical and implicit style features.

[1]  Michael R. Lyu,et al.  SoRec: social recommendation using probabilistic matrix factorization , 2008, CIKM '08.

[2]  P. I. Banokin,et al.  Natural language text parsing for social network user sentiment analysis based on fuzzy sets , 2015, 2015 International Conference on Mechanical Engineering, Automation and Control Systems (MEACS).

[3]  Ebin Deni Raj,et al.  A model fuzzy inference system for online social network analysis , 2015, 2015 International Conference on Computing and Network Communications (CoCoNet).

[4]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[5]  Yuji Matsumoto,et al.  Using the Mutual k-Nearest Neighbor Graphs for Semi-supervised Classification on Natural Language Data , 2011, CoNLL.

[6]  Licia Capra,et al.  kNN CF: a temporal social network , 2008, RecSys '08.

[7]  Kaizhu Huang,et al.  Field Support Vector Machines , 2017, IEEE Transactions on Emerging Topics in Computational Intelligence.

[8]  Sean Hughes,et al.  Clustering by Fast Search and Find of Density Peaks , 2016 .

[9]  Liang Zhao,et al.  A nonparametric classification method based on K-associated graphs , 2011, Inf. Sci..

[10]  Yuanyuan Liu,et al.  Fuzzy Double Trace Norm Minimization for Recommendation Systems , 2018, IEEE Transactions on Fuzzy Systems.

[11]  Lukasz A. Kurgan,et al.  Knowledge discovery approach to automated cardiac SPECT diagnosis , 2001, Artif. Intell. Medicine.

[12]  S.T. Sarasamma,et al.  Data Mining Through Fuzzy Social Network Analysis , 2007, NAFIPS 2007 - 2007 Annual Meeting of the North American Fuzzy Information Processing Society.

[13]  Hui Xiong,et al.  An Influence Propagation View of PageRank , 2017, ACM Trans. Knowl. Discov. Data.

[14]  Michio Sugeno,et al.  Fuzzy identification of systems and its applications to modeling and control , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[15]  Keith C. C. Chan,et al.  Fuzzy Clustering in a Complex Network Based on Content Relevance and Link Structures , 2016, IEEE Transactions on Fuzzy Systems.

[16]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[17]  Sebastiano Vigna,et al.  PageRank as a function of the damping factor , 2005, WWW '05.

[18]  Yousef Saad,et al.  Fast Approximate kNN Graph Construction for High Dimensional Data via Recursive Lanczos Bisection , 2009, J. Mach. Learn. Res..

[19]  Jerry M. Mendel,et al.  Fuzzy Opinion Networks: A Mathematical Framework for the Evolution of Opinions and Their Uncertainties Across Social Networks , 2014, IEEE Transactions on Fuzzy Systems.

[20]  Sergey Brin,et al.  Reprint of: The anatomy of a large-scale hypertextual web search engine , 2012, Comput. Networks.

[21]  Zhaohong Deng,et al.  Transductive domain adaptive learning for epileptic electroencephalogram recognition , 2014, Artif. Intell. Medicine.

[22]  M. Harries SPLICE-2 Comparative Evaluation: Electricity Pricing , 1999 .

[23]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[25]  José Augusto Baranauskas,et al.  How Many Trees in a Random Forest? , 2012, MLDM.

[26]  Joshua B. Tenenbaum,et al.  Separating Style and Content with Bilinear Models , 2000, Neural Computation.

[27]  Foster J. Provost,et al.  Classification in Networked Data: a Toolkit and a Univariate Case Study , 2007, J. Mach. Learn. Res..

[28]  Gang Liu,et al.  MetaLDA: A Topic Model that Efficiently Incorporates Meta Information , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[29]  Barry Smyth,et al.  A comparative study of collaboration-based reputation models for social recommender systems , 2013, User Modeling and User-Adapted Interaction.

[30]  G. Vachtsevanos,et al.  Epileptic Seizures May Begin Hours in Advance of Clinical Onset A Report of Five Patients , 2001, Neuron.

[31]  James M. Keller,et al.  A fuzzy K-nearest neighbor algorithm , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[32]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[33]  Jesús Alcalá-Fdez,et al.  KEEL Data-Mining Software Tool: Data Set Repository, Integration of Algorithms and Experimental Analysis Framework , 2011, J. Multiple Valued Log. Soft Comput..

[34]  Mohammad Hossein Fazel Zarandi,et al.  From type-2 fuzzy rate-based neural networks to social networks' behaviors , 2016, 2016 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[35]  Michael R. Lyu,et al.  Information propagation in social rating networks , 2012, CIKM '12.

[36]  Namrata Tapaswi,et al.  Social network comment classification using fuzzy based classifier technique , 2016, 2016 Symposium on Colossal Data Analysis and Networking (CDAN).

[37]  Pablo M. Granitto,et al.  Solving Nonstationary Classification Problems With Coupled Support Vector Machines , 2011, IEEE Transactions on Neural Networks.

[38]  Kaizhu Huang,et al.  Pattern Field Classification with Style Normalized Transformation , 2011, IJCAI.

[39]  George Nagy,et al.  Style consistent classification of isogenous patterns , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Zhaohong Deng,et al.  Scalable TSK Fuzzy Modeling for Very Large Datasets Using Minimal-Enclosing-Ball Approximation , 2011, IEEE Transactions on Fuzzy Systems.

[41]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[42]  Ming Shao,et al.  Learning Consensus Representation for Weak Style Classification , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Dimitrios I. Fotiadis,et al.  Epileptic Seizure Detection in EEGs Using Time–Frequency Analysis , 2009, IEEE Transactions on Information Technology in Biomedicine.

[44]  Krishna P. Gummadi,et al.  A measurement-driven analysis of information propagation in the flickr social network , 2009, WWW '09.