Finding quasi core with simulated stacked neural networks

Studying networks is promising for diverse applications. We are often interested in exploring significant substructures in different types of real-life networks. Finding cliques, which denote a complete subgraph of a graph, is one such important problem in network analysis. Interestingly, many real-life networks often contain a significant number of almost (quasi) complete subgraphs, which are not entirely complete due to the presence of noise. Considering these networks as weighted adds further challenges to the problem. Finding quasi-complete subgraphs in weighted graphs has never been formally addressed. In this paper, we propose a stacked neural network model for finding out the largest quasi-complete module (core) in weighted graphs. We show the effectiveness of the proposed approach on DIMACS graphs. We also highlight its utility in analyzing scientific collaboration networks, social networks and biological networks.

[1]  Jure Leskovec,et al.  Learning to Discover Social Circles in Ego Networks , 2012, NIPS.

[2]  Christos Faloutsos,et al.  Graph evolution: Densification and shrinking diameters , 2006, TKDD.

[3]  Eyke Hüllermeier,et al.  Functional Classification of Protein Kinase Binding Sites Using Cavbase , 2007, ChemMedChem.

[4]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[5]  Boaz Patt-Shamir,et al.  Distributed discovery of large near-cliques , 2009, PODC '09.

[6]  Sanghamitra Bandyopadhyay,et al.  Analyzing Topological Properties of Protein–Protein Interaction Networks: A Perspective toward Systems Biology , 2010 .

[7]  Philip S. Yu,et al.  A graph-based approach to systematically reconstruct human transcriptional regulatory modules , 2007, ISMB/ECCB.

[8]  Elena Marchiori,et al.  Genetic, Iterated and Multistart Local Search for the Maximum Clique Problem , 2002, EvoWorkshops.

[9]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[10]  Panos M. Pardalos,et al.  On maximum clique problems in very large graphs , 1999, External Memory Algorithms.

[11]  E. A. Akkoyunlu,et al.  The Enumeration of Maximal Cliques of Large Graphs , 1973, SIAM J. Comput..

[12]  David Martin,et al.  Functional classification of proteins for the prediction of cellular function from a protein-protein interaction network , 2003, Genome Biology.

[13]  D. Du,et al.  Theory of Computational Complexity , 2000 .

[14]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[15]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[16]  Gregory Gutin,et al.  A selection of useful theoretical tools for the design and analysis of optimization heuristics , 2009, Memetic Comput..

[17]  Jingsheng Lei,et al.  An Improved Ant Colony Optimization for the Maximum Clique Problem , 2007, Third International Conference on Natural Computation (ICNC 2007).

[18]  Zheng Tang,et al.  An Improved Transiently Chaotic Neural Network with Multiple Chaotic Dynamics for Maximum Clique Problem , 2007, Second International Conference on Innovative Computing, Informatio and Control (ICICIC 2007).

[19]  A. Barabasi,et al.  Quantifying social group evolution , 2007, Nature.

[20]  C. Bron,et al.  Algorithm 457: finding all cliques of an undirected graph , 1973 .

[21]  J. J. Hopfield,et al.  “Neural” computation of decisions in optimization problems , 1985, Biological Cybernetics.

[22]  Jian Pei,et al.  On mining cross-graph quasi-cliques , 2005, KDD '05.

[23]  Shailesh V. Date,et al.  A Probabilistic Functional Network of Yeast Genes , 2004, Science.

[24]  Jiahai Wang,et al.  Maximum neural network with nonlinear self-feedback for maximum clique problem , 2004, Neurocomputing.

[25]  David S. Johnson,et al.  Cliques, Coloring, and Satisfiability , 1996 .

[26]  Sanghamitra Bandyopadhyay,et al.  Bounds on Quasi-Completeness , 2012, IWOCA.

[27]  Etsuji Tomita,et al.  Clique-based data mining for related genes in a biomedical database , 2009, BMC Bioinformatics.

[28]  Patric R. J. Östergård,et al.  A fast algorithm for the maximum clique problem , 2002, Discret. Appl. Math..

[29]  Zheng Tang,et al.  An Improved Chaotic Maximum Neural Network for Maximum Clique Problem , 2007 .

[30]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[31]  David R. Wood,et al.  An algorithm for finding a maximum clique in a graph , 1997, Oper. Res. Lett..

[32]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[33]  Panos M. Pardalos,et al.  The maximum clique problem , 1994, J. Glob. Optim..

[34]  Sandra Sudarsky,et al.  Massive Quasi-Clique Detection , 2002, LATIN.

[35]  Yoshiyasu Takefuji,et al.  A parallel neural network computing for the maximum clique problem , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.

[36]  Jin Xu,et al.  A simple simulated annealing algorithm for the maximum clique problem , 2007, Inf. Sci..

[37]  J. Barker,et al.  Large-scale temporal gene expression mapping of central nervous system development. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Yoshiyasu Takefuji,et al.  A parallel improvement algorithm for the bipartite subgraph problem , 1992, IEEE Trans. Neural Networks.

[39]  Stanislav Busygin,et al.  A new trust region technique for the maximum weight clique problem , 2006, Discret. Appl. Math..

[40]  J. Håstad Clique is hard to approximate withinn1−ε , 1999 .

[41]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[42]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[44]  Panos M. Pardalos,et al.  A branch and bound algorithm for the maximum clique problem , 1992, Comput. Oper. Res..

[45]  Jian Pei,et al.  Mining frequent cross-graph quasi-cliques , 2009, TKDD.

[46]  Michael Jünger,et al.  Experiments in quadratic 0–1 programming , 1989, Math. Program..

[47]  Panos M. Pardalos,et al.  Neural Network Models in Combinatorial Optimization , 2013 .

[48]  Sanghamitra Bandyopadhyay,et al.  Mining the Largest Dense Vertexlet in a Weighted Scale-free Graph , 2009, Fundam. Informaticae.