Bayes net graphs to understand co-authorship networks?

Improvements in data collection and the birth of online communities made it possible to obtain very large social networks (graphs). Several communities have been involved in modeling and analyzing these graphs. Usage of graphical models, such as Bayesian Networks (BN), to analyze massive data has become increasingly popular, due to their scalability and robustness to noise. In the literature BNs are primarily used for compact representation of joint distributions and to perform inference, i.e. answer queries about the data. In this work we learn Bayes Nets using the previously proposed SBNS algorithm [14]. We look at the learned networks for the purpose of analyzing the graph structure itself. We also point out a few improvements over the SBNS algorithm. The usefulness of Bayes Net structures to understand social networks is an open area. We discuss possible interpretations using a small subgraph of the Medline publications and hope to provoke some discussion and interest in further analysis.

[1]  Wray L. Buntine Theory Refinement on Bayesian Networks , 1991, UAI.

[2]  Andrew W. Moore,et al.  Using Tarjan's Red Rule for Fast Dependency Tree Construction , 2002, NIPS.

[3]  Heikki Mannila,et al.  Mixture Models and Frequent Sets: Combining Global and Local Methods for 0-1 Data , 2003, SDM.

[4]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[5]  Nir Friedman,et al.  Learning Bayesian Network Structure from Massive Datasets: The "Sparse Candidate" Algorithm , 1999, UAI.

[6]  S. Wasserman,et al.  Logit models and logistic regressions for social networks: III. Valued relations , 1999 .

[7]  P. Pattison,et al.  New Specifications for Exponential Random Graph Models , 2006 .

[8]  Carter T. Butts,et al.  Network inference, error, and informant (in)accuracy: a Bayesian approach , 2003, Soc. Networks.

[9]  D. Lauffenburger,et al.  Network inference , 2005 .

[10]  Geoff Hulten,et al.  Mining complex models from arbitrarily large databases in constant time , 2002, KDD.

[11]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[12]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[13]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[14]  Reka Albert,et al.  Mean-field theory for scale-free random networks , 1999 .

[15]  Anna Goldenberg,et al.  Tractable learning of large Bayes net structures from sparse data , 2004, ICML.

[16]  Peter D. Hoff Random Effects Models for Network Data , 2003 .

[17]  Richard E. Neapolitan,et al.  Learning Bayesian networks , 2007, KDD '07.

[18]  Jacob L. Moreno,et al.  Statistics of Social Configurations , 1938 .

[19]  Stephen E. Fienberg,et al.  Discrete Multivariate Analysis: Theory and Practice , 1976 .

[20]  Martin A. Nowak,et al.  Inferring Cellular Networks Using Probabilistic Graphical Models , 2004 .

[21]  Heikki Mannila,et al.  Beyond Independence: Probabilistic Models for Query Approximation on Binary Transaction Data , 2003, IEEE Trans. Knowl. Data Eng..

[22]  Lise Getoor,et al.  SRL2003 IJCAI 2003 Workshop on Learning Statistical Models from Relational Data , 2003 .

[23]  Gregory F. Cooper,et al.  A Bayesian Method for Constructing Bayesian Belief Networks from Databases , 1991, UAI.

[24]  Paul Erdös,et al.  On random graphs, I , 1959 .

[25]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[26]  Yuchung J. Wang,et al.  Stochastic Blockmodels for Directed Graphs , 1987 .

[27]  Marina Meila,et al.  An Accelerated Chow and Liu Algorithm: Fitting Tree Distributions to High-Dimensional Sparse Data , 1999, ICML.

[28]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Sandra Gonzalez,et al.  Dynamic Social Network Modelling and Analysis: Workshop Summary and Papers Edited by Ronald L. Breiger, Kathleen M. Carley and Philippa Pattison , 2003, J. Artif. Soc. Soc. Simul..

[30]  Tom A. B. Snijders,et al.  Markov Chain Monte Carlo Estimation of Exponential Random Graph Models , 2002, J. Soc. Struct..

[31]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[32]  M. Handcock Center for Studies in Demography and Ecology Assessing Degeneracy in Statistical Models of Social Networks , 2005 .

[33]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[34]  Peter D. Hoff,et al.  Latent Space Approaches to Social Network Analysis , 2002 .