User Interest and Interaction Structure in Online Forums

We present a new similarity measure tailored to posts in an online forum. Our measure takes into account all the available information about user interest and interaction --- the content of posts, the threads in the forum, and the author of the posts. We use this post similarity to build a similarity between users, based on principal coordinate analysis. This allows easy visualization of the user activity as well. Similarity between users has numerous applications, such as clustering or classification. We show that including the author of a post in the post similarity has a smoothing effect on principal coordinate projections. We demonstrate our method on real data drawn from an internal corporate forum, and compare our results to those given by a standard document classification method. We conclude our method gives a more detailed picture of both the local and global network structure.

[1]  James Bennett,et al.  The Netflix Prize , 2007 .

[2]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[3]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[4]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[5]  P. Holland,et al.  An Exponential Family of Probability Distributions for Directed Graphs , 1981 .

[6]  William W. Cohen,et al.  Recommendation as Classification: Using Social and Content-Based Information in Recommendation , 1998, AAAI/IAAI.

[7]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[8]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[9]  Geoffrey E. Hinton,et al.  Restricted Boltzmann machines for collaborative filtering , 2007, ICML '07.

[10]  Mark S. Ackerman,et al.  Competing to Share Expertise: The Taskcn Knowledge Sharing Community , 2021, ICWSM.

[11]  R. Prim Shortest connection networks and some generalizations , 1957 .

[12]  John Riedl,et al.  Application of Dimensionality Reduction in Recommender Systems , 2000 .

[13]  Weiss,et al.  Text Mining , 2010 .

[14]  Ben Taskar,et al.  Link Prediction in Relational Data , 2003, NIPS.

[15]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[16]  Tad Hogg,et al.  Diversity of online community activities , 2008, Hypertext.