Modeling blogger influence in a community

Blogging has become a popular and convenient way to communicate, publish information, share preferences, voice opinions, provide suggestions, report news, and form virtual communities in the Blogosphere. The blogosphere obeys a power law distribution with very few blogs being extremely influential and a huge number of blogs being largely unknown. Regardless of a (multi-author) blog being influential or not, there are influential bloggers. However, the sheer number of such blogs makes it extremely challenging to study each one of them. One way to analyze these blogs is to find influential bloggers and consider them as the community representatives. Influential bloggers can impact fellow bloggers in various ways. In this paper, we study the problem of identifying influential bloggers. We define influential bloggers, investigate their characteristics, discuss the challenges with identification, develop a model to quantify their influence, and pave the way for further research leading to more sophisticated models that enable categorization of various types of influential bloggers. To highlight these issues, we conduct experiments using data from blogs, evaluate multiple facets of the problem, and present a unique and objective evaluation strategy given the subjectivity in defining the influence, in addition to various other analytical capabilities. We conclude with interesting findings and future work.

[1]  P. Lazarsfeld,et al.  Voting: A Study of Opinion Formation in a Presidential Campaign. , 1955 .

[2]  Michael Stefanone,et al.  Writing for Friends and Family: The Interpersonal Nature of Blogs , 2007, J. Comput. Mediat. Commun..

[3]  J. Berry The Influentials: One American in Ten Tells the Other Nine How to Vote, Where to Eat, and What to Buy , 2003 .

[4]  John M. Carroll,et al.  When opinion leaders blog: new forms of citizen interaction , 2006, DG.O.

[5]  Andrew McCallum,et al.  Mining a digital library for influential authors , 2007, JCDL '07.

[6]  Jun'ichi Tatemura,et al.  Discovering Important Bloggers based on Analyzing Blog Threads , 2005 .

[7]  Ray J. Paul,et al.  Visualizing a Knowledge Domain's Intellectual Structure , 2001, Computer.

[8]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, WWW '04.

[9]  Gene H. Golub,et al.  Matrix computations , 1983 .

[10]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[11]  Tim O'Reilly,et al.  What is Web 2.0: Design Patterns and Business Models for the Next Generation of Software , 2007 .

[12]  Tim Oates,et al.  Modeling the Spread of Influence on the Blogosphere , 2006 .

[13]  Laks V. S. Lakshmanan,et al.  Learning influence probabilities in social networks , 2010, WSDM '10.

[14]  M. Thelwall Bloggers during the London attacks: Top information sources and topics , 2006 .

[15]  Gilad Mishne,et al.  Deriving wishlists from blogs show us your blog, and we'll tell you what books to buy , 2006, WWW '06.

[16]  P. Bonacich Power and Centrality: A Family of Measures , 1987, American Journal of Sociology.

[17]  Christos Faloutsos,et al.  Cascading Behavior in Large Blog Graphs , 2007 .

[18]  Wei Chen,et al.  Efficient influence maximization in social networks , 2009, KDD.

[19]  D. Watts,et al.  Viral Marketing for the Real World Duncan J. Watts, Jonah Peretti, and Michael Frumin , 2007 .

[20]  T.R. Coffman,et al.  Dynamic classification of groups through social network analysis and HMMs , 2004, 2004 IEEE Aerospace Conference Proceedings (IEEE Cat. No.04TH8720).

[21]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[22]  E. Katz The Two-Step Flow of Communication: An Up-To-Date Report on an Hypothesis , 1957 .

[23]  Yun Chi,et al.  Splog detection using self-similarity analysis on blog temporal dynamics , 2007, AIRWeb '07.

[24]  Jon Kleinberg,et al.  Maximizing the spread of influence through a social network , 2003, KDD '03.

[25]  Qi He,et al.  TwitterRank: finding topic-sensitive influential twitterers , 2010, WSDM '10.

[26]  Kathy E. Gill How can we measure the influence of the blogosphere? , 2004 .

[27]  Andreas Krause,et al.  Cost-effective outbreak detection in networks , 2007, KDD '07.

[28]  Michalis Faloutsos,et al.  On power-law relationships of the Internet topology , 1999, SIGCOMM '99.

[29]  R. Armstrong The Long Tail: Why the Future of Business Is Selling Less of More , 2008 .

[30]  Ee-Peng Lim,et al.  Measuring article quality in wikipedia: models and evaluation , 2007, CIKM '07.

[31]  Hsinchun Chen,et al.  A framework for authorship identification of online messages: Writing-style features and classification techniques , 2006 .

[32]  Mark H. Chignell,et al.  A social hypertext model for finding community in blogs , 2006, HYPERTEXT '06.

[33]  E. Rogers,et al.  Diffusion of innovations , 1964, Encyclopedia of Sport Management.

[34]  Kees Niemöller,et al.  Applied network analysis , 1980 .

[35]  Ramanathan V. Guha,et al.  The predictive power of online chatter , 2005, KDD '05.

[36]  P. Lazarsfeld,et al.  Personal Influence: The Part Played by People in the Flow of Mass Communications , 1956 .

[37]  Philip Yu,et al.  Searching for “ Familiar Strangers ” on Blogosphere : Problems and Challenges , 2007 .

[38]  Huan Liu,et al.  A Social Identity Approach to Identify Familiar Strangers in a Social Network , 2009, ICWSM.

[39]  D. Watts,et al.  Influentials, Networks, and Public Opinion Formation , 2007 .

[40]  Philip S. Yu,et al.  Truth Discovery with Multiple Conflicting Information Providers on the Web , 2007, IEEE Transactions on Knowledge and Data Engineering.

[41]  Qiang Yang,et al.  Exploring in the weblog space by detecting informative and affective articles , 2007, WWW '07.

[42]  E. Rogers,et al.  Communication of Innovations; A Cross-Cultural Approach. , 1974 .

[43]  Henk F. Moed,et al.  Citation Analysis in Research Evaluation , 1899 .

[44]  P. Lazarsfeld,et al.  The people's choice. , 1945 .

[45]  Anat Rachel Shimoni,et al.  Gender, genre, and writing style in formal written texts , 2003 .

[46]  Timothy W. Finin,et al.  SVMs for the Blogosphere: Blog Identification and Splog Detection , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[47]  C. Spearman The proof and measurement of association between two things. , 2015, International journal of epidemiology.

[48]  Jacob Goldenberg,et al.  Talk of the Network: A Complex Systems Look at the Underlying Process of Word-of-Mouth , 2001 .

[49]  R. Merton Social Theory and Social Structure , 1958 .

[50]  Iraklis Varlamis,et al.  BlogRank: ranking weblogs based on connectivity and similarity features , 2006, AAA-IDEA '06.

[51]  Dan Gillmor,et al.  We the media - grassroots journalism by the people, for the people , 2006 .

[52]  R. L. Keeney,et al.  Decisions with Multiple Objectives: Preferences and Value Trade-Offs , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[53]  Joel Podolny Status Signals: A Sociological Study of Market Competition , 2005 .

[54]  Robert Scoble,et al.  Naked Conversations: How Blogs are Changing the Way Businesses Talk with Customers , 2006 .

[55]  Yun Chi,et al.  Identifying opinion leaders in the blogosphere , 2007, CIKM '07.

[56]  J. Coleman,et al.  Medical Innovation: A Diffusion Study. , 1967 .

[57]  D. Watts,et al.  Viral marketing for the real world , 2007 .

[58]  Rajeev Motwani,et al.  Randomized algorithms , 1996, CSUR.

[59]  L. Guest The People's Choice: How the Voter Makes Up His Mind in a Presidential Campaign. , 1946 .

[60]  Huan Liu,et al.  BlogTrackers: A Tool for Sociologists to Track and Analyze Blogosphere , 2009, ICWSM.

[61]  Ralph L. Keeney,et al.  Decisions with multiple objectives: preferences and value tradeoffs , 1976 .

[62]  M. Kendall A NEW MEASURE OF RANK CORRELATION , 1938 .

[63]  Jimeng Sun,et al.  Social influence analysis in large-scale networks , 2009, KDD.

[64]  P. Lazarsfeld,et al.  6. Katz, E. Personal Influence: The Part Played by People in the Flow of Mass Communications , 1956 .

[65]  Gerald D. Fensterer,et al.  Planning and Assessing Stability Operations: A Proposed Value Focus Thinking Approach , 2012 .

[66]  Matthew Richardson,et al.  Mining knowledge-sharing sites for viral marketing , 2002, KDD.

[67]  Daniel W. Drezner,et al.  The power and politics of blogs , 2007 .