Web Social Mining

With increasing user presence in the Web and Web 2.0, Web social mining becomes an important and challenging task that finds a wide range of new applications relevant to e-commerce and social software. In this entry, we describe three Web social mining topics, namely, social network discovery, social network analysis, and social network applications. The essential concepts, models, and techniques of these Web social mining topics will be surveyed so as to establish the basic foundation for developing novel applications and for conducting research

[1]  David B. Skillicorn,et al.  Structure in the Enron Email Dataset , 2005, Comput. Math. Organ. Theory.

[2]  Linton C. Freeman,et al.  Cliques, Galois lattices, and the structure of human social groups☆ , 1996 .

[3]  Ravi Kumar,et al.  Structure and evolution of blogspace , 2004, CACM.

[4]  Christos Faloutsos,et al.  Graph mining: Laws, generators, and algorithms , 2006, CSUR.

[5]  Shou-De Lin,et al.  Unsupervised link discovery in multi-relational data via rarity analysis , 2003, Third IEEE International Conference on Data Mining.

[6]  Yun Chi,et al.  Identifying opinion leaders in the blogosphere , 2007, CIKM '07.

[7]  Hector Garcia-Molina,et al.  The Eigentrust algorithm for reputation management in P2P networks , 2003, WWW '03.

[8]  Sameer Patil,et al.  Who gets to know what when: configuring privacy permissions in an awareness application , 2005, CHI.

[9]  Tim O'Reilly,et al.  What is Web 2.0: Design Patterns and Business Models for the Next Generation of Software , 2007 .

[10]  Mark S. Granovetter Threshold Models of Collective Behavior , 1978, American Journal of Sociology.

[11]  Jakob Nielsen,et al.  Automating the assignment of submitted manuscripts to reviewers , 1992, SIGIR '92.

[12]  Michael F. Schwartz,et al.  Discovering shared interests using graph analysis , 1993, CACM.

[13]  Amin Saberi,et al.  Exploring the community structure of newsgroups , 2004, KDD.

[14]  Takashi Washio,et al.  An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data , 2000, PKDD.

[15]  Jörg Sander,et al.  Analysis of SIGMOD's co-authorship graph , 2003, SGMD.

[16]  Henry MacKay Walker,et al.  Variability of referees' ratings of conference papers , 2002, ITiCSE '02.

[17]  Paul Resnick,et al.  Recommender systems , 1997, CACM.

[18]  Paul Resnick,et al.  Reputation systems , 2000, CACM.

[19]  H. Small A Co-Citation Model of a Scientific Specialty: A Longitudinal Study of Collagen Research , 1977 .

[20]  Caroline Haythornthwaite,et al.  Studying Online Social Networks , 2006, J. Comput. Mediat. Commun..

[21]  M. M. Kessler Bibliographic coupling between scientific papers , 1963 .

[22]  Ankur Teredesai,et al.  A Framework for Mining Instant Messaging Services , 2004 .

[23]  Raymond T. Ng,et al.  A Unified Notion of Outliers: Properties and Computation , 1997, KDD.

[24]  Ee-Peng Lim,et al.  Mining Relationship Graphs for Effective Business Objectives , 2002, PAKDD.

[25]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[26]  Jiming Liu,et al.  Community Mining from Signed Social Networks , 2007, IEEE Transactions on Knowledge and Data Engineering.

[27]  Matthew Richardson,et al.  Mining knowledge-sharing sites for viral marketing , 2002, KDD.

[28]  Valdis E. Krebs,et al.  Mapping Networks of Terrorist Cells , 2001 .

[29]  Hsinchun Chen,et al.  Untangling Criminal Networks: A Case Study , 2003, ISI.

[30]  Hsinchun Chen,et al.  Fighting organized crimes: using shortest-path algorithms to identify associations in criminal networks , 2004, Decis. Support Syst..

[31]  Ramakrishnan Srikant,et al.  Mining newsgroups using networks arising from social behavior , 2003, WWW '03.

[32]  Danah Boyd,et al.  Friendster and publicly articulated social networking , 2004, CHI EA '04.

[33]  Bart Selman,et al.  Referral Web: combining social networks and collaborative filtering , 1997, CACM.

[34]  Taher H. Haveliwala Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for Web Search , 2003, IEEE Trans. Knowl. Data Eng..

[35]  Ankur Teredesai,et al.  Extracting Social Networks from Instant Messaging Populations , 2004 .

[36]  Yan Huang,et al.  Discovering Spatial Co-location Patterns: A Summary of Results , 2001, SSTD.

[37]  Kathleen M. Carley,et al.  Exploration of communication networks from the Enron email corpus , 2005 .

[38]  Jacob Goldenberg,et al.  Talk of the Network: A Complex Systems Look at the Underlying Process of Word-of-Mouth , 2001 .

[39]  Ankur Teredesai,et al.  Modeling spread of ideas in online social networks , 2006 .

[40]  Mary J. Culnan,et al.  The intellectual development of management information systems, 1972-1982: a co-citation analysis , 1986 .

[41]  Robert Wilensky,et al.  An algorithm for automated rating of reviewers , 2001, JCDL '01.

[42]  P. Bonacich TECHNIQUE FOR ANALYZING OVERLAPPING MEMBERSHIPS , 1972 .

[43]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[44]  Les Carr,et al.  Trailblazing the literature of hypertext: author co-citation analysis (1989–1998) , 1999, HYPERTEXT '99.

[45]  George Karypis,et al.  Discovering frequent geometric subgraphs , 2007, Inf. Syst..

[46]  Jimeng Sun,et al.  Neighborhood formation and anomaly detection in bipartite graphs , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[47]  Bülent Yener,et al.  Graph Theoretic and Spectral Analysis of Enron Email Data , 2005, Comput. Math. Organ. Theory.

[48]  Darren Leigh,et al.  Social net: using patterns of physical proximity over time to infer shared interests , 2002, CHI Extended Abstracts.

[49]  Prabhakar Raghavan,et al.  A Linear Method for Deviation Detection in Large Databases , 1996, KDD.

[50]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[51]  Allan Borodin,et al.  Link analysis ranking: algorithms, theory, and experiments , 2005, TOIT.

[52]  Matthew Richardson,et al.  Mining the network value of customers , 2001, KDD '01.

[53]  Cliff Lampe,et al.  A face(book) in the crowd: social Searching vs. social browsing , 2006, CSCW '06.

[54]  Ling Liu,et al.  PeerTrust: supporting reputation-based trust for peer-to-peer electronic communities , 2004, IEEE Transactions on Knowledge and Data Engineering.

[55]  Jaswinder Pal Singh,et al.  Computing and using reputations for internet ratings , 2001, EC '01.

[56]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[57]  Munindar P. Singh,et al.  Searching social networks , 2003, AAMAS '03.

[58]  Randy Goebel,et al.  DBconnect: mining research community on DBLP data , 2007, WebKDD/SNA-KDD '07.

[59]  Jiawei Han,et al.  CloseGraph: mining closed frequent graph patterns , 2003, KDD '03.

[60]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[61]  Yehuda Koren,et al.  Measuring and extracting proximity in networks , 2006, KDD '06.

[62]  Ee-Peng Lim,et al.  Measuring article quality in wikipedia: models and evaluation , 2007, CIKM '07.

[63]  Kathleen M. Carley A Theory of Group Stability , 1991 .

[64]  Phillip Bonacich,et al.  Simultaneous group and individual centralities , 1991 .

[65]  Henry G. Small,et al.  Co-citation in the scientific literature: A new measure of the relationship between two documents , 1973, J. Am. Soc. Inf. Sci..

[66]  James A. Hendler,et al.  Inferring binary trust relationships in Web-based social networks , 2006, TOIT.

[67]  Lawrence B. Holder,et al.  Graph-based Data Mining on Social Networks , 2004 .

[68]  C. Lee Giles,et al.  Self-Organization and Identification of Web Communities , 2002, Computer.

[69]  Christos Faloutsos,et al.  Fast discovery of connection subgraphs , 2004, KDD.

[70]  George Karypis,et al.  Frequent subgraph discovery , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[71]  E. J. Barboni,et al.  Co-Citation Analyses of Science: An Evaluation , 1977 .

[72]  Christos Faloutsos,et al.  Center-piece subgraphs: problem definition and fast solutions , 2006, KDD '06.

[73]  HweeHwa Pang,et al.  Mining social network from spatio-temporal events , 2005 .

[74]  Alex Pentland,et al.  Reality mining: sensing complex social systems , 2006, Personal and Ubiquitous Computing.

[75]  Linton C. Freeman,et al.  The Sociological Concept of "Group": An Empirical Test of Two Models , 1992, American Journal of Sociology.

[76]  Shou-de Lin,et al.  Issues of Verification for Unsupervised Discovery Systems , 2004 .

[77]  Katherine W. McCain,et al.  Visualizing a discipline: an author co-citation analysis of information science, 1972–1995 , 1998 .

[78]  Jiawei Han,et al.  gSpan: graph-based substructure pattern mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[79]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[80]  Jon M. Kleinberg,et al.  Group formation in large social networks: membership, growth, and evolution , 2006, KDD '06.

[81]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[82]  Michael J. Pazzani,et al.  Mining for proposal reviewers: lessons learned at the national science foundation , 2006, KDD '06.

[83]  C. Lee Giles,et al.  Efficient identification of Web communities , 2000, KDD '00.

[84]  Lawrence B. Holder,et al.  Graph-Based Data Mining , 2000, IEEE Intell. Syst..

[85]  Jure Leskovec,et al.  The dynamics of viral marketing , 2005, EC '06.

[86]  Alex Pentland,et al.  Sensing and modeling human networks using the sociometer , 2003, Seventh IEEE International Symposium on Wearable Computers, 2003. Proceedings..

[87]  S. Feld The Focused Organization of Social Ties , 1981, American Journal of Sociology.

[88]  Ramanathan V. Guha,et al.  Propagation of trust and distrust , 2004, WWW '04.

[89]  Jun Zhang,et al.  SWIM: fostering social network based information search , 2004, CHI EA '04.