Social networking for scientists using tagging and shared bookmarks: a Web 2.0 application

Web-based social networks, online personal profiles, keyword tagging, and online bookmarking are staples of Web 2.0-style applications. In this paper we report our investigation and implementation of these capabilities as a means for creating communities of like-minded faculty and researchers, particularly at minority serving institutions. Our motivating problem is to provide outreach tools that broaden the participation of these groups in funded research activities, particularly in cyberinfrastructure and e-science. In this paper, we discuss the system design, implementation, social network seeding, and portal capabilities. Underlying our system, and folksonomy systems generally, is a graph- based data model that links external URLs, system users, and descriptive tags. We conclude with a survey of the applicability of clustering and other data mining techniques to these folksonomy graphs.

[1]  Elias Dahlhaus,et al.  Parallel Algorithms for Hierarchical Clustering and Applications to Split Decomposition and Parity Graph Recognition , 2000, J. Algorithms.

[2]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[3]  Geoffrey C. Fox,et al.  Web 2.0 for Grids and e-Science , 2003 .

[4]  Tim O'Reilly,et al.  What is Web 2.0: Design Patterns and Business Models for the Next Generation of Software , 2007 .

[5]  Kilian Stoffel,et al.  Parallel k/h-Means Clustering for Large Data Sets , 1999, Euro-Par.

[6]  Geoffrey C. Fox,et al.  High Performance Multi-paradigm Messaging Runtime Integrating Grids and Multicore Systems , 2007, Third IEEE International Conference on e-Science and Grid Computing (e-Science 2007).

[7]  Thomas Hofmann,et al.  Map-Reduce for Machine Learning on Multicore , 2007 .

[8]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[9]  Vipin Kumar,et al.  Chameleon: Hierarchical Clustering Using Dynamic Modeling , 1999, Computer.

[10]  George Karypis,et al.  Hierarchical Clustering Algorithms for Document Datasets , 2005, Data Mining and Knowledge Discovery.

[11]  Chris H. Q. Ding,et al.  K-means clustering via principal component analysis , 2004, ICML.

[12]  Valentin Robu,et al.  The complex dynamics of collaborative tagging , 2007, WWW '07.

[13]  Hugo Zaragoza,et al.  Information Retrieval: Algorithms and Heuristics , 2002, Information Retrieval.

[14]  Ron Shamir,et al.  A clustering algorithm based on graph connectivity , 2000, Inf. Process. Lett..

[15]  K. Rose Deterministic annealing for clustering, compression, classification, regression, and related optimization problems , 1998, Proc. IEEE.

[16]  Andreas Hotho,et al.  Information Retrieval in Folksonomies: Search and Ranking , 2006, ESWC.

[17]  Geoffrey C. Fox,et al.  Cyberinfrastructure and Web 2.0 , 2006, High Performance Computing Workshop.

[18]  Geoffrey C. Fox,et al.  SRG: A Digital Document-Enhanced Service Oriented Research Grid , 2007, 2007 IEEE International Conference on Information Reuse and Integration.