Improving research paper searching with social tagging — A preliminary investigation

The WWW provides an efficient way to store and share information. Search engines and social bookmarking systems are important tools for web resource discovery. This study investigated three different indexing approaches applied to CiteULike — a social bookmarking system for tagging academic research papers. The indexing approaches here are known as: Tag only; Title with Abstract; and Tag, Title with Abstract. These three indexing approaches were evaluated using mean values of Normalized Discount Cumulative Gain (NDCG). The preliminary results illustrated that indexing using “Tag, Title, with Abstract” performed the best. The initial evaluation on our implementation implied that these designs might improve the accuracy and efficiency of web resource searching on social bookmarking system, not only in academics but also in other domains.

[1]  Antal van den Bosch,et al.  Recommending scientific articles using citeulike , 2008, RecSys '08.

[2]  Judith Gelernter,et al.  A quantitative analysis of collaborative tags: Evaluation for information retrieval—a preliminary study , 2007, 2007 International Conference on Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom 2007).

[3]  Michael B. Spring,et al.  Applying Social Annotations to Retrieve and Re-rank Web Resources , 2009, 2009 International Conference on Information Management and Engineering.

[4]  Yang Song,et al.  Evaluating tagging behavior in social bookmarking systems: metrics and design heuristics , 2007, GROUP.

[5]  Matei Ripeanu,et al.  Tracking User Attention in Collaborative Tagging Communities , 2007, CAMA.

[6]  Michael J. Muller,et al.  Social tagging roles: publishers, evangelists, leaders , 2008, CHI.

[7]  Otis Gospodnetic,et al.  Lucene in Action , 2004 .

[8]  Andreas Hotho,et al.  Tag Recommendations in Folksonomies , 2007, LWA.

[9]  John M. Carroll,et al.  Supporting distributed scientific collaboration: Implications for designing the CiteSeer collaboratory , 2007, 2007 40th Annual Hawaii International Conference on System Sciences (HICSS'07).

[10]  Guido Caldarelli,et al.  Folksonomies and clustering in the collaborative system CiteULike , 2007, 0710.2835.

[11]  Karl Aberer,et al.  To tag or not to tag -: harvesting adjacent metadata in large-scale tagging systems , 2008, SIGIR '08.

[12]  Russell Beale,et al.  Sharing vocabularies: tag usage in CiteULike , 2008, BCS HCI.

[13]  Qinghua Zhu,et al.  The Determination of Semantic Dimension in Social Tagging System Based on SOM Model , 2008, 2008 Second International Symposium on Intelligent Information Technology Application.

[14]  Dinan Gunawardena,et al.  Social tags: meaning and suggestions , 2008, CIKM '08.

[15]  Jaana Kekäläinen,et al.  IR evaluation methods for retrieving highly relevant documents , 2000, SIGIR '00.

[16]  C. Lee Giles,et al.  CiteSeer: an automatic citation indexing system , 1998, DL '98.