Existing works in user profiling suffers from two well known problems in IR: polysemy and synonymy. Enriching semantics to terms that represent user interests disambiguate it’s context, polysemous topics, and synonyms. One way of enriching semantics to terms is by grouping related terms together into clusters. This work exploits users’ tweets to build a Contextualized User Interest Profile(CUIP) that consist of clusters of (semantically) related terms and their term-weights. We propose two approaches to build the CUIP: svdCUIP based on Singular Value Decomposition (SVD); and, modsvdCUIP based on modded SVD (modSVD). Experimental results show that the clustering tendency and accuracy of the modsvdCUIP cluster structure is far more superior than the svdCUIP cluster structure.
[1]
Ali S. Hadi,et al.
Finding Groups in Data: An Introduction to Chster Analysis
,
1991
.
[2]
T. Landauer,et al.
Indexing by Latent Semantic Analysis
,
1990
.
[3]
Christoph Meinel,et al.
Web Search Personalization Via Social Bookmarking and Tagging
,
2007,
ISWC/ASWC.
[5]
Jose L. Marzo,et al.
User Modeling, Adaption and Personalization - 19th International Conference, UMAP 2011, Girona, Spain, July 11-15, 2011. Proceedings
,
2011,
UMAP.
[6]
Hong-Gee Kim,et al.
Using folksonomies for building user interest profile
,
2011,
UMAP'11.