Web Clustering Using Social Bookmark Data Regarding User Network

Social Bookmark services are web services which enable users to share their personal bookmark data with other users. In our research, we propose a web clustering method in a social bookmark service using bookmark data modified with user network data. Here, we define bookmark data as a pair of a user and a webpage he bookmarks, and user network data as links between users and their favorite users. In more detail, we use Probabilistic Latent Semantic Indexing (PLSI) which enables to project social bookmark data on latent semantic space and classify them by their semantic attributes. Using PLSI, we can classify webpages according to their topics, and regarding user network, we can append all bookmark data of user, who are reachable for a user, to his bookmark data. To confirm an effectiveness of our proposed clustering method, we have evaluated a clustering examination using real Social Bookmark Data. And finally we have confirmed that our proposed method could have classified webpages in the same topic.