A Web mining approach for finding expertise in research areas

Finding expertise in a research area helps researchers to know whom are the experts working on the research area. This paper proposes a web mining approach for finding expertise in scientific research areas. In this approach, Indexing Agents search and download scientific publications from web sites that typically include academic web pages, then they extract citations and store them in a Web Citation Database. In addition, researcher information is also saved into the Researcher Database. Data mining techniques are applied to the Web Citation Database on citation keywords and authors to form document clusters and author clusters. The Multi-Clustering technique is proposed to mine the combined information of document clusters and author clusters for information on expertise in specified research areas.

[1]  Howard D. White,et al.  Author cocitation: A literature measure of intellectual structure , 1981, J. Am. Soc. Inf. Sci..

[2]  C. Lee Giles,et al.  CiteSeer: an autonomous Web agent for automatic retrieval and identification of interesting publications , 1998, AGENTS '98.

[3]  Yulan He,et al.  PubSearch: a Web citation‐based retrieval system , 2001 .

[4]  Loren Terveen,et al.  PHOAKS: a system for sharing recommendations , 1997, CACM.

[5]  Witold Pedrycz,et al.  Data Mining Methods for Knowledge Discovery , 1998, IEEE Trans. Neural Networks.

[6]  C. J. van Rijsbergen,et al.  Information Retrieval , 1979, Encyclopedia of GIS.

[7]  Mark S. Ackerman,et al.  Expertise Locating : a Challenge for Recommender Systems , 2001 .

[8]  Siu Cheung Hui,et al.  Mining a web citation database for document clustering , 2002, Appl. Artif. Intell..

[9]  Siu Cheung Hui,et al.  Effective techniques for automatic extraction of Web publications , 2002, Online Inf. Rev..

[10]  Paul E. Green,et al.  Multidimensional Scaling: Concepts and Applications , 1989 .

[11]  Mark S. Ackerman,et al.  Expertise recommender: a flexible recommendation system and architecture , 2000, CSCW '00.

[12]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[13]  Daniel Boley,et al.  Principal Direction Divisive Partitioning , 1998, Data Mining and Knowledge Discovery.

[14]  Brian Everitt,et al.  Cluster analysis , 1974 .

[15]  Anupam Joshi,et al.  An Expertise Recommender Using Web Mining , 2001, FLAIRS Conference.

[16]  S. C. Hui,et al.  Mining a Web Citation Database for author co-citation analysis , 2002, Inf. Process. Manag..

[17]  Richard M. Crowder,et al.  An Agent Based Approach to Finding Expertise , 2002, PAKM.