Automatic extraction of social networks by topics of interest

This paper presents an automatic process to extract from internet the scientific community interested in a topic. The process is based on e-mails co-occurrences and it obtains the invisible colleges and subtopics of the community. For this objective, it is built a new framework that uses query results to search engines for integrating different internet bibliography sources. For illustrative purposes, this technique is applied to extract the social network of participants in the Spanish Conference on Software Engineering and Database.

[1]  Ahmed K. Elmagarmid,et al.  Duplicate Record Detection: A Survey , 2007, IEEE Transactions on Knowledge and Data Engineering.

[2]  Kevin Crowston,et al.  The social structure of free and open source software development , 2005, First Monday.

[3]  Bart Selman,et al.  The Hidden Web , 1997, AI Mag..

[4]  Peter Mika,et al.  Flink: Semantic Web technology for the extraction and analysis of social networks , 2005, J. Web Semant..

[5]  P. Jaccard,et al.  Etude comparative de la distribution florale dans une portion des Alpes et des Jura , 1901 .

[6]  H. P. Luhn Key word‐in‐context index for technical literature (kwic index) , 1960 .

[7]  Kôiti Hasida,et al.  POLYPHONET: an advanced social network extraction system from the web , 2006, WWW '06.

[8]  Rafael M. Gasca,et al.  Sistemas de Inteligencia Web basados en redes sociales , 2007 .

[9]  Oliver Mason Programming for corpus linguistics , 2000 .

[10]  Paul Mutton,et al.  Inferring and visualizing social networks on Internet relay chat , 2004, Proceedings. Eighth International Conference on Information Visualisation, 2004. IV 2004..

[11]  Andrew McCallum,et al.  Extracting social networks and contact information from email and the Web , 2004, CEAS.

[12]  P. Mutton Inferring and visualizing social networks on Internet relay chat , 2004 .

[13]  Kôiti Hasida,et al.  Mining social network of conference participants from the Web , 2003, Proceedings IEEE/WIC International Conference on Web Intelligence (WI 2003).