Semantic analytics on social networks: experiences in addressing the problem of conflict of interest detection

In this paper, we describe a Semantic Web application that detects Conflict of Interest (COI) relationships among potential reviewers and authors of scientific papers. This application discovers various 'semantic associations' between the reviewers and authors in a populated ontology to determine a degree of Conflict of Interest. This ontology was created by integrating entities and relationships from two social networks, namely "knows," from a FOAF (Friend-of-a-Friend) social network and "co-author," from the underlying co-authorship network of the DBLP bibliography. We describe our experiences developing this application in the context of a class of Semantic Web applications, which have important research and engineering challenges in common. In addition, we present an evaluation of our approach for real-life COI detection.

[1]  Jon Kleinberg,et al.  Maximizing the spread of influence through a social network , 2003, KDD '03.

[2]  Silvana Castano,et al.  Semantic integration of semistructured and structured data sources , 1999, SGMD.

[3]  Vassilis Christophides,et al.  RQL: a declarative query language for RDF , 2002, WWW.

[4]  Amit P. Sheth,et al.  Managing Semantic Content for the Web , 2002, IEEE Internet Comput..

[5]  Caroline Haythornthwaite,et al.  Studying Online Social Networks , 2006, J. Comput. Mediat. Commun..

[6]  Hsinchun Chen,et al.  Untangling Criminal Networks: A Case Study , 2003, ISI.

[7]  Jörg Sander,et al.  Analysis of SIGMOD's co-authorship graph , 2003, SGMD.

[8]  Lada A. Adamic,et al.  A social network caught in the Web , 2003, First Monday.

[9]  B. Wellman Structural analysis: From method and metaphor to theory and substance. , 1988 .

[10]  Amit P. Sheth From Semantic Search & Integration to Analytics , 2005, Semantic Interoperability and Integration.

[11]  John Townley The Streaming Search Engine That Reads Your Mind , 2005 .

[12]  Les Carr,et al.  Trailblazing the literature of hypertext: author co-citation analysis (1989–1998) , 1999, HYPERTEXT '99.

[13]  Bernardo A. Huberman,et al.  E-Mail as Spectroscopy: Automated Discovery of Community Structure within Organizations , 2005, Inf. Soc..

[14]  Amit P. Sheth,et al.  Semantics for the Semantic Web: The Implicit, the Formal and the Powerful , 2005, Int. J. Semantic Web Inf. Syst..

[15]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[16]  Amit P. Sheth,et al.  Semantic Association Identification and Knowledge Discovery for National Security Applications , 2005, J. Database Manag..

[17]  Jayant Madhavan,et al.  Reference reconciliation in complex information spaces , 2005, SIGMOD '05.

[18]  Bernardo A. Huberman,et al.  Email as spectroscopy: automated discovery of community structure within organizations , 2003 .

[19]  Ian Horrocks,et al.  Querying the Semantic Web: A Formal Approach , 2002, SEMWEB.

[20]  Ross Anderson,et al.  The Use of Information Retrieval Techniques for Intrusion Detection , 1997 .

[21]  Dimitris Plexousakis,et al.  CONFIOUS: Managing the Electronic Submission and Reviewing Process of Scientific Conferences , 2005, WISE.

[22]  Kenneth N. McKay,et al.  Out of the Ordinary: Finding Hidden Threats by Analyzing Unusual Behavior , 2005 .

[23]  Azadeh Shakery,et al.  Toward Entity Retrieval over Structured and Text Data , 2004 .

[24]  Amit P. Sheth Enterprise Applications of Semantic Web: The Sweet Spot of Risk and Compliance , 2005, Industrial Applications of Semantic Web.

[25]  Vipul Kashyap,et al.  Relationships at the Heart of Semantic Web: Modeling, Discovering, and Exploiting Complex Semantic Relationships , 2004 .

[26]  Valter Crescenzi,et al.  RoadRunner: Towards Automatic Data Extraction from Large Web Sites , 2001, VLDB.

[27]  Evelyne de Leuw,et al.  Connecting the dots , 1999, Nature Genetics.

[28]  Amit P. Sheth,et al.  Ρ-Queries: enabling querying for semantic associations on the semantic web , 2003, WWW '03.

[29]  Li Ding,et al.  Social Networking on the Semantic Web , 2005 .

[30]  Amit P. Sheth,et al.  Semantic Enhancement Engine: A Modular Document Enhancement Platform for Semantic Applications over Heterogeneous Content , 2002 .

[31]  Berthier A. Ribeiro-Neto,et al.  A brief survey of web data extraction tools , 2002, SGMD.

[32]  Stephen D. Berkowitz,et al.  An Introduction to Structural Analysis: The Network Approach to Social Research , 1983 .

[33]  Krys J. Kochut,et al.  BRAHMS: A WorkBench RDF Store and High Performance Memory System for Semantic Association Discovery , 2005, SEMWEB.

[34]  Chaomei Chen,et al.  Visualising Semantic Spaces and Author Co-Citation Networks in Digital Libraries , 1999, Inf. Process. Manag..

[35]  Amit P. Sheth,et al.  Discovering informative connection subgraphs in multi-relational graphs , 2005, SKDD.

[36]  Timothy W. Finin,et al.  Swoogle: a search and metadata engine for the semantic web , 2004, CIKM '04.

[37]  Ramanathan V. Guha,et al.  SemTag and seeker: bootstrapping the semantic web via automated semantic annotation , 2003, WWW '03.

[38]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[39]  Alan F. Smeaton,et al.  Analysis of papers from twenty-five years of SIGIR conferences: what have we been doing for the last quarter of a century? , 2002, SIGF.

[40]  Joao Antonio Pereira,et al.  Linked: The new science of networks , 2002 .

[41]  Ramanathan V. Guha,et al.  Semantic search , 2003, WWW '03.

[42]  Amit P. Sheth,et al.  Ranking complex relationships on the semantic Web , 2005, IEEE Internet Computing.