Ranking Entities on the Web using Social Network Mining and Ranking Learning

Social networks have garnered much attention recently. Several studies have been undertaken to extract social networks among people, companies, and so on automatically from the web. For use in social sciences, social networks enable analyses of the performance and valuation of companies. This paper describes an attempt to learn ranking of entities from a social network that has been mined from the web. In our approach, we first extract different kinds of relational data from the web. We construct social networks using several relevance measures in addition to text analysis. Subsequently, the relations are integrated to maximize the ranking predictability. We also integrate several relations into a combined-relational network and use the latest ranking learning algorithm to obtain the ranking model. Additionally, we propose the use of centrality scores of companies on the network as features for ranking. We conducted two experiments on a social network among companies to learn the ranking of market capitalization, and on a social network among researchers for ranking of researchers’ productivity. This study specifically examines a new approach to using web information for advanced analysis by integrating multiple relations among named entities.

[1]  Amit P. Sheth,et al.  Semantic analytics on social networks: experiences in addressing the problem of conflict of interest detection , 2006, WWW '06.

[2]  Mitsuru Ishizuka,et al.  Extracting Social Networks Among Various Entities on the Web , 2007, ESWC.

[3]  Yutaka Matsuo,et al.  Real-world oriented information sharing using social networks , 2005, GROUP '05.

[4]  Ralph Grishman,et al.  Discovering Relations among Named Entities from Large Corpora , 2004, ACL.

[5]  Wolfgang Nejdl,et al.  Semantically Rich Recommendations in Social Networks for Sharing, Exchanging and Ranking Semantic Context , 2005, International Semantic Web Conference.

[6]  Paul Buitelaar,et al.  RelExt: A Tool for Relation Extraction from Text in Ontology Extension , 2005, SEMWEB.

[7]  Mitsuru Ishizuka,et al.  Extracting Inter-Firm Networks fromWorldWideWeb , 2007, The 9th IEEE International Conference on E-Commerce Technology and The 4th IEEE International Conference on Enterprise Computing, E-Commerce and E-Services (CEC-EEE 2007).

[8]  Nanda Kambhatla,et al.  Combining Lexical, Syntactic, and Semantic Features with Maximum Entropy Models for Information Extraction , 2004, ACL.

[9]  Dean M. Behrens,et al.  Redundant governance structures: an analysis of structural and relational embeddedness in the steel and semiconductor industries , 2000 .

[10]  Lise Getoor,et al.  Link mining: a survey , 2005, SKDD.

[11]  Andrew McCallum,et al.  Extracting social networks and contact information from email and the Web , 2004, CEAS.

[12]  David A. Cohn,et al.  Creating customized authority lists , 1999, ICML 1999.

[13]  B. Uzzi,et al.  Social Structure and Competition in Interfirm Networks: The Paradox of Embeddedness , 1997 .

[14]  Soumen Chakrabarti,et al.  Learning to rank networked entities , 2006, KDD '06.

[15]  Bart Selman,et al.  The Hidden Web , 1997, AI Mag..

[16]  Peter Mika,et al.  Flink: Semantic Web technology for the extraction and analysis of social networks , 2005, J. Web Semant..

[17]  Peter Mika Ontologies Are Us: A Unified Model of Social Networks and Semantics , 2005, International Semantic Web Conference.

[18]  Vojtech Svátek,et al.  Discovery of Lexical Entries for Non-taxonomic Relations in Ontology Learning , 2004, SOFSEM.

[19]  Nanda Kambhatla,et al.  Combining Lexical, Syntactic, and Semantic Features with Maximum Entropy Models for Information Extraction , 2004, ACL.

[20]  Stefano Battiston,et al.  Inner structure of capital control networks , 2004 .

[21]  Kôiti Hasida,et al.  POLYPHONET: an advanced social network extraction system from the web , 2006, WWW '06.

[22]  Yoshi Fujiwara,et al.  Shareholding Networks in Japan , 2005 .

[23]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[24]  Peter D. Turney Expressing Implicit Semantic Relations without Supervision , 2006, ACL.

[25]  Andrew McCallum,et al.  Disambiguating Web appearances of people in a social network , 2005, WWW '05.

[26]  M. Bengtsson,et al.  Cooperation and competition in relationships between competitors in business networks , 1999 .

[27]  Andrew McCallum,et al.  Integrating Probabilistic Extraction Models and Data Mining to Discover Relations and Patterns in Text , 2006, NAACL.