论文信息 - Construction and Analysis of Web-Based Computer Science Information Networks

Construction and Analysis of Web-Based Computer Science Information Networks

With the rapid development of the Web, huge amounts of information are available on the Web in the form of Web documents, structures, and links. It has been a dream of the database and Web communities to harvest information exhibited on the Web and reconcile the unstructured nature of the Web with the semi-structured schemas of the database paradigm. This is a challenging task. Even though databases are currently used to generate Web content in some sites, the schemas of these databases are rarely consistent across a domain. However, with the recent research in Web structure mining and information network analysis, major progress has been made at discovering Web hidden structures, constructing heterogeneous information networks by integration of information from structured databases and Web contents, and performing in-depth analysis for systematic harvesting of such rich information on the Web.

Jiawei Han

[1] Yizhou Sun,et al. Ranking-based clustering of heterogeneous information networks with star network schema , 2009, KDD.

[2] Yizhou Sun,et al. WINACS: construction and analysis of web-based computer science information networks , 2011, SIGMOD '11.

[3] Philip S. Yu,et al. PathSim , 2011, Proc. VLDB Endow..

[4] Donato Malerba,et al. Mapping web pages to database records via link paths , 2010, CIKM.

[5] Yizhou Sun,et al. Graph Regularized Transductive Classification on Heterogeneous Information Networks , 2010, ECML/PKDD.

[6] Jiawei Han,et al. Mining advisor-advisee relationships from research publication networks , 2010, KDD.

[7] Donato Malerba,et al. Growing parallel paths for entity-page discovery , 2011, WWW.

[8] Yizhou Sun,et al. RankClus: integrating clustering with ranking for heterogeneous information network analysis , 2009, EDBT '09.