Construction and Analysis of Web-Based Computer Science Information Networks

With the rapid development of the Web, huge amounts of information are available on the Web in the form of Web documents, structures, and links. It has been a dream of the database and Web communities to harvest information exhibited on the Web and reconcile the unstructured nature of the Web with the semi-structured schemas of the database paradigm. This is a challenging task. Even though databases are currently used to generate Web content in some sites, the schemas of these databases are rarely consistent across a domain. However, with the recent research in Web structure mining and information network analysis, major progress has been made at discovering Web hidden structures, constructing heterogeneous information networks by integration of information from structured databases and Web contents, and performing in-depth analysis for systematic harvesting of such rich information on the Web.