Building web domain data integration system with user collaboration

With the rapid development of the Internet, the Web is becoming the largest information repository of the world. Major efforts have been made in order to integrate the data of a specific domain on the Web. The traditional methods are largely done by few of system administrators which do not adapt to web scale. The construction of a web domain data integration system (WDDIS) becomes an urgent task The paper describes a new idea which asks the users to help the builders incrementally build WDDIS. It proposes an architecture of WDDIS and describes the mechanism of user collaboration. The approach shifts the enormous endeavors from the producers to the consumers which will promote WDDIS to be constructed quickly and effectively.

[1]  Kevin Chen-Chuan Chang,et al.  Supporting entity search: a large-scale prototype search engine , 2007, SIGMOD '07.

[2]  Kevin Chen-Chuan Chang,et al.  Entity Search Engine: Towards Agile Best-Effort Information Integration over the Web , 2007, CIDR.

[3]  Raghu Ramakrishnan,et al.  DBLife: A Community Information Management Platform for the Database Research Community (Demo) , 2007, CIDR.

[4]  Alon Y. Halevy,et al.  An adaptive query execution system for data integration , 1999, SIGMOD '99.

[5]  Joann J. Ordille,et al.  Querying Heterogeneous Information Sources Using Source Descriptions , 1996, VLDB.

[6]  Matthew Richardson,et al.  Building large knowledge bases by mass collaboration , 2003, K-CAP '03.

[7]  AnHai Doan,et al.  Integrating data from disparate sources: a mass collaboration approach , 2005, 21st International Conference on Data Engineering (ICDE'05).

[8]  Wei-Ying Ma,et al.  Object-level Vertical Search , 2007, CIDR.

[9]  Kevin Chen-Chuan Chang,et al.  EntityRank: Searching Entities Directly and Holistically , 2007, VLDB.

[10]  Jennifer Widom,et al.  The TSIMMIS Project: Integration of Heterogeneous Information Sources , 1994, IPSJ.

[11]  Raghu Ramakrishnan,et al.  Building Structured Web Community Portals: A Top-Down, Compositional, and Incremental Approach , 2007, VLDB.

[12]  Wei-Ying Ma,et al.  Object-level ranking: bringing order to Web objects , 2005, WWW '05.

[13]  Wei-Ying Ma,et al.  Extracting Objects from the Web , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[14]  David J. DeWitt,et al.  NiagaraCQ: a scalable continuous query system for Internet databases , 2000, SIGMOD '00.