An Intermediate View for Data Integration, Management in Cloud Computing

It is necessary to integrate and manage data in the Cloud for providing in-depth data services.This paper introduces an intermediate view that provides a highly interactive environment to integrate and manage data distributed in different sources.The intermediate view, emphasizing on the notion of payas-you-go data integration and management,provides Data sources Integration Broker,Data Sources Intermediate View Model and Data Sources View Manager that can automatically create mappings between data sources to support integrating,viewing and querying data sources relationship,integrates multiple search/query strategies, and provides view query operations for users to manage data more easily.Finally,we report on successful application results obtained with a prototype implementation called MSDSN,involving challenging management and sharing of materials scientific data,to illustrate the feasibility and effectiveness of intermediate view.

[1]  Martin Bergman,et al.  The deep web:surfacing the hidden value , 2000 .

[2]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[3]  Jayant Madhavan,et al.  Web-Scale Data Integration: You can afford to Pay as You Go , 2007, CIDR.

[4]  Robert L. Grossman,et al.  An overview of the Open Science Data Cloud , 2010, HPDC '10.

[5]  Alon Y. Halevy,et al.  Bootstrapping pay-as-you-go data integration systems , 2008, SIGMOD Conference.

[6]  Sumit Sarkar,et al.  PSQL: A Query Language for Probabilistic Relational Data , 1998, Data Knowl. Eng..

[7]  David Maier,et al.  Quarrying dataspaces: Schemaless profiling of unfamiliar information sources , 2008, 2008 IEEE 24th International Conference on Data Engineering Workshop.