Accommodating data heterogeneity in ULS systems

Ultra-Large Scale (ULS) systems comprise numerous software elements designed and implemented by independent stakeholders whose requirements may vary widely. Consequently, elements in a ULS system may use different data formats, which complicates integration of elements. Writing code to robustly convert data from one format to another requires time and skills that some programmers may lack. Worse, the stakeholders who control a software element may change the element's data format at any point in the future without warning, causing format incompatibility not foreseen during the ULS system's construction. To address heterogeneity of data formats, we present a new abstraction called "topes". Each tope describes one kind of data, including known formats of that data and rules for transforming values among formats. Labeling the inputs and outputs of software elements with topes raises the level of abstraction so that elements produce and consume certain kinds of data, rather than particular formats.

[1]  Douglas C. Schmidt,et al.  Ultra-large-scale systems , 2006, OOPSLA '06.

[2]  Mary Shaw,et al.  Tool support for data validation by end-user programmers , 2008, 2008 ACM/IEEE 30th International Conference on Software Engineering.

[3]  Hersh Bhasin Microsoft ASP.NET professional projects , 2002 .

[4]  Grady Booch,et al.  Developing the future , 2001, CACM.

[5]  Anne H. H. Ngu,et al.  QoS-aware middleware for Web services composition , 2004, IEEE Transactions on Software Engineering.

[6]  Sundar A. Christopher,et al.  Seven year particulate matter air quality assessment from surface and satellite measurements , 2008 .

[7]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[8]  Ernesto Damiani,et al.  User opinions and rewards in a reuse-based development system , 1999, SSR '99.

[9]  Tantek Çelik,et al.  Microformats: a pragmatic path to the semantic web , 2006, WWW '06.

[10]  Mary Shaw,et al.  Writing good software engineering research papers: minitutorial , 2003, ICSE 2003.

[11]  R. S. Parmar,et al.  Study of size distribution of atmospheric aerosol at Agra , 2001 .

[12]  Chaomei Chen,et al.  Mining the Web: Discovering knowledge from hypertext data , 2004, J. Assoc. Inf. Sci. Technol..

[13]  Yang Liu,et al.  Estimating Regional Spatial and Temporal Variability of PM2.5 Concentrations Using Satellite Data, Meteorology, and Land Use Information , 2009, Environmental health perspectives.

[14]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[15]  Douglas C. Schmidt,et al.  Ultra-Large-Scale Systems: The Software Challenge of the Future , 2006 .

[16]  Mary Shaw,et al.  Semantic anomaly detection in online data sources , 2002, ICSE '02.

[17]  Eser Kandogan,et al.  Koala: capture, share, automate, personalize business processes on the web , 2007, CHI.

[18]  Mary Shaw,et al.  Writing good software engineering research papers , 2003, 25th International Conference on Software Engineering, 2003. Proceedings..

[19]  Mary Shaw,et al.  Topes , 2008, 2008 ACM/IEEE 30th International Conference on Software Engineering.