Propagation and Provenance of Probabilistic and Interval Uncertainty in Cyberinfrastructure-Related Data Processing and Data Fusion

In the past, communications were much slower than computations. As a result, re- searchers and practitioners collected difierent data into huge databases located at a single location such as NASA and US Geological Survey. At present, communications are so much faster that it is possible to keep difierent databases at difierent locations, and automatically select, transform, and collect relevant data when necessary. The corresponding cyberinfrastructure is actively used in many applications. It drastically enhances scientists' ability to discover, reuse and combine a large number of resources, e.g., data and services. Because of this importance, it is desirable to be able to gauge the the uncertainty of the results obtained by using cyberinfrastructure. This problem is made more urgent by the fact that the level of uncertainty associated with cyberinfrastructure resources can vary greatly { and that scientists have much less control over the quality of difierent resources than in the centralized database. Thus, with the cyberinfrastructure promise comes the need to analyze how data uncertainty propagates via this cyberinfrastructure. When the resulting accuracy is too low, it is desirable to produce the provenance of this inac- curacy: to flnd out which data points contributed most to it, and how an improved accuracy of these data points will improve the accuracy of the result. In this paper, we describe algorithms for propagating uncertainty and for flnding the provenance for this uncertainty.

[1]  A. Krishna Sinha,et al.  Geoinformatics : data to knowledge , 2006 .

[2]  Vladik Kreinovich,et al.  Monte-Carlo-Type Techniques for Processing Interval Uncertainty, and Their Potential Engineering Applications , 2006, Reliab. Comput..

[3]  John A. Hole,et al.  Nonlinear high‐resolution three‐dimensional seismic travel time tomography , 1992 .

[4]  Matthew George Averill A lithospheric investigation of the Southern Rio Grande Rift , 2007 .

[5]  Vladik Kreinovich,et al.  Exact Bounds for Interval and Fuzzy Functions Under Monotonicity Constraints, with Potential Applications to Biostratigraphy , 2005, The 14th IEEE International Conference on Fuzzy Systems, 2005. FUZZ '05..

[6]  Maarten J. Korsten,et al.  Measurement Errors and Uncertainty , 2004 .

[7]  C. Zelt,et al.  Modeling wide-angle seismic data for crustal structure: Southeastern Grenville Province , 1994 .

[8]  Larry J. Ruff,et al.  How good are our best models? Jackknifing, bootstrapping, and earthquake depth , 1989 .

[9]  Ann Q. Gates,et al.  Towards Secure Cyberinfrastructure for Sharing Border Information , 2006 .

[10]  Jonathan M. Lees,et al.  Tomographic inversion for three‐dimensional velocity structure at Mount St. Helens using earthquake data , 1989 .

[11]  Vladik Kreinovich,et al.  Trade-Off between Sample Size and Accuracy: Case of Dynamic Measurements under Interval Uncertainty , 2008, Interval / Probabilistic Uncertainty and Non-Classical Logics.

[12]  Vladik Kreinovich,et al.  Eliminating Duplicates under Interval and Fuzzy Uncertainty: An Asymptotically Optimal Algorithm and Its Geospatial Applications , 2004, Reliab. Comput..

[13]  Vladik Kreinovich,et al.  The Multi-layered Interval Categorizer Tesselation-based Model , 2004, GeoInfo.

[14]  Vladik Kreinovich,et al.  For unknown-but-bounded errors, interval estimates are often better than averaging , 1996, SGNM.

[15]  Vladik Kreinovich,et al.  An IDL/ENVI Implementation of the FFT Based Algorithm for Automatic Image Registration , 2003 .

[16]  V. Kreinovich,et al.  How To Take Into Account Dependence Between the Inputs: From Interval Computations to Constraint-Rel , 2006 .

[17]  Colin A. Zelt,et al.  Three‐dimensional seismic refraction tomography: A comparison of two methods applied to data from the Faeroe Basin , 1998 .

[18]  G. William Walster Philosophy and practicalities of interval arithmetic , 1988 .

[19]  Ann Q. Gates,et al.  GEON: Geophysical Data Add the 3rd Dimension in Geospatial Studies , 2004 .

[20]  Vladik Kreinovich,et al.  Estimating Uncertainties for Geophysical Tomography , 1998, Reliab. Comput..

[21]  Vladik Kreinovich,et al.  Images with Uncertainty: Efficient Algorithms for Shift, Rotation, Scaling, and Registration, and their Applications to Geosciences , 2007 .

[22]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[23]  Vladik Kreinovich,et al.  Using expert knowledge in solving the seismic inverse problem , 2005, NAFIPS 2005 - 2005 Annual Meeting of the North American Fuzzy Information Processing Society.

[24]  Ann Q. Gates,et al.  A community effort to construct a gravity database for the United States and an associated Web portal , 2006 .

[25]  Vladik Kreinovich,et al.  A new Cauchy-based black-box technique for uncertainty in risk analysis , 2004, Reliab. Eng. Syst. Saf..

[26]  Ann Q. Gates,et al.  Towards automatic detection of erroneous measurement results in a gravity database , 2001, 2001 IEEE International Conference on Systems, Man and Cybernetics. e-Systems and e-Man for Cybernetics in Cyberspace (Cat.No.01CH37236).

[27]  Vladik Kreinovich,et al.  Trade-Off between Sample Size and Accuracy: Case of Static Measurements under Interval Uncertainty , 2008, Interval / Probabilistic Uncertainty and Non-Classical Logics.

[28]  R. Parker Geophysical Inverse Theory , 1994 .

[29]  S. Vavasis Nonlinear optimization: complexity issues , 1991 .

[30]  Semyon G. Rabinovich,et al.  Measurement Errors and Uncertainties: Theory and Practice , 1999 .

[31]  Vladik Kreinovich,et al.  Taylor Model-Type Techniques for Handling Uncertainty in Expert Systems , 2005 .

[32]  Luc Jaulin,et al.  Applied Interval Analysis , 2001, Springer London.

[33]  Vladik Kreinovich,et al.  How to Efficiently Process Uncertainty within a Cyberinfrastructure without Sacrificing Privacy and Confidentiality , 2007, Computational Intelligence in Information Assurance and Security.

[34]  Vladik Kreinovich,et al.  Trade-off between sample size and accuracy: Case of measurements under interval uncertainty , 2009, Int. J. Approx. Reason..

[35]  V. Kreinovich Computational Complexity and Feasibility of Data Processing and Interval Computations , 1997 .

[36]  Aaron A. Velasco,et al.  High‐resolution Rayleigh wave slowness tomography of central Asia , 2005 .