Using Qualitative Hypotheses to Identify Inaccurate Data

Identifying inaccurate data has long been regarded as a significant and difficult problem in AI. In this paper, we present a new method for identifying inaccurate data on the basis of qualitative correlations among related data. First, we introduce the definitions of related data and qualitative correlations among related data. Then we put forward a new concept called support coefficient function (SCF). SCF can be used to extract, represent, and calculate qualitative correlations among related data within a dataset. We propose an approach to determining dynamic shift intervals of inaccurate data, and an approach to calculating possibility of identifying inaccurate data, respectively. Both of the approaches are based on SCF. Finally we present an algorithm for identifying inaccurate data by using qualitative correlations among related data as confirmatory or disconfirmatory evidence. We have developed a practical system for interpreting infrared spectra by applying the method, and have fully tested the system against several hundred real spectra. The experimental results show that the method is significantly better than the conventional methods used in many similar systems.

[1]  Toyoaki Nishida,et al.  A Knowledge Model for Infrared Spectrum Processing , 1994 .

[2]  Raymond Reiter,et al.  A Theory of Diagnosis from First Principles , 1986, Artif. Intell..

[3]  J. H. Perkins,et al.  Expert system based on principal components analysis for the identification of molecular structures from vapor-phase infrared spectra. 2. Identification of carbonyl-containing functionalities , 1992 .

[4]  A. Ralescu,et al.  Simulation, Knowledge-Based Computing, and Fuzzy Statistics , 1987 .

[5]  Kishan G. Mehrotra,et al.  Analyzing Images Containing Multiple Sparse Patterns with Neural Networks , 1991, IJCAI.

[6]  Sterling A. Tomellini,et al.  Descriptive, interactive computer-assisted interpretation of infrared spectra , 1989 .

[7]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[8]  S. Lowry,et al.  Computerized infrared spectral identification of compounds frequently found at hazardous waste sites. , 1986, Analytical chemistry.

[9]  James Bowen,et al.  Lexical Imprecision in Fuzzy Constraint Networks , 1992, AAAI.

[10]  Jean-Thomas Clerc,et al.  Performance analysis of infrared library search systems , 1986 .

[11]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[12]  Richard O. Duda,et al.  Subjective bayesian methods for rule-based inference systems , 1976, AFIPS '76.

[13]  G. Jalsovszky,et al.  Pattern recognition applied to vapour-phase infrared spectra: characteristics of νOH bands , 1988 .

[14]  Edward H. Shortliffe,et al.  A model of inexact reasoning in medicine , 1990 .

[15]  Morton E. Munk,et al.  A neural network approach to infrared spectrum interpretation , 1990 .