Maximum Consistency of Incomplete Data via Non-Invasive Imputation

We present an algorithm to impute missingvalues from given dataalone, and analyse its performance. Theproposed procedure is based onnon-numeric rule based data analysis, and aimsto maximise consistency of imputation from known values. Incontrast to the prevailingstatistical imputation algorithms, it does notmake representationalassumptions or presupposes other modelconstraints. Therefore, it is suitablefor a wide variety of data – sets, and can beused as a pre-processing step beforeresorting to harder numerical methods.

[1]  Ivo Düntsch,et al.  Roughian: Rough information analysis , 2001 .

[2]  Ivo Düntsch,et al.  Rough set data analysis: A road to non-invasive knowledge discovery , 2000 .

[3]  Ivo Düntsch,et al.  Classificatory filtering in decision systems , 2000, Int. J. Approx. Reason..

[4]  Joseph L Schafer,et al.  Analysis of Incomplete Multivariate Data , 1997 .

[5]  Ivo Düntsch,et al.  Simple data filtering in rough set systems , 1998, Int. J. Approx. Reason..

[6]  Enrico Fagiuoli,et al.  2U: An Exact Interval Propagation Algorithm for Polytrees with Binary Variables , 1998, Artif. Intell..

[7]  Ivo Düntsch,et al.  Statistical evaluation of rough set dependency analysis , 1997, Int. J. Hum. Comput. Stud..

[8]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[9]  J. Graham,et al.  Analysis with missing data in drug prevention research. , 1994, NIDA research monograph.

[10]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[11]  Peter M. Bentler,et al.  EQS : structural equations program manual , 1989 .

[12]  D. Rubin,et al.  Multiple Imputation for Nonresponse in Surveys , 1989 .

[13]  Robert P. Goldman,et al.  Imputation of Missing Data Using Machine Learning Techniques , 1996, KDD.

[14]  Xiao-Li Meng,et al.  Multiple-Imputation Inferences with Uncongenial Sources of Input , 1994 .

[15]  D. Rubin Multiple Imputation After 18+ Years , 1996 .

[16]  Jerzy W. Grzymala-Busse,et al.  On the Unknown Attribute Values in Learning from Examples , 1991, ISMIS.

[17]  Ivo Diintsch Uncertainty measures of rough set prediction , 2003 .