Pattern classification with missing data: a review

Pattern classification has been successfully applied in many problem domains, such as biometric recognition, document classification or medical diagnosis. Missing or unknown data are a common drawback that pattern recognition techniques need to deal with when solving real-life classification tasks. Machine learning approaches and methods imported from statistical learning theory have been most intensively studied and used in this subject. The aim of this work is to analyze the missing data problem in pattern classification tasks, and to summarize and compare some of the well-known methods used for handling missing values.

[1]  Pieter Abbeel,et al.  Max-margin classification of incomplete data , 2006, NIPS.

[2]  Gene H. Golub,et al.  Missing value estimation for DNA microarray gene expression data: local least squares imputation , 2005, Bioinform..

[3]  Le Gruenwald,et al.  Estimating Missing Values in Related Sensor Data Streams , 2005, COMAD.

[4]  Kai Jiang,et al.  Classification for Incomplete Data Using Classifier Ensembles , 2005, 2005 International Conference on Neural Networks and Brain.

[5]  Alexander J. Smola,et al.  A Second Order Cone programming Formulation for Classifying Missing Data , 2004, NIPS.

[6]  Yoshua Bengio,et al.  Recurrent Neural Networks for Missing or Asynchronous Data , 1995, NIPS.

[7]  Rich Caruana,et al.  Multitask Learning , 1997, Machine-mediated learning.

[8]  Francis L. Merat,et al.  Neural network based sensor array signal processing , 1996, 1996 IEEE/SICE/RSJ International Conference on Multisensor Fusion and Integration for Intelligent Systems (Cat. No.96TH8242).

[9]  Peter Clark,et al.  The CN2 Induction Algorithm , 1989, Machine Learning.

[10]  Geoffrey I. Webb The Problem of Missing Values in Decision Tree Grafting , 1998, Australian Joint Conference on Artificial Intelligence.

[11]  Satosi Watanabe,et al.  Pattern Recognition: Human and Mechanical , 1985 .

[12]  Phil D. Green,et al.  Handling missing data in speech recognition , 1994, ICSLP.

[13]  Hideo Tanaka,et al.  An extension of the BP-algorithm to interval input vectors-learning from numerical data and expert's knowledge , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.

[14]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[15]  M. Aldenderfer,et al.  Cluster Analysis. Sage University Paper Series On Quantitative Applications in the Social Sciences 07-044 , 1984 .

[16]  Gustavo E. A. P. A. Batista,et al.  A Study of K-Nearest Neighbour as an Imputation Method , 2002, HIS.

[17]  Xitao Fan,et al.  Missing Data in Disguise and Implications for Survey Data Analysis , 2004 .

[18]  Gustavo E. A. P. A. Batista,et al.  Experimental comparison pf K-NEAREST NEIGHBOUR and MEAN OR MODE imputation methods with the internal strategies used by C4.5 and CN2 to treat missing data , 2003 .

[19]  Aníbal R. Figueiras-Vidal,et al.  Multi-task Neural Networks for Dealing with Missing Inputs , 2007, IWINAC.

[20]  James C. Bezdek,et al.  Fuzzy c-means clustering of incomplete data , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[21]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[22]  Gene H. Golub,et al.  Imputation of missing values in DNA microarray gene expression data , 2004 .

[23]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[24]  Michael A. Proschan,et al.  Sensitivity analysis using an imputation method for missing binary data in clinical trials , 2001 .

[25]  Roger A. Sugden,et al.  Multiple Imputation for Nonresponse in Surveys , 1988 .

[26]  Michael I. Jordan,et al.  Supervised learning from incomplete data via an EM approach , 1993, NIPS.

[27]  Volker Tresp,et al.  Efficient Methods for Dealing with Missing Data in Supervised Learning , 1994, NIPS.

[28]  Mia K. Markey,et al.  Impact of missing data in training artificial neural networks for computer-aided diagnosis , 2004, 2004 International Conference on Machine Learning and Applications, 2004. Proceedings..

[29]  Paola Sebastiani,et al.  c ○ 2001 Kluwer Academic Publishers. Manufactured in The Netherlands. Robust Learning with Missing Data , 2022 .

[30]  David G. Stork,et al.  Pattern Classification , 1973 .

[31]  William T. Scherer,et al.  IMPUTATION TECHNIQUES TO ACCOUNT FOR MISSING DATA IN SUPPORT OF INTELLIGENT TRANSPORTATION SYSTEMS APPLICATIONS , 2003 .

[32]  Robi Polikar,et al.  An ensemble of classifiers approach for the missing feature problem , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[33]  Tze-Yun Leong,et al.  Fuzzy K-means clustering with missing values , 2001, AMIA.

[34]  Leonardo Franco,et al.  Missing data imputation in breast cancer prognosis , 2006 .

[35]  R. Polikar,et al.  An ensemble technique to handle missing data from sensors , 2006, Proceedings of the 2006 IEEE Sensors Applications Symposium, 2006..

[36]  Chee Peng Lim,et al.  A Hybrid Neural Network System for Pattern Classification Tasks with Missing Features , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[37]  Volker Tresp,et al.  Training Neural Networks with Deficient Data , 1993, NIPS.

[38]  Peng Liu,et al.  An Analysis of Missing Data Treatment Methods and Their Application to Health Care Dataset , 2005, ADMA.

[39]  P. Kofman,et al.  Using Multiple Imputation in the Analysis of Incomplete Observations in Finance , 2003 .

[40]  Rudolf Kruse,et al.  Learning in neuro-fuzzy systems with symbolic attributes and missing values , 1999, ICONIP'99. ANZIIS'99 & ANNES'99 & ACNN'99. 6th International Conference on Neural Information Processing. Proceedings (Cat. No.99EX378).

[41]  R.J. Marks,et al.  Set constraint discovery: missing sensor data restoration using autoassociative regression machines , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[42]  Thomas Hofmann,et al.  Kernel Methods for Missing Variables , 2005, AISTATS.

[43]  Hisao Ishibuchi,et al.  Classification of fuzzy input patterns by neural networks , 1995, Proceedings of ICNN'95 - International Conference on Neural Networks.

[44]  Dorian Pyle,et al.  Data Preparation for Data Mining , 1999 .

[45]  Joseph L Schafer,et al.  Analysis of Incomplete Multivariate Data , 1997 .

[46]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[47]  J. Ross Quinlan,et al.  Unknown Attribute Values in Induction , 1989, ML.

[48]  Graham K. Rand,et al.  Quantitative Applications in the Social Sciences , 1983 .

[49]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[50]  Volker Tresp,et al.  Some Solutions to the Missing Feature Problem in Vision , 1992, NIPS.

[51]  Hideo Tanaka,et al.  Learning from incomplete training data with missing values and medical application , 1993, Proceedings of 1993 International Conference on Neural Networks (IJCNN-93-Nagoya, Japan).

[52]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[53]  Johan A. K. Suykens,et al.  Handling missing values in support vector machine classifiers , 2005, Neural Networks.

[54]  Peter Clark,et al.  The CN2 induction algorithm , 2004, Machine Learning.

[55]  Aníbal R. Figueiras-Vidal,et al.  Exploiting Multitask Learning Schemes Using Private Subnetworks , 2005, IWANN.

[56]  Tariq Samad,et al.  Self–organization with partial data , 1992 .

[57]  Zijian Zheng,et al.  Classifying Unseen Cases with Many Missing Values , 1999, PAKDD.

[58]  Phil D. Green,et al.  Speech enhancement with missing data techniques using recurrent neural networks , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[59]  Sophie Midenet,et al.  Self-Organising Map for Data Imputation and Correction in Surveys , 2002, Neural Computing & Applications.

[60]  Russ B. Altman,et al.  Missing value estimation methods for DNA microarrays , 2001, Bioinform..

[61]  Lawrence Carin,et al.  On Classification with Incomplete Data , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[62]  Jinbo Bi,et al.  Support Vector Classification with Input Data Uncertainty , 2004, NIPS.

[63]  M. Marseguerra,et al.  The AutoAssociative Neural Network in signal analysis: II. Application to on-line monitoring of a simulated BWR component , 2005 .

[64]  Yoshua Bengio,et al.  Pattern Recognition and Neural Networks , 1995 .

[65]  G. DiCesare Imputation, Estimation and Missing Data in Finance , 2006 .

[66]  Amit Gupta,et al.  Estimating Missing Values Using Neural Networks , 1996 .

[67]  Tariq Samad,et al.  Imputation of Missing Data in Industrial Databases , 1999, Applied Intelligence.

[68]  T. Marwala,et al.  Fault classification in structures with incomplete measured data using autoassociative neural networks and genetic algorithm , 2006 .

[69]  Michael R. Berthold,et al.  Missing Values and Learning of Fuzzy Rules , 1998, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[70]  Soo-Young Lee,et al.  Training Algorithm with Incomplete Data for Feed-Forward Neural Networks , 1999, Neural Processing Letters.

[71]  Søren Feodor Nielsen,et al.  1. Statistical Analysis with Missing Data (2nd edn). Roderick J. Little and Donald B. Rubin, John Wiley & Sons, New York, 2002. No. of pages: xv+381. ISBN: 0‐471‐18386‐5 , 2004 .

[72]  C. Ji,et al.  Measurement-based network monitoring: missing data formulation and scalability analysis , 2000, 2000 IEEE International Symposium on Information Theory (Cat. No.00CH37060).

[73]  Peter K. Sharpe,et al.  Dealing with missing values in neural network-based diagnostic systems , 1995, Neural Computing & Applications.

[74]  Bogdan Gabrys Pattern classification for incomplete data , 2000, KES'2000. Fourth International Conference on Knowledge-Based Intelligent Engineering Systems and Allied Technologies. Proceedings (Cat. No.00TH8516).

[75]  Hidetomo Ichihashi,et al.  Fuzzy c-Means Classifier for Incomplete Data Sets with Outliers and Missing Values , 2005, International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC'06).

[76]  Robert P. W. Duin,et al.  Combining One-Class Classifiers to Classify Missing Data , 2004, Multiple Classifier Systems.

[77]  Thierry Denoeux,et al.  A Neuro-Fuzzy model for missing data reconstruction , 1998 .

[78]  Chong-Ho Choi,et al.  Input Feature Selection by Mutual Information Based on Parzen Window , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[79]  Robert E. Mercer,et al.  Selective transfer of neural network task knowledge , 2000 .

[80]  Michael I. Jordan,et al.  Learning from Incomplete Data , 1994 .

[81]  Bogdan Gabrys,et al.  Neuro-fuzzy approach to processing inputs with missing values in pattern recognition problems , 2002, Int. J. Approx. Reason..

[82]  S. Nordbotten Neural network imputation applied to the Norwegian 1990 population census data , 1996 .

[83]  Lena Kallin Westin Missing data and the preprocessing perceptron , 2004 .

[84]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[85]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..