The Sample and Instance Selection for Data Dimensionality Reduction

The paper proposes tools for data dimensionality reduction containing sample selection method and instance informativity indicators based on the evolutionary search, which is modified to speed up the search through the creation of special operators, taking into account a priori information about the data sample and concentrating search on the most perspective solution areas. This allows preserving the stochastic nature of the search to accelerate the obtainment of acceptable solutions through the introduction of deterministic component in the search strategy. The proposed methods are experimentally studied. On the results of experiments the comparative characteristics and recommendations for the use of the proposed methods are given.

[1]  Roman Szewczyk,et al.  A Mathematical Model of the Thermo-Anemometric Flowmeter , 2015, Sensors.

[2]  Subir Ghosh,et al.  Multivariate analysis, design of experiments, and survey sampling , 2000 .

[3]  A. Chaudhuri,et al.  Survey sampling : theory and methods , 1992 .

[4]  Sergey Subbotin,et al.  The Instance and Feature Selection for Neural Network Based Diagnosis of Chronic Obstructive Bronchitis , 2015, Applications of Computational Intelligence in Biomedical Technology.

[5]  S. A. Subbotin,et al.  The training set quality measures for neural network learning , 2010, Optical Memory and Neural Networks.

[6]  Chi-Keong Goh,et al.  Computational Intelligence in Expensive Optimization Problems , 2010 .

[7]  El-Ghazali Talbi,et al.  Metaheuristics - From Design to Implementation , 2009 .

[8]  Sergey A. Subbotin The sample properties evaluation for pattern recognition and intelligent diagnosis , 2014, The 10th International Conference on Digital Technologies 2014.

[9]  H. Russell Bernard,et al.  Social Research Methods: Qualitative and Quantitative Approaches , 2000 .

[10]  Sergei A. Subbotin Methods of sampling based on exhaustive and evolutionary search , 2013, Automatic Control and Computer Sciences.

[11]  Morris H. Hansen,et al.  Sample survey methods and theory , 1955 .

[12]  Roman Szewczyk,et al.  Precision increase in automated digital image measurement systems of geometric values , 2016 .

[13]  T. Zaiko,et al.  Training sample reduction based on association rules for neuro-fuzzy networks synthesis , 2014, Optical Memory and Neural Networks.