International Journal of Approximate Reasoning Diverse Reduct Subspaces Based Co-training for Partially Labeled Data

Rough set theory is an effective supervised learning model for labeled data. However, it is often the case that practical problems involve both labeled and unlabeled data, which is outside the realm of traditional rough set theory. In this paper, the problem of attribute reduction for partially labeled data is first studied. With a new definition of discernibility matrix, a Markov blanket based heuristic algorithm is put forward to compute the optimal reduct of partially labeled data. A novel rough co-training model is then proposed, which could capitalize on the unlabeled data to improve the performance of rough classifier learned only from few labeled data. The model employs two diverse reducts of partially labeled data to train its base classifiers on the labeled data, and then makes the base classifiers learn from each other on the unlabeled data iteratively. The classifiers constructed in different reduct subspaces could benefit from their diversity on the unlabeled data and significantly improve the performance of the rough co-training model. Finally, the rough co-training model is theoretically analyzed, and the upper bound on its performance improvement is given. The experimental results show that the proposed model outperforms other representative models in terms of accuracy and even compares favorably with rough classifier trained on all training data labeled.

[1]  S. K. Michael Wong,et al.  Rough Sets: Probabilistic versus Deterministic Approach , 1988, Int. J. Man Mach. Stud..

[2]  Rajkumar Roy,et al.  Advances in Soft Computing: Engineering Design and Manufacturing , 1998 .

[3]  R. Słowiński Intelligent Decision Support: Handbook of Applications and Advances of the Rough Sets Theory , 1992 .

[4]  Daphne Koller,et al.  Toward Optimal Feature Selection , 1996, ICML.

[5]  Qinghua Hu,et al.  Neighborhood rough set based heterogeneous feature subset selection , 2008, Inf. Sci..

[6]  Anna Maria Radzikowska,et al.  A comparative study of fuzzy rough sets , 2002, Fuzzy Sets Syst..

[7]  Witold Pedrycz,et al.  Gaussian kernel based fuzzy rough sets: Model, uncertainty measures and applications , 2010, Int. J. Approx. Reason..

[8]  S. K. Tso,et al.  Applying rough-set concept to neural-network-based transient-stability classification of power systems , 2000 .

[9]  Andrzej Skowron,et al.  Tolerance Approximation Spaces , 1996, Fundam. Informaticae.

[10]  Yiyu Yao,et al.  A Comparative Study of Fuzzy Sets and Rough Sets , 1998 .

[11]  Yiyu Yao,et al.  Attribute reduction in decision-theoretic rough set models , 2008, Inf. Sci..

[12]  Theresa Beaubouef,et al.  Information-Theoretic Measures of Uncertainty for Rough Sets and Rough Relational Databases , 1998, Inf. Sci..

[13]  S. Tsumoto,et al.  Rough Set Theory and Granular Computing , 2003 .

[14]  Dominik Slezak,et al.  The investigation of the Bayesian rough set model , 2005, Int. J. Approx. Reason..

[15]  Dominik Ślęzak Approximate Markov Boundaries and Bayesian Networks: Rough Set Approach , 2003 .

[16]  Andrzej Skowron,et al.  Rough sets and Boolean reasoning , 2007, Inf. Sci..

[17]  Liang Ding,et al.  Collaborative statistical learning with rough feature reduction for visual target classification , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[18]  Feng Chong Improved Algorithm of Attribute Reduction Based on Discernibility Matrix , 2007 .

[19]  Duoqian Miao,et al.  A Rough Set Approach to Classifying Web Page Without Negative Examples , 2007, PAKDD.

[20]  Min Chen,et al.  Semi-supervised Rough Cost/Benefit Decisions , 2009, Fundam. Informaticae.

[21]  Salvatore Greco,et al.  Monotonic Variable Consistency Rough Set Approaches , 2009, Int. J. Approx. Reason..

[22]  W. Li,et al.  Hybrid approaches to attribute reduction based on indiscernibility and discernibility relation , 2011, Int. J. Approx. Reason..

[23]  Irena Koprinska,et al.  Co-training with a Single Natural Feature Set Applied to Email Classification , 2004, IEEE/WIC/ACM International Conference on Web Intelligence (WI'04).

[24]  Tu Bao Ho,et al.  Nonhierarchical document clustering based on a tolerance rough set model , 2002, Int. J. Intell. Syst..

[25]  Hung Son Nguyen,et al.  A Tolerance Rough Set Approach to Clustering Web Search Results , 2004, PKDD.

[26]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[27]  Yiyu Yao,et al.  A Decision Theoretic Framework for Approximating Concepts , 1992, Int. J. Man Mach. Stud..

[28]  Andrzej Skowron,et al.  Rough sets: Some extensions , 2007, Inf. Sci..

[29]  David H. Bailey,et al.  Algorithms and applications , 1988 .

[30]  Rayid Ghani,et al.  Analyzing the effectiveness and applicability of co-training , 2000, CIKM '00.

[31]  Zhi-Hua Zhou,et al.  Analyzing Co-training Style Algorithms , 2007, ECML.

[32]  Xiaojin Zhu,et al.  Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[33]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[34]  Salvatore Greco,et al.  Rough sets theory for multicriteria decision analysis , 2001, Eur. J. Oper. Res..

[35]  D. Dubois,et al.  ROUGH FUZZY SETS AND FUZZY ROUGH SETS , 1990 .

[36]  T. Lin Granulation and nearest neighborhoods: rough set approach , 2001 .

[37]  K. Thangavel,et al.  Dimensionality reduction based on rough set theory: A review , 2009, Appl. Soft Comput..

[38]  Yiyu Yao,et al.  Constructive and Algebraic Methods of the Theory of Rough Sets , 1998, Inf. Sci..

[39]  Yiyu Yao,et al.  Probabilistic rough set approximations , 2008, Int. J. Approx. Reason..

[40]  Dominik Slezak,et al.  Rough Sets and Bayes Factor , 2005, Trans. Rough Sets.

[41]  Lars Kai Hansen,et al.  Neural Network Ensembles , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[42]  T. Y. Lin,et al.  NEIGHBORHOOD SYSTEMS AND APPROXIMATION IN RELATIONAL DATABASES AND KNOWLEDGE BASES , 1989 .

[43]  Tsau Young Neighborhood Systems: Mathematical Models of Information Granulations: , 2003 .

[44]  Lirong Jian,et al.  Rough Set Theory , 2010 .

[45]  Qinghua Hu,et al.  Neighborhood classifiers , 2008, Expert Syst. Appl..

[46]  Urszula Wybraniec-Skardowska,et al.  Extensions and Intentions in the Ruogh Set Theory , 1998, Inf. Sci..

[47]  Qinghua Hu,et al.  EROS: Ensemble rough subspaces , 2007, Pattern Recognit..

[48]  Ying Sai,et al.  A comparison of two types of rough sets induced by coverings , 2009, Int. J. Approx. Reason..

[49]  Robert P. W. Duin,et al.  Experiments with Classifier Combining Rules , 2000, Multiple Classifier Systems.

[50]  Yiyu Yao,et al.  Probabilistic approaches to rough sets , 2003, Expert Syst. J. Knowl. Eng..

[51]  Andrzej Skowron,et al.  Rudiments of rough sets , 2007, Inf. Sci..

[52]  Rajkumar Roy,et al.  Advances in Soft Computing , 2018, Lecture Notes in Computer Science.

[53]  Witold Pedrycz,et al.  Granular Computing - The Emerging Paradigm , 2007 .

[54]  Roman Słowiński,et al.  The Use of Rough Sets and Fuzzy Sets in MCDM , 1999 .

[55]  Li Wei-min Algorithm for attribute reduction based on improved discernibility matrix , 2007 .

[56]  Salvatore Greco,et al.  Monotonic Variable Consistency Rough Set Approaches , 2009, Int. J. Approx. Reason..

[57]  Malcolm J. Beynon,et al.  Reducts within the variable precision rough sets model: A further investigation , 2001, Eur. J. Oper. Res..

[58]  Wojciech Ziarko,et al.  Variable Precision Rough Set Model , 1993, J. Comput. Syst. Sci..

[59]  Andrzej Skowron,et al.  The Discernibility Matrices and Functions in Information Systems , 1992, Intelligent Decision Support.

[60]  Maria-Florina Balcan,et al.  Co-Training and Expansion: Towards Bridging Theory and Practice , 2004, NIPS.

[61]  Yiyu Yao,et al.  Two views of the theory of rough sets in finite universes , 1996, Int. J. Approx. Reason..

[62]  Josef Kittler,et al.  Combining classifiers: A theoretical framework , 1998, Pattern Analysis and Applications.

[63]  Yiyu Yao,et al.  Decision-Theoretic Rough Set Models , 2007, RSKT.

[64]  Aleksander Ohrn,et al.  ROSETTA -- A Rough Set Toolkit for Analysis of Data , 1997 .

[65]  Masahiro Inuiguchi,et al.  Variable-precision dominance-based rough set approach and attribute reduction , 2009, Int. J. Approx. Reason..

[66]  Z. Bo Variable precision rough sets model based on(α,τ) limited similarity relation , 2009 .

[67]  Miao Duo-qian,et al.  Information-based algorithm for reduction of knowledge , 1997, 1997 IEEE International Conference on Intelligent Processing Systems (Cat. No.97TH8335).

[68]  Yiyu Yao Granular Computing using Neighborhood Systems , 1999 .

[69]  L. Polkowski Rough Sets: Mathematical Foundations , 2013 .

[70]  Yiyu Yao,et al.  Relational Interpretations of Neigborhood Operators and Rough Set Approximation Operators , 1998, Inf. Sci..

[71]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[72]  Yiyu Yao,et al.  Three-way decisions with probabilistic rough sets , 2010, Inf. Sci..

[73]  Roberto Barbuti,et al.  Timed P Automata , 2009, Electron. Notes Theor. Comput. Sci..

[74]  Yan Zhou,et al.  Enhancing Supervised Learning with Unlabeled Data , 2000, ICML.

[75]  Zhi-Hua Zhou,et al.  Improve Computer-Aided Diagnosis With Machine Learning Techniques Using Undiagnosed Samples , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[76]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[77]  Qinghua Hu,et al.  Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation , 2007, Pattern Recognit..

[78]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .