A study of applying data mining approach to the information disclosure for Taiwan's stock market investors

The financial literature and practices have shown the importance of corporate governance for decades, not only for firm's management but also for investor protection. Information disclosure plays a key role in all of the governance mechanisms. With good information disclosure, the asymmetric information and the agency cost between the insider and the outsider of firms can be reduced effectively. However, the information disclosure status of listed companies is hard to be evaluated or judged by investors before the annual official announcement is reported in the next year. The main purpose of this study is to explore the hidden knowledge of information disclosure status among the listed companies in Taiwan's stock market. In this paper, we employed decision tree-based mining techniques to explore the classification rules of information transparency levels of the listed firms in Taiwan's stock market. Moreover, the multi-learner model constructed by boosting ensemble approach with decision tree algorithm has been applied. The numerical results show that the classification accuracy has been improved by using multi-leaner model in terms of less Type I and Type II errors. In particular, the extracted rules from the data mining approach can be developed as a computer model for the prediction or the classification of good/poor information disclosure potential and like expert systems.

[1]  Alex Alves Freitas,et al.  Data mining with an ant colony optimization algorithm , 2002, IEEE Trans. Evol. Comput..

[2]  Richard J. Roiger,et al.  Data Mining: A Tutorial Based Primer , 2002 .

[3]  B. Lev,et al.  Value-Relevance of Nonfinancial Information: The Wireless Communications Industry , 1996 .

[4]  J. Forker,et al.  Corporate Governance and Disclosure Quality , 1992 .

[5]  Brian J. Bushee,et al.  Corporate Disclosure Practices, Institutional Investors, and Stock Return Volatility , 2000 .

[6]  Cheng-Lung Huang,et al.  A GA-based feature selection and parameters optimizationfor support vector machines , 2006, Expert Syst. Appl..

[7]  Gregory Piatetsky-Shapiro,et al.  Knowledge Discovery in Databases: An Overview , 1992, AI Mag..

[8]  Amanda J. C. Sharkey,et al.  On Combining Artificial Neural Nets , 1996, Connect. Sci..

[9]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[10]  Wei Du,et al.  Molecular classification of cancer types from microarray data using the combination of genetic algorithms and support vector machines , 2003, FEBS letters.

[11]  L. Eng,et al.  Corporate governance and voluntary disclosure , 2003 .

[12]  Chun-Lang Chang A study of applying data mining to early intervention for developmentally-delayed children , 2007, Expert Syst. Appl..

[13]  William Nick Street,et al.  An intelligent system for customer targeting: a data mining approach , 2004, Decis. Support Syst..

[14]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[15]  Filippo Menczer,et al.  Optimal ensemble construction via meta-evolutionary ensembles , 2006, Expert Syst. Appl..

[16]  Bikki Jaggi,et al.  Association between independent non-executive directors, family control and financial disclosures in Hong Kong , 2000 .

[17]  Richard A. Johnson,et al.  Applied Multivariate Statistical Analysis , 1983 .

[18]  Sandeep A. Patel,et al.  Measuring transparency and disclosure at firm-level in emerging markets , 2002 .

[19]  David West,et al.  Neural network credit scoring models , 2000, Comput. Oper. Res..

[20]  R. Ball,et al.  An empirical evaluation of accounting income numbers , 1968 .

[21]  Ching-Hsue Cheng,et al.  Entropy-based fuzzy rough classification approach for extracting classification rules , 2006, Expert Syst. Appl..

[22]  James T. Kwok,et al.  Mining customer product ratings for personalized marketing , 2003, Decis. Support Syst..