Comparing numerical data and text information from annual reports using self-organizing maps

Abstract More and more companies provide their accounting information in electronic form today. The accounting information in electronic form can be found in large commercial databases or on the web. This information is of great interest for different stakeholders, i.e., stockholders, creditors, auditors, financial analysts, and management. For the stakeholders it is important to be able to extract both quantitative and qualitative information concerning the companies they are interested in. The annual reports contain information both in numerical and symbolic form. So far, only the numerical information has been analyzed with help of computers. However, technology has evolved and in particular neural networks in the form of self-organizing maps (SOMs) provide a new tool for analyzing also text information. In this paper, we compare results on quantitative data with results on qualitative data from annual reports. We use smart encoding, SOMs, and document histograms for comparing the performance of forest companies worldwide. Firstly, we cluster the companies according to, on the one hand, quantitative information, and on the other hand, qualitative information. Secondly, we compare the results produced by the clustering methods. Our results of the comparison show that there is a difference between the results.

[1]  Kaisa Sere,et al.  Managing Complexity in Large Data Bases Using Self-Organizing Maps , 1996 .

[2]  Timo Honkela,et al.  Newsgroup Exploration with WEBSOM Method and Browsing Interface , 1996 .

[3]  Hannu Vanharanta,et al.  Knowledge discovery from text documents based on paragraph maps , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[4]  A. Refenes Neural Networks in the Capital Markets , 1994 .

[5]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[6]  Gary Marchionini,et al.  A self-organizing semantic map for information retrieval , 1991, SIGIR '91.

[7]  Timo Honkela,et al.  Very Large Two-Level SOM for the Browsing of Newsgroups , 1996, ICANN.

[8]  Anil K. Jain,et al.  A nonlinear projection method based on Kohonen's topology preserving maps , 1992, IEEE Trans. Neural Networks.

[9]  Hannu Vanharanta,et al.  Toward text understanding: classification of text documents by word map , 2000, SPIE Defense + Commercial Sensing.

[10]  David J. Ketchen,et al.  THE APPLICATION OF CLUSTER ANALYSIS IN STRATEGIC MANAGEMENT RESEARCH: AN ANALYSIS AND CRITIQUE , 1996 .

[11]  J. C. Scholtes Unsupervised learning and the information retrieval problem , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.

[12]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[13]  A. Tsui,et al.  Configurational Approaches to Organizational Analysis , 1993 .

[14]  J. G. Taylor,et al.  ARTIFICIAL NEURAL NETWORKS, 2 , 1992 .