Combining data and text mining techniques for analysing financial reports

There is a vast amount of financial information on companies’ financial performance available to investors today. While automatic analysis of financial figures is common, it has been difficult to automatically extract meaning from the textual part of financial reports. The textual part of an annual report contains richer information than the financial ratios. In this paper, we combine data mining methods for analyzing quantitative and qualitative data from financial reports, in order to see if the textual part of the report contains some indication about future financial performance. The quantitative analysis has been performed using selforganizing maps, and the qualitative analysis using prototype-matching text clustering. The analysis is performed on the quarterly reports of three leading companies in the telecommunications sector.

[1]  Jorma Laaksonen,et al.  SOM_PAK: The Self-Organizing Map Program Package , 1996 .

[2]  Hannu Vanharanta,et al.  Financial Benchmarking of Telecommunications Companies , 2001 .

[3]  Hannu Vanharanta,et al.  Knowledge discovery from text documents based on paragraph maps , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[4]  Hannu Vanharanta,et al.  Comparing numerical data and text information from annual reports using self-organizing maps , 2001, Int. J. Account. Inf. Syst..

[5]  Robert G. Insley,et al.  Performance and Readability: A Comparison of Annual Reports of Profitable and Unprofitable Corporations , 1993 .

[6]  J. Thomas,et al.  Discourse in the Marketplace: The Making of Meaning in Annual Reports , 1997 .

[7]  Barbro Back,et al.  Analysing Financial Performance with Quarterly Data Using Self- Organising Maps , 2001 .

[8]  Albert H. Segars,et al.  The President's Letter to Stockholders: An Examination of Corporate Communication Strategy , 1992 .

[9]  Hannu Vanharanta,et al.  Prototype matching finding meaning in the books of the Bible , 2001, Proceedings of the 34th Annual Hawaii International Conference on System Sciences.

[10]  Hannu Vanharanta,et al.  Validation of Text Clustering Based on Document Contents , 2001, MLDM.

[11]  Janne Lehtinen,et al.  Financial ratios in an international comparison : validity and reliability , 1996 .

[12]  Dean W. Wichern,et al.  Business Statistics: Decision Making with Data , 1996 .

[13]  ScienceDirect Accounting, management, and information technologies , 2000 .

[14]  Kaisa Sere,et al.  Managing Complexity in Large Data Bases Using Self-Organizing Maps , 1996 .

[15]  Carlos Serrano-Cinca,et al.  Self-organizing neural networks for the analysis and representation of data: Some financial cases , 1993, Neural Computing & Applications.

[16]  Dorothy A. Winsor,et al.  Owning Corporate Texts , 1993 .