Mining Textual Contents of Financial Reports

The message, stylistic focus, language and readability of financial reports are good indicators of the perspectives and developments of any company. These indicators can guide companies ̓decision makers to more efficient actions in the dynamic business environment. In this paper, we have studied the language and contents of quarterly financial reports using automated linguistic and text mining methods. We aim at comparing the results from linguistic analysis of quarterly reports by means of collocational networks and the results obtained from text mining analysis of quarterly report by means of the prototype matching. We perform the study on the quarterly reports from three leading companies in the telecommunications sector. Our results are somewhat controversial: some of the reports from the companies have as their closest matches the reports with similar collocational networks and some do not have.

[1]  H. Hildebrandt,et al.  The Pollyanna Hypothesis in Business Writing: Initial Results, Suggestions for Research , 1981 .

[2]  John Sinclair,et al.  Corpus, Concordance, Collocation , 1991 .

[3]  Hannu Vanharanta,et al.  Validation of Text Clustering Based on Document Contents , 2001, MLDM.

[4]  Hannu Vanharanta,et al.  Comparing numerical data and text information from annual reports using self-organizing maps , 2001, Int. J. Account. Inf. Syst..

[5]  Dorothy A. Winsor,et al.  Owning Corporate Texts , 1993 .

[6]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[7]  Marti A. Hearst Untangling Text Data Mining , 1999, ACL.

[8]  Robert G. Insley,et al.  Performance and Readability: A Comparison of Annual Reports of Profitable and Unprofitable Corporations , 1993 .

[9]  Geoffrey Williams Collocational networks: Interlocking patterns of lexis in a Corpusof plant biology research articles , 1998 .

[10]  Albert H. Segars,et al.  The President's Letter to Stockholders: An Examination of Corporate Communication Strategy , 1992 .

[11]  Michael Stubbs,et al.  COLLOCATIONS AND SEMANTIC PROFILES: ON THE CAUSE OF THE TROUBLE WITH QUANTITATIVE STUDIES , 1995 .

[12]  Hannu Vanharanta,et al.  Combining data and text mining techniques for analysing financial reports , 2004, Intell. Syst. Account. Finance Manag..

[13]  Susan T. Dumais,et al.  The vocabulary problem in human-system communication , 1987, CACM.

[14]  J. Thomas,et al.  Discourse in the Marketplace: The Making of Meaning in Annual Reports , 1997 .

[15]  Hannu Vanharanta,et al.  Data mining of text as a tool in authorship attribution , 2001, SPIE Defense + Commercial Sensing.

[16]  J. D. Osborne,et al.  Strategic groups and competitive enactment: a study of dynamic relationships between mental models and performance , 2001 .