Transforming Textual Patterns into Knowledge

Business Intelligence (BI) can benefit greatly from the bulk of knowledge that stays hidden in the large amount of textual information existing in the organizational environment. Text Mining (TM) is a technology that provides the support to extract patterns from texts. After interpreting these patterns, a business analyst can reach useful insights to improve the organizational knowledge. Although text represents the largest part of the available information in a company, just a small part of all Knowledge Discovery (KD) applications are in TM. By means of a case study, this This chapter appears in the book, Business Intelligence in the Digital Economy, edited by Mahesh S. Raisinghani. Copyright © 2004, Idea Group Inc. Copying or distributing in print or electronic forms without written permission of Idea Group Inc. is prohibited. 701 E. Chocolate Avenue, Suite 200, Hershey PA 17033-1240, USA Tel: 717/533-8845; Fax 717/533-8661; URL-http://www.idea-group.com IDEA GROUP PUBLISHING 208 do Prado, de Oliveira, Ferneda, Wives, Silva, and Loh Copyright © 2004, Idea Group Inc. Copying or distributing in print or electronic forms without written permission of Idea Group Inc. is prohibited. chapter shows an alternative to how TM can contribute to BI. Also, a discussion on future trends and some conclusions are presented that support the effectiveness of TM as source of relevant knowledge. INTRODUCTION In order to help companies to remain competitive, Business Intelligence (BI) requires adequate tools to transform the large amount of information existing in the organizational environment into models that accurately represent the reality. By analyzing and interpreting these patterns, a business analyst can reach interesting insights that lead to a better understanding of the business, to improve internal processes, and to enable more accurate decision making. In this environment, Knowledge Discovery in Databases (KDD) has been shown to be an important approach, providing the methodologies to extract models from large data sets by means of relatively user-friendly tools. This chapter discusses the application of Text Mining (TM) in the BI realm. TM is part of the broader field of KDD that, departing from a data set relevant to the solution of a specific problem, looks for interesting and previously unknown patterns. After human-specialized interpretation of these patterns, important knowledge that leads to the solution of that problem can be discovered. KDD comprises the preliminary tasks of data collection, cleansing, and transformation, the core task of data mining, and the post-processing pattern interpretation activity. Data Mining (DM) is concerned with the extraction of patterns from structured data. In our point of view, TM is situated at the same level of DM, but is addressed to extract patterns from textual information. The patterns issued by a TM application can provide feedback to the business specialist, triggering the mental process that leads to insights on the business problems. The next section draws the relations between TM and BI, making clear at which point TM can interfere proactively in the BI process. It also discusses the weak use of TM in an environment in which the majority of the available information is represented in textual form. Next, the methodology applied to guide the case study is presented. After that, a description of some procedures applied to identify clusters in a set of texts is shown. The subsequent section is devoted to discuss the case study carried out in a Brazilian public news agency. Future trends on TM and some conclusions are presented in the last two sections of the chapter. 19 more pages are available in the full version of this document, which may be purchased using the "Add to Cart" button on the product's webpage: www.igi-global.com/chapter/transforming-textual-patternsinto-knowledge/6072?camid=4v1 This title is available in InfoSci-Books, Business Intelligence, InfoSci-Business Technologies, Business, Administration, and Management, InfoSci-Select, InfoSci-Business and Management, InfoSci-Select, InfoSci-Select. Recommend this product to your librarian: www.igi-global.com/e-resources/libraryrecommendation/?id=1