Data Refining for Text Mining Process in Aviation Safety Data

Successful data mining is an iterative process during which data will be refined and adjusted to achieve more accurate mining results. Most important tools in the text mining context are list of stop words and list of synonyms. The size and richness of the lists mentioned depend on the structure of the language used in the text to be mined. English, for example, is an “easy” language for search technologies, because with a couple of exceptions, the stem of the word is not conjugated and terms are formed using several words instead of creating compounds. This requires special attention to definitions when processing morphologically rich languages like Finnish. This chapter introduces the need and realisation of refining the source data for a successful data mining process based onto the results achieved from first mining round.

[1]  J. Drexhage Proposal for a DIRECTIVE OF THE EUROPEAN PARLIAMENT AND OF THE COUNCIL on the reduction of the impact of certain plastic products on the environment , 2002 .

[2]  Lisa Singh,et al.  A Component-Based Data Management and Knowledge Discovery Framework for Aviation Studies , 2006 .

[3]  Lisa Singh,et al.  Experience Report: A Component-Based Data Management and Knowledge Discovery Framework for Aviation Studies , 2006, Int. J. Inf. Technol. Web Eng..

[4]  Clemens Niemi A Finnish Grammar , 2009 .

[5]  Padhraic Smyth,et al.  Knowledge Discovery and Data Mining: Towards a Unifying Framework , 1996, KDD.

[6]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[7]  Li Cao,et al.  LSSVM with Fuzzy Pre-processing Model Based Aero Engine Data Mining Technology , 2007, ADMA.

[8]  Zohreh Nazeri Application of Aviation Safety Data Mining Workbench at American Airlines , 2003 .

[9]  Richard T. Watson,et al.  Data Management, Databases and Organizations , 2008 .

[10]  Hannu Vanharanta,et al.  Data mining of text as a tool in authorship attribution , 2001, SPIE Defense + Commercial Sensing.

[11]  Ben J Hicks,et al.  SPIE - The International Society for Optical Engineering , 2001 .

[12]  Antonina Kloptchenko,et al.  Text Mining Based on the Prototype Matching Method , 2003 .

[13]  Dursun Delen,et al.  Seeding the survey and analysis of research literature with text mining , 2008, Expert Syst. Appl..

[14]  Jeffrey W. Seifert,et al.  Data Mining: An Overview , 2004 .