Differentiating between data-mining and text-mining terminology

When a new discipline emerges, it usually takes some time and a great deal of academic discussion before concepts and terms become standardized. Text mining is one such new discipline. In a groundbreaking article, Untangling text data mining, Hearst (1999) tackled the problem of clarifying text-mining concepts and terminology. This article is aimed at building on Hearst's ideas by pointing out some inconsistencies and inaccuracies, and suggesting an improved and extended categorization of data-mining and text-mining approaches.

[1]  Eduard Hovy,et al.  Automated Text Summarization in SUMMARIST , 1997, ACL 1997.

[2]  Rudolf F. Albrecht,et al.  Knowledge Discovery in Literature Data Bases , 1998 .

[3]  Patrick Brézillon,et al.  Lecture Notes in Artificial Intelligence , 1999 .

[4]  Frederick E. Petry,et al.  Extraction and representation of contextual information for knowledge discovery in texts , 2003, Inf. Sci..

[5]  Charles Halliman Business Intelligence Using Smart Techniques : Environmental Scanning Using Text Mining and Competitor Analysis Using Scenarios and Manual Simulation , 2001 .

[6]  Tetsuya Nasukawa,et al.  Text analysis and knowledge mining system , 2001, IBM Syst. J..

[7]  Martin Rajman,et al.  Text Mining: Natural Language techniques and Text Mining applications , 1998 .

[8]  Tony Cornford,et al.  Project Research in Information Systems: A Student's Guide , 1996 .

[9]  S. Sumathi,et al.  Data Warehousing, Data Mining, and OLAP , 2006 .

[10]  Christina Alexandris,et al.  Greek Verb Semantic Processing for Stock Market Text Mining , 2000, Natural Language Processing.

[11]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[12]  Hsinchun Chen,et al.  Knowledge Management Systems: A Text Mining Perspective , 2001 .

[13]  Lucy Marshall,et al.  Finding needles in the haystack : Mining meets the Web , 1999 .

[14]  Peter Rob,et al.  Database systems - design, implementation, and management (2. ed.) , 1995 .

[15]  Marti A. Hearst Untangling Text Data Mining , 1999, ACL.

[16]  Christopher R. Westphal,et al.  Data Mining Solutions: Methods and Tools for Solving Real-World Problems , 1998 .

[17]  Bhavani Thuraisingham,et al.  Data Mining: Technologies, Techniques, Tools, and Trends , 1998 .

[18]  Michael Hehenberger,et al.  Text-based knowledge discovery: search and mining of life-sciences documents. , 2002, Drug discovery today.

[19]  Dan Sullivan,et al.  Document Warehousing and Text Mining: Techniques for Improving Business Operations, Marketing, and Sales , 2001 .

[20]  Hajo Hippner,et al.  Text Mining , 2006, Informatik-Spektrum.

[21]  Peter Rob,et al.  Database systems : design, implementation, and management , 2000 .