Exploring the Field of Text Mining

Text mining is the technique of automatically deducing nonobvious but statistically supported novel information from various text data sources written in natural languages. In the big data and cloud computing era of today huge amount of text data are getting generated online. Thus text mining is becoming very essential for business intelligence extraction as volume of internet data generation is growing exponentially. Next generation computing is going to see text mining amongst other disruptive technologies like semantic web, mobile computing, big data generation, and cloud computing phenomena. Text mining needs proven techniques to be developed for it to be most effective. Even though structured data mining field is very active and mature, unstructured text mining field has just emerged. Challenges of text mining field are different from that of structured data analytics field. In this paper, I survey text mining techniques and various interesting and important applications of text mining that can increase business revenue. I give several examples of text mining to show how they can be beneficial for extracting business intelligence. Using text mining and machine learning techniques new challenges for business intelligence extraction from text data can be solved effectively.

[1]  Kate Smith-Miles,et al.  A Comprehensive Survey of Data Mining-based Fraud Detection Research , 2010, ArXiv.

[2]  Vignesh Prajapati,et al.  Big Data Analytics with R and Hadoop , 2013 .

[3]  Li Xiu,et al.  Application of data mining techniques in customer relationship management: A literature review and classification , 2009, Expert Syst. Appl..

[4]  Paula Escudeiro,et al.  Semi-Automatic Grading of Students' Answers Written in Free Text. , 2011 .

[5]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[6]  Wilfried N. Gansterer,et al.  Spam Filtering Based on Latent Semantic Indexing , 2008 .

[7]  Michael W. Berry,et al.  Survey of Text Mining: Clustering, Classification, and Retrieval , 2007 .

[8]  Desheng Dash Wu,et al.  Using text mining and sentiment analysis for online forums hotspot detection and forecast , 2010, Decis. Support Syst..

[9]  John Elder,et al.  Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications , 2012 .

[10]  R. Chalmeta,et al.  Social customer relationship management: taking advantage of Web 2.0 and Big Data technologies , 2016, SpringerPlus.

[11]  Radha Guha,et al.  Impact of Semantic Web and Cloud Computing Platform on Software Engineering , 2013 .

[12]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[13]  Salvatore Valenti,et al.  An Overview of Current Research on Automated Essay Grading , 2003, J. Inf. Technol. Educ..

[14]  Suad Alhojely,et al.  Sentiment Analysis and Opinion Mining: A Survey , 2016 .

[15]  John Elder,et al.  Handbook of Statistical Analysis and Data Mining Applications , 2009 .

[16]  W. Bruce Croft,et al.  Search Engines - Information Retrieval in Practice , 2009 .