Selection criteria for text mining approaches

Text mining include several techniques like categorization of text, clustering, etc.Text mining techniques can be used to finding useful information from documents.We propose some criteria to evaluate the effectiveness of text mining techniques.These proposed criteria can facilitate the selection of appropriate technique. Text mining techniques include categorization of text, summarization, topic detection, concept extraction, search and retrieval, document clustering, etc. Each of these techniques can be used in finding some non-trivial information from a collection of documents. Text mining can also be employed to detect a document's main topic/theme which is useful in creating taxonomy from the document collection. Areas of applications for text mining include publishing, media, telecommunications, marketing, research, healthcare, medicine, etc. Text mining has also been applied on many applications on the World Wide Web for developing recommendation systems. We propose here a set of criteria to evaluate the effectiveness of text mining techniques in an attempt to facilitate the selection of appropriate technique.

[1]  Elke Achtert,et al.  Finding Hierarchies of Subspace Clusters , 2006, PKDD.

[2]  Sotiris B. Kotsiantis,et al.  Supervised Machine Learning: A Review of Classification Techniques , 2007, Informatica.

[3]  Ben Shneiderman,et al.  Discovering interesting usage patterns in text collections: integrating text mining with visualization , 2007, CIKM '07.

[4]  Hazem M. Hajj,et al.  A Framework for Emotion Mining from Text in Online Social Networks , 2010, 2010 IEEE International Conference on Data Mining Workshops.

[5]  Andreas Krause,et al.  Data association for topic intensity tracking , 2006, ICML '06.

[6]  Hans-Peter Kriegel,et al.  Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering , 2009, TKDD.

[7]  Stan Matwin,et al.  Feature Engineering for Text Classification , 1999, ICML.

[8]  Bei Yu,et al.  A cross-collection mixture model for comparative text mining , 2004, KDD.

[9]  Gaurav Sharma,et al.  A survey on text mining techniques , 2014 .

[10]  Haesun Park,et al.  Cluster-Preserving Dimension Reduction Methods for Document Classification , 2008 .

[11]  Sanjeev Dhawan,et al.  A Framework for Polarity Classification and Emotion Mining from Text , 2014 .

[12]  Stan Matwin,et al.  Statistical Phrases in Automated Text Categorization , 2000 .

[13]  Abderrafih Lehmam Essential summarizer: innovative automatic text summarization software in twenty languages , 2010, RIAO.

[14]  G. Tholkappia Arasu,et al.  Enhancing Text Clustering Using Concept-based Mining Model , 2012 .

[15]  Yi Zhang,et al.  A Text Mining Model for Strategic Alliance Discovery , 2012, 2012 45th Hawaii International Conference on System Sciences.

[16]  John Mylopoulos,et al.  Text Mining Through Semi Automatic Semantic Annotation , 2006, PAKM.

[17]  Rizwan Ahmad Document Topic Generation in Text Mining by Using Cluster Analysis with EROCK , 2010 .

[18]  Patrick Lambrix,et al.  Selecting an Ontology for Biomedical Text Mining , 2009, BioNLP@HLT-NAACL.

[19]  V. M. Navaneethakumar,et al.  A Consistent Web Documents Based Text Clustering Using Concept Based Mining Model , 2012 .

[20]  Jesualdo Tomás Fernández-Breis,et al.  Searching research papers using clustering and text mining , 2013, CONIELECOMP 2013, 23rd International Conference on Electronics, Communications and Computing.

[21]  Sumit Vashishta,et al.  Efficient Retrieval of Text for Biomedical Domain using Data Mining Algorithm , 2011 .

[22]  Gurpreet Singh Lehal,et al.  A Survey of Text Mining Techniques and Applications , 2009 .

[23]  Vincent D. Blondel,et al.  Automatic discovery of similar words , 2004 .

[24]  Kulthida Tuamsuk,et al.  Data Mining and Its Applications for Knowledge Management: A Literature Review from 2007 to 2012 , 2012, ArXiv.

[25]  Farhad Soleimanian Gharehchopogh,et al.  Study on Information Extraction Methods from Text Mining and Natural Language Processing Perspectives , 2012 .

[26]  Carlotta Domeniconi,et al.  Text Clustering with Local Semantic Kernels , 2008 .

[27]  Sanjeev Dhawan,et al.  Emotion Mining Techniques in Social Networking Sites , 2014 .

[28]  Santosh Kumar Rath,et al.  Data Mining : a Healthy Tool for Your Information Retrieval and Text Mining , 2011 .

[29]  Kaizhong Zhang,et al.  An Index Structure for Data Mining and Clustering , 2000, Knowledge and Information Systems.

[30]  Yue Xu,et al.  Deploying Approaches for Pattern Refinement in Text Mining , 2006, Sixth International Conference on Data Mining (ICDM'06).