论文信息 - Identifying context of text documents using Naïve Bayes classification and Apriori association rule mining

Identifying context of text documents using Naïve Bayes classification and Apriori association rule mining

Huge amount of unstructured data is available in the form of text documents. Ranking these text documents by considering their context will be very useful in information retrieval. We propose classification of abstracts by considering their context using Naïve Bayes classifier and Apriori association rule algorithm - i.e. Context Based Naive Bayesian and Apriori (CBNBA). In proposed approach, we initially classify the documents using Naïve Bayes. We find the context of an abstract by looking for associated terms which help us understand the focus of the abstract and interpret the information beyond simple keywords. The results indicate that context based classification increases accuracy of classification to great extent and in turn discovers different contexts of the documents. Further this approach can found to be very useful for applications beyond abstract classification where word speaks very little and lead to ambiguous state but context can lead you to right decision/classification.

[2] Hae-Chang Rim,et al. Some Effective Techniques for Naive Bayes Text Classification , 2006, IEEE Transactions on Knowledge and Data Engineering.

[3] Ramakrishnan Srikant,et al. Fast algorithms for mining association rules , 1998, VLDB 1998.

[4] David D. Lewis,et al. Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval , 1998, ECML.

[5] Rakesh Agarwal,et al. Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[6] José Francisco Martínez Trinidad,et al. Finding Maximal Sequential Patterns in Text Document Collections and Single Documents , 2010, Informatica.

[7] A. Chickinsky. Intelligent Searching using Sentence Context , 2008, 2008 IEEE Conference on Technologies for Homeland Security.

[8] David R. Karger,et al. Tackling the Poor Assumptions of Naive Bayes Text Classifiers , 2003, ICML.

[9] Xing Zhang,et al. A new approach to classification based on association rule mining , 2006, Decis. Support Syst..