RETRACTED ARTICLE: In text mining: detection of topic and sub-topic using multiple spider hunting model

In this electronic era, everyone is in fast communication and sharing of data through social Medias. Within a fraction of second we received millions of text in whatsapp, facebook, twitter, mails and etc. It is really hard to categorize relevant data and information from massive volume of text documents. Instead of reading all documents fully, there is a need to determine Topic and subtopic of a corpus. Existing technique takes more time to detect topic and subtopic of a corpus, so we proposed dynamic multiple spider hunting algorithm. Due to the usage of multiple spiders, this technique could effectively recognize the desired artifacts with minimum amount of time and have superior performance compared to other techniques.

[1]  Xiao Liu,et al.  Learning to Track Multiple Targets , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[2]  Chloé Clavel,et al.  Grounding the detection of the user's likes and dislikes on the topic structure of human-agent interactions , 2016, Knowl. Based Syst..

[3]  Peng Zhang,et al.  Concept over time: the combination of probabilistic topic model with wikipedia knowledge , 2016, Expert Syst. Appl..

[4]  Kuan-Yu Chen,et al.  Hot Topic Extraction Based on Timeline Analysis and Multidimensional Sentence Modeling , 2007, IEEE Transactions on Knowledge and Data Engineering.

[5]  Shailesh D. Kamble,et al.  Text Mining Using Metadata for Generation of Side Information , 2016 .

[6]  Walid Magdy,et al.  Unsupervised adaptive microblog filtering for broad dynamic topics , 2016, Inf. Process. Manag..

[7]  Qi He,et al.  Keep It Simple with Time: A Reexamination of Probabilistic Topic Detection Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Charu C. Aggarwal,et al.  A Survey of Text Clustering Algorithms , 2012, Mining Text Data.

[9]  Jeffrey Nichols,et al.  CrowdE: Filtering Tweets for Direct Customer Engagements , 2013, ICWSM.

[10]  Young-Woo Seo,et al.  Text clustering for topic detection , 2004 .

[11]  Yu-Hsiang Fu,et al.  Web Usage Mining Based on Clustering of Browsing Features , 2008, 2008 Eighth International Conference on Intelligent Systems Design and Applications.

[12]  Sophia Ananiadou,et al.  Topic detection using paragraph vectors to support active learning in systematic reviews , 2016, J. Biomed. Informatics.

[13]  Martin Hepp,et al.  Semantic Web and semantic Web services: father and son or indivisible twins? , 2006, IEEE Internet Computing.

[14]  Jun Li,et al.  Social emotion classification of short text via topic-level maximum entropy model , 2016, Inf. Manag..

[15]  Chen Zhang,et al.  A hybrid term-term relations analysis approach for topic detection , 2016, Knowl. Based Syst..

[16]  Yannis Stavrakas,et al.  Tweet and followee personalized recommendations based on knowledge graphs , 2018, J. Ambient Intell. Humaniz. Comput..

[17]  Brian D. Davison,et al.  A Bootstrapping Approach to Identifying Relevant Tweets for Social TV , 2011, ICWSM.

[18]  Solane Duque,et al.  Using Data Mining Algorithms for Developing a Model for Intrusion Detection System (IDS) , 2015, Complex Adaptive Systems.

[19]  Katsuhiro Honda,et al.  Solving data preprocessing problems in existing location-aware systems , 2018, J. Ambient Intell. Humaniz. Comput..

[20]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[21]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[22]  John Pavlopoulos,et al.  AUEB: Two Stage Sentiment Analysis of Social Network Messages , 2014, *SEMEVAL.

[23]  John D. Lafferty,et al.  Dynamic topic models , 2006, ICML.

[24]  Jen-Tzung Chien,et al.  Topic-Based Hierarchical Segmentation , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[25]  Charu C. Aggarwal,et al.  Mining Text Data , 2012, Springer US.

[26]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[27]  Carlo Strapparava,et al.  WordNet Affect: an Affective Extension of WordNet , 2004, LREC.

[28]  Huan Liu,et al.  Unsupervised sentiment analysis with emotional signals , 2013, WWW.

[29]  Min Song,et al.  Opinion polarity detection in Twitter data combining shrinkage regression and topic modeling , 2016, J. Informetrics.

[30]  Dmitriy Fradkin,et al.  Anticipating annotations and emerging trends in biomedical literature , 2008, KDD.

[31]  W. Bruce Croft,et al.  Cluster-based retrieval using language models , 2004, SIGIR '04.

[32]  Rada Mihalcea,et al.  A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources , 2008, LREC.

[33]  Charu C. Aggarwal,et al.  Mining Text Data , 2012 .

[34]  Wu Yang,et al.  Discovering burst patterns of burst topic in twitter , 2017, Comput. Electr. Eng..

[35]  Inderjit S. Dhillon,et al.  Efficient Clustering of Very Large Document Collections , 2001 .

[36]  Sanda M. Harabagiu,et al.  Topic themes for multi-document summarization , 2005, SIGIR '05.

[37]  Prakash K. Aithal,et al.  Neuro-Fuzzy Based Hybrid Model for Web Usage Mining , 2015 .