n-stage Latent Dirichlet Allocation: A Novel Approach for LDA

Nowadays, data analysis has become a problem as the amount of data is constantly increasing. In order to overcome this problem in textual data, many models and methods are used in natural language processing. The topic modeling field is one of these methods. Topic modeling allows determining the semantic structure of a text document. Latent Dirichlet Allocation (LDA) is the most common method among topic modeling methods. In this article, the proposed n-stage LDA method, which can enable the LDA method to be used more effectively, is explained in detail. The positive effect of the method has been demonstrated by the applied English and Turkish studies. Since the method focuses on reducing the word count in the dictionary, it can be used languageindependently. You can access the open-source code of the method and the example: https: //github.com/anil1055/n-stage_LDA

[1]  C. Lee Giles,et al.  Topic and Trend Detection in Text Collections Using Latent Dirichlet Allocation , 2009, ECIR.

[2]  R Kusumaningrum,et al.  Latent Dirichlet Allocation (LDA) for Sentiment Analysis Toward Tourism Review in Indonesia , 2017 .

[3]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[4]  Luis M. de Campos,et al.  LDA-based term profiles for expert finding in a political setting , 2021, Journal of Intelligent Information Systems.

[5]  Banu Diri,et al.  Comparison Method for Emotion Detection of Twitter Users , 2019, 2019 Innovations in Intelligent Systems and Applications Conference (ASYU).

[6]  Hui Zhang,et al.  Experimental explorations on short text topic mining between LDA and NMF based Schemes , 2019, Knowl. Based Syst..

[7]  Banu Diri,et al.  Comparison of Topic Modeling Methods for Type Detection of Turkish News , 2019, 2019 4th International Conference on Computer Science and Engineering (UBMK).

[8]  Saeedeh Momtazi,et al.  Unsupervised Latent Dirichlet Allocation for supervised question classification , 2018, Inf. Process. Manag..

[9]  Ahmad Fathan Hidayatullah,et al.  Road traffic topic modeling on Twitter using latent dirichlet allocation , 2017, 2017 International Conference on Sustainable Information Engineering and Technology (SIET).

[10]  Khalid Alfalqi,et al.  A Survey of Topic Modeling in Text Mining , 2015 .

[11]  Vili Podgorelec,et al.  Text classification method based on self-training and LDA topic models , 2017, Expert Syst. Appl..

[12]  S. SowmyaKamath,et al.  An Approach for Multimodal Medical Image Retrieval using Latent Dirichlet Allocation , 2019, COMAD/CODS.

[13]  Banu Diri,et al.  Classification of TurkishTweet emotions by n- stage Latent Dirichlet Allocation , 2018, 2018 Electric Electronics, Computer Science, Biomedical Engineerings' Meeting (EBBT).

[14]  PavlinekMiha,et al.  Text classification method based on self-training and LDA topic models , 2017 .

[15]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[16]  James C. Wetherbe,et al.  An Empirical Comparison of Four Text Mining Methods , 2010, 2010 43rd Hawaii International Conference on System Sciences.

[17]  Susan Jia Toward a better fitness club: Evidence from exerciser online rating and review using latent Dirichlet allocation and support vector machine , 2019 .

[18]  Mark Johnson,et al.  More Efficient Topic Modelling Through a Noun Only Approach , 2015, ALTA.

[19]  Banu Diri,et al.  Classification of New Titles by Two Stage Latent Dirichlet Allocation , 2018, 2018 Innovations in Intelligent Systems and Applications Conference (ASYU).

[20]  Xia Feng,et al.  Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey , 2017, Multimedia Tools and Applications.

[21]  Ngai-Man Cheung,et al.  Characterizing Artificial Intelligence Applications in Cancer Research: A Latent Dirichlet Allocation Analysis , 2019, JMIR medical informatics.

[22]  Banu Diri,et al.  Emotion Detection with n-stage Latent Dirichlet Allocation for Turkish Tweets , 2019, Academic Platform Journal of Engineering and Science.

[23]  Kaveh Bastani,et al.  Latent Dirichlet Allocation (LDA) for Topic Modeling of the CFPB Consumer Complaints , 2018, Expert Syst. Appl..

[24]  T. Zhu,et al.  Public discourse and sentiment during the COVID 19 pandemic: Using Latent Dirichlet Allocation for topic modeling on Twitter , 2020, PloS one.

[25]  Sulong Zhou,et al.  A guided latent Dirichlet allocation approach to investigate real-time latent topics of Twitter data during Hurricane Laura , 2021, J. Inf. Sci..