Author-Topic Modeling of DESIDOC Journal of Library and Information Technology (2008-2017), India

This study presents a method to analyze textual data and applying it to the field of Library and Information Science. This paper subsumes a special case of Latent Dirichlet Allocation and Author-Topic models where each article has one unique author and each author has one unique topic. Topic Modeling Toolkit is used to perform the author-topic modeling. The study further which considers topics and their changes over time by taking into account both the word co-occurrence pattern and time. 393 full-text articles were downloaded from DESIDOC Journal of Library and Information Technology and were analyzed accordingly. 16 core topics have been identified throughout the period of ten years. These core topics can be considered as the core area of research in the journal from 2008 to 2017. This paper further identifies top five authors associated with the representative articles for each studied year. These authors can be treated as the subject-experts for the modeled topics as indicated. The results of the study can serve as a platform to determine the research trend; core areas of research; and the subject-experts related to those core areas in the field the Library and Information Science in India.

[1]  Tiantian Wang,et al.  THC-DAT: a document analysis tool based on topic hierarchy and context information , 2016, Libr. Hi Tech.

[2]  Stuart J. Barnes,et al.  Mining meaning from online ratings and reviews: Tourist satisfaction analysis using latent dirichlet allocation , 2017 .

[3]  Jay Lee,et al.  Recent advances and trends in predictive manufacturing systems in big data environment , 2013 .

[4]  Thomas L. Griffiths,et al.  Learning author-topic models from text corpora , 2010, TOIS.

[5]  Kun Lu,et al.  Topic scientific community in science: a combined perspective of scientific collaboration and topics , 2017, Scientometrics.

[6]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[7]  Younghee Noh,et al.  Imagining Library 4.0: Creating a Model for Future Libraries , 2015 .

[8]  Alexander Mehler,et al.  Enhancing document modeling by means of open topic models: Crossing the frontier of classification schemes in digital libraries by example of the DDC , 2009, Libr. Hi Tech.

[9]  Katrina Fenlon,et al.  Building topic models in a federated digital library through selective document exclusion , 2011, ASIST.

[10]  Kun Lu,et al.  Vocabulary size and its effect on topic representation , 2017, Inf. Process. Manag..

[11]  K. C. Garg,et al.  Bibliometrics of Library and Information Science research in India during 2004-2015 , 2017 .

[12]  Lin-Chih Chen An effective LDA-based time topic model to improve blog search performance , 2017, Inf. Process. Manag..

[13]  Luca Cagliero,et al.  Discovering cross-topic collaborations among researchers by exploiting weighted association rules , 2018, Scientometrics.

[14]  S. Godfrey Winster,et al.  Event identification in social media through latent dirichlet allocation and named entity recognition , 2014, Proceedings of IEEE International Conference on Computer Communication and Systems ICCCS14.

[15]  Shaowen Yao,et al.  An overview of topic modeling and its current applications in bioinformatics , 2016, SpringerPlus.

[16]  Olessia Koltsova,et al.  Mapping the public agenda with topic modeling: The case of the Russian livejournal , 2013 .

[17]  Hideaki Takeda,et al.  Topic Representation of Researchers' Interests in a Large-Scale Academic Database and Its Application to Author Disambiguation , 2015, IEICE Trans. Inf. Syst..

[18]  Hai Jin,et al.  Future Generation Computer Systems , 2022 .

[19]  Saeid Nahavandi,et al.  Unsupervised mining of long time series based on latent topic model , 2013, Neurocomputing.

[20]  Vishal Dattatray Bapte DESIDOC Journal of Library and Information Technology (DJLIT): A Bibliometric Analysis of Cited References , 2017 .

[21]  Shu Fang,et al.  Empirical study of constructing a knowledge organization system of patent documents using topic modeling , 2014, Scientometrics.

[22]  Cassidy R. Sugimoto,et al.  The shifting sands of disciplinary development: Analyzing North American Library and Information Science dissertations using latent Dirichlet allocation , 2011, J. Assoc. Inf. Sci. Technol..

[23]  Lu Yang,et al.  Towards big topic modeling , 2017, Inf. Sci..

[24]  Juyoung Kang,et al.  Analyzing the discriminative attributes of products using text mining focused on cosmetic reviews , 2018, Inf. Process. Manag..

[25]  Leah G. Nichols A topic model approach to measuring interdisciplinarity at the National Science Foundation , 2014, Scientometrics.

[26]  Saeedeh Momtazi,et al.  Unsupervised Latent Dirichlet Allocation for supervised question classification , 2018, Inf. Process. Manag..

[27]  Carlos G. Figuerola,et al.  Mapping the evolution of library and information science (1978–2014) using topic modeling on LISA , 2017, Scientometrics.

[28]  Tingcan Ma,et al.  Topic based research competitiveness evaluation , 2018, Scientometrics.

[29]  Jung-Hwan Bae,et al.  Twitter Issue Tracking System by Topic Modeling Techniques , 2014 .

[30]  Min Chen,et al.  iDoctor: Personalized and professionalized medical recommendations based on hybrid matrix factorization , 2017, Future Gener. Comput. Syst..

[31]  Zijian Wang,et al.  Collective topical PageRank: a model to evaluate the topic-dependent academic impact of scientific papers , 2018, Scientometrics.

[32]  Kun Lu,et al.  Measuring author research relatedness: A comparison of word-based, topic-based, and author cocitation approaches , 2012, J. Assoc. Inf. Sci. Technol..