Enhancing user search experience in digital libraries with rotated latent semantic indexing

This study investigates a semi‐automatic method for creation of topical labels representing the topical concepts in information objects. The method is called rotated latent semantic indexing (rLSI). rLSI has found application in text mining but has not been used for topical labels generation in digital libraries (DLs). The present study proposes a theoretical model and an evaluation framework which are based on the LSA theory of meaning and investigates rLSI in a DL environment. The proposed evaluation framework for rLSI topical labels is focused on human‐information search behavior and satisfaction measures. The experimental systems that utilize those topical labels were built for the purposes of evaluating user satisfaction with the search process. A new instrument was developed for this study and the experiment showed high reliability of the measurement scales and confirmed the construct validity. Data was collected through the information search tasks performed by 122 participants using two experimental systems. A quantitative method of analysis, partial least squares structural equation modeling (PLS‐SEM), was used to test a set of research hypotheses and to answer research questions. The results showed a not significant, indirect effect of topical label type on both guidance and satisfaction. The conclusion of the study is that topical labels generated using rLSI provide the same levels of alignment, guidance, and satisfaction with the search process as topical labels created by the professional indexers using best practices.

[1]  Hao-Ren Ke,et al.  A study on mental models of taggers and experts for article indexing based on analysis of keyword usage , 2014, J. Assoc. Inf. Sci. Technol..

[2]  Steven J. Miller Metadata for Digital Collections: A How-To-Do-It Manual , 2011 .

[3]  Verónica Dahl,et al.  Meaning in Context , 2005, CONTEXT.

[4]  Mark D. Smucker,et al.  Information representation , 2011, Interactive Information Seeking, Behaviour and Retrieval.

[5]  Anna Sidorova,et al.  Uncovering the Intellectual Core of the Information Systems Discipline , 2008, MIS Q..

[6]  Nicholas Evangelopoulos,et al.  Review Articles: TOPICS IN REAL ESTATE RESEARCH, 1973-2010: A LATENT SEMANTIC ANALYSIS , 2013 .

[7]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[8]  Gerald J. Kowalski,et al.  Information Retrieval Systems , 1997, The Information Retrieval Series.

[9]  Julian Warner Human Information Retrieval , 2009 .

[10]  Ankush Gupta,et al.  Latent Semantic Indexing based Intelligent Information Retrieval System for Digital Libraries , 2006, J. Comput. Inf. Technol..

[11]  Shailesh S. Kulkarni,et al.  The Use of Latent Semantic Analysis in Operations Management Research , 2014, Decis. Sci..

[12]  Danielle S. McNamara,et al.  Handbook of latent semantic analysis , 2007 .

[13]  James D. Anderson,et al.  The nature of indexing: how humans and machines analyze messages and texts for retrieval - Part II: Machine indexing, and the allocation of human versus machine effort , 2001, Inf. Process. Manag..

[14]  Victor R. Prybutok,et al.  Latent Semantic Analysis: five methodological recommendations , 2012, Eur. J. Inf. Syst..

[15]  Daniel Tunkelang,et al.  Faceted Search , 2009, Synthesis Lectures on Information Concepts, Retrieval, and Services.

[16]  S. T. Dumais,et al.  Human factors and behavioral science: Statistical semantics: Analysis of the potential performance of key-word information systems , 1983, The Bell System Technical Journal.

[17]  Nicholas Evangelopoulos,et al.  Topics in Real Estate Research, 1973-2010: A Latent Semantic Analysis , 2013 .

[18]  T. Landauer LSA as a Theory of Meaning , 2007 .

[19]  Muh-Chyun Tang Browsing and searching in a faceted information space: A naturalistic study of PubMed users' interaction with a display tool , 2007, J. Assoc. Inf. Sci. Technol..