Identifying search patterns in an image-based digital library

Three months of clickstream transaction log data for an image-based digital library were analyzed for patterns in user session behavior. After extensive log cleaning of the data files, k-means cluster analysis was applied using selected session-related statistics. The outcomes reveal largely uniform session behaviors (97.37%) consisting of a limited number of actions, with a higher number of queries than other session types, but little browsing of individual results. Given the specialized nature of the digital collections within the studied system, most searchers appear to engage in purposeful, directed searching. The remaining session clusters represent higher degrees of browsing over a longer time with fewer queries per session. The session behaviors for this environment are markedly different than those revealed by transaction log studies of other information retrieval environments.

[1]  Amanda Spink,et al.  Searching the Web: the public and their queries , 2001 .

[2]  Xiaohui Liu,et al.  The role of human factors in stereotyping behavior and perception of digital library users: a robust clustering approach , 2007, User Modeling and User-Adapted Interaction.

[3]  Joonho Lee,et al.  End user searching: A Web log analysis of NAVER, a Korean Web search engine , 2005 .

[4]  Panagiotis Germanakos,et al.  Investigating the Relation between Users' Cognitive Style and Web Navigation Behavior with K-means Clustering , 2012, ER Workshops.

[5]  Michalis Sfakakis,et al.  User Behavior Tendencies on Data Collections in a Digital Library , 2002, ECDL.

[6]  Sally Jo Cunningham,et al.  A transaction log analysis of a digital library , 2000, International Journal on Digital Libraries.

[7]  Jin Zhang,et al.  Identifying Web search session patterns using cluster analysis: A comparison of three search environments , 2009, J. Assoc. Inf. Sci. Technol..

[8]  Bernard J. Jansen Searching for digital images on the web , 2008, J. Documentation.

[9]  Gary Marchionini,et al.  Exploratory search , 2006, Commun. ACM.

[10]  Amanda Spink,et al.  Classifying the user intent of web queries using k-means clustering , 2010, Internet Res..

[11]  Hsiao-Tieh Pu,et al.  An analysis of failed queries for web image retrieval , 2008, J. Inf. Sci..

[12]  Mimi Recker,et al.  Teaching Analytics: A Clustering and Triangulation Study of Digital Library User Data , 2012, J. Educ. Technol. Soc..

[13]  Minsoo Park,et al.  Understanding science and technology information users through transaction log analysis , 2013, Libr. Hi Tech.

[14]  Dick Stenmark Identifying clusters of user behavior in intranet search engine log files , 2008, J. Assoc. Inf. Sci. Technol..

[15]  Giorgio Maria Di Nunzio,et al.  Web log analysis: a review of a decade of studies about information acquisition, inspection and interpretation of user interaction , 2011, Data Mining and Knowledge Discovery.

[16]  Daqing He,et al.  Detecting session boundaries from Web user logs , 2000 .

[17]  Dolf Trieschnigg In Search of Cinderella : A Transaction Log Analysis of Folktale Searchers , 2013 .

[18]  Michael D. Cooper,et al.  Using clustering techniques to detect usage patterns in a Web-based information system , 2001, J. Assoc. Inf. Sci. Technol..