Improving efficacy of library Services: ARIMA modelling for predicting book borrowing for optimizing resource utilization

Introduction Academic libraries are an essential component of colleges and universities throughout the world and play a central role in academic life (1). Regular library usage by students improves critical thinking (2,3), student learning and engagement, collaborative learning, student-faculty interactions, and academic challenges (4,5). Even the academic performance of students improves with library usage (6,7). Academic librarians face immense pressure to meet the needs of ever-increasing users and commitments for modernization and improvements (8). Further, with the advent of information-communication revolution, academic libraries are exploring service developments to support a series of new scenarios such as publication and scholarly communications; extensive use of digital resources; increasing heterogeneous student population; ICT-based learning and distance learning. Hence, libraries with limited resources, particularly in terms of staffing face enormous challenges in management (9). Librarians need to predict library usage over time, as this helps in planning the resources required to meet the usage. The future usage might not be stationary and there could be periods of huge demand, interleaved with long periods. A model for predicting the library usage should take into account this variability to be accurate in its prediction. Hence, seasonal prediction models are likely to improve the accuracy of prediction. The study proposes a prediction model that forecasts library usage for a period of 12 months. Time series analysis allows examining temporal dependency of data and generating predictions. Time series consists of two components: trend and seasonality. Trend depicts a linear or non-linear component that is non-repetitive within the time range, whereas seasonality repeats at systematic intervals over time (10). Time series analysis using Autoregressive Integrated Moving Average (ARIMA) models are very effective in modelling the time-dependent data as they structure time dependence between the observations (11). The predictions made using ARIMA models are more accurate than those obtained by other statistical methods (12-15). Earlier, ARIMA models are successfully forecasted household electric consumption (16); software evolution (17); incidence of several infectious diseases (12,18-25) and use of health facilities (26,27). However, such predictions are not available in the context of library usage. Hence, in this study, the authors used Box-Jenkings ARIMA modelling to monitor and predict book borrowing in Sri Venkateswara College of Engineering and Technology, Chittoor. The data pertaining to book borrowing from 1998 to 2013 fit into an ARIMA model using Box-Jenkings approach (28). The fitted model has been used to predict book borrowing for the year 2014 with 12-steps ahead and 1-step ahead approach. Root mean squared error (RMSE) was used to compare the predictive power of the two approaches. Further, both approaches have been tested using Wilcoxon signed rank test (29). Objectives of the study * To detect any seasonal pattern in the book borrowing services and identify the underlying factors associated with it; * To identify trends in the distribution of book borrowing over time; and * To validate the usage of ARIMA model with seasonal information to accurate forecasting of book borrowing. Methodology The study was conducted at the college library of Sri Venkateswara College of Engineering and Technology, Chittoor established in 1998. For the analyses, monthly data on book borrowing for the period 1998-2013 was used to fit an ARIMA model, which later used to predict out-of-sample book borrowing for the year 2014. Since estimation procedures are available only for stationary series, Augmented Dickey-Fuller (ADF) unit root test and Box-Jenkins approach on ARIMA modelling of time series, used to check the stationary series of the data, which was ascertained by observing the plot of book borrowing that involves a four-step process (11,15,24,28-33). …

[1]  Claire Creaser The role of the academic library , 2014 .

[2]  Miguel Goulão,et al.  Software Evolution Prediction Using Seasonal Time Analysis: A Comparative Study , 2012, 2012 16th European Conference on Software Maintenance and Reengineering.

[3]  Jin Ru-feng Forecasting incidence of intestinal infectious diseases in mainland China with ARIMA model and GM(1,1) model , 2008 .

[4]  David Bawden The library in the twenty-first century: new services for the information age: Peter Brophy; Library Association Publishing, London, 240pp, price £39.95, ISBN 1-85604-75-4 , 2001, Int. J. Inf. Manag..

[5]  Peraphon Sophatsathit,et al.  Resource Allocation with Exponential Model Prediction for Server , 2015 .

[6]  A. Earnest,et al.  Using autoregressive integrated moving average (ARIMA) models to predict and monitor the number of beds occupied during a SARS outbreak in a tertiary hospital in Singapore , 2005, BMC health services research.

[7]  Pasapitch Chujai,et al.  Time Series Analysis of Household Electric Consumption with ARIMA and ARMA Models , 2022 .

[8]  Thomas Hill Statistics: Methods and Applications , 2005 .

[9]  Roch Roy,et al.  Stochastic modeling of empirical time series of childhood infectious diseases data before and after mass vaccination , 2006, Emerging themes in epidemiology.

[10]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[11]  L. Braskamp,et al.  Involvement and growth of students in three academic programs , 1988 .

[12]  W. Hu,et al.  Climate variation and incidence of Ross river virus in Cairns, Australia: a time-series analysis. , 2001, Environmental health perspectives.

[13]  Ying Zhang,et al.  Time series analysis of a Web search engine transaction log , 2009, Inf. Process. Manag..

[14]  P. Milner,et al.  Ten-year follow-up of ARIMA forecasts of attendances at accident and emergency departments in the Trent region. , 1997, Statistics in medicine.

[15]  Leon Hirsch Applied Time Series Analysis For The Social Sciences , 2016 .

[16]  John C. Smart,et al.  Educational expenditures and student engagement: When does money matter? , 2006 .

[17]  Richard A. Berk,et al.  Applied Time Series Analysis for the Social Sciences , 1980 .

[19]  Ethelene Whitmire Development of Critical Thinking Skills: An Analysis of Academic Library Experiences and Other Measures , 1998 .

[20]  Weizhong Yang,et al.  Forecasting incidence of hemorrhagic fever with renal syndrome in China using ARIMA model , 2011, BMC infectious diseases.

[21]  Walter Vandaele,et al.  Statistical Methods for Comparative Studies Techniques for Bias Reduction , 1980 .

[22]  Ying-Wong Cheung,et al.  Lag Order and Critical Values of the Augmented Dickey-Fuller Test , 1995 .

[23]  Richard M. Heiberger,et al.  Statistical Analysis and Data Display: An Intermediate Course with Examples in S-Plus, R, and SAS , 2004 .

[24]  Peter Brophy The library in the twenty-first century : new services for the information age , 2001 .

[25]  Robert M. Gonyea,et al.  The Role of the Academic Library in Promoting Student Engagement in Learning , 2003 .

[26]  Terrence A. Brooks Naive vs. Sophisticated Methods of Forecasting Public Library Circulations. , 1984 .

[27]  U Helfenstein,et al.  The use of transfer function models, intervention analysis and related time series methods in epidemiology. , 1991, International journal of epidemiology.

[28]  G. Box,et al.  On a measure of lack of fit in time series models , 1978 .

[29]  B. Asha Stochastic Model for the Electronic Access of an Article , 2008 .

[30]  Displays for Direct Comparison of ARIMA Models , 2002 .

[31]  Patrick Barkey Patterns of Student Use of a College Library , 1965 .

[32]  Richard Weber,et al.  A Forecasting Methodology Using Support Vector Regression and Dynamic Feature Selection , 2006, J. Inf. Knowl. Manag..

[33]  R. Farmer,et al.  Models for forecasting hospital bed requirements in the acute sector. , 1990, Journal of epidemiology and community health.

[34]  R. Allard Use of time-series analysis in infectious disease surveillance. , 1998, Bulletin of the World Health Organization.

[35]  D Prothero,et al.  Finding causes of seasonal diseases using time series analysis. , 1981, International journal of epidemiology.

[36]  F F Nobre,et al.  Dynamic linear model and SARIMA: a comparison of their forecasting performance in epidemiology , 2001, Statistics in medicine.

[37]  Kenneth D. Mandl,et al.  Time series modeling for syndromic surveillance , 2003, BMC Medical Informatics Decis. Mak..

[38]  John Ignatius Griffin,et al.  Statistics; methods and applications , 1963 .

[39]  Patricia B. Knapp,et al.  The Monteith college library experiment , 1966 .

[40]  Lee Rainie,et al.  Library Services in the Digital Age. , 2013 .

[41]  S B Thacker,et al.  An evaluation of influenza mortality surveillance, 1962-1979. I. Time series forecasts of expected pneumonia and influenza deaths. , 1981, American journal of epidemiology.

[42]  U Helfenstein,et al.  Box-Jenkins modelling of some viral infectious diseases. , 1986, Statistics in medicine.

[43]  George D. Kuh,et al.  Student Experiences With Information Technology And Their Relationship To Other Aspects Of Student Engagement , 2005 .