Data science from a library and information science perspective

Data science is a relatively new field which has gained considerable attention in recent years. This new field requires a wide range of knowledge and skills from different disciplines including mathematics and statistics, computer science and information science. The purpose of this paper is to present the results of the study that explored the field of data science from the library and information science (LIS) perspective.,Analysis of research publications on data science was made on the basis of papers published in the Web of Science database. The following research questions were proposed: What are the main tendencies in publication years, document types, countries of origin, source titles, authors of publications, affiliations of the article authors and the most cited articles related to data science in the field of LIS? What are the main themes discussed in the publications from the LIS perspective?,The highest contribution to data science comes from the computer science research community. The contribution of information science and library science community is quite small. However, there has been continuous increase in articles from the year 2015. The main document types are journal articles, followed by conference proceedings and editorial material. The top three journals that publish data science papers from the LIS perspective are the Journal of the American Medical Informatics Association, the International Journal of Information Management and the Journal of the Association for Information Science and Technology. The top five countries publishing are USA, China, England, Australia and India. The most cited article has got 112 citations. The analysis revealed that the data science field is quite interdisciplinary by nature. In addition to the field of LIS the papers belonged to several other research areas. The reviewed articles belonged to the six broad categories: data science education and training; knowledge and skills of the data professional; the role of libraries and librarians in the data science movement; tools, techniques and applications of data science; data science from the knowledge management perspective; and data science from the perspective of health sciences.,The limitations of this research are that this study only analyzed research papers in the Web of Science database and therefore only covers a certain amount of scientific papers published in the field of LIS. In addition, only publications with the term “data science” in the topic area of the Web of Science database were analyzed. Therefore, several relevant studies are not discussed in this paper that are not reflected in the Web of Science database or were related to other keywords such as “e-science,” “e-research,” “data service,” “data curation” or “research data management.”,The field of data science has not been explored using bibliographic analysis of publications from the perspective of the LIS. This paper helps to better understand the field of data science and the perspectives for information professionals.

[1]  Liz Lyon,et al.  Bridging the Data Talent Gap: Positioning the iSchool as an Agent for Change , 2015 .

[2]  Edward J. Kim,et al.  Teaching Data Science , 2016, ICCS.

[3]  Ali Intezari,et al.  Information and reformation in KM systems: big data and strategic decision-making , 2017, J. Knowl. Manag..

[4]  Karen Antell,et al.  Dealing with Data: Science Librarians' Participation in Data Management at Association of Research Libraries Institutions , 2014, Coll. Res. Libr..

[5]  Jane Greenberg,et al.  A cross-institutional analysis of data-related curricula in information science programmes: A focused look at the iSchools , 2018, J. Inf. Sci..

[6]  Michael A. Walker,et al.  The professionalisation of data science , 2015 .

[7]  Witold Pedrycz,et al.  Fuzzy Regression Transfer Learning in Takagi–Sugeno Fuzzy Models , 2017, IEEE Transactions on Fuzzy Systems.

[8]  Erik Brynjolfsson,et al.  Big data: the management revolution. , 2012, Harvard business review.

[9]  Tom Fawcett,et al.  Data Science and its Relationship to Big Data and Data-Driven Decision Making , 2013, Big Data.

[10]  Il-Yeol Song,et al.  Big data and data science: what should we teach? , 2016, Expert Syst. J. Knowl. Eng..

[11]  H. Frank Cervone,et al.  Informatics and data science: an overview for the information professional , 2016, Digit. Libr. Perspect..

[12]  Gary J Marchionini,et al.  Information Science Roles in the Emerging Field of Data Science , 2016, J. Data Inf. Sci..

[13]  Bao Xueming Library open 24/7: A study of user needs and library management concerns , 2009 .

[14]  Gordon Bell,et al.  Beyond the Data Deluge , 2009, Science.

[15]  Leonidas Aristodemou,et al.  The state-of-the-art on Intellectual Property Analytics (IPA): A literature review on artificial intelligence, machine learning and deep learning methods for analysing intellectual property (IP) data , 2018, World Patent Information.

[16]  Emmanouel Garoufallou,et al.  A critical introduction to metadata for e-science and e-research , 2014, Int. J. Metadata Semant. Ontologies.

[17]  Michelle Dunn,et al.  The National Institutes of Health's Big Data to Knowledge (BD2K) initiative: capitalizing on biomedical big data , 2014, J. Am. Medical Informatics Assoc..

[18]  N. Reid Statistical science in the world of big data , 2018 .

[19]  Sirje Virkus,et al.  Information Overload in a Disciplinary Context , 2017, ECIL.

[20]  Ahmed Elragal,et al.  Big Data Analytics: A Literature Review Paper , 2014, ICDM.

[21]  Victor Chang,et al.  A review and future direction of agile, business intelligence, analytics and data science , 2016, Int. J. Inf. Manag..

[22]  Thomas J. Steenburgh,et al.  Motivating Salespeople: What Really Works , 2012, Harvard business review.

[23]  Victor I. Chang,et al.  Directory-based incentive management services for ad-hoc mobile clouds , 2016, Int. J. Inf. Manag..

[24]  Marco Iansiti,et al.  Real-world R&D: jumping the product generation gap , 1999 .

[25]  Emmanouel Garoufallou,et al.  Greek academic librarians' perceptions of the impact of Google on their role as information providers , 2008, Educ. Inf..

[26]  Pouria Amirian,et al.  Data Science and Analytics , 2017 .

[27]  Jane Greenberg,et al.  Big Metadata, Smart Metadata, and Metadata Capital: Toward Greater Synergy Between Data Science and Metadata , 2017, J. Data Inf. Sci..

[28]  Martin Frické,et al.  Big data and its epistemology , 2015, J. Assoc. Inf. Sci. Technol..

[29]  David J. Pauleen Davenport and Prusak on KM and big data/analytics: interview with David J. Pauleen , 2017, J. Knowl. Manag..

[30]  Li Si,et al.  The cultivation of scientific data specialists: Development of LIS education oriented to e-science service requirements , 2013, Libr. Hi Tech.

[31]  Han Woo Park,et al.  Decomposing social and semantic networks in emerging "big data" research , 2013, J. Informetrics.

[32]  Il-Yeol Song,et al.  Big Data and Data Science: Opportunities and Challenges of iSchools , 2017, J. Data Inf. Sci..

[33]  Karin van Es,et al.  The Datafied Society. Studying Culture through Data , 2017 .

[34]  Matthias Dehmer,et al.  The Process of Analyzing Data is the Emergent Feature of Data Science , 2016, Front. Genet..

[35]  Vasant Dhar,et al.  Editorial - Big Data, Data Science, and Analytics: The Opportunity and Challenge for IS Research , 2014, Inf. Syst. Res..

[36]  Maribel Yasmina Santos,et al.  The data scientist profile and its representativeness in the European e-Competence framework and the skills framework for the information age , 2017, Int. J. Inf. Manag..

[37]  Vasant Dhar,et al.  Data science and prediction , 2012, CACM.

[38]  William S. Cleveland Data Science: an Action Plan for Expanding the Technical Areas of the Field of Statistics , 2001 .

[40]  Zhou Ying,et al.  Application of the probability-based covering algorithm model in text classification , 2009 .

[41]  C. L. Philip Chen,et al.  Data-intensive applications, challenges, techniques and technologies: A survey on Big Data , 2014, Inf. Sci..

[42]  Shahriar Akter,et al.  How ‘Big Data’ Can Make Big Impact: Findings from a Systematic Review and a Longitudinal Case Study , 2015 .

[43]  Dan Sholler,et al.  Data science on the ground: Hype, criticism, and everyday work , 2016, J. Assoc. Inf. Sci. Technol..

[44]  Arun Sundararajan,et al.  Research Commentary - Information in Digital, Economic, and Social Networks , 2013, Inf. Syst. Res..

[45]  Deborah Estrin,et al.  Center of excellence for mobile sensor data-to-knowledge (MD2K) , 2015, J. Am. Medical Informatics Assoc..

[46]  Youngseek Kim,et al.  Education for eScience Professionals: Integrating Data Curation and Cyberinfrastructure , 2011, Int. J. Digit. Curation.

[47]  Lin Wang,et al.  Twinning data science with information science in schools of library and information science , 2018, J. Documentation.

[48]  Jonathan Foster,et al.  Data work in context: Value, risks, and governance , 2018, J. Assoc. Inf. Sci. Technol..

[49]  Andy Koronios,et al.  Unicorn data scientist: the rarest of breeds , 2017, Program.

[50]  Achim Osswald E-science and information services: a missing link in the context of digital libraries , 2008, Online Inf. Rev..

[51]  S. Fawcett,et al.  Data Science, Predictive Analytics, and Big Data: A Revolution that Will Transform Supply Chain Design and Management , 2013 .

[52]  Peter J. Diggle,et al.  Statistics: a data science for the 21st century , 2015 .