Building Updated Research Agenda by Investigating Papers Indexed on Google Scholar: A Natural Language Processing Approach

Under many circumstances, scholars need to identify new research directions by going through many different databases to identify the research gap and identify areas which have not yet been studied thus far. Checking all the electronic databases is tiresome, and one often misses the important pieces. In this paper, we propose to shorten the time required for identifying the research gap by using web scraping and natural language processing. We tested this approach by reviewing three distinct areas: (i) safety awareness, (ii) housing price, (iii) sentiment and artificial intelligence from 1988 to 2019. Tokenisation was used to parse the titles of the publications indexed on Google Scholar. We then ranked the collocations from the highest to the lowest frequency. Thus, we determined the sets of keywords that had not been stated in the title and identified the initial idea as a research void.

[1]  Nuno Horta,et al.  Expert Systems With Applications , 2022 .

[2]  Canmian Liu,et al.  Impacts of Haze on Housing Prices: An Empirical Analysis Based on Data from Chengdu (China) , 2018, International journal of environmental research and public health.

[3]  Habin Lee,et al.  Data properties and the performance of sentiment classification for electronic commerce applications , 2017, Inf. Syst. Frontiers.

[4]  Loes M. M. Braun,et al.  Natural Language Processing in Radiology: A Systematic Review. , 2016, Radiology.

[5]  R. Li,et al.  Sustainable Construction Safety Knowledge Sharing: A Partial Least Square-Structural Equation Modeling and A Feedforward Neural Network Approach , 2019, Sustainability.

[6]  J. Vehí,et al.  Artificial Intelligence for Diabetes Management and Decision Support: Literature Review , 2018, Journal of medical Internet research.

[7]  Xiao Wang,et al.  World Cup 2014 in the Twitter World: A big data analysis of sentiments in U.S. sports fans' tweets , 2015, Comput. Hum. Behav..

[8]  M. Elsinga,et al.  Making a home out of a temporary dwelling: a literature review and building transformation case studies , 2018, Intelligent Buildings International.

[9]  Victoria I. Marín,et al.  Systematic review of research on artificial intelligence applications in higher education – where are the educators? , 2019, International Journal of Educational Technology in Higher Education.

[10]  Xiaocui Yin,et al.  How do housing prices affect consumption in China? New evidence from a continuous wavelet analysis , 2018, PloS one.

[11]  R. Li,et al.  Ranking of Risks for Existing and New Building Works , 2019, Sustainability.

[12]  Constantinos Alexiou,et al.  Homeownership motivation, rationality, and housing prices: Evidence from gloom, boom, and bust-and-boom economies , 2018, International Journal of Finance & Economics.

[13]  R. Li,et al.  Have Housing Prices Gone With the Smelly Wind? Big Data Analysis on Landfill in Hong Kong , 2018 .

[14]  D. Tetemke,et al.  Awareness of Occupational Hazards and Utilization of Safety Measures among Welders in Aksum and Adwa Towns, Tigray Region, Ethiopia, 2013 , 2019, Journal of environmental and public health.