Science foresight using life-cycle analysis, text mining and clustering: A case study on natural ventilation

Abstract Science foresight comprises a range of methods to analyze past, present and expected research trends, and uses this information to predict the future status of different fields of science and technology. With the ability to identify high-potential development directions, science foresight can be a useful tool to support the management and planning of future research activities. Science foresight analysts can choose from a rather large variety of approaches. There is, however, relatively little information about how the various approaches can be applied in an effective way. This paper describes a three-step methodological framework for science foresight on the basis of published research papers, consisting of (i) life-cycle analysis, (ii) text mining and (iii) knowledge gap identification by means of automated clustering. The three steps are connected using the research methodology of the research papers, as identified by text mining. The potential of combining these three steps in one framework is illustrated by analyzing scientific literature on wind catchers; a natural ventilation concept which has received considerable attention from academia, but with quite low application in practice. The knowledge gaps that are identified show that the automated foresight analysis is indeed able to find uncharted research areas. Results from a sensitivity analysis further show the importance of using full-texts for text mining instead of only title, keywords and abstract. The paper concludes with a reflection on the methodological framework, and gives directions for its intended use in future studies.

[1]  Elias B. Kosmatopoulos,et al.  A roadmap towards intelligent net zero- and positive-energy buildings , 2011 .

[2]  Yuehong Su,et al.  Experimental and CFD study of ventilation flow rate of a Monodraught™ windcatcher , 2008 .

[3]  Thomas Ackermann,et al.  Patent data as indicators of wind power technology development , 2011 .

[4]  H Hamid Montazeri,et al.  Experimental study on natural ventilation performance of one-sided wind catcher , 2008 .

[5]  Tong Zhang,et al.  Fundamentals of Predictive Text Mining , 2010, Texts in Computer Science.

[6]  Vipul Jain,et al.  A journey from normative to behavioral operations in supply chain management: A review using Latent Semantic Analysis , 2015, Expert Syst. Appl..

[7]  S. Soutullo,et al.  Theoretical model to estimate the thermal performance of an evaporative wind tower placed in an open space , 2011 .

[8]  Kamaruzzaman Sopian,et al.  Review of windcatcher technologies , 2012 .

[9]  Anne Sunikka,et al.  Applying text-mining to personalization and customization research literature - Who, what and where? , 2012, Expert Syst. Appl..

[10]  Holger Ernst,et al.  The Use of Patent Data for Technological Forecasting: The Diffusion of CNC-Technology in the Machine Tool Industry , 1997 .

[11]  H. Montazeri,et al.  CFD analysis of the impact of physical parameters on evaporative cooling by a mist spray system , 2015 .

[12]  B. Martin Foresight in science and technology , 1995 .

[13]  Patricia C. Dykes,et al.  Complexity and the science of implementation in health IT - Knowledge gaps and future visions , 2014, Int. J. Medical Informatics.

[14]  Cheuk Ming Mak,et al.  The assessment of the performance of a windcatcher system using computational fluid dynamics , 2007 .

[15]  E. Bilgen,et al.  Numerical study of solar-wind tower systems for ventilation of dwellings , 2008 .

[16]  Jenny A. Harding,et al.  The needs and benefits of Text Mining applications on Post-Project Reviews , 2009, Comput. Ind..

[17]  Can Huang,et al.  Nanoscience and technology publications and patents: a review of social science studies and search strategies , 2011 .

[18]  Arie Rip,et al.  Tracking the evolution of new and emerging S&T via statement-linkages: Vision assessment in molecular machines , 2007, Scientometrics.

[19]  Dustin Johnson,et al.  Assessment of India's Research Literature , 2007 .

[20]  Michael Greenacre,et al.  Dynamic visualization of statistical learning in the context of high-dimensional textual data , 2010, J. Web Semant..

[21]  Donna K. Kidwell,et al.  Principal investigators as knowledge brokers: A multiple case study of the creative actions of PIs in entrepreneurial science , 2013 .

[22]  Loet Leydesdorff,et al.  Tracking areas of strategic importance using scientometric journal mappings , 1994 .

[23]  Jlm Jan Hensen,et al.  Simulation-based support for product development of innovative building envelope components , 2014 .

[24]  Yuya Kajikawa,et al.  Assessing the industrial opportunity of academic research with patent relatedness: A case study on polymer electrolyte fuel cells , 2015 .

[25]  Shih-Chieh Fang,et al.  Exploring technological opportunities by mining the gaps between science and technology: Microalgal biofuels , 2015 .

[26]  Nieves Arranz,et al.  R&D partnerships: An exploratory approach to the role of structural variables in joint project performance , 2015 .

[27]  Isaac A. Meir,et al.  Refining the use of evaporation in an experimental down-draft cool tower , 1996 .

[28]  Paulo Cortez,et al.  Business intelligence in banking: A literature analysis from 2002 to 2013 using text mining and latent Dirichlet allocation , 2015, Expert Syst. Appl..

[29]  Marco Campani,et al.  A simple interpretation of the growth of scientific/technological research impact leading to hype-type evolution curves , 2015, Scientometrics.

[30]  Rajagopalan Srinivasan,et al.  Sustainability trends in the process industries: A text mining-based analysis , 2014, Comput. Ind..

[31]  Luk Van Langenhove,et al.  Integration of Technology Assessment in R&D Management Practices , 1998 .

[32]  Ben R. Martin,et al.  The origins of the concept of ‘foresight’ in science and technology: An insider's perspective , 2010 .

[33]  Dongwoo Kang,et al.  An SAO-based text mining approach to building a technology tree for technology planning , 2012, Expert Syst. Appl..

[34]  B. J. Vickery,et al.  Evaluation of pressure coefficients and estimation of air flow rates in buildings employing wind towers , 1986 .

[35]  N. K. Bansal,et al.  A study of solar chimney assisted wind tower system for natural ventilation in buildings , 1994 .

[36]  Ozcan Saritas,et al.  The evolution of the use of Foresight methods: a scientometric analysis of global FTA research output , 2015, Scientometrics.

[37]  H Hamid Montazeri,et al.  Experimental and numerical study on natural ventilation performance of various multi-opening wind catchers , 2011 .

[38]  Clark Hu,et al.  Text mining a decade of progress in hospitality human resource management research: identifying emerging thematic development. , 2007 .

[39]  Mohsen Mazidi,et al.  Experimental investigation of new designs of wind towers , 2008 .

[40]  Isabel Gómez,et al.  Interdisciplinarity in science: A tentative typology of disciplines and research areas , 2003, J. Assoc. Inf. Sci. Technol..

[41]  H Hamid Montazeri,et al.  CFD simulation of wind-induced pressure coefficients on buildings with and without balconies: Validation and sensitivity analysis , 2013 .

[42]  Vali Kalantar,et al.  Numerical simulation of cooling performance of wind tower (Baud-Geer) in hot and arid region , 2009 .

[43]  S. Soutullo,et al.  Energy performance evaluation of an evaporative wind tower , 2012 .

[44]  Dirk Thorleuchter,et al.  Web mining based extraction of problem solution ideas , 2013, Expert Syst. Appl..

[45]  Ronald N. Kostoff,et al.  Science and technology roadmaps , 2001, IEEE Trans. Engineering Management.

[46]  Ismael Rafols,et al.  Is science becoming more interdisciplinary? Measuring and mapping six research fields over time , 2009, Scientometrics.

[47]  M. Coccia What is the Optimal Rate of R&D Investment to Maximize Productivity Growth? , 2009 .

[48]  Murat Bengisu,et al.  Forecasting emerging technologies with the aid of science and technology databases , 2006 .

[49]  Sunghae Jun,et al.  Document clustering method using dimension reduction and support vector clustering to overcome sparseness , 2014, Expert Syst. Appl..

[50]  J. Ogilvie ‘What’ and ‘where’ , 1999, Trends in Cognitive Sciences.

[51]  Wei Wu,et al.  Investigation of ecological factors controlling quality of flue-cured tobacco (Nicotiana tabacum L.) using classification methods , 2013, Ecol. Informatics.

[52]  Miguel A. Andrade-Navarro,et al.  Information extraction from full text scientific articles: Where are the keywords? , 2003, BMC Bioinformatics.

[53]  Peder Olesen Larsen,et al.  The rate of growth in scientific publication and the decline in coverage provided by Science Citation Index , 2010, Scientometrics.

[54]  Inchae Park,et al.  Exploring technological opportunities by linking technology and products: Application of morphology analysis and text mining , 2014 .

[55]  Júlio Cesar Rodrigues Pereira,et al.  The scenario of Brazilian health sciences in the period of 1981 to 1995 , 2006, Scientometrics.

[56]  Yongtae Park,et al.  A structured approach to explore knowledge flows through technology-based business methods by integrating patent citation analysis and text mining , 2015 .

[57]  Bert Blocken,et al.  50 years of Computational Wind Engineering: Past, present and future , 2014 .

[58]  Mohd. Farid Mohamed,et al.  Computational Analysis of Wind-Driven Natural Ventilation in a Two Sided Rectangular Wind Catcher , 2013 .

[59]  Amy J. C. Trappey,et al.  Clustering patents using non-exhaustive overlaps , 2010 .

[60]  Yuehong Su,et al.  A review on wind driven ventilation techniques , 2008 .

[61]  Bart De Moor,et al.  Combining full text and bibliometric information in mapping scientific disciplines , 2005, Inf. Process. Manag..

[62]  John Kaiser Calautit,et al.  A numerical investigation into the feasibility of integrating green building technologies into row houses in the Middle East , 2013 .

[63]  Habiba Drias,et al.  From data mining to knowledge mining: Application to intelligent agents , 2015, Expert Syst. Appl..

[64]  Chulhyun Kim,et al.  A systematic approach to new mobile service creation , 2008, Expert Syst. Appl..

[65]  H Hamid Montazeri,et al.  Two-sided wind catcher performance evaluation using experimental, numerical and analytical modeling , 2010 .

[66]  Jan Hensen,et al.  Integrated building performance simulation: Progress, prospects and requirements , 2015 .

[67]  Ronald N. Kostoff,et al.  Literature-related discovery: Potential treatments and preventatives for SARS , 2011, Technological Forecasting and Social Change.

[68]  Ş. Baloğlu,et al.  A Content Analysis of Subject Areas and Research Methods Used in Five Hospitality Management Journals , 1999 .

[69]  Mehdi N. Bahadori,et al.  An improved design of wind towers for natural ventilation and passive cooling , 1985 .

[70]  James C. Bezdek,et al.  Some new indexes of cluster validity , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[71]  Dominique Derome,et al.  CFD analysis of forced convective heat transfer coefficients at windward building facades: influence of building geometry , 2015 .

[72]  Pei-Chun Lee,et al.  Mapping knowledge structure by keyword co-occurrence: a first look at journal papers in Technology Foresight , 2010, Scientometrics.

[73]  Jesse H. Ausubel,et al.  A Primer on Logistic Growth and Substitution: The Mathematics of the Loglet Lab Software , 1999 .

[74]  P Bork,et al.  Automated extraction of information in molecular biology , 2000, FEBS letters.

[75]  Amy J. C. Trappey,et al.  Technology and knowledge document cluster analysis for enterprise R&D strategic planning , 2006, Int. J. Technol. Manag..

[76]  Ying Wah Teh,et al.  Text mining for market prediction: A systematic review , 2014, Expert Syst. Appl..

[77]  Yoshiyuki Takeda,et al.  Tracking emerging technologies in energy research : toward a roadmap for sustainable energy , 2008 .

[78]  S. Iniyan,et al.  The application of a Delphi technique in the linear programming optimization of future renewable energy options for India , 2003 .

[79]  Hyowon Lee,et al.  A theoretical approach to the design of sustainable dwellings in hot dry zones: A Toshka case study , 2014 .

[80]  H Hamid Montazeri,et al.  Experimental study on natural ventilation performance of a two-sided wind catcher , 2009 .

[81]  John Kaiser Calautit,et al.  The development of commercial wind towers for natural ventilation: A review , 2012 .

[82]  Byungun Yoon,et al.  A systematic approach for identifying technology opportunities: Keyword-based morphology analysis , 2005 .

[83]  H. Montazeri,et al.  Evaporative cooling by water spray systems: CFD simulation, experimental validation and sensitivity analysis , 2015 .

[84]  Tugrul U. Daim,et al.  Forecasting emerging technologies: Use of bibliometrics and patent analysis , 2006 .

[85]  Gilda Massari Coelho,et al.  Text mining as a valuable tool in foresight exercises: A study on nanotechnology , 2006 .

[86]  Sungjoo Lee,et al.  An approach to discovering new technology opportunities: Keyword-based patent map approach , 2009 .

[87]  Ronald N. Kostoff,et al.  Literature-Related Discovery (LRD): Introduction and background , 2008 .

[88]  Enrique Herrera-Viedma,et al.  SciMAT: A new science mapping analysis software tool , 2012, J. Assoc. Inf. Sci. Technol..

[89]  Sepehr Ghazinoory,et al.  An application of the text mining approach to select technology centers of excellence , 2013 .

[90]  Dursun Delen,et al.  Seeding the survey and analysis of research literature with text mining , 2008, Expert Syst. Appl..

[91]  John Kaiser Calautit,et al.  Determining the optimum spacing and arrangement for commercial wind towers for ventilation performance , 2014 .

[92]  Mehdi N. Bahadori,et al.  Viability of wind towers in achieving summer comfort in the hot arid regions of the middle east , 1994 .

[93]  Amy J. C. Trappey,et al.  Using patent data for technology forecasting: China RFID patent analysis , 2011, Adv. Eng. Informatics.