Validation of a web mining technique to measure innovation in the Canadian nanotechnology-related community

In this exploratory study, we explore a methodology using a web mining technique to source data in order to analyse innovation and commercialisation processes in Canadian nanotechnology firms. 79 websites have been extracted and analysed based on keywords related to 4 core concepts (R&D, intellectual property, collaboration and external financing) especially important for the commercialisation of nanotechnology. To validate our methodology, we compare our web mining results with those from a classic questionnaire-based survey. Our results show a correlation between the indicators from the two methods of r=0.306 (p-value=0.007) for R&D, of r=0.368 (p-value=0.002) for IP, of r=0.222 (p-value of 0.071) for Collaboration and of r=0.222 (p-value=0.067) for external financing. We conclude that some of the data extracted by our web mining technique can be used as proxy for specific variables obtained from more classical methods.

[1]  Mahieddine Djoudi,et al.  Overview of Web Content Mining Tools , 2013, ArXiv.

[2]  D. Campbell,et al.  Unobtrusive Measures: Nonreactive Research in the Social Sciences , 1966 .

[3]  K. Pavitt,et al.  Patent statistics as indicators of innovative activities: Possibilities and problems , 2005, Scientometrics.

[4]  F. Gault The Oslo Manual , 2013 .

[5]  Jin Young Kim,et al.  Impact of university scientists on innovations in nanotechnology , 2014 .

[6]  Jan L. Youtie,et al.  Pathways from discovery to commercialisation: using web sources to track small and medium-sized enterprise strategies in emerging nanotechnologies , 2012, Technol. Anal. Strateg. Manag..

[7]  J. V. Reenen,et al.  The Profitability of Innovating Firms , 1993 .

[8]  Juneseuk Shin,et al.  Factors influencing nanotechnology commercialization: an empirical analysis of nanotechnology firms in South Korea , 2013, Journal of Nanoparticle Research.

[9]  H. Kastenholz,et al.  Laypeople's and Experts' Perception of Nanotechnology Hazards , 2007, Risk analysis : an official publication of the Society for Risk Analysis.

[10]  M. Roach,et al.  Increasing Web Survey Response Rates in Innovation Research: An Experimental Study of Static and Dynamic Contact Design Features , 2012 .

[11]  Philip Shapira,et al.  Use of web mining in studying innovation , 2014, Scientometrics.

[12]  Z. Griliches,et al.  Do Subsidies to Commercial R&D Reduce Market Failures? Microeconomic Evaluation Studies , 1999 .