Patent Keyword Extraction for Sustainable Technology Management

Recently, sustainable growth and development has become an important issue for governments and corporations. However, maintaining sustainable development is very difficult. These difficulties can be attributed to sociocultural and political backgrounds that change over time [1]. Because of these changes, the technologies for sustainability also change, so governments and companies attempt to predict and manage technology using patent analyses, but it is very difficult to predict the rapidly changing technology markets. The best way to achieve insight into technology management in this rapidly changing market is to build a technology management direction and strategy that is flexible and adaptable to the volatile market environment through continuous monitoring and analysis. Quantitative patent analysis using text mining is an effective method for sustainable technology management. There have been many studies that have used text mining and word-based patent analyses to extract keywords and remove noise words. Because the extracted keywords are considered to have a significant effect on the further analysis, researchers need to carefully check out whether they are valid or not. However, most prior studies assume that the extracted keywords are appropriate, without evaluating their validity. Therefore, the criteria used to extract keywords needs to change. Until now, these criteria have focused on how well a patent can be classified according to its technical characteristics in the collected patent data set, typically using term frequency–inverse document frequency weights that are calculated by comparing the words in patents. However, this is not suitable when analyzing a single patent. Therefore, we need keyword selection criteria and an extraction method capable of representing the technical characteristics of a single patent without comparing them with other patents. In this study, we proposed a methodology to extract valid keywords from single patent documents using relevant papers and their authors’ keywords. We evaluated the validity of the proposed method and its practical performance using a statistical verification experiment. First, by comparing the document similarity between papers and patents containing the same search terms in their titles, we verified the validity of the proposed method of extracting patent keywords using authors’ keywords and the paper. We also confirmed that the proposed method improves the precision by about 17.4% over the existing method. It is expected that the outcome of this study will contribute to increasing the reliability and the validity of the research on patent analyses based on text mining and improving the quality of such studies.

[1]  Guangquan Zhang,et al.  Topic-based technological forecasting based on patent data: A case study of Australian patents from 2000 to 2014 , 2017 .

[2]  Bokyoung Kang,et al.  Novelty-focused patent mapping for technology opportunity analysis , 2015 .

[3]  Chao-Fu Hong,et al.  Extracting the significant-rare keywords for patent analysis , 2009, Expert Syst. Appl..

[4]  Ichiro Sakata,et al.  Detecting potential technological fronts by comparing scientific papers and patents , 2011 .

[5]  Jaehyun Choi,et al.  A Predictive Model of Technology Transfer Using Patent Analysis , 2015 .

[6]  Fiona E. Murray Innovation as co-evolution of scientific and technological networks: exploring tissue engineering , 2002 .

[7]  In-Cheol Kim,et al.  Comparative Influences of Precipitation and River Stage on Groundwater Levels in Near-River Areas , 2015 .

[8]  Yuen-Hsien Tseng,et al.  Text mining techniques for patent analysis , 2007, Inf. Process. Manag..

[9]  Alan C. Brent,et al.  Sustainable development : a conceptual framework for the technology management field of knowledge and a departure for further research , 2012 .

[10]  D. Leonard-Barton CORE CAPABILITIES AND CORE RIGIDITIES: A PARADOX IN MANAGING NEW PRODUCT DEVELOPMENT , 1992 .

[11]  Shih-Ming Pi,et al.  Development of a Patent Retrieval and Analysis Platform - A hybrid approach , 2011, Expert Syst. Appl..

[12]  Martin G. Moehrle Measures for textual patent similarities: a guided way to select appropriate approaches , 2010, Scientometrics.

[13]  Sungjoo Lee,et al.  Keyword selection and processing strategy for applying text mining to patent analysis , 2015, Expert Syst. Appl..

[14]  Michele Grimaldi,et al.  The patent portfolio value analysis: A new framework to leverage patent information for strategic technology planning , 2015 .

[15]  Aviv Segev,et al.  Analyzing multilingual knowledge innovation in patents , 2013, Expert Syst. Appl..

[16]  Calvin S. Weng,et al.  A New Comprehensive Patent Analysis Approach for New Product Design in Mechanical Engineering , 2011 .

[17]  Sunghae Jun,et al.  A Novel Forecasting Methodology for Sustainable Management of Defense Technology , 2015 .

[18]  Kumiko Miyazaki,et al.  Evaluating the effectiveness of keyword search strategy for patent identification , 2013 .

[19]  Sunghae Jun,et al.  A Patent Analysis for Sustainable Technology Management , 2016 .

[20]  Sungjoo Lee,et al.  An approach to discovering new technology opportunities: Keyword-based patent map approach , 2009 .

[21]  Jun Ye,et al.  Cosine similarity measures for intuitionistic fuzzy sets and their applications , 2011, Math. Comput. Model..

[22]  Daniel A. McAdams,et al.  Biologically Meaningful Keywords for Functional Terms of the Functional Basis , 2011 .

[23]  Ulrich Lichtenthaler,et al.  Determinants of patent value: Insights from a simulation analysis , 2010 .

[24]  Sang-Chan Park,et al.  Visualization of patent analysis for emerging technology , 2008, Expert Syst. Appl..

[25]  Holger Ernst,et al.  Patent information for strategic technology management , 2003 .

[26]  Tadeusz Gerstaenkorn,et al.  Correlation of intuitionistic fuzzy sets , 1991 .