Patent analysis and classification prediction of biomedicine industry: SOM-KPCA-SVM model

This paper proposed the application of a combinatorial model of machine learning to patent quality classification and forecasting in the biomedical industry. The model consists of three methods: Self-Organizing Map (SOM), Kernel Principal Component Analysis (KPCA) and Support Vector Machine (SVM), and names it SOM-KPCA-SVM model. The model proposed in this paper is implemented in two steps. First, the SOM groups the patent data and defines the patent level. Second, the patent data is reduced by KPCA to decrease noise, and SVM is applied to KPCA’s patent data to derive the classification results. The study collected 11,251 biopharmaceutical patent data from the patent transaction news. After training the patent quality model, 2196 historical patents were used to verify the performance of the training model. The accuracy of the match between experimental results and actual transaction status reached 84.13%. Therefore, the proposed patent quality method as a preliminary screening solution automatically and effectively evaluates the quality of patents. This method saves valuable time for reviewing experts, facilitates the rapid identification of high-quality patents, and can be used for the development of commercialization and mass customization of products.

[1]  Amy J. C. Trappey,et al.  A patent quality analysis for innovative technology and product development , 2012, Adv. Eng. Informatics.

[2]  Bjørn L. Basberg,et al.  Patents and the measurement of technological change: A survey of the literature☆ , 1987 .

[3]  Jae-Jun Kim,et al.  The structure and knowledge flow of building information modeling based on patent citation network analysis , 2018 .

[4]  Tao Huang,et al.  Patent classification system using a new hybrid genetic algorithm support vector machine , 2010, Appl. Soft Comput..

[5]  K. Hussinger,et al.  Information ambiguity, patents and the market value of innovative assets , 2019, Research Policy.

[6]  Samee U. Khan,et al.  A literature review on the state-of-the-art in patent analysis , 2014 .

[7]  Pei-Chun Lee,et al.  Patent litigation precaution method: analyzing characteristics of US litigated and non-litigated patents from 1976 to 2010 , 2012, Scientometrics.

[8]  Gupeng Zhang,et al.  Private value of patent right and patent infringement: An empirical study based on patent renewal data of China , 2014 .

[9]  Chao-Fu Hong,et al.  A Proposed IPC-Based Clustering and Applied to Technology Strategy Formulation , 2012, ACIIDS.

[10]  Olivier Chapelle,et al.  Model Selection for Support Vector Machines , 1999, NIPS.

[11]  Gülgün Kayakutlu,et al.  Patent value analysis using support vector machines , 2014, Soft Comput..

[12]  Amy J. C. Trappey,et al.  Intelligent patent recommendation system for innovative design collaboration , 2013, J. Netw. Comput. Appl..

[13]  Mika Liukkonen,et al.  Cluster analysis by self-organizing maps: An application to the modelling of water quality in a treatment process , 2013, Appl. Soft Comput..

[14]  Kristen A. Severson,et al.  Opportunities and challenges of real‐time release testing in biopharmaceutical manufacturing , 2017, Biotechnology and bioengineering.

[15]  M. Trajtenberg A Penny for Your Quotes : Patent Citations and the Value of Innovations , 1990 .

[16]  Wentao Hu,et al.  The fault feature extraction and classification of gear using principal component analysis and kernel principal component analysis based on the wavelet packet transform , 2014 .

[17]  Panos M. Pardalos,et al.  Robust chance-constrained support vector machines with second-order moment information , 2018, Ann. Oper. Res..

[18]  D. Harhoff,et al.  Citation Frequency and the Value of Patented Innovation , 1997 .

[19]  Subhashini Venugopalan,et al.  Topic based classification and pattern identification in patents , 2015 .

[20]  David M. Diamond,et al.  Current status on behavioral and biological markers of PTSD: A search for clarity in a conflicting literature , 2013, Neuroscience & Biobehavioral Reviews.

[21]  David H. Hsu,et al.  Resources as dual sources of advantage: Implications for valuing entrepreneurial‐firm patents , 2013 .

[22]  C. Thelwell Biological Standards for Potency Assignment to Fibrinolytic Agents Used in Thrombolytic Therapy , 2014, Seminars in Thrombosis & Hemostasis.

[23]  Aviv Segev,et al.  Identification of trends from patents using self-organizing maps , 2012, Expert Syst. Appl..

[24]  Yonghee Cho,et al.  Industrial technology roadmap as a decision making tool to support public R&D planning , 2014, Proceedings of PICMET '14 Conference: Portland International Center for Management of Engineering and Technology; Infrastructure and Service Integration.

[25]  Chui-Yu Chiu,et al.  Application of the Honeybee Mating Optimization Algorithm to Patent Document Classification in Combination with the Support Vector Machine , 2013 .

[26]  Sungjoo Lee,et al.  Keyword selection and processing strategy for applying text mining to patent analysis , 2015, Expert Syst. Appl..

[27]  Kwangsoo Kim,et al.  Creating patents on the new technology using analogy-based patent mining , 2014, Expert Syst. Appl..

[28]  André Aleman,et al.  The biological and psychological basis of neuroticism: Current status and future directions , 2013, Neuroscience & Biobehavioral Reviews.

[29]  S. Hodgins,et al.  Sexual Risk Behaviors in the Adolescent Offspring of Parents with Bipolar Disorder: Prospective Associations with Parents’ Personality and Externalizing Behavior in Childhood , 2016, Journal of abnormal child psychology.

[30]  Jiancheng Guan,et al.  The impact of university–industry collaboration networks on innovation in nanobiopharmaceuticals , 2013 .

[31]  Joe Tidd,et al.  Development of novel products through intraorganizational and interorganizational networks the case of home automation , 1995 .

[32]  Kazuyuki Motohashi,et al.  Patent statistics: A good indicator for innovation in China? Patent subsidy program impacts on patent quality , 2015 .

[33]  Yongtae Park,et al.  How to assess patent infringement risks: a semantic patent claim analysis using dependency relationships , 2013, Technol. Anal. Strateg. Manag..

[34]  Timo Fischer,et al.  Testing patent value indicators on directly observed patent value—An empirical analysis of Ocean Tomo patent auctions , 2014 .