Patent Big Data Analysis using Fuzzy Learning

Big data has had an immense effect on most social and industrial fields. It has three main characteristics, namely volume, variety, and velocity. Volume refers to the tremendous size of big data, variety pertains to its heterogeneous sources including numbers, text, and figures, and velocity refers to the rapid speed of data growth. Patent documents follow the characteristics of big data. A patent contains various results about the developed technology such as title, abstract, citations, figures, and drawings. In general, the volume of patent documents related to a target technology is very large. Moreover, a massive number of patent applications are submitted to the patent offices in every country daily. Patent data are analyzed for R&D planning by many institutes and companies. In this study, we propose a methodology for technology analysis applied to patent big data. Additionally, we employ fuzzy learning based on the fuzzy rule-based system for patent big data analysis. We study the fuzzy models for classification, regression, and clustering and group the patents by the fuzzy classification model. Using a fuzzy regression model, we build a technological relationship between subtechnologies. Lastly, we develop a fuzzy clustering model for technology clustering. To illustrate how our research may be applied to a practical domain, we employ a case study using the patent documents related to the three-dimensional printing technology.

[1]  Yuen-Hsien Tseng,et al.  Text mining techniques for patent analysis , 2007, Inf. Process. Manag..

[2]  Alan L. Porter,et al.  Forecasting and Management of Technology , 1991 .

[3]  John Cotton,et al.  Introductory statistics. 3rd ed. , 1978 .

[4]  Sunghae Jun,et al.  Examining technological innovation of Apple using patent analysis , 2013, Ind. Manag. Data Syst..

[5]  Hichem Frigui,et al.  Multiple Instance Mamdani Fuzzy Inference , 2015, Int. J. Fuzzy Log. Intell. Syst..

[6]  Michael W. Berry,et al.  Text mining : applications and theory , 2010 .

[7]  Peter Dalgaard,et al.  Introductory statistics with R , 2002, Statistics and computing.

[8]  Sunghae Jun,et al.  Extracting Key Technology Using Advanced Fuzzy Clustering , 2013 .

[9]  Witold Pedrycz,et al.  A Competent Memetic Algorithm for Learning Fuzzy Cognitive Maps , 2015, IEEE Transactions on Fuzzy Systems.

[10]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[11]  Sunghae Jun A Big Data Learning for Patent Analysis , 2013 .

[12]  Sheldon M. Ross Introductory Statistics , 1995 .

[13]  R. H. Myers Classical and modern regression with applications , 1986 .

[14]  Francisco Herrera,et al.  A learning process for fuzzy control rules using genetic algorithms , 1998, Fuzzy Sets Syst..

[15]  Sunghae Jun,et al.  Technology Forecasting using Matrix Map and Patent Clustering , 2012, Ind. Manag. Data Syst..

[16]  Sunghae Jun,et al.  Methodology of technological evolution for three-dimensional printing , 2016, Ind. Manag. Data Syst..

[17]  Jerry M. Mendel,et al.  On a Novel Way of Processing Data that Uses Fuzzy Sets for Later Use in Rule-Based Regression and Pattern Classification , 2014, Int. J. Fuzzy Log. Intell. Syst..

[18]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[19]  Hisao Ishibuchi,et al.  Effect of rule weights in fuzzy rule-based classification systems , 2001, IEEE Trans. Fuzzy Syst..

[20]  Pasi Rikkonen,et al.  Future prospects of alternative agro-based bioenergy use in Finland—Constructing scenarios with quantitative and qualitative Delphi data , 2009 .

[21]  Kurt Hornik,et al.  Misc Functions of the Department of Statistics, ProbabilityTheory Group (Formerly: E1071), TU Wien , 2015 .

[22]  Jian Pei,et al.  2012- Data Mining. Concepts and Techniques, 3rd Edition.pdf , 2012 .

[23]  Marco Russo,et al.  Genetic fuzzy learning , 2000, IEEE Trans. Evol. Comput..

[24]  Alan L. Porter,et al.  Forecasting and Management of Technology: Porter/Forecasting and Management Technology 2E , 2011 .

[25]  Ebrahim Mamdani,et al.  Applications of fuzzy algorithms for control of a simple dynamic plant , 1974 .

[26]  Sunghae Jun,et al.  Graphical causal inference and copula regression model for apple keywords by text mining , 2015, Adv. Eng. Informatics.

[27]  C. L. Philip Chen,et al.  Fuzzy Restricted Boltzmann Machine for the Enhancement of Deep Learning , 2015, IEEE Transactions on Fuzzy Systems.

[28]  Sheldon M. Ross,et al.  Introduction to Probability and Statistics for Engineers and Scientists , 1987 .

[29]  Lala Septem Riza,et al.  frbs: Fuzzy Rule-Based Systems for Classification and Regression in R , 2015 .

[30]  Yuen-Hsien Tseng,et al.  TEXT MINING FOR PATENT MAP ANALYSIS , 2005 .