Multiple regression analysis of a patent’s citation frequency and quantitative characteristics: the case of Japanese patents

Although many studies have been conducted to clarify the factors that affect the citation frequency of “academic papers,” there are few studies where the citation frequency of “patents” has been predicted on the basis of statistical analysis, such as regression analysis. Assuming that a patent based on a variety of technological bases tends to be an important patent that is cited more often, this study examines the influence of the number of cited patents’ classifications and compares it with other factors, such as the numbers of inventors, classifications, pages, and claims. Multiple linear, logistic, and zero-inflated negative binomial regression analyses using these factors are performed. Significant positive correlations between the number of classifications of cited patents and the citation frequency are observed for all the models. Moreover, the multiple regression analyses demonstrate that the number of classifications of cited patents contributes more to the regression than do other factors. This implies that, if confounding between factors is taken into account, it is the diversity of classifications assigned to backward citations that more largely influences the number of forward citations.

[1]  William E. Snizek,et al.  Textual and nontextual characteristics of scientific papers: Neglected science indicators , 2005, Scientometrics.

[2]  Lutz Bornmann,et al.  Multiple publication on a single research study: Does it pay? The influence of number of research articles on total citation counts in biomedicine , 2007, J. Assoc. Inf. Sci. Technol..

[3]  Jeong-Dong Lee,et al.  An in-depth empirical analysis of patent citation counts using zero-inflated count data model: The case of KIST , 2007, Scientometrics.

[4]  Masao Utiyama,et al.  Overview of the Patent Translation Task at the NTCIR-7 Workshop , 2008, NTCIR.

[5]  Francis Narin,et al.  Patents as indicators for the evaluation of industrial research output , 1995, Scientometrics.

[6]  Keita Tsuji,et al.  Analysis of the relationship between citation frequency of patents and diversity of their backward citations for Japanese patents , 2011, Scientometrics.

[7]  Peter Groves,et al.  International Patent Classification (IPC) , 2011 .

[8]  魏屹东,et al.  Scientometrics , 2018, Encyclopedia of Big Data.

[9]  Bradford L. Barham,et al.  UNIVERSITIES AND AGRICULTURAL BIOTECHNOLOGY PATENT PRODUCTION , 2000 .

[10]  P. Shapira,et al.  Effects of international collaboration and knowledge moderation on China's nanotechnology research impacts , 2012 .

[11]  A. F. J. Van Raan,et al.  On determinants of citation scores: a case study in chemical engineering , 1994 .

[12]  Ronald N. Kostoff,et al.  The difference between highly and poorly cited medical articles in the journal Lancet , 2007, Scientometrics.

[13]  Wolfgang Glänzel,et al.  Coauthorship Patterns and Trends in the Sciences (1980-1998): A Bibliometric Study With Implications for Database Indexing and Search Strategies , 2002, Libr. Trends.

[14]  Makoto Iwayama,et al.  Overview of the Patent Mining Task at the NTCIR-7 Workshop , 2008, NTCIR.

[15]  D. Harhoff,et al.  Citation Frequency and the Value of Patented Inventions , 1999, Review of Economics and Statistics.

[16]  Lutz Bornmann,et al.  What do citation counts measure? A review of studies on citing behavior , 2008, J. Documentation.