A patent is an intellectual property document that protects new inventions. It covers how things work, what they do, how they do it, what they are made of and how they are made. The owner of the granted patent application has the ability to take a legal action to stop others from making, using, importing or selling the invention without permission. While applying for a patent, the inventor has issues in identifying similar patents. Citations of related patents, which are referred to as the prior art, should be included while applying for a patent. We propose a system to develop a Patent Search Engine to identify related patents. We also propose a system to predict Business Trends by analyzing the patents. In our proposed system, we carry out a query independent clustering of patent documents to generate topic clusters using LDA. From these clusters, we retrieve query specific patents based on relevance thereby maximizing the query likelihood. Ranking is based on relevancy and recency which can be performed using BM25F algorithm. We analyze the Topic-Company trends and forecast the future of the technology which is based on the Time Series Algorithm - ARIMA. We evaluate the proposed methods on USPTO patent database. The experimental results show that the proposed techniques perform well as compared to the corresponding baseline methods.
[1]
Byungun Yoon,et al.
Patent analysis for technology forecasting: Sector-specific applications
,
2008,
2008 IEEE International Engineering Management Conference.
[2]
Sunghae Jun,et al.
Emerging Technology Forecasting Using New Patent Information Analysis
,
2012
.
[3]
Martin F. Porter,et al.
An algorithm for suffix stripping
,
1997,
Program.
[4]
Stephen E. Robertson,et al.
Simple BM25 extension to multiple weighted fields
,
2004,
CIKM '04.
[5]
Vagelis Hristidis,et al.
Patentssearcher: a novel portal to search and explore patents
,
2010,
PaIR '10.
[6]
Víctor Fresno-Fernández,et al.
Integrating the Probabilistic Models BM25/BM25F into Lucene
,
2009,
ArXiv.
[7]
Michael I. Jordan,et al.
Latent Dirichlet Allocation
,
2001,
J. Mach. Learn. Res..
[8]
Bo Gao,et al.
PatentMiner: topic-driven patent analysis and mining
,
2012,
KDD.
[9]
Jungi Kim,et al.
Cluster-based patent retrieval
,
2007,
Inf. Process. Manag..
[10]
Sunghae Jun,et al.
New Technology Management Using Time Series Regression and Clustering
,
2012
.
[11]
Leah S. Larkey,et al.
A patent search and classification system
,
1999,
DL '99.
[12]
Yuen-Hsien Tseng,et al.
Text mining techniques for patent analysis
,
2007,
Inf. Process. Manag..