Computer-Aided Comparison of Thesauri Extracted from Complementary Patent Classes as a Means to Identify Relevant Field Parameters

Patents are gaining a growing importance as a complementary source of technical information, since the information they disclose is not accessible in scientific and technical literature. Text mining technologies are emerging as a possible solution to increase the efficiency of patent analysis activities; besides, most of the existing systems are derived from general purpose applications that marginally leverage patents peculiarities. The authors are developing algorithm and tools fully dedicated to patent mining, i.e. information extraction from patent literature. The present paper aims at the identification of relevant technical parameters for a certain domain, through the comparison of thesauri automatically extracted from the given field of application and from its complementary patent classes.

[1]  Mervyn Bregonje,et al.  Patents: A unique source for scientific technical information in chemistry related industry? , 2005 .

[2]  David Bawden,et al.  Thesaurus Construction and Use: A Practical Manual , 2000 .

[3]  Sharon A. Caraballo Automatic construction of a hypernym-labeled noun hierarchy from text , 1999, ACL.

[4]  U. Miller,et al.  Thesaurus construction: problems and their roots , 1997, Inf. Process. Manag..

[5]  Davide Russo,et al.  Functional Modeling for TRIZ-Based Evolutionary Analyses , 2009 .

[6]  James R. Curran,et al.  Improvements in Automatic Thesaurus Extraction , 2002, ACL 2002.

[7]  Davide Russo,et al.  Computer-Aided Patent Analysis: finding invention peculiarities , 2007, IFIP CAI.

[8]  Frank Keller,et al.  Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics: HLT-NAACL 2004 , 2004, HLT-NAACL 2004.

[9]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[10]  Soon-Young Huh,et al.  Automatic generation of concept hierarchies using WordNet , 2008, Expert Syst. Appl..

[11]  Noel León-Rovira Trends in Computer Aided Innovation - Second IFIP Working Conference on Computer Aided Innovation, October 8-9 2007, Michigan, USA , 2007, IFIP CAI.

[12]  Davide Russo,et al.  Computer-aided analysis of patents and search for TRIZ contradictions , 2007 .

[13]  James Curran,et al.  Ensemble Methods for Automatic Thesaurus Extraction , 2002, EMNLP.

[14]  Hinrich Schütze,et al.  A Cooccurrence-Based Thesaurus and Two Applications to Information Retrieval , 1994, Inf. Process. Manag..

[15]  Stuart C. Shapiro,et al.  Encyclopedia of artificial intelligence, vols. 1 and 2 (2nd ed.) , 1992 .

[16]  Jesper W. Schneider,et al.  Verification of bibliometric methods' applicability for thesaurus construction , 2005, SIGF.

[17]  S. Cunningham,et al.  Technology futures analysis: Toward integration of the field and new methods , 2004 .

[18]  Kentaro Torisawa,et al.  Acquiring Hyponymy Relations from Web Documents , 2004, NAACL.

[19]  Denis Cavallucci,et al.  Parameter network as a means for driving problem solving process , 2007, Int. J. Comput. Appl. Technol..

[20]  Barr and Feigenbaum Edward A. Avron The Handbook of Artificial Intelligence , 1981 .

[21]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .