Automatic Term Extraction

This chapter focuses on computational approaches to the automatic extraction of terms from domain specific corpora. The different subtasks of Automatic Term Extraction are presented in detail, including corpus compilation, unithood, termhood and variant detection, and system evaluation.

[1]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[2]  Clément de Groc Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology Extraction , 2011, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[3]  Fabio Massimo Zanzotto,et al.  Terminology Extraction: An Analysis of Linguistic and Statistical Approaches , 2005 .

[4]  Thierry Hamon,et al.  Improving Term Extraction with Terminological Resources , 2006, FinTAL.

[5]  Hiroshi Nakagawa,et al.  A Simple but Powerful Automatic Term Extraction Method , 2002, COLING 2002.

[6]  Patrick Pantel,et al.  A Statistical Corpus-Based Term Extractor , 2001, Canadian Conference on AI.

[7]  Daniel Wiechmann On the computation of collostruction strength: Testing measures of association as expressions of lexical bias , 2008 .

[8]  Éric Gaussier,et al.  Towards Automatic Extraction of Monolingual and Bilingual Terminology , 1994, COLING.

[9]  Béatrice Daille,et al.  Variations and application-oriented terminology engineering , 2005 .

[10]  Pavel Pecina,et al.  Combining Association Measures for Collocation Extraction , 2006, ACL.

[11]  Ian H. Witten,et al.  Thesaurus based automatic keyphrase indexing , 2006, Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '06).

[12]  Patrick Drouin,et al.  Term extraction using non-technical corpora as a point of leverage , 2003 .

[13]  Goran Nenadic,et al.  Enhancing automatic term recognition through recognition of variation , 2004, COLING.

[14]  Christian Jacquemin,et al.  Term Extraction + Term Clustering: An Integrated Platform for Computer-Aided Terminology , 1999, EACL.

[15]  Magnus Merkel,et al.  Computer aided term bank creation and standardization: Building standardized term banks through automated term extraction and advanced editing tools , 2010 .

[16]  Camino Rea Rizzo GETTING ON WITH CORPUS COMPILATION: FROM THEORY TO PRACTICE. , 2010 .

[17]  Kyo Kageura,et al.  METHODS OF AUTOMATIC TERM RECOGNITION : A REVIEW , 1996 .

[18]  Udo Hahn,et al.  Paradigmatic Modifiability Statistics for the Extraction of Complex Multi-Word Terms , 2005, HLT.

[19]  Silvia Bernardini,et al.  BootCaT: Bootstrapping Corpora and Terms from the Web , 2004, LREC.

[20]  David A. Evans,et al.  Clarit-TREC Experiments , 1995, Inf. Process. Manag..

[21]  Mitsuru Ishizuka,et al.  Keyword extraction from a single document using word co-occurrence statistical information , 2004, Int. J. Artif. Intell. Tools.

[22]  Stefan Evert,et al.  The Statistics of Word Cooccur-rences: Word Pairs and Collocations , 2004 .

[23]  Ioannis Korkontzelos,et al.  Reviewing and Evaluating Automatic Term Recognition Techniques , 2008, GoTAL.

[24]  Gregor Thurmair Making term extraction tools usable , 2003, EAMT.

[25]  Mohammed Bennamoun,et al.  Determining Termhood for Learning Domain Ontologies using Domain Prevalence and Tendency , 2007, AusDM.

[26]  Hiroshi Nakagawa Automatic term recognition based on statistics of compound nouns , 2000 .

[27]  Chunyu Kit,et al.  Measuring mono-word termhood by rank difference via corpus comparison , 2008 .

[28]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[29]  Patrick Drouin,et al.  Quantifying Termhood Through Corpus Comparison , 2008 .

[30]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[31]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[32]  Sophia Ananiadou,et al.  A Methodology for Automatic Term Recognition , 1994, COLING.

[33]  Horacio Rodríguez,et al.  Evaluation of terms and term extraction systems: A practical approach , 2007 .

[34]  José Gabriel Pereira Lopes,et al.  Using LocalMaxs Algorithm for the Extraction of Contiguous and Non-contiguous Multiword Lexical Units , 1999, EPIA.

[35]  Douglas Biber,et al.  Representativeness in corpus design , 1993 .

[36]  Didier Bourigault,et al.  Surface Grammatical Analysis for the Extraction of Terminological Noun Phrases , 1992, COLING.

[37]  Hideki Mima,et al.  Automatic recognition of multi-word terms:. the C-value/NC-value method , 2000, International Journal on Digital Libraries.

[38]  Teresa Mihwa Chung A corpus comparison approach for terminology extraction , 2003 .

[39]  Kyo Kageura Computing the potential lexical productivity of head elements in nominal compounds using the textual corpus , 2009 .

[40]  Slava M. Katz,et al.  Technical terminology: some linguistic properties and an algorithm for identification in text , 1995, Natural Language Engineering.

[41]  Paola Velardi,et al.  TermExtractor: a Web Application to Learn the Common Terminology of Interest Groups and Research Communities , 2007 .

[42]  M. Teresa Cabré Castellví,et al.  Automatic term detection: A review of current systems , 2001 .

[43]  Chunyu Kit,et al.  Statistical termhood measurement for mono-word terms via corpus comparison , 2009, 2009 International Conference on Machine Learning and Cybernetics.

[44]  Horacio Rodríguez,et al.  Improving Term Extraction by System Combination Using Boosting , 2001, ECML.

[45]  Ziqi Zhang,et al.  A Comparative Evaluation of Term Recognition Algorithms , 2008, LREC.

[46]  Jody Foo Computational Terminology : Exploring Bilingual and Monolingual Term Extraction , 2012 .

[47]  N. Ellis,et al.  An Academic Formulas List: New Methods in Phraseology Research , 2010 .