GETTING ON WITH CORPUS COMPILATION: FROM THEORY TO PRACTICE.

The aim of this paper is first to present some theoretical guidelines on the design and compilation of a specialized corpus in compliance with Corpus Linguistics standards; second, to provide a comprehensive description of the stages followed in the creation of the Telecommunication Engineering Corpus and its characteristics; and finally, to report the immediate results obtained from the basic analysis of the corpus: statistical information and frequency list. This paper is particularly addressed to those ESP practitioners who have to deal with the compilation of a specialized corpus for the first time, without a working knowledge of the specific subject domain they are involved in 1 .

[1]  Randi Reppen,et al.  From Corpus to classroom: Language use and language teaching , 2008 .

[2]  Y Huizhong,et al.  A new technique for identifying scientific/technical terms and describing science texts , 1986 .

[3]  Antonio Zampolli,et al.  Computational approaches to the lexicon , 1994 .

[4]  Pascual Cantos Gómez,et al.  El ritmo incremental de palabras nuevas en los repertorios de textos: Estudio experimental y comparativo basado en dos corpus lingüísticos equivalentes de cuatro millones de palabras, de las lenguas inglesa y española y en cinco autores de ambas lenguas , 1997 .

[5]  S. Johansson Computer corpora in English language research , 1984 .

[6]  John Sinclair,et al.  Intuition and annotation – the discussion continues , 2004 .

[7]  Michael A. West,et al.  A general service list of English words, with semantic frequencies and a supplementary word-list for the writing of popular science and technology , 1953 .

[8]  Martin Wynne,et al.  Developing Linguistic Corpora: a Guide to Good Practice , 2005 .

[9]  Graeme D. Kennedy,et al.  Book Reviews: An Introduction to Corpus Linguistics , 1999, CL.

[10]  Stig Johansson,et al.  English computer corpora : selected papers and research guide , 1991 .

[11]  N. Schmitt,et al.  Vocabulary in Language Teaching , 2000 .

[12]  Susan Conrad,et al.  Corpus-Based Research in TESOL , 2001 .

[13]  C. Goddard Words and Phrases: Corpus Studies of Lexical Semantics , 2006 .

[14]  John Sinclair,et al.  Looking up : an account of the COBUILD Project in lexical computing and the development of the Collins COBUILD English Language Dictionary , 1987 .

[15]  K. Hyland,et al.  Developments in English for specific purposes: a multi-disciplinary approach; Tony Dudley-Evans and Maggie-Jo St John. Cambridge, Cambridge University Press, 1998, 301 pp. , 2000 .

[16]  Stefan Th. Gries,et al.  What is Corpus Linguistics? , 2009, Lang. Linguistics Compass.

[17]  Susan Conrad,et al.  Corpus Linguistics: Investigating Language Structure and Use , 1998 .

[18]  John M. Swales,et al.  ESP comes of age? - 21 years after 'Some Measurable Characteristics of Modern Scientific Prose' , 1984 .

[19]  Michael McCarthy,et al.  From Corpus to Classroom: Language Use and Language Teaching , 2007 .

[20]  A. Sánchez,et al.  Cumbre : Corpus lingüístico del español contemporáneo : fundamentos, metodología y aplicaciones , 1995 .

[21]  R. W. Zandvoort,et al.  Contributions to English Syntax and Philology , 1963 .

[22]  Jennifer Pearson,et al.  Terms in context , 1998 .

[23]  B. Boguraev Book Reviews: Looking Up: An Account of the COBUILD PROJECT IN LEXICAL COMPUTING , 1990, CL.

[24]  Nicholas Ostler,et al.  Corpus Design Criteria , 1992 .