Advantages and disadvantages in the use of internet as a corpus : the case of the online dictionaries of Spanish Valladolid-UVa

This paper initially discusses some of the consequences which the technological development has for lexicography, especially in terms of the different types of empirical basis which can be used in dictionary projects. The most important advantages and disadvantages of using the Internet as a corpus are then listed and compared to the usefulness of "traditional" corpora. As an example, the paper shows how the Internet is used as the main empirical source in order to select lemmata and meaning items in the Online Dictionaries of Spanish Valladolid-UVa. The methods and tools employed in the project are discussed together with the requirements to the lexicographers' competences, knowledge and skills. Finally, the paper provides some general conclusions as well as recommendations and hypotheses for future lexicographical work and research.

[1]  Rufus H. Gouws,et al.  Information overload and data overload in lexicography , 2016 .

[3]  Adam Kilgarriff,et al.  "I Don’t Believe in Word Senses" , 1997, Comput. Humanit..

[4]  Henning Bergenholtz,et al.  e-Lexicography : the internet, digital initiatives and lexicography , 2011 .

[5]  Adam Kilgarriff,et al.  Introduction to the Special Issue on the Web as Corpus , 2003, CL.

[6]  R. Lew The Oxford Guide to Practical Lexicography , 2009 .

[7]  Manuel Seco,et al.  Diccionario del español actual , 1999 .

[8]  Hubert L. Dreyfus,et al.  Mind over Machine: The Power of Human Intuition and Expertise in the Era of the Computer , 1987, IEEE Expert.

[9]  Henning Bergenholtz,et al.  Empirische Textwissenschaft : Aufbau und Auswertung von Text-Corpora , 1979 .

[10]  Michael Rundell,et al.  From Print to Digital: Implications for Dictionary Policy and Lexicographic Conventions , 2015 .

[11]  Patrick Hanks,et al.  The Corpus Revolution in Lexicography , 2012 .

[12]  Sven Tarp,et al.  Lexicography in the Borderland between Knowledge and Non-Knowledge: General Lexicographical Theory with Particular Focus on Learner's Lexicography , 2008 .

[13]  Sven Tarp Structures in the communication between lexicographer and programmer: Database and interface / Strukturen in der Kommunikation zwischen Lexikograph und Programmierer: Datenbasis und Schnittstelle / Les structures de la communication entre lexicographe et programmeur: base de données et interface , 2015 .