论文信息 - Big Data Versus Small Data: The Case of ‘Gripe’ (Flu) in Spanish

Big Data Versus Small Data: The Case of ‘Gripe’ (Flu) in Spanish

Abstract Big data is a broad term for data sets so large and complex that traditional data processing applications are inadequate. A new field, Predictive Analytics, is trying to extract value from those big (unstructured) data. In Corpus Linguistics, researchers usually deal with small data. In this paper, we compare the amount and the quality of information with respect to a single topic (flu) in Twitter and in MultiMedica (a corpus of medicine texts).

Esteban Moro | Antonio Moreno-Sandoval

[1] Antonio Moreno-Sandoval,et al. Design and Annotation of MultiMedica – A Multilingual Text Corpus of the Biomedical Domain , 2013 .

[2] Athanasios V. Vasilakos,et al. Big data: From beginning to future , 2016, Int. J. Inf. Manag..

[3] Adam Kilgarriff,et al. The Sketch Engine: ten years on , 2014 .