The term Big Data can be defined as analysis a large volume of data in unstructured databases, organizations of different segment and size, has been employing the Big Data Analytics philosophy as support tool strategic to anticipate valuable insights and trends on the behavior of consumers and their expectations, thus gaining a competitive advantage in the market in which they operate. However, to extract information from values as the goal of turning that volume of data into predictive information or insights is still a big challenge in Big Data. The main objective of this work is to present an implementation of a Big Data project, using data originated from social networks, as well as text mining techniques and machine learning, through the implementation of K-Means and SVM algorithms, with intention of identifying patterns from dengue outbreaks, through analyzes that show insights of probable outbreaks of dengue in a particular region of Brazil. The results obtained indicate that the implemented project had a satisfactory yield if compared of the data collected of the Ministry of Health of Brazil, thus indicating a potential for utilization of its purpose. It is observed that the main advantage of the analyzes in Big Data is related to the possibility of use the unstructured data that can be obtained in social networks, e-commerce sites, among others and structured data obtained from traditional databases and from this union, to extract information from values that can be used to benefit of organizations, thus allowing know the future behaviors and thereby act in a preventive way.
[1]
J. Manyika.
Big data: The next frontier for innovation, competition, and productivity
,
2011
.
[2]
Gurpreet Singh Lehal,et al.
A Survey of Text Mining Techniques and Applications
,
2009
.
[3]
P. Tauil,et al.
[Critical aspects of dengue control in Brazil].
,
2002,
Cadernos de saude publica.
[4]
Jacqueline S. Coberly,et al.
Tweeting Fever: Are Tweet Extracts a Valid Surrogate Data Source for Dengue Fever?
,
2013,
Online Journal of Public Health Informatics.
[5]
Giovanini Evelim Coelho,et al.
Challenges in the control of Aedes aegypti.
,
2012,
Revista do Instituto de Medicina Tropical de Sao Paulo.
[6]
P. Gething,et al.
Refining the Global Spatial Limits of Dengue Virus Transmission by Evidence-Based Consensus
,
2012,
PLoS neglected tropical diseases.
[7]
Cláudia T. Codeço,et al.
Surveillance of dengue vectors using spatio-temporal Bayesian modeling
,
2015,
BMC Medical Informatics and Decision Making.
[8]
A. Ruffino-Netto,et al.
[Dengue in a urban locality of southeastern Brazil: epidemiological aspects].
,
1994,
Revista de saude publica.