EXTRACT ASSOCIATION RULES TO MINIMIZE THE EFFECTS OF DENGUE BY USING A TEXT MINING TECHNIQUE

Nowadays, dengue (vector-borne tropical viral diseases) has been become the greatest scourge of humankind and there consequence has more impact than any other pathogen in shaping the human genome. Generally in Pakistan specifically in Punjab (Pakistan) Dengue is emerging as one of the major public-health problem. Federal & Provincial Health Governments are taking all possible steps on ―War-Footing‖ to recover such type of diseases. They are determines in making special possibilities to face such type of problems in advance. From WWW, digital libraries, World Health Organization (WHO) and other news sources it is estimated that about 2.5 billion people, or 40 percent of the world’s population, live in areas where there is a risk of dengue transmission because Dengue flourishes in urban poor areas, suburbs and the countryside but also affects more affluent neighborhoods in tropical and subtropical countries. As of November 2011, it has killed over 300 people in the last several months and over 14,000 are infected by this mosquito-borne disease. Majority of the people infected are from the Lahore area in Punjab, Pakistan. As a matter of fact, if a virus attacks on somewhere, what will be its next target in geographically aspects, because dengue virus will be spread from one place to other from contaminated water and mosquito. Therefore by using above said data sources, performing some preprocessing techniques such as transformation, filtration, stemming and indexing of the documents and then applying data mining techniques our system will not only helps to identify geographical spreading patterns of the viruses but it also helps to suggest proactively next geographical location where virus has most probability to attach so that government can take remedy measures.

[1]  Martin Rajman,et al.  Text Mining: Natural Language techniques and Text Mining applications , 1998 .

[2]  Kjetil Nørvåg,et al.  Mining Association Rules in Temporal Document Collections , 2006, ISMIS.

[3]  Hany Mahgoub,et al.  Mining Association Rules from Unstructured Documents , 2008 .

[4]  George Buchanan,et al.  Scalable browsing for large collections: a case study , 2000, DL '00.

[5]  Ramakrishnan Srikant,et al.  Discovering Trends in Text Databases , 1997, KDD.

[6]  Heikki Mannila,et al.  Discovery of Frequent Episodes in Event Sequences , 1997, Data Mining and Knowledge Discovery.

[7]  Mika Klemettinen,et al.  Applying data mining techniques for descriptive phrase extraction in digital document collections , 1998, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-.

[8]  Mika Klemettinen,et al.  Mining in the Phrasal Frontier , 1997, PKDD.

[9]  R. Agarwal Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[10]  Fawzy A. Torkey,et al.  A Text Mining Technique Using Association Rules Extraction , 2008 .

[11]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[12]  Jan. Paralic,et al.  Text Mining for Documents Annotation and Ontology Support , 2003 .

[13]  Ido Dagan,et al.  Knowledge Discovery in Textual Databases (KDT) , 1995, KDD.

[14]  Haym Hirsh,et al.  Mining Associations in Text in the Presence of Background Knowledge , 1996, KDD.