A Study and Analysis of Gene Drug Association for Diabetic Gene - A Text Mining Approach

The explosive growth of genomic data has lead to the growth of published literature of genomic text submitted by researchers all over the world. The literature database like Medline has grown tremendously in the upcoming years. Analyzing the literature for finding association of biological components has become an important domain area. The analysis of gene drug association from text has become an important research area in the field of biomedical text mining. This paper provides with a detailed study on various methods used for gene drug association discovery. The methodology for analysis of gene, drug associations related to diabetic disease using Dictionary-based term identification approach consists of two main important phases. The first phase makes use of the preprocessing text mining techniques like stop word removal, tokenization and stemming. The second phase corpus is constructed for analyzing gene drugs associations using dictionary based approach.

[1]  Karen M. Daniels,et al.  Gaussian Kernel Width Exploration in Support Vector Clustering , 2004 .

[2]  Teruyoshi Hishiki,et al.  Extraction of Gene-Disease Relations from Medline Using Domain Dictionaries and Machine Learning , 2005, Pacific Symposium on Biocomputing.

[3]  Padmini Srinivasan,et al.  Mining MEDLINE for implicit links between dietary substances and diseases , 2004, ISMB/ECCB.

[4]  Marc Weeber,et al.  Advances in Literature-Based Discovery , 2001 .

[5]  Martijn J. Schuemie,et al.  A dictionary to identify small molecules and drugs in free text , 2009, Bioinform..

[6]  Dimitris Kanellopoulos,et al.  Association Rules Mining: A Recent Overview , 2006 .

[7]  Jonathan D. Wren,et al.  Extending the mutual information measure to rank inferred literature relationships , 2004, BMC Bioinformatics.

[8]  Anton Yuryev,et al.  Extracting human protein interactions from MEDLINE using a full-sentence parser , 2004, Bioinform..

[9]  Tanja Urbancic,et al.  Discovering Hidden Knowledge from Biomedical Literature , 2007, Informatica.

[10]  A. Valencia,et al.  Text Mining for Drugs and Chemical Compounds: Methods, Tools and Applications , 2011, Molecular informatics.

[11]  Ah-Hwee Tan,et al.  Text Mining: The state of the art and the challenges , 2000 .

[12]  Doheon Lee,et al.  Inferring hidden relationships from biological literature with multi-level context terms , 2011, DTMBIO '11.

[13]  Andreas Hotho,et al.  A Brief Survey of Text Mining , 2005, LDV Forum.

[14]  Padmini Srinivasan,et al.  Text mining: Generating hypotheses from MEDLINE , 2004, J. Assoc. Inf. Sci. Technol..

[15]  Jacob de Vlieg,et al.  Literature Mining for the Discovery of Hidden Connections between Drugs, Genes and Diseases , 2010, PLoS Comput. Biol..