Soft Fuzzy Model for Mining Amino Acid Associations in Peptide Sequences of Mycobacterium tuberculosis Complex

Analysis of biological data plays an important role in medical and bioinformatics industry. However, uncertainty in this biological information is the most unavoidable challenge of this era. The existing algorithms for association rule mining are inadequate to address the issues of uncertainty in the molecular data. Variation in the length of the sequences leads to variation in the degree of relationships among amino acids. Ignorance of the parameters leads to uncertainty due to the dependencies of the objects and their patterns on the parameters. The degree of relationships among various amino acids present in the molecular sequences also depends on the parameters like length ranges and species, etc. In this article, a soft fuzzy set approach has been proposed for mining fuzzy amino acid associations in peptide sequences of Mycobacterium tuberculosis complex (MTBC). The approach is employed to incorporate the degree of relationships among amino acids present in the peptide sequences. The soft sets are employed to model relationships of amino acids with the parameters like length range, species etc. The amino acid associations and their relationships with various parameters in the peptide sequences of MTBC obtained in the present study will be of great use in developing signatures that will provide better insights into the structures, functions and interactions of proteins.

[1]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[2]  D. Molodtsov Soft set theory—First results , 1999 .

[3]  Kenji Satou,et al.  Extraction of knowledge on protein-protein interaction by association rule discovery , 2002, Bioinform..

[4]  S. Cole,et al.  Comparative and functional genomics of the Mycobacterium tuberculosis complex. , 2002, Microbiology.

[5]  C. Buchrieser,et al.  A new evolutionary scenario for the Mycobacterium tuberculosis complex , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Rengül Çetin-Atalay,et al.  Application of data mining techniques to protein-protein interaction prediction , 2003 .

[7]  Walid G. Aref Mining Association Rules in Large Databases , 2004 .

[8]  José María Carazo,et al.  Mining association rules from biological databases , 2005, J. Assoc. Inf. Sci. Technol..

[9]  Nitin Gupta,et al.  Mining Quantitative Association Rules in Protein Sequences , 2006, Selected Papers from AusDM.

[10]  Francisco-Javier Lopez,et al.  Fuzzy association rules for biological data analysis: A case study on yeast , 2008, BMC Bioinformatics.

[11]  Maulana Azad,et al.  Rough Set Model for Discovering Multidimensional Association Rules , 2009 .

[12]  Neelu Khare,et al.  An Algorithm for Mining Multidimensional Fuzzy Association Rules , 2009, ArXiv.

[13]  Anjana Pandey,et al.  PPCI Algorithm for Mining Temporal Association Rules in Large Databases , 2009, J. Inf. Knowl. Manag..

[14]  Neelu Khare,et al.  A Fuzzy Based Model for Mining Conditional Hybrid Dimensional Association Rules , 2010 .

[15]  Pratima Gautam,et al.  A Novel Approach For Discovery Multi Level Fuzzy Association Rule Mining , 2010, ArXiv.

[16]  Mustafa Mat Deris,et al.  A soft set approach for association rules mining , 2011, Knowl. Based Syst..

[17]  Huang-Cheng Kuo,et al.  Discovering amino acid patterns on binding sites in protein complexes , 2011, Bioinformation.

[18]  Raju Nagarajan,et al.  Sequence-Structure Similarity: Do Sequentially Identical Peptide Fragments have Similar Three-Dimensional Structures? , 2012 .

[19]  Bülent Yener,et al.  TB-Lineage: an online tool for classification and analysis of strains of Mycobacterium tuberculosis complex. , 2012, Infection, genetics and evolution : journal of molecular epidemiology and evolutionary genetics in infectious diseases.

[20]  Konda Mani Saravanan,et al.  Search for identical octapeptides in unrelated proteins: Structural plasticity revisited. , 2012, Biopolymers.

[21]  Das Amrita,et al.  Mining Association Rules between Sets of Items in Large Databases , 2013 .

[22]  Kamal Raj Pardasani,et al.  Mining amino acid association patterns in class B GPCRs , 2015, Int. J. Bioinform. Res. Appl..

[23]  D. K. Swami,et al.  Lattice Based Algorithm for Incremental Mining of Association Rules , 2022 .