Wavelet Transform for Detection of Conserved Motifs inProtein Sequences with Ten Bit Physico-ChemicalProperties

Detection of common motifs among proteins with low sequence identities provides important clues to the function of the proteins or to classify unknown proteins into proper families. Hence motif identification in protein sequences is essential for annotation of proteins from the sequence database among proteins with less than 30% homology. In the present work we have detected conserved regions in protein sequences using digital signal processing methods such as discrete Fourier transform (DFT) and wavelet transform with ten bit numerical representation of amino acids based on physico-chemical properties. The resulting ten bit numerical representation of each residue of the protein sequence has significant correlation with its biological activity. The conserved motifs are identified in peak regions from the DFT spectrum and wavelet spectrum. It is found that the new ten bit numerical representation using wavelet transform shows improved result than DFT. We have used wavelet transform to decompose protein sequences represented numerically by different indices such as positive charge, negative charge, polarity, charge, medium volume, small volume, aliphatic, aromatic chain and alicyclic character of the amino acids. The decomposed signals are then plotted to identify similar regions across all the proteins. Results indicate that wavelet transform using ten bit binary representation of physico-chemical properties is a promising approach for conserved motif detection. The proposed techniques are not only fast but also give the better interpretation of conserved motifs in protein sequences.

[1]  Satoru Kuhara,et al.  The hydrophobic cores of proteins predicted by wavelet analysis , 1999, Bioinform..

[2]  Kevin Barraclough,et al.  I and i , 2001, BMJ : British Medical Journal.

[3]  Charles Elkan,et al.  Fitting a Mixture Model By Expectation Maximization To Discover Motifs In Biopolymer , 1994, ISMB.

[4]  Irena Cosic,et al.  An Overview of Protein Sequence Comparisons Using Wavelets , 2001 .

[5]  Amos Bairoch,et al.  The PROSITE database , 2005, Nucleic Acids Res..

[6]  S. Krane,et al.  Mutation in collagen‐I that confers resistance to the action of collagenase results in failure of recovery from CCl4‐induced liver fibrosis, persistence of activated hepatic stellate cells, and diminished hepatocyte regeneration , 2003, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[7]  S. Friedman Molecular Regulation of Hepatic Fibrosis, an Integrated Cellular Response to Tissue Injury* , 2000, The Journal of Biological Chemistry.

[8]  Ruth Nussinov,et al.  MUSTA - A General, Efficient, Automated Method for Multiple Structure Alignment and Detection of Common Motifs: Application to Proteins , 2001, J. Comput. Biol..

[9]  D. Brutlag,et al.  Highly specific protein sequence motifs for genome analysis. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Qiang Fang,et al.  Protein sequence comparison based on the wavelet transform approach. , 2002, Protein engineering.

[11]  D. Brenner,et al.  Erratum: Liver fibrosis (Journal of Clinical Investigation (2005) 115 (209-218) DOI:10.1172/JCI200524282) , 2005 .

[12]  C. Wong,et al.  Interleukin-3, -5, and granulocyte macrophage colony-stimulating factor-induced adhesion molecule expression on eosinophils by p38 mitogen-activated protein kinase and nuclear factor-[kappa] B. , 2003, American journal of respiratory cell and molecular biology.

[13]  J. Iredale,et al.  Mechanisms of spontaneous resolution of rat liver fibrosis. Hepatic stellate cell apoptosis and reduced hepatic expression of metalloproteinase inhibitors. , 1998, The Journal of clinical investigation.

[14]  S. Mallat A wavelet tour of signal processing , 1998 .

[15]  Youqing Xu,et al.  Lycopene attenuates alcoholic apoptosis in HepG2 cells expressing CYP2E1. , 2003, Biochemical and biophysical research communications.

[16]  Sai Gu,et al.  [Recent developments in the investigation of anti-liver fibrosis compositions of herbs]. , 2005, Zhonghua gan zang bing za zhi = Zhonghua ganzangbing zazhi = Chinese journal of hepatology.

[17]  Ramon Bataller,et al.  Human hepatic stellate cells express CCR5 and RANTES to induce proliferation and migration. , 2003, American journal of physiology. Gastrointestinal and liver physiology.

[18]  A. Nanji,et al.  Animal Models Are Designed to Address Specific Questions How Do Alcohol and Acetaldehyde Directly Affect the Liver ? , 2004 .

[19]  Robert E Mann,et al.  The Epidemiology of Alcoholic Liver Disease , 2018, Alcohol research & health : the journal of the National Institute on Alcohol Abuse and Alcoholism.

[20]  A. D. McLachlan,et al.  Profile analysis: detection of distantly related proteins. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Paul S Haber,et al.  Gene expression profiling of alcoholic liver disease in the baboon (Papio hamadryas) and human liver. , 2003, The American journal of pathology.

[22]  J. Iredale,et al.  Apoptosis of hepatic stellate cells: involvement in resolution of biliary fibrosis and regulation by soluble growth factors , 2001, Gut.

[23]  Harvey F Lodish,et al.  Role of Ras signaling in erythroid differentiation of mouse fetal liver cells: functional analysis by a flow cytometry-based novel culture system. , 2003, Blood.

[24]  Kai Fan,et al.  [Study on a recombinant keratinocyte growth factor variant in treating experimental rat liver fibrosis]. , 2005, Zhonghua gan zang bing za zhi = Zhonghua ganzangbing zazhi = Chinese journal of hepatology.

[25]  M. Arthur,et al.  Reversibility of liver fibrosis and cirrhosis following treatment for hepatitis C. , 2002, Gastroenterology.

[26]  Kuhara,et al.  Prediction of Hydrophobic Cores of Proteins Using Wavelet Analysis. , 1997, Genome informatics. Workshop on Genome Informatics.

[27]  Magnus Ingelman-Sundberg,et al.  Alcoholic Liver Disease in Rats Fed Ethanol as Part of Oral or Intragastric Low-Carbohydrate Liquid Diets , 2004, Experimental biology and medicine.

[28]  Arun Krishnan,et al.  Rapid detection of conserved regions in protein sequences using wavelets , 2004, Silico Biol..

[29]  I. Dodd,et al.  Improved detection of helix-turn-helix DNA-binding motifs in protein sequences. , 1990, Nucleic acids research.

[30]  W. H. Cantrell Tuning analysis for the high-Q class-E power amplifier , 2000 .

[31]  W. Marsden I and J , 2012 .

[32]  David A. Brenner,et al.  The Role of Focal Adhesion Kinase-Phosphatidylinositol 3-Kinase-Akt Signaling in Hepatic Stellate Cell Proliferation and Type I Collagen Expression* , 2003, The Journal of Biological Chemistry.

[33]  S. Friedman,et al.  Cytochrome P450 2E1-derived Reactive Oxygen Species Mediate Paracrine Stimulation of Collagen I Protein Synthesis by Hepatic Stellate Cells* , 2002, The Journal of Biological Chemistry.

[34]  Li Yang,et al.  Association of differentially expressed genes with activation of mouse hepatic stellate cells by high-density cDNA microarray. , 2004, World journal of gastroenterology.

[35]  C. Lieber,et al.  Alcoholic fatty liver: its pathogenesis and mechanism of progression to inflammation and fibrosis. , 2004, Alcohol.

[36]  Denise Gorse,et al.  Wavelet transforms for the characterization and detection of repeating motifs. , 2002, Journal of molecular biology.

[37]  A. Nanji,et al.  Liver asialoglycoprotein receptor levels correlate with severity of alcoholic liver damage in rats. , 2004, Journal of applied physiology.

[38]  W Lehmann,et al.  The role of angiogenesis in a murine tibial model of distraction osteogenesis. , 2004, Bone.

[39]  C. Lieber,et al.  Relationships Between Nutrition, Alcohol Use, and Liver Disease , 2003, Alcohol research & health : the journal of the National Institute on Alcohol Abuse and Alcoholism.

[40]  Xia Lu Study of Kanglaite-induced apoptosis on human pancreatic cancer cells by cDNA microarray , 2004 .

[41]  Amos Bairoch,et al.  The PROSITE database, its status in 1997 , 1997, Nucleic Acids Res..

[42]  N. Nalini,et al.  Glycine prevents hepatic fibrosis by preventing the accumulation of collagen in rats with alcoholic liver injury. , 2004, Polish journal of pharmacology.

[43]  R. Sauer,et al.  Transcription factors: structural families and principles of DNA recognition. , 1992, Annual review of biochemistry.