Regulatory link mapping between organisms

BackgroundIdentification of gene regulatory networks is useful in understanding gene regulation in any organism. Some regulatory network information has already been determined experimentally for model organisms, but much less has been identified for non-model organisms, and the limited amount of gene expression data available for non-model organisms makes inference of regulatory networks difficult.ResultsThis paper proposes a method to determine the regulatory links that can be mapped from a model to a non-model organism. Mapping a regulatory network involves mapping the transcription factors and target genes from one genome to another. In the proposed method, Basic Local Alignment Search Tool (BLAST) and InterProScan are used to map the transcription factors, whereas BLAST along with transcription factor binding site motifs and the GALF-P tool are used to map the target genes. Experiments are performed to map the regulatory network data of S. cerevisiae to A. thaliana and analyze the results. Since limited information is available about gene regulatory network links, gene expression data is used to analyze results. A set of rules are defined on the gene expression experiments to identify the predicted regulatory links that are well supported.ConclusionsCombining transcription factors mapped using BLAST and subfamily classification, together with target genes mapped using BLAST and binding site motifs, produced the best regulatory link predictions. More than two-thirds of these predicted regulatory links that were analyzed using gene expression data have been verified as correctly mapped regulatory links in the target genome.

[1]  Satoru Miyano,et al.  Identification of Genetic Networks from a Small Number of Gene Expression Patterns Under the Boolean Network Model , 1998, Pacific Symposium on Biocomputing.

[2]  S Fuhrman,et al.  Reveal, a general reverse engineering algorithm for inference of genetic network architectures. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[3]  Susumu Goto,et al.  The KEGG databases at GenomeNet , 2002, Nucleic Acids Res..

[4]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[5]  T. Wagner,et al.  A ribozyme-mediated, gene "knockdown" strategy for the identification of gene function in zebrafish. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Xiaoping Zhou,et al.  A Systems Biology Approach to Transcription Factor Binding Site Prediction , 2010, PloS one.

[7]  M. Campbell,et al.  PANTHER: a library of protein families and subfamilies indexed by function. , 2003, Genome research.

[8]  Michael Hecker,et al.  Gene regulatory network inference: Data integration in dynamic models - A review , 2009, Biosyst..

[9]  Xin Chen,et al.  PlantTFDB: a comprehensive plant transcription factor database , 2007, Nucleic Acids Res..

[10]  Kwong-Sak Leung,et al.  TFBS identification based on genetic algorithm with combined representations and adaptive post-processing , 2008, Bioinform..

[11]  Patricia A. Evans,et al.  Mapping a Regulatory Network Between Organisms , 2010 .

[12]  Eugene W. Myers,et al.  Basic local alignment search tool. Journal of Molecular Biology , 1990 .

[13]  Satoru Miyano,et al.  Inferring Gene Regulatory Networks from Time-Ordered Gene Expression Data Using Differential Equations , 2002, Discovery Science.

[14]  Rolf Apweiler,et al.  InterProScan - an integration platform for the signature-recognition methods in InterPro , 2001, Bioinform..

[15]  Satoru Miyano,et al.  Combining microarrays and biological knowledge for estimating gene networks via Bayesian networks , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[16]  Ying Xu,et al.  Mapping of microbial pathways through constrained mapping of orthologous genes , 2004, Proceedings. 2004 IEEE Computational Systems Bioinformatics Conference, 2004. CSB 2004..

[17]  Seung Yon Rhee,et al.  Biological Databases for Plant Research , 2005, Plant Physiology.

[18]  Marcel J. T. Reinders,et al.  A Comparison of Genetic Network Models , 2000, Pacific Symposium on Biocomputing.

[19]  Chunguang Zhou,et al.  Reconstruction of Gene Regulatory Networks Based on Two-Stage Bayesian Network Structure Learning Algorithm , 2009 .

[20]  Siu-Ming Yiu,et al.  MotifVoter: a novel ensemble method for fine-grained integration of generic motif finders , 2008, Bioinform..

[21]  Pooja Jain,et al.  The YEASTRACT database: a tool for the analysis of transcription regulatory associations in Saccharomyces cerevisiae , 2005, Nucleic Acids Res..

[22]  B. Steensel Mapping of genetic and epigenetic regulatory networks using microarrays , 2005, Nature Genetics.

[23]  H Matsuno,et al.  Hybrid Petri net representation of gene regulatory network. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[24]  Robert D. Finn,et al.  InterPro: the integrative protein signature database , 2008, Nucleic Acids Res..

[25]  Satoru Miyano,et al.  Inferring Gene Regulatory Networks from Time-Ordered Gene Expression Data of Bacillus Subtilis Using Differential Equations , 2002, Pacific Symposium on Biocomputing.

[26]  J. Villard,et al.  Transcription regulation and human diseases. , 2004, Swiss medical weekly.

[27]  Patricia A. Evans,et al.  Transcription Factor mapping between Bacteria Genomes , 2009, Int. J. Funct. Informatics Pers. Medicine.

[28]  Eyad Almasri,et al.  Incorporating Literature Knowledge in Bayesian Network for Inferring Gene Networks with Gene Expression Data , 2008, ISBRA.

[29]  C. Daub,et al.  BMC Systems Biology , 2007 .

[30]  J. Priestley,et al.  ASSESSMENT OF EVALUATION METHODS FOR BINARY CLASSIFICATION MODELING , 2003 .

[31]  E. Grotewold,et al.  Genome wide analysis of Arabidopsis core promoters , 2005, BMC Genomics.