Discovering regulatory motifs in the Plasmodium genome using comparative genomics

MOTIVATION Understanding gene regulation in Plasmodium, the causative agent of malaria, is an important step in deciphering its complex life cycle as well as leading to possible new targets for therapeutic applications. Very little is known about gene regulation in Plasmodium, and in particular, few regulatory elements have been identified. Such discovery has been significantly hampered by the high A-T content of some of the genomes of Plasmodium species, as well as the challenge in associating discovered regulatory elements to gene regulatory cascades due to Plasmodium's complex life cycle. RESULTS We report a new method of using comparative genomics to systematically discover motifs in Plasmodium without requiring any functional data. Different from previous methods, our method does not depend on sequence alignments, and thus is particularly suitable for highly divergent genomes. We applied our method to discovering regulatory motifs between the human parasite, P.falciparum, and its rodent-infectious relative, P.yoelii. We also tested our procedure against comparisons between P.falciparum and the primate-infectious, P.knowlesi. Our computational effort leads to an initial catalog of 38 distinct motifs, corresponding to over 16 200 sites in the Plasmodium genome. The functionality of these motifs was further supported by their defined distribution within the genome as well as a correlation with gene expression patterns. This initial map provides a systematic view of gene regulation in Plasmodium, which can be refined as additional genomes become available. AVAILABILITY The new algorithm, named motif discovery using orthologous sequences (MDOS), is available at http://www.ics.uci.edu/ approximately xhx/project/mdos/.

[1]  Patricia De la Vega,et al.  Discovery of Gene Function by Expression Profiling of the Malaria Parasite Life Cycle , 2003, Science.

[2]  K. Lindblad-Toh,et al.  Systematic discovery of regulatory motifs in human promoters and 3′ UTRs by comparison of several mammals , 2005, Nature.

[3]  M Lanzer,et al.  Control of gene expression in Plasmodium falciparum. , 1998, Molecular and biochemical parasitology.

[4]  Manuel Llinás,et al.  Mechanisms of gene regulation in Plasmodium. , 2007, The American journal of tropical medicine and hygiene.

[5]  Elizabeth A. Winzeler,et al.  Applied systems biology and malaria , 2006, Nature Reviews Microbiology.

[6]  N. Slonim,et al.  A universal framework for regulatory element discovery across all genomes and data types. , 2007, Molecular cell.

[7]  Renu Tuteja,et al.  Malaria − an overview , 2007, The FEBS journal.

[8]  B. Birren,et al.  Sequencing and comparison of yeast species to identify genes and regulatory elements , 2003, Nature.

[9]  Jonathan E. Allen,et al.  Genome sequence of the human malaria parasite Plasmodium falciparum , 2002, Nature.

[10]  Li Li,et al.  PlasmoDB: the Plasmodium genome resource. A database integrating experimental and computational data , 2003, Nucleic Acids Res..

[11]  Olivier Elemento,et al.  Fast and systematic genome-wide discovery of conserved regulatory elements using a non-alignment based approach , 2005, Genome Biology.

[12]  W. Miller,et al.  Finding cis-regulatory elements using comparative genomics: some lessons from ENCODE data. , 2007, Genome research.

[13]  Manuel Llinás,et al.  Mining the malaria transcriptome. , 2005, Trends in parasitology.

[14]  C. Janse,et al.  Plasmodium post-genomics: better the bug you know? , 2006, Nature Reviews Microbiology.

[15]  R. Wilson,et al.  The transcriptome: malariologists ride the wave. , 2004, BioEssays : news and reviews in molecular, cellular and developmental biology.

[16]  Yingyao Zhou,et al.  Evidence-Based Annotation of the Malaria Parasite's Genome Using Comparative Expression Profiling , 2008, PloS one.

[17]  Colin N. Dewey,et al.  Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures , 2007, Nature.

[18]  C. Stoeckert,et al.  OrthoMCL: identification of ortholog groups for eukaryotic genomes. , 2003, Genome research.

[19]  Yingyao Zhou,et al.  Global analysis of transcript and protein levels across the Plasmodium falciparum life cycle. , 2004, Genome research.

[20]  Bindu Gajria,et al.  PlasmoDB: The Plasmodium Genome Resource , 2005 .

[21]  Jonathan E. Allen,et al.  Genome sequence and comparative analysis of the model rodent malaria parasite Plasmodium yoelii yoelii , 2002, Nature.

[22]  David S Roos,et al.  The genomics of malaria infection. , 2004, Trends in parasitology.

[23]  L. Fulton,et al.  Finding Functional Features in Saccharomyces Genomes by Phylogenetic Footprinting , 2003, Science.

[24]  Benedict Paten,et al.  The discovery, positioning and verification of a set of transcription-associated motifs in vertebrates , 2005, Genome Biology.

[25]  Yingyao Zhou,et al.  In silico discovery of transcription regulatory elements in Plasmodium falciparum , 2008, BMC Genomics.

[26]  Todd M. Gierahn,et al.  Regulatory motifs uncovered among gene expression clusters in Plasmodium falciparum. , 2007, Molecular and biochemical parasitology.