Identification of Transcriptional Regulatory Elements in Chemosensory Receptor Genes by Probabilistic Segmentation

Genome sequencing has allowed many gene regulatory elements to be identified through cross-species comparisons . However, the expression of genes in multigene families can diverge rapidly between related species . An alternative approach to characterizing multigene families utilizes the fact that genes within the group are likely to share aspects of their regulation. Here, we use a statistical approach, probabilistic segmentation , to identify sequences that are overrepresented in the regions upstream of C. elegans chemosensory receptor genes. Although each of these elements is present in only a subset of the genes, their distribution across and within the promoters of chemosensory receptor genes makes it possible to detect them. Many of the motifs show positional preference with respect to the translational start site and correspond to the binding sites of known families of transcription factors. We verified one motif, the E-box sequence WWYCACSTGYY, by showing that it directs expression of reporter genes to the ADL chemosensory neurons. Thus, probabilistic segmentation can be used to identify functional regulatory elements with no previous knowledge of gene expression or regulation. This approach may be of particular value for rapidly evolving genes in the immune system and the nervous system.

[1]  B. Birren,et al.  Sequencing and comparison of yeast species to identify genes and regulatory elements , 2003, Nature.

[2]  C. Dulac,et al.  A Novel Family of Putative Pheromone Receptors in Mammals with a Topographically Organized and Sexually Dimorphic Distribution , 1997, Cell.

[3]  David J. Anderson,et al.  Atypical expansion in mice of the sensory neuron-specific Mrg G protein-coupled receptor family , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[4]  T. Insel,et al.  Oxytocin receptor distribution reflects social organization in monogamous and polygamous voles. , 1992, Annals of the New York Academy of Sciences.

[5]  J. Thomas,et al.  egl-4 acts through a transforming growth factor-beta/SMAD pathway in Caenorhabditis elegans to regulate multiple neuronal circuits in response to sensory cues. , 2000, Genetics.

[6]  H. Bussemaker,et al.  Building a dictionary for genomes: identification of presumptive regulatory sites by statistical analysis. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[7]  H. Robertson Two large families of chemoreceptor genes in the nematodes Caenorhabditis elegans and Caenorhabditis briggsae reveal extensive gene duplication, diversification, movement, and intron loss. , 1998, Genome research.

[8]  D. Baltimore,et al.  The “initiator” as a transcription control element , 1989, Cell.

[9]  T. Insel,et al.  Species Differences in Central Oxytocin Receptor Gene Expression: Comparative Analysis of Promoter Sequences , 1996, Journal of neuroendocrinology.

[10]  Gennifer E. Merrihew,et al.  High Genetic Diversity in the Chemoreceptor Superfamily of Caenorhabditis elegans , 2005, Genetics.

[11]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[12]  Cori Bargmann,et al.  odr-10 Encodes a Seven Transmembrane Domain Olfactory Receptor Required for Responses to the Odorant Diacetyl , 1996, Cell.

[13]  Oliver Hobert,et al.  CisOrtho: A program pipeline for genome-wide identification of transcription factor target genes using phylogenetic footprinting , 2004, BMC Bioinformatics.

[14]  T. Inoue,et al.  Targets of TGF-beta signaling in Caenorhabditis elegans dauer formation. , 2000, Developmental biology.

[15]  P. Sengupta,et al.  The DAF-7 TGF-beta signaling pathway regulates chemosensory receptor gene expression in C. elegans. , 2002, Genes & development.

[16]  E. Norwitz,et al.  Activin A augments GnRH-mediated transcriptional activation of the mouse GnRH receptor gene. , 2002, Endocrinology.

[17]  L. Pennacchio,et al.  Identification of a novel enhancer of brain expression near the apoE gene cluster by comparative genomics. , 2004, Biochimica et biophysica acta.

[18]  James H. Thomas,et al.  Targets of TGF-β Signaling in Caenorhabditis elegans Dauer Formation , 2000 .

[19]  Chi V. Dang,et al.  A strategy for identifying transcription factor binding sites reveals two classes of genomic c-Myc target sites , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[20]  O. Hobert,et al.  Genomic cis-regulatory architecture and trans-acting regulators of a single interneuron-specific gene battery in C. elegans. , 2004, Developmental cell.

[21]  T. Insel,et al.  Oxytocin Receptor Distribution Reflects Social Organization in Monogamous and Polygamous Voles , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[22]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[23]  E. Serra,et al.  Promoter-specific binding of Rap1 revealed by genome-wide maps of protein-DNA association , 2001, Nature Genetics.

[24]  Cori Bargmann,et al.  Social feeding in Caenorhabditis elegans is induced by neurons that detect aversive stimuli , 2002, Nature.

[25]  H. Robertson,et al.  The large srh family of chemoreceptor genes in Caenorhabditis nematodes reveals processes of genome evolution involving large duplications and deletions and intron gains and losses. , 2000, Genome research.

[26]  A. Hart,et al.  Feeding status and serotonin rapidly and reversibly modulate a Caenorhabditis elegans chemosensory circuit. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[27]  W. Miller,et al.  Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons. , 2000, Science.

[28]  Cori Bargmann,et al.  Sensory experience and sensory activity regulate chemosensory receptor gene expression in Caenorhabditis elegans , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Cori Bargmann,et al.  Divergent seven transmembrane receptors are candidate chemosensory receptors in C. elegans , 1995, Cell.

[30]  A. Fainsod,et al.  Isolation and characterization of target sequences of the chicken CdxA homeobox gene. , 1993, Nucleic acids research.