Spliceosomal small nuclear RNA genes in 11 insect genomes.

The removal of introns from the primary transcripts of protein-coding genes is accomplished by the spliceosome, a large macromolecular complex of which small nuclear RNAs (snRNAs) are crucial components. Following the recent sequencing of the honeybee (Apis mellifera) genome, we used various computational methods, ranging from sequence similarity search to RNA secondary structure prediction, to search for putative snRNA genes (including their promoters) and to examine their pattern of conservation among 11 available insect genomes (A. mellifera, Tribolium castaneum, Bombyx mori, Anopheles gambiae, Aedes aegypti, and six Drosophila species). We identified candidates for all nine spliceosomal snRNA genes in all the analyzed genomes. All the species contain a similar number of snRNA genes, with the exception of A. aegypti, whose genome contains more U1, U2, and U5 genes, and A. mellifera, whose genome contains fewer U2 and U5 genes. We found that snRNA genes are generally more closely related to homologs within the same genus than to those in other genera. Promoter regions for all spliceosomal snRNA genes within each insect species share similar sequence motifs that are likely to correspond to the PSEA (proximal sequence element A), the binding site for snRNA activating protein complex, but these promoter elements vary in sequence among the five insect families surveyed here. In contrast to the other insect species investigated, Dipteran genomes are characterized by a rapid evolution (or loss) of components of the U12 spliceosome and a striking loss of U12-type introns.

[1]  Ying Wang,et al.  Insights into social insects from the genome of the honeybee Apis mellifera , 2006, Nature.

[2]  Nuno L Barbosa-Morais,et al.  Systematic genome-wide annotation of spliceosomal proteins reveals differential gene family expansion. , 2005, Genome research.

[3]  W. Stumph,et al.  The PSEA promoter element of the Drosophila U1 snRNA gene is sufficient to bring DmSNAPc into contact with 20 base pairs of downstream DNA , 2005, Nucleic acids research.

[4]  M. Nei,et al.  Concerted and birth-and-death evolution of multigene families. , 2005, Annual review of genetics.

[5]  S. Celniker,et al.  Identification and analysis of U5 snRNA variants in Drosophila. , 2005, RNA.

[6]  Andrea Barta,et al.  Evolutionary conservation of minor U12-type spliceosome between plants and humans. , 2005, RNA.

[7]  R. J. Herrera,et al.  The silk moth Bombyx mori U1 and U2 snRNA variants are differentially expressed. , 2005, Gene.

[8]  B. Kastner,et al.  Protein Stoichiometry of a Multiprotein Complex, the Human Spliceosomal U1 Small Nuclear Ribonucleoprotein , 2005, Journal of Biological Chemistry.

[9]  A. Ruíz,et al.  Duplicative and Conservative Transpositions of Larval serum protein 1 Genes in the Genus Drosophila Sequence data from this article have been deposited with the EMBL/GenBank Data Libraries under accession nos. AY561258 and AY561259. , 2004, Genetics.

[10]  Jürgen Brosius,et al.  Identification of an evolutionarily divergent U11 small nuclear ribonucleoprotein particle in Drosophila. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Henning Urlaub,et al.  The human 18S U11/U12 snRNP contains a set of novel proteins not found in the U2-dependent spliceosome. , 2004, RNA.

[12]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[13]  J. Steitz,et al.  An Intronic Enhancer Regulates Splicing of the Twintron of Drosophila melanogaster prospero Pre-mRNA by Two Different Spliceosomes , 2004, Molecular and Cellular Biology.

[14]  Melissa S Jurica,et al.  Pre-mRNA splicing: awash in a sea of proteins. , 2003, Molecular cell.

[15]  Michael Zuker,et al.  Mfold web server for nucleic acid folding and hybridization prediction , 2003, Nucleic Acids Res..

[16]  Abhijit A. Patel,et al.  The splicing of U12‐type introns can be a rate‐limiting step in gene expression , 2002, The EMBO journal.

[17]  Sean R. Eddy,et al.  A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure , 2002, BMC Bioinformatics.

[18]  C. Will,et al.  Human U4/U6.U5 and U4atac/U6atac.U5 Tri-snRNPs Exhibit Similar Protein Compositions , 2002, Molecular and Cellular Biology.

[19]  Masatoshi Nei,et al.  Purifying selection and birth-and-death evolution in the histone H4 gene family. , 2002, Molecular biology and evolution.

[20]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[21]  R. Durbin,et al.  A computational scan for U12-dependent introns in the human genome sequence. , 2001, Nucleic acids research.

[22]  N. Hernandez,et al.  Small Nuclear RNA Genes: a Model System to Study Fundamental Mechanisms of Transcription* , 2001, The Journal of Biological Chemistry.

[23]  Stephen M. Mount,et al.  Pre-Messenger RNA Processing Factors in the Drosophila Genome , 2000, The Journal of cell biology.

[24]  Stephen M. Mount,et al.  The genome sequence of Drosophila melanogaster. , 2000, Science.

[25]  M. Moore,et al.  The human Prp8 protein is a component of both U2- and U12-dependent spliceosomes. , 1999, RNA.

[26]  Adrian R. Krainer,et al.  AT-AC Pre-mRNA Splicing Mechanisms and Conservation of Minor Introns in Voltage-Gated Ion Channel Genes , 1999, Molecular and Cellular Biology.

[27]  P. Sharp,et al.  Evolutionary fates and origins of U12-type introns. , 1998, Molecular cell.

[28]  Christopher B. Burge,et al.  Classification of Introns: U2-Type or U12-Type , 1997, Cell.

[29]  R. Padgett,et al.  Terminal intron dinucleotide sequences do not distinguish between U2- and U12-dependent introns. , 1997, Molecular cell.

[30]  M. Montagu,et al.  Non–canonical introns are at least 109 years old , 1996, Nature Genetics.

[31]  Woan-Yuh Tarn,et al.  A Novel Spliceosome Containing U11, U12, and U5 snRNPs Excises a Minor Class (AT–AC) Intron In Vitro , 1996, Cell.

[32]  Charles Elkan,et al.  Fitting a Mixture Model By Expectation Maximization To Discover Motifs In Biopolymer , 1994, ISMB.

[33]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[34]  R. Padgett,et al.  Conserved sequences in a class of rare eukaryotic nuclear introns with non-consensus splice sites. , 1994, Journal of molecular biology.

[35]  P. Gruss,et al.  Prox 1, a prospero-related homeobox gene expressed during mouse development , 1993, Mechanisms of Development.

[36]  M. Schuler,et al.  Developmental expression of plant snRNAs. , 1991, Nucleic acids research.

[37]  Stephen M. Mount,et al.  Drosophila melanogaster genes for U1 snRNA variants and their expression during development. , 1990, Nucleic acids research.

[38]  Stephen M. Mount,et al.  Sequence of U1 RNA from Drosophila melanogaster: implications for U1 secondary structure and possible involvement in splicing. , 1981, Nucleic acids research.

[39]  W. Gilbert Why genes in pieces? , 1978, Nature.

[40]  R. Roberts,et al.  An amazing sequence arrangement at the 5′ ends of adenovirus 2 messenger RNA , 1977, Cell.

[41]  P. Sharp,et al.  Spliced segments at the 5′ terminus of adenovirus 2 late mRNA* , 1977, Proceedings of the National Academy of Sciences.