Comparative genomic analysis of the arthropod muscle myosin heavy chain genes allows ancestral gene reconstruction and reveals a new type of 'partially' processed pseudogene

BackgroundAlternative splicing of mutually exclusive exons is an important mechanism for increasing protein diversity in eukaryotes. The insect Mhc (myosin heavy chain) gene produces all different muscle myosins as a result of alternative splicing in contrast to most other organisms of the Metazoa lineage, that have a family of muscle genes with each gene coding for a protein specialized for a functional niche.ResultsThe muscle myosin heavy chain genes of 22 species of the Arthropoda ranging from the waterflea to wasp and Drosophila have been annotated. The analysis of the gene structures allowed the reconstruction of an ancient muscle myosin heavy chain gene and showed that during evolution of the arthropods introns have mainly been lost in these genes although intron gain might have happened in a few cases. Surprisingly, the genome of Aedes aegypti contains another and that of Culex pipiens quinquefasciatus two further muscle myosin heavy chain genes, called Mhc3 and Mhc4, that contain only one variant of the corresponding alternative exons of the Mhc1 gene. Mhc3 transcription in Aedes aegypti is documented by EST data. Mhc3 and Mhc4 inserted in the Aedes and Culex genomes either by gene duplication followed by the loss of all but one variant of the alternative exons, or by incorporation of a transcript of which all other variants have been spliced out retaining the exon-intron structure. The second and more likely possibility represents a new type of a 'partially' processed pseudogene.ConclusionBased on the comparative genomic analysis of the alternatively spliced arthropod muscle myosin heavy chain genes we propose that the splicing process operates sequentially on the transcript. The process consists of the splicing of the mutually exclusive exons until one exon out of the cluster remains while retaining surrounding intronic sequence. In a second step splicing of introns takes place. A related mechanism could be responsible for the splicing of other genes containing mutually exclusive exons.

[1]  Mark Johnston,et al.  Yeast genome duplication was followed by asynchronous differentiation of duplicated genes , 2003, Nature.

[2]  Stephen M. Mount,et al.  The genome sequence of Drosophila melanogaster. , 2000, Science.

[3]  N. Davidson,et al.  Differential processing of RNA transcribed from the single-copy Drosophila myosin heavy chain gene produces four mRNAs that encode two polypeptides. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Roderic D. M. Page,et al.  TreeView: an application to display phylogenetic trees on personal computers , 1996, Comput. Appl. Biosci..

[5]  Inna Dubchak,et al.  Comparative genome sequencing of Drosophila pseudoobscura: chromosomal, gene, and cis-element evolution. , 2005, Genome research.

[6]  H. Sweeney,et al.  Two Conserved Lysines at the 50/20-kDa Junction of Myosin Are Necessary for Triggering Actin Activation* , 2001, The Journal of Biological Chemistry.

[7]  D. Swank,et al.  The converter domain modulates kinetic properties of Drosophila myosin. , 2003, American journal of physiology. Cell physiology.

[8]  E V Koonin,et al.  Origin of alternative splicing by tandem exon duplication. , 2001, Human molecular genetics.

[9]  E. L. George,et al.  Functional Domains of the Drosophila melanogaster Muscle Myosin Heavy-Chain Gene Are Encoded by Alternatively Spliced Exons , 1989, Molecular and cellular biology.

[10]  J. Spudich,et al.  Enzymatic activities correlate with chimaeric substitutions at the actin-binding face of myosin , 1994, Nature.

[11]  A. Kumar,et al.  Genetic complexity of the human geranylgeranyltransferase I beta-subunit gene: a multigene family of pseudogenes derived from mis-spliced transcripts. , 1998, Gene.

[12]  D. Swank,et al.  The myosin converter domain modulates muscle performance , 2002, Nature Cell Biology.

[13]  J. Sellers,et al.  Identification and analysis of the myosin superfamily in Drosophila: a database approach , 2004, Journal of Muscle Research & Cell Motility.

[14]  Cecilia Saccone,et al.  Pseudogenes in metazoa: origin and features. , 2004, Briefings in functional genomics & proteomics.

[15]  Terrence S. Furey,et al.  The UCSC Genome Browser Database: update 2006 , 2005, Nucleic Acids Res..

[16]  Ronald D Vale,et al.  The Molecular Motor Toolbox for Intracellular Transport , 2003, Cell.

[17]  N. Satoh,et al.  A genomewide survey of developmentally relevant genes in Ciona intestinalis , 2003, Development Genes and Evolution.

[18]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[19]  Peer Bork,et al.  Common exon duplication in animals and its role in alternative splicing. , 2002, Human molecular genetics.

[20]  Dimitris Anastassiou,et al.  Variable window binding for mutually exclusive alternative splicing , 2006, Genome Biology.

[21]  M. Schliwa,et al.  Molecular motors , 2003, Nature.

[22]  Jian Wang,et al.  The Genome Sequence of the Malaria Mosquito Anopheles gambiae , 2002, Science.

[23]  R. Milligan,et al.  Fine tuning a molecular motor: the location of alternative domains in the Drosophila myosin head. , 1997, Journal of molecular biology.

[24]  S. Rosenfeld,et al.  Kinetic Tuning of Myosin via a Flexible Loop Adjacent to the Nucleotide Binding Pocket* , 1998, The Journal of Biological Chemistry.

[25]  Yoshiaki Nagamura,et al.  The genome sequence of silkworm, Bombyx mori. , 2004, DNA research : an international journal for rapid publication of reports on genes and genomes.

[26]  D. Black Protein Diversity from Alternative Splicing A Challenge for Bioinformatics and Post-Genome Biology , 2000, Cell.

[27]  F. Ayala,et al.  Pseudogenes: are they "junk" or functional DNA? , 2003, Annual review of genetics.

[28]  Florian Odronitz,et al.  Drawing the tree of eukaryotic life based on the analysis of 2,269 manually annotated myosins from 328 species , 2007, Genome Biology.

[29]  J. Coulombe-Huntington,et al.  Intron loss and gain in Drosophila. , 2007, Molecular biology and evolution.

[30]  Evgeny M. Zdobnov,et al.  Genome Sequence of Aedes aegypti, a Major Arbovirus Vector , 2007, Science.

[31]  B. Graveley Alternative splicing: increasing diversity in the proteomic world. , 2001, Trends in genetics : TIG.

[32]  S. Bernstein,et al.  Alternative RNA splicing generates transcripts encoding a thorax-specific isoform of Drosophila melanogaster myosin heavy chain , 1986, Molecular and cellular biology.

[33]  Manfred Schliwa,et al.  Molecular motors , 2003, Nature.

[34]  Michael Grüninger,et al.  Introduction , 2002, CACM.

[35]  Ting Wang,et al.  The UCSC Genome Browser Database: update 2009 , 2008, Nucleic Acids Res..

[36]  Lee Rowen,et al.  The organization and evolution of the dipteran and hymenopteran Down syndrome cell adhesion molecule (Dscam) genes. , 2004, RNA.

[37]  J. Berg,et al.  A millennial myosin census. , 2001, Molecular biology of the cell.

[38]  M. Kollmar,et al.  Crystal structure of the motor domain of a class‐I myosin , 2002, The EMBO journal.

[39]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[40]  D. Manstein,et al.  Modulation of actin affinity and actomyosin adenosine triphosphatase by charge changes in the myosin motor domain. , 1998, Biochemistry.

[41]  Melanie A. Huntley,et al.  Evolution of genes and genomes on the Drosophila phylogeny , 2007, Nature.

[42]  M. Levine,et al.  A genomewide survey of developmentally relevant genes in Ciona intestinalis , 2003, Development Genes and Evolution.

[43]  Rodrigo Lopez,et al.  Multiple sequence alignment with the Clustal series of programs , 2003, Nucleic Acids Res..

[44]  Oliver Tn,et al.  Tails of unconventional myosins. , 1999 .

[45]  P. Bork,et al.  Vertebrate-Type Intron-Rich Genes in the Marine Annelid Platynereis dumerilii , 2005, Science.

[46]  Brenton R Graveley,et al.  Mutually Exclusive Splicing of the Insect Dscam Pre-mRNA Directed by Competing Intronic RNA Secondary Structures , 2005, Cell.

[47]  P Chambon,et al.  Organization and expression of eucaryotic split genes coding for proteins. , 1981, Annual review of biochemistry.

[48]  Thangavel Alphonse Thanaraj,et al.  ASD: the Alternative Splicing Database , 2004, Nucleic Acids Res..

[49]  T. N. Oliver,et al.  Tails of unconventional myosins , 1999, Cellular and Molecular Life Sciences CMLS.

[50]  Kenneth C. Holmes,et al.  Introduction: one contribution of 14 to a discussion meeting issue 'Myosin, muscle and motility' , 2004 .