Manual superscaffolding of honey bee (Apis mellifera) chromosomes 12–16: implications for the draft genome assembly version 4, gene annotation, and chromosome structure

The euchromatic arms of the five smallest telocentric chromosomes in the honey bee genome draft Assembly v4 were manually connected into superscaffolds. This effort reduced chromosomes 12–16 from 30, 21, 25, 42, and 21 mapped scaffolds to five, four, five, six, and five superscaffolds, respectively, and incorporated 178 unmapped contigs and scaffolds totalling 2.6 Mb, a 6.4% increase in length. The superscaffolds extend from the genetically mapped location of the centromere to their identified distal telomeres on the long arms. Only two major misassemblies of 146 kb and 65 kb sections were identified in this 23% of the mapped assembly. Nine duplicate gene models on chromosomes 15 and 16 were made redundant, while another 15 gene models were improved, most spectacularly the MAD (MAX dimerization protein) gene which extends across 11 scaffolds for at least 400 kb.

[1]  J. Cornuet,et al.  A third-generation microsatellite-based linkage map of the honey bee, Apis mellifera, and its comparison with the sequence-based physical map , 2007, Genome Biology.

[2]  G. Weinstock,et al.  Creating a honey bee consensus gene set , 2007, Genome Biology.

[3]  A. Clark,et al.  Heterogeneity in regional GC content and differential usage of codons and amino acids in GC-poor and GC-rich regions of the genome of Apis mellifera. , 2006, Molecular biology and evolution.

[4]  Christine G Elsik,et al.  Community annotation: procedures, protocols, and supporting tools. , 2006, Genome research.

[5]  S. Forêt,et al.  Function and evolution of a gene family encoding odorant binding-like proteins in a social insect, the honey bee (Apis mellifera). , 2006, Genome research.

[6]  H. Robertson,et al.  Canonical TTAGG-repeat telomeres and telomerase in the honey bee, Apis mellifera. , 2006, Genome research.

[7]  A. Clark,et al.  Thrice Out of Africa: Ancient and Recent Expansions of the Honey Bee, Apis mellifera , 2006, Science.

[8]  Ying Wang,et al.  Insights into social insects from the genome of the honeybee Apis mellifera , 2006, Nature.

[9]  D. D. de Graaf,et al.  Genomic and transcriptional analysis of protein heterogeneity of the honeybee venom allergen Api m 6 , 2006, Insect molecular biology.

[10]  R. Gutell,et al.  Characteristics of the nuclear (18S, 5.8S, 28S and 5S) and mitochondrial (12S and 16S) rRNA genes of Apis mellifera (Insecta: Hymenoptera): structure, organization, and retrotransposable elements , 2006, Insect molecular biology.

[11]  F. Delsuc,et al.  Phylogenomics: the beginning of incongruence? , 2006, Trends in genetics : TIG.

[12]  C. D. Sauer,et al.  Pteropsin: a vertebrate-like non-visual opsin expressed in the honey bee brain. , 2005, Insect biochemistry and molecular biology.

[13]  A. Coulson,et al.  Genomics in C. elegans: so many genes, such a little worm. , 2005, Genome research.

[14]  J. Cornuet,et al.  Whole-Genome Scan in Thelytokous-Laying Workers of the Cape Honeybee (Apis mellifera capensis): Central Fusion, Reduced Recombination Rates and Centromere Mapping Using Half-Tetrad Analysis , 2004, Genetics.

[15]  J. Cornuet,et al.  A Microsatellite-Based Linkage Map of the Honeybee, Apis mellifera L. , 2004, Genetics.

[16]  E. Birney,et al.  Apollo: a sequence annotation editor , 2002, Genome Biology.

[17]  S. Lewis,et al.  The generic genome browser: a building block for a model organism system database. , 2002, Genome research.

[18]  G. Robinson,et al.  Annotated expressed sequence tags and cDNA microarrays for studies of brain and behavior in the honey bee. , 2002, Genome research.

[19]  Stephen M. Mount,et al.  The genome sequence of Drosophila melanogaster. , 2000, Science.

[20]  P. Hengen Optimizing multiplex and LA-PCR with betaine. , 1997, Trends in biochemical sciences.

[21]  M. Beye,et al.  Characterization of honeybee (Apis mellifera L.) chromosomes using repetitive DNA probes and fluorescence in situ hybridization. , 1995, The Journal of heredity.

[22]  Russell Higuchi,et al.  Effective amplification of long targets from cloned inserts and human genomic DNA. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[23]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[24]  C. Marck,et al.  'DNA Strider': a 'C' program for the fast analysis of DNA and protein sequences on the Apple Macintosh family of computers. , 1988, Nucleic acids research.

[25]  P. Gallant Myc/Max/Mad in invertebrates: the evolution of the Max network. , 2006, Current topics in microbiology and immunology.

[26]  M. P. Cummings,et al.  PAUP* Phylogenetic analysis using parsimony (*and other methods) Version 4 , 2000 .