Patterns of Insertion and Deletion in Mammalian Genomes

Nucleotide insertions and deletions (indels) are responsible for gaps in the sequence alignments. Indel is one of the major sources of evolutionary change at the molecular level. We have examined the patterns of insertions and deletions in the 19 mammalian genomes, and found that deletion events are more common than insertions in the mammalian genomes. Both the number of insertions and deletions decrease rapidly when the gap length increases and single nucleotide indel is the most frequent in all indel events. The frequencies of both insertions and deletions can be described well by power law.

[1]  M T Clegg,et al.  Evolution of a noncoding region of the chloroplast genome. , 1993, Molecular phylogenetics and evolution.

[2]  H. Maki,et al.  Mutational specificity of the dnaE173 mutator associated with a defect in the catalytic subunit of DNA polymerase III of Escherichia coli. , 1991, Journal of molecular biology.

[3]  Anna Wetterbom,et al.  Comparative Genomic Analysis of Human and Chimpanzee Indicates a Key Role for Indels in Primate Evolution , 2006, Journal of Molecular Evolution.

[4]  Reed A. Cartwright,et al.  Logarithmic gap costs decrease alignment accuracy , 2006, BMC Bioinformatics.

[5]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.

[6]  Xun Gu,et al.  The size distribution of insertions and deletions in human and rodent pseudogenes suggests the logarithmic gap penalty for sequence alignment , 1995, Journal of Molecular Evolution.

[7]  W. Murphy,et al.  Resolution of the Early Placental Mammal Radiation Using Bayesian Phylogenetics , 2001, Science.

[8]  Mark Gerstein,et al.  Patterns of nucleotide substitution, insertion and deletion in the human genome inferred from pseudogenes. , 2003, Nucleic acids research.

[9]  C Sander,et al.  Mapping the Protein Universe , 1996, Science.

[10]  D. Haussler,et al.  Human-mouse alignments with BLASTZ. , 2003, Genome research.

[11]  Wilfried W. de Jong,et al.  Causes of more frequent deletions than insertions in mutations and protein evolution , 1981, Nature.

[12]  D. Haussler,et al.  Evolution's cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Martin S. Taylor,et al.  Occurrence and consequences of coding sequence insertions and deletions in Mammalian genomes. , 2004, Genome research.

[14]  G. Gonnet,et al.  Empirical and structural models for insertions and deletions in the divergent evolution of proteins. , 1993, Journal of molecular biology.

[15]  Terrence S. Furey,et al.  The UCSC Genome Browser Database , 2003, Nucleic Acids Res..

[16]  D. Haussler,et al.  Aligning multiple genomic sequences with the threaded blockset aligner. , 2004, Genome research.

[17]  Tadashi Imanishi,et al.  Comparative sequencing of human and chimpanzee MHC class I regions unveils insertions/deletions as the major path to genomic divergence , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[18]  N. Saitou,et al.  Evolutionary rates of insertion and deletion in noncoding nucleotide sequences of primates. , 1994, Molecular biology and evolution.

[19]  B Qian,et al.  Distribution of indel lengths , 2001, Proteins.

[20]  Lisa M. D'Souza,et al.  Genome sequence of the Brown Norway rat yields insights into mammalian evolution , 2004, Nature.

[21]  R. Britten,et al.  Majority of divergence between closely related DNA samples is due to indels , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[22]  D Graur,et al.  Patterns and rates of indel evolution in processed pseudogenes from humans and murids. , 1997, Gene.

[23]  R. Britten,et al.  Divergence between samples of chimpanzee and human DNA sequences is 5%, counting indels , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Byungkook Lee,et al.  Frequency of gaps observed in a structurally aligned protein pair database suggests a simple gap penalty function. , 2004, Nucleic acids research.

[25]  Tim J. P. Hubbard,et al.  SCOP: a Structural Classification of Proteins database , 1999, Nucleic Acids Res..

[26]  Dan Graur,et al.  Deletions in processed pseudogenes accumulate faster in rodents than in humans , 1989, Journal of Molecular Evolution.