Transposable elements are found in a large number of human protein-coding genes.

To study the genome-wide impact of transposable elements (TEs) on the evolution of protein-coding regions, we examined 13 799 human genes and found 533 (approximately 4%) cases of TEs within protein-coding regions. The majority of these TEs (approximately 89.5%) reside within 'introns' and were recruited into coding regions as novel exons. We found that TE integration often has an effect on gene function. In particular, there were two mouse genes whose coding regions consist largely of TEs, suggesting that TE insertion might create new genes. Thus, there is increasing evidence for an important role of TEs in gene evolution. Because many TEs are taxon-specific, their integration into coding regions could accelerate species divergence.

[1]  A. Smit Interspersed repeats and other mementos of transposable elements in mammalian genomes. , 1999, Current opinion in genetics & development.

[2]  M. Batzer,et al.  Alu repeats and human disease. , 1999, Molecular genetics and metabolism.

[3]  W. Makałowski,et al.  Genomic scrap yard: how genomes utilize all that junk. , 2000, Gene.

[4]  M. Suzuki,et al.  Primary structure and functional expression of a novel non-selective cation channel. , 1998, Biochemical and biophysical research communications.

[5]  M. C. Hu,et al.  Human HPK1, a novel human hematopoietic progenitor kinase that activates the JNK/SAPK kinase cascade. , 1996, Genes & development.

[6]  J. Brosius,et al.  On "genomenclature": a comprehensive (and respectful) taxonomy for pseudogenes and other "junk DNA". , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[7]  P Bork,et al.  EST comparison indicates 38% of human mRNAs contain possible alternative splice forms , 2000, FEBS letters.

[8]  Z. Gu,et al.  Evolutionary analyses of the human genome , 2001, Nature.

[9]  M. Furuichi,et al.  Regulation of Expression of the Human MTH1 Gene Encoding 8-Oxo-dGTPase , 1997, The Journal of Biological Chemistry.

[10]  R. Coffman,et al.  Lungkine, a novel CXC chemokine, specifically expressed by lung bronchoepithelial cells. , 1999, Journal of immunology.

[11]  D. Labuda,et al.  Alu sequences in the coding regions of mRNA: a source of protein variability. , 1994, Trends in genetics : TIG.