Fine-scale structural variation of the human genome

Inversions, deletions and insertions are important mediators of disease and disease susceptibility. We systematically compared the human genome reference sequence with a second genome (represented by fosmid paired-end sequences) to detect intermediate-sized structural variants >8 kb in length. We identified 297 sites of structural variation: 139 insertions, 102 deletions and 56 inversion breakpoints. Using combined literature, sequence and experimental analyses, we validated 112 of the structural variants, including several that are of biomedical relevance. These data provide a fine-scale structural variation map of the human genome and the requisite sequence precision for subsequent genetic studies of human disease.

[1]  B. S. Baker,et al.  Segmental aneuploidy and the genetic gross structure of the Drosophila genome. , 1972, Genetics.

[2]  J. Cartron,et al.  Genetic basis of the RhD-positive and RhD-negative blood group polymorphism as determined by Southern analysis. , 1991, Blood.

[3]  H. Hobbs,et al.  Molecular definition of the extreme size polymorphism in apolipoprotein(a). , 1993, Human molecular genetics.

[4]  E. Eichler,et al.  Length of uninterrupted CGG repeats determines instability in the FMR1 gene , 1994, Nature Genetics.

[5]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[6]  E. Lee,et al.  GLUTATHIONE S-TRANSFERASE THETA (GSTT1) GENETIC POLYMORPHISM AMONG CHINESE, MALAYS AND INDIANS IN INGAPORE , 1995 .

[7]  E. Lee,et al.  Glutathione S transferase-theta (GSTT1) genetic polymorphism among Chinese, Malays and Indians in Singapore. , 1995, Pharmacogenetics.

[8]  M Ingelman-Sundberg,et al.  Frequent distribution of ultrarapid metabolizers of debrisoquine in an ethiopian population carrying duplicated and multiduplicated functional CYP2D6 alleles. , 1996, The Journal of pharmacology and experimental therapeutics.

[9]  M. Olson,et al.  Multiple-complete-digest restriction fragment mapping: generating sequence-ready maps for large-scale DNA sequencing. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[10]  S. Warren,et al.  Emerin deletion reveals a common X-chromosome inversion mediated by inverted repeats , 1997, Nature Genetics.

[11]  M Ingelman-Sundberg,et al.  Frequent occurrence of CYP2D6 gene duplication in Saudi Arabians. , 1997, Pharmacogenetics.

[12]  S Holloway,et al.  A chromosomal duplication map of malformations: regions of suspected haplo- and triplolethality--and tolerance of segmental aneuploidy--in humans. , 1999, American journal of human genetics.

[13]  E. Eichler,et al.  The mosaic structure of human pericentromeric DNA: a strategy for characterizing complex regions of the human genome. , 2000, Genome research.

[14]  U. Brinkmann,et al.  Characterization of the glutathione S-transferase GSTT1 deletion: discrimination of all genotypes by polymerase chain reaction indicates a trimodular genotype-phenotype correlation. , 2000, Pharmacogenetics.

[15]  D. Nickerson,et al.  Variation is the spice of life , 2001, Nature Genetics.

[16]  Ajay N. Jain,et al.  Assembly of microarrays for genome-wide measurement of DNA copy number , 2001, Nature Genetics.

[17]  T. Hoogenboezem,et al.  Duplication of the CYP21A2 gene complicates mutation analysis of steroid 21-hydroxylase deficiency: characteristics of three unusual haplotypes , 2002, Human Genetics.

[18]  M. Adams,et al.  Recent Segmental Duplications in the Human Genome , 2002, Science.

[19]  D. Botstein,et al.  Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease , 2003, Nature Genetics.

[20]  J. V. Moran,et al.  ATLAS: a system to selectively identify human-specific L1 insertions. , 2003, American journal of human genetics.

[21]  P. Buckland,et al.  Polymorphically duplicated genes: their relevance to phenotypic variation in humans , 2003, Annals of medicine.

[22]  B. Roe,et al.  Refinement of a chimpanzee pericentric inversion breakpoint to a segmental duplication cluster , 2003, Genome Biology.

[23]  Kenny Q. Ye,et al.  Large-Scale Copy Number Polymorphism in the Human Genome , 2004, Science.

[24]  C. Ponting,et al.  Finishing the euchromatic sequence of the human genome , 2004 .

[25]  E. Lander,et al.  Finishing the euchromatic sequence of the human genome , 2004 .

[26]  J. Bonfield,et al.  Finishing the euchromatic sequence of the human genome , 2004, Nature.

[27]  L. Feuk,et al.  Detection of large-scale variation in the human genome , 2004, Nature Genetics.

[28]  E. Eichler,et al.  Shotgun sequence assembly and recent segmental duplications within the human genome , 2004, Nature.

[29]  H. Stefánsson,et al.  A common inversion under selection in Europeans , 2005, Nature Genetics.

[30]  B. Rovin,et al.  The Influence of CCL 3 L 1 Gene – Containing Segmental Duplications on HIV-1 / AIDS Susceptibility , 2009 .