AluScan: a method for genome-wide scanning of sequence and structure variations in the human genome

BackgroundTo complement next-generation sequencing technologies, there is a pressing need for efficient pre-sequencing capture methods with reduced costs and DNA requirement. The Alu family of short interspersed nucleotide elements is the most abundant type of transposable elements in the human genome and a recognized source of genome instability. With over one million Alu elements distributed throughout the genome, they are well positioned to facilitate genome-wide sequence amplification and capture of regions likely to harbor genetic variation hotspots of biological relevance.ResultsHere we report on the use of inter-Alu PCR with an enhanced range of amplicons in conjunction with next-generation sequencing to generate an Alu-anchored scan, or 'AluScan', of DNA sequences between Alu transposons, where Alu consensus sequence-based 'H-type' PCR primers that elongate outward from the head of an Alu element are combined with 'T-type' primers elongating from the poly-A containing tail to achieve huge amplicon range. To illustrate the method, glioma DNA was compared with white blood cell control DNA of the same patient by means of AluScan. The over 10 Mb sequences obtained, derived from more than 8,000 genes spread over all the chromosomes, revealed a highly reproducible capture of genomic sequences enriched in genic sequences and cancer candidate gene regions. Requiring only sub-micrograms of sample DNA, the power of AluScan as a discovery tool for genetic variations was demonstrated by the identification of 357 instances of loss of heterozygosity, 341 somatic indels, 274 somatic SNVs, and seven potential somatic SNV hotspots between control and glioma DNA.ConclusionsAluScan, implemented with just a small number of H-type and T-type inter-Alu PCR primers, provides an effective capture of a diversity of genome-wide sequences for analysis. The method, by enabling an examination of gene-enriched regions containing exons, introns, and intergenic sequences with modest capture and sequencing costs, computation workload and DNA sample requirement is particularly well suited for accelerating the discovery of somatic mutations, as well as analysis of disease-predisposing germline polymorphisms, by making possible the comparative genome-wide scanning of DNA sequences from large human cohorts.

[1]  Ying Zhang,et al.  Distributions of Transposable Elements Reveal Hazardous Zones in Mammalian Introns , 2011, PLoS Comput. Biol..

[2]  J. V. Moran,et al.  Mobile elements and mammalian genome evolution. , 2003, Current opinion in genetics & development.

[3]  D. Ledbetter,et al.  Alu polymerase chain reaction: a method for rapid isolation of human-specific sequences from complex DNA sources. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Kai Ye,et al.  Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads , 2009, Bioinform..

[5]  Seong-Hyeuk Nam,et al.  Whole human exome capture for high-throughput sequencing. , 2010, Genome.

[6]  Hong Xue,et al.  Alternative-Splicing in the Exon-10 Region of GABAA Receptor β2 Subunit Gene: Relationships between Novel Isoforms and Psychotic Disorders , 2009, PloS one.

[7]  P. Pevzner,et al.  Whole-genome analysis of Alu repeat elements reveals complex evolutionary history. , 2004, Genome research.

[8]  S. Sinha,et al.  Inter‐alu PCR detects high frequency of genetic alterations in glioma cells exposed to sub‐lethal cisplatin , 2005, International journal of cancer.

[9]  H. Xue,et al.  Association of SNPs and haplotypes in GABAA receptor β2 gene with schizophrenia , 2004, Molecular Psychiatry.

[10]  D. Labuda,et al.  Detection of a mutator phenotype in cancer cells by inter-Alu polymerase chain reaction. , 1996, Cancer research.

[11]  Qiang Yang,et al.  A Recombination Hotspot in a Schizophrenia-Associated Region of GABRB2 , 2010, PloS one.

[12]  M. Batzer,et al.  Alu repeats increase local recombination rates , 2009, BMC Genomics.

[13]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[14]  Steven J. M. Jones,et al.  Circos: an information aesthetic for comparative genomics. , 2009, Genome research.

[15]  L. Vives,et al.  Genome-wide tracking of unmethylated DNA Alu repeats in normal and cancer cells , 2007, Nucleic acids research.

[16]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[17]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[18]  John Quackenbush,et al.  Exome sequencing-based copy-number variation and loss of heterozygosity detection: ExomeCNV , 2011, Bioinform..

[19]  R. Wilson,et al.  BreakDancer: An algorithm for high resolution mapping of genomic structural variation , 2009, Nature Methods.

[20]  H. Xue,et al.  Alu-associated enhancement of single nucleotide polymorphisms in the human genome. , 2006, Gene.

[21]  Heui-Soo Kim,et al.  Analysis of newly identified low copy AluYj subfamily. , 2005, Genes & genetic systems.

[22]  M. Batzer,et al.  Alu repeats and human genomic diversity , 2002, Nature Reviews Genetics.

[23]  F. Caire,et al.  1p19q LOH patterns and expression of p53 and Olig2 in gliomas: relation with histological types and prognosis , 2010, Modern Pathology.

[24]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[25]  Miriam K. Konkel,et al.  A mobile threat to genome stability: The impact of non-LTR retrotransposons upon the human genome. , 2010, Seminars in cancer biology.

[26]  H. Xue,et al.  Positive Selection within the Schizophrenia-Associated GABAA Receptor β2 Gene , 2007, PloS one.

[27]  D. Labuda,et al.  Linkage mapping by simultaneous screening of multiple polymorphic loci using Alu oligonucleotide-directed PCR. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[28]  E. Ullu,et al.  Alu sequences are processed 7SL RNA genes , 1984, Nature.