Generation of Long Insert Pairs Using a Cre-LoxP Inverse PCR Approach

Large insert mate pair reads have a major impact on the overall success of de novo assembly and the discovery of inherited and acquired structural variants. The positional information of mate pair reads generally improves genome assembly by resolving repeat elements and/or ordering contigs. Currently available methods for building such libraries have one or more of limitations, such as relatively small insert size; unable to distinguish the junction of two ends; and/or low throughput. We developed a new approach, Cre-LoxP Inverse PCR Paired-End (CLIP-PE), which exploits the advantages of (1) Cre-LoxP recombination system to efficiently circularize large DNA fragments, (2) inverse PCR to enrich for the desired products that contain both ends of the large DNA fragments, and (3) the use of restriction enzymes to introduce a recognizable junction site between ligated fragment ends and to improve the self-ligation efficiency. We have successfully created CLIP-PE libraries up to 22 kb that are rich in informative read pairs and low in small fragment background. These libraries have demonstrated the ability to improve genome assemblies. The CLIP-PE methodology can be implemented with existing and future next-generation sequencing platforms.

[1]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[2]  R. Hoess,et al.  Bacteriophage P1 site-specific recombination. II. Recombination between loxP and the bacterial chromosome. , 1981, Journal of molecular biology.

[3]  M. Adams,et al.  High throughput direct end sequencing of BAC clones. , 1999, Nucleic acids research.

[4]  E. Liu,et al.  Gene identification signature (GIS) analysis for transcriptome characterization and genome annotation , 2005, Nature Methods.

[5]  Peter Winter,et al.  Gene expression analysis of plant host–pathogen interactions by SuperSAGE , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[6]  E. Liu,et al.  Next-generation DNA sequencing of paired-end tags (PET) for transcriptome and genome analyses. , 2009, Genome research.

[7]  N. Sternberg,et al.  Bacteriophage P1 site-specific recombination. I. Recombination between loxP sites. , 1981, Journal of molecular biology.

[8]  S. Salzberg,et al.  Versatile and open software for comparing large genomes , 2004, Genome Biology.

[9]  A. Gnirke,et al.  High-quality draft assemblies of mammalian genomes from massively parallel sequence data , 2010, Proceedings of the National Academy of Sciences.

[10]  Chee Seng Chan,et al.  Comprehensive long-span paired-end-tag mapping reveals characteristic patterns of structural variations in epithelial cancer genomes. , 2011, Genome research.

[11]  E. Eichler,et al.  A genome-wide survey of structural variation between human and chimpanzee. , 2005, Genome research.

[12]  N. Sternberg,et al.  Bacteriophage P1 cloning system for the isolation, amplification, and recovery of DNA fragments as large as 100 kilobase pairs. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[13]  E. Birney,et al.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs. , 2008, Genome research.

[14]  R. Hoess,et al.  Interaction of the bacteriophage P1 recombinase Cre with the recombining site loxP. , 1984, Proceedings of the National Academy of Sciences of the United States of America.