Haplo2Ped: a tool using haplotypes as markers for linkage analysis

BackgroundGenerally, SNPs are abundant in the genome; however, they display low power in linkage analysis because of their limited heterozygosity. Haplotype markers, on the other hand, which are composed of many SNPs, greatly increase heterozygosity and have superiority in linkage statistics.ResultsHere we developed Haplo2Ped to automatically transform SNP data into haplotype markers and then to compute the logarithm (base 10) of odds (LOD) scores of regional haplotypes that are homozygous within the disease co-segregation haploid group. The results are reported as a hypertext file and a 3D figure to help users to obtain the candidate linkage regions. The hypertext file contains parameters of the disease linked regions, candidate genes, and their links to public databases. The 3D figure clearly displays the linkage signals in each chromosome. We tested Haplo2Ped in a simulated SNP dataset and also applied it to data from a real study. It successfully and accurately located the causative genomic regions. Comparison of Haplo2Ped with other existing software for linkage analysis further indicated the high effectiveness of this software.ConclusionsHaplo2Ped uses haplotype fragments as mapping markers in whole genome linkage analysis. The advantages of Haplo2Ped over other existing software include straightforward output files, increased accuracy and superior ability to deal with pedigrees showing incomplete penetrance. Haplo2Ped is freely available at: http://bighapmap.big.ac.cn/software.html.

[1]  J. Witte,et al.  Genetic dissection of complex traits , 1996, Nature Genetics.

[2]  J. Ott,et al.  Some statistical properties of the lod method and the method of scoring known recombination events in linkage analysis. , 1978, Cytogenetics and cell genetics.

[3]  Eric S. Lander,et al.  Resolution of quantitative traits into Mendelian factors by using a complete linkage map of restriction fragment length polymorphisms , 1988, Nature.

[4]  BMC Bioinformatics , 2005 .

[5]  Miao Sun,et al.  Copy-number mutations on chromosome 17q24.2-q24.3 in congenital generalized hypertrichosis terminalis with or without gingival hyperplasia. , 2009, American journal of human genetics.

[6]  Daniel F. Gudbjartsson,et al.  Allegro, a new computer program for multipoint linkage analysis , 2000, Nature genetics.

[7]  E. Lander,et al.  Genetic dissection of complex traits: guidelines for interpreting and reporting linkage results , 1995, Nature Genetics.

[8]  M. Daly,et al.  Genetic Mapping in Human Disease , 2008, Science.

[9]  Lusheng Wang,et al.  Identification of linked regions using high-density SNP genotype data in linkage analysis , 2008, Bioinform..

[10]  Wei Chen,et al.  SNP@Evolution: a hierarchical database of positive selection on the human genome , 2009, BMC Evolutionary Biology.

[11]  Tayfun Ozcelik,et al.  Mutations in the very low-density lipoprotein receptor VLDLR cause cerebellar hypoplasia and quadrupedal locomotion in humans , 2008, Proceedings of the National Academy of Sciences.

[12]  Yan Li,et al.  A novel frame-shift mutation of GLI3 causes non-syndromic and complex digital anomalies in a Chinese family. , 2011, Clinica chimica acta; international journal of clinical chemistry.

[13]  G. Abecasis,et al.  Merlin—rapid analysis of dense genetic maps using sparse gene flow trees , 2002, Nature Genetics.

[14]  Emily L. Webb,et al.  SNPLINK: multipoint linkage analysis of densely distributed SNP data incorporating automated linkage disequilibrium removal , 2005, Bioinform..