Comparative linkage analysis and visualization of high-density oligonucleotide SNP array data

BackgroundThe identification of disease-associated genes using single nucleotide polymorphisms (SNPs) has been increasingly reported. In particular, the Affymetrix Mapping 10 K SNP microarray platform uses one PCR primer to amplify the DNA samples and determine the genotype of more than 10,000 SNPs in the human genome. This provides the opportunity for large scale, rapid and cost-effective genotyping assays for linkage analysis. However, the analysis of such datasets is nontrivial because of the large number of markers, and visualizing the linkage scores in the context of genome maps remains less automated using the current linkage analysis software packages. For example, the haplotyping results are commonly represented in the text format.ResultsHere we report the development of a novel software tool called CompareLinkage for automated formatting of the Affymetrix Mapping 10 K genotype data into the "Linkage" format and the subsequent analysis with multi-point linkage software programs such as Merlin and Allegro. The new software has the ability to visualize the results for all these programs in dChip in the context of genome annotations and cytoband information. In addition we implemented a variant of the Lander-Green algorithm in the dChipLinkage module of dChip software (V1.3) to perform parametric linkage analysis and haplotyping of SNP array data. These functions are integrated with the existing modules of dChip to visualize SNP genotype data together with LOD score curves. We have analyzed three families with recessive and dominant diseases using the new software programs and the comparison results are presented and discussed.ConclusionsThe CompareLinkage and dChipLinkage software packages are freely available. They provide the visualization tools for high-density oligonucleotide SNP array data, as well as the automated functions for formatting SNP array data for the linkage analysis programs Merlin and Allegro and calling these programs for linkage analysis. The results can be visualized in dChip in the context of genes and cytobands. In addition, a variant of the Lander-Green algorithm is provided that allows parametric linkage analysis and haplotyping.

[1]  L Kruglyak,et al.  Efficient multipoint linkage analysis through reduction of inheritance space. , 2001, American journal of human genetics.

[2]  Sreenivasan Ravi Book Review: Mathematical and statistical methods for genetic analysis, 2nd edition , 2005 .

[3]  G. Abecasis,et al.  Merlin—rapid analysis of dense genetic maps using sparse gene flow trees , 2002, Nature Genetics.

[4]  Daniel F. Gudbjartsson,et al.  Allegro, a new computer program for multipoint linkage analysis , 2000, Nature genetics.

[5]  L Kruglyak,et al.  Parametric and nonparametric linkage analysis: a unified multipoint approach. , 1996, American journal of human genetics.

[6]  Eric S. Lander,et al.  Faster Multipoint Linkage Analysis Using Fourier Transforms , 1998, J. Comput. Biol..

[7]  E. Lander,et al.  Construction of multilocus genetic linkage maps in humans. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[8]  S. Riazuddin,et al.  Mutations of ESPN cause autosomal recessive deafness and vestibular dysfunction , 2004, Journal of Medical Genetics.

[9]  K Lange,et al.  A multipoint method for detecting genotyping errors and mutations in sibling-pair linkage data. , 2000, American journal of human genetics.

[10]  C Cannings,et al.  Mathematical and Statistical Methods for Genetic Analysis (2nd ed) , 2004, Heredity.

[11]  Jing Huang,et al.  Parallel genotyping of over 10,000 SNPs using a one-primer assay on a high-density oligonucleotide array. , 2004, Genome research.

[12]  D. Schaid Mathematical and Statistical Methods for Genetic Analysis , 1999 .

[13]  Cheng Li,et al.  DNA-Chip Analyzer (dChip) , 2003 .

[14]  J R O'Connell,et al.  PedCheck: a program for identification of genotype incompatibilities in linkage analysis. , 1998, American journal of human genetics.

[15]  M. Daly,et al.  Genomewide linkage analysis of bipolar disorder by use of a high-density single-nucleotide-polymorphism (SNP) genotyping assay: a comparison with microsatellite marker assays and finding of significant linkage to chromosome 6q22. , 2004, American journal of human genetics.

[16]  S. P. Fodor,et al.  Large-scale genotyping of complex DNA , 2003, Nature Biotechnology.

[17]  Jeanette C Papp,et al.  Detection and integration of genotyping errors in statistical genetics. , 2002, American journal of human genetics.

[18]  C. Li,et al.  Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Eleftheria Zeggini,et al.  Whole-genome scan, in a complex disease, using 11,245 single-nucleotide polymorphisms: comparison with microsatellites. , 2004, American journal of human genetics.

[20]  E. Lander,et al.  Genetic dissection of complex traits: guidelines for interpreting and reporting linkage results , 1995, Nature Genetics.

[21]  M. Daly,et al.  Rapid multipoint linkage analysis of recessive traits in nuclear families, including homozygosity mapping. , 1995, American journal of human genetics.

[22]  Cheng Li,et al.  dChipSNP: significance curve and clustering of SNP-array-based loss-of-heterozygosity data , 2004, Bioinform..

[23]  M. Goldstein,et al.  Analysis of Gene Expression Data , 2022 .

[24]  E. Mugnaini,et al.  The Deaf Jerker Mouse Has a Mutation in the Gene Encoding the Espin Actin-Bundling Proteins of Hair Cell Stereocilia and Lacks Espins , 2000, Cell.