Prioritizing regions of candidate genes for efficient mutation screening

The availability of the complete sequence of the human genome has dramatically facilitated the search for disease‐causing sequence variations. In fact, the rate‐limiting step has shifted from the discovery and characterization of candidate genes to the actual screening of human populations and the subsequent interpretation of observed variations. In this study we tested the hypothesis that some segments of candidate genes are more likely than others to contain disease‐causing variations and that these segments can be predicted bioinformatically. A bioinformatic technique, prioritization of annotated regions (PAR), was developed to predict the likelihood that a specific coding region of a gene will harbor a disease‐causing mutation based on conserved protein functional domains and protein secondary structures. This method was evaluated by using it to analyze 710 genes that collectively harbor 4,498 previously identified mutations. Nearly 50% of the genes were recognized as disease‐associated after screening only 9% of the complete coding sequence. The PAR technique identified 90% of the genes as containing at least one mutation, with less than 40% of the screening resources that traditional approaches would require. These results suggest that prioritization strategies such as PAR can accelerate disease‐gene identification through more efficient use of screening resources. Hum Mutat 27(2), 195–200, 2006. © 2006 Wiley‐Liss, Inc.

[1]  C. M. Davenport,et al.  Molecular genetics of human blue cone monochromacy. , 1989, Science.

[2]  R Langridge,et al.  Improvements in protein secondary structure prediction by an enhanced neural network. , 1990, Journal of molecular biology.

[3]  V. Sheffield,et al.  Mutation analysis of 3 genes in patients with Leber congenital amaurosis. , 2000, Archives of ophthalmology.

[4]  K. Roomp,et al.  ABCA1 regulatory variants influence coronary artery disease independent of effects on plasma lipid levels , 2002, Clinical genetics.

[5]  Jurg Ott,et al.  Distribution and characterization of regulatory elements in the human genome. , 2002, Genome research.

[6]  L. Ala‐Kokko,et al.  Conformation sensitive gel electrophoresis for simple and accurate detection of mutations: comparison with denaturing gradient gel electrophoresis and nucleotide sequencing. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[7]  M. Katz,et al.  Canine hereditary ceroid-lipofuscinosis: evidence for a defect in the carnitine biosynthetic pathway. , 1995, American journal of medical genetics.

[8]  L. Lim,et al.  An Abundant Class of Tiny RNAs with Probable Regulatory Roles in Caenorhabditis elegans , 2001, Science.

[9]  V. Sheffield,et al.  Missense variations in the fibulin 5 gene and age-related macular degeneration. , 2004, The New England journal of medicine.

[10]  V. Sheffield,et al.  An analysis of allelic variation in the ABCA4 gene. , 2001, Investigative ophthalmology & visual science.

[11]  T. Sekiya,et al.  Detection of polymorphisms of human DNA by gel electrophoresis as single-strand conformation polymorphisms. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2004, Nucleic Acids Res..

[13]  S Rozen,et al.  Primer3 on the WWW for general users and for biologist programmers. , 2000, Methods in molecular biology.

[14]  P. Bork,et al.  Towards a structural basis of human non-synonymous single nucleotide polymorphisms. , 2000, Trends in genetics : TIG.