Automated detection of hereditary syndromes using data mining.

Computer-based data mining methodology applied to family history clinical data can algorithmically create highly accurate, clinically oriented hereditary disease pattern recognizers. For the example of hereditary colon cancer, the data mining's selection of relevant factors to assess for hereditary colon cancer was statistically significant (P < 0.05). All final recognizer-formulated patterns of hereditary colon cancer were independently confirmed by a clinical expert. Applied to previously analyzed family histories, the recognizer identified the definitive hereditary histories, correctly responded negatively to the putative hereditary histories, and correctly responded negatively to empirically elevated colon cancer risk situations. This capability facilitates patient selection for DNA studies in search of gene mutations. When genetic mutations are included as parameters in a patient database for a genetic disease, the process yields an expert system which characterizes variations in clinical disease presentations in terms of genetic mutations. Such information can greatly improve the efficiency of gene testing.

[1]  H. Lynch Cancer and the family history trail. , 1991, New York state journal of medicine.

[2]  S. Altschul,et al.  Identification of FAP locus genes from chromosome 5q21. , 1991, Science.

[3]  D. Ward,et al.  Mutation in the DNA mismatch repair gene homologue hMLH 1 is associated with hereditary non-polyposis colon cancer , 1994, Nature.

[4]  Jay Liebowitz,et al.  An introduction to expert systems , 1988 .

[5]  A M Goldstein,et al.  Increased risk of pancreatic cancer in melanoma-prone kindreds with p16INK4 mutations. , 1995, The New England journal of medicine.

[6]  S. Seal,et al.  Localization of a breast cancer susceptibility gene, BRCA2, to chromosome 13q12-13. , 1994, Science.

[7]  M. King,et al.  Linkage of early-onset familial breast cancer to chromosome 17q21. , 1990, Science.

[8]  R. Fitzgibbons,et al.  Heterogeneity and natural history of hereditary breast cancer. Surgical implications. , 1990, The Surgical clinics of North America.

[9]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[10]  H. Lynch,et al.  Genetic Epidemiology of Cancer , 1989 .

[11]  N. Copeland,et al.  The human mutator gene homolog MSH2 and its association with hereditary nonpolyposis colon cancer , 1993, Cell.

[12]  Hereditary nonpolyposis colorectal cancer (Lynch syndrome): An updated review , 1996, Cancer.

[13]  J. Boyd,et al.  Molecular genetic evidence of the occurrence of breast cancer as an integral tumor in patients with the hereditary nonpolyposis colorectal carcinoma syndrome , 1996, Cancer.

[14]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .