Finding association rules on heterogeneous genome data.

A novel approach for discovery of knowledge from genome data, which has been recently watched with interest in the research area of database, is applied to finding unified rules spreading over sequence, structure, and function of protein. As the result of experiments using data extracted from PDB, SWISS-PROT, and PROSITE, some association rules stating sequential/structural/functional aspects of two kinds of endopeptidases were found.

[1]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.