Pattern Recognition of Relational Structures

A new representation which abstracts relational characteristics of a class of structured data is introduced in this paper. The representation, called primitive relational structure, naturally becomes an element of Boolean algebra, the operations of which reflect the structural similarity and dissimularity of any two objects. Then on the Boolean algebra, distance and probability measures are defined. Further, to render a feasible scheme for estimating structural probability distribution where sample size of data class is relatively small in real world application, a second order approximation scheme of higher order probability on discrete-valued data is adopted. In such a scheme the optimal subset of features for the representation of the probability distributions are extracted by optimising certain information measures defined on the set of relations. The objective function for optimisation can be formulated to yield either (a) distributions that best approximate the high order probability of an ensemble or (b) distributions that lead to optimal discrimination between classes. Thus with the distance and probability measures defined, both unsupervised and supervised classification on PRS can be achieved by algorithms adapted respectively from (a) a discrete-value data clustering algorithm and (b) an error-probability minimax classification scheme. The proposed method has been applied to the analysis of structural and measurable patterns of discrete-time systems.

[1]  C. N. Liu,et al.  Approximating discrete probability distributions with dependence trees , 1968, IEEE Trans. Inf. Theory.

[2]  King-Sun Fu,et al.  A Clustering Procedure for Syntactic Patterns , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[3]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[4]  Andrew K. C. Wong,et al.  DECA: A Discrete-Valued Data Clustering Algorithm , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Claude Berge,et al.  Graphs and Hypergraphs , 2021, Clustering.

[6]  Andrew K. C. Wong,et al.  A Decision-Directed Clustering Algorithm for Discrete Data , 1977, IEEE Transactions on Computers.

[7]  King-Sun Fu,et al.  Attributed Grammar-A Tool for Combining Syntactic and Statistical Approaches to Pattern Recognition , 1980, IEEE Transactions on Systems, Man, and Cybernetics.

[8]  Philip M. Lewis,et al.  Approximating Probability Distributions to Reduce Storage Requirements , 1959, Information and Control.

[9]  Godfried T. Toussaint,et al.  Bibliography on estimation of misclassification , 1974, IEEE Trans. Inf. Theory.

[10]  Martin E. Hellman,et al.  Probability of error, equivocation, and the Chernoff bound , 1970, IEEE Trans. Inf. Theory.

[11]  Andrew K. C. Wong,et al.  Random Graphs: Structural-Contextual Dichotomy , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  A. Wong,et al.  Classification of discrete data with feature space transformation , 1978 .

[13]  Benjamin S. Duran,et al.  Cluster Analysis , 2020, Marketing Analytics.

[14]  L. Goldfarb,et al.  Towards analysis of structural and measurable patterns of systems states , 1978, 1978 IEEE Conference on Decision and Control including the 17th Symposium on Adaptive Processes.

[15]  King-Sun Fu,et al.  Syntactic Pattern Recognition And Applications , 1968 .

[16]  P. Halmos Lectures on Boolean Algebras , 1963 .

[17]  King-Sun Fu,et al.  Error-Correcting Isomorphisms of Attributed Relational Graphs for Pattern Analysis , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[18]  Solomon Kullback,et al.  Approximating discrete probability distributions , 1969, IEEE Trans. Inf. Theory.

[19]  Andrew K. C. Wong,et al.  Graph Optimal Monomorphism Algorithms , 1980, IEEE Transactions on Systems, Man, and Cybernetics.

[20]  King-Sun Fu,et al.  Digital pattern recognition , 1976, Communication and cybernetics.