Inferring a Graph from Path Frequency

We consider the problem of inferring a graph (and a sequence) from the numbers of occurrences of vertex-labeled paths, which is closely related to the pre-image problem for graphs in machine learning: to reconstruct a graph from its feature space representation. We show that this problem can be solved in polynomial time in the size of an output graph if graphs are trees of bounded degree and the lengths of given paths are bounded by a constant. On the other hand, we show that this problem is strongly NP-hard even for planar graphs of bounded degree.

[1]  Robert B. Nachbar,et al.  Molecular Evolution: Automated Manipulation of Hierarchical Chemical Topology and Its Application to Average Molecular Structures , 2000, Genetic Programming and Evolvable Machines.

[2]  Alexander Zien,et al.  Learning to Find Graph Pre-images , 2004, DAGM-Symposium.

[3]  Bernhard Schölkopf,et al.  Kernel Methods in Computational Biology , 2005 .

[4]  H. M. Vinkers,et al.  SYNOPSIS: SYNthesize and OPtimize System in Silico. , 2003, Journal of medicinal chemistry.

[5]  Tatsuya Akutsu,et al.  Extensions of marginalized graph kernels , 2004, ICML.

[6]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[7]  Petra Mutzel,et al.  Computational Molecular Biology , 1996 .

[8]  Yoshihiro Yamanishi,et al.  Extraction of leukemia specific glycan motifs in humans by computational glycomics. , 2005, Carbohydrate research.

[9]  David S. Johnson,et al.  Computers and In stractability: A Guide to the Theory of NP-Completeness. W. H Freeman, San Fran , 1979 .

[10]  Eleazar Eskin,et al.  The Spectrum Kernel: A String Kernel for SVM Protein Classification , 2001, Pacific Symposium on Biocomputing.

[11]  Bernhard Schölkopf,et al.  Learning to Find Pre-Images , 2003, NIPS.

[12]  Hisashi Kashima,et al.  Marginalized Kernels Between Labeled Graphs , 2003, ICML.

[13]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[14]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[15]  Takao Asano An O(n log log n) Time Algorithm for Constructing a Graph of Maximum Connectivity with Prescribed Degrees , 1995, J. Comput. Syst. Sci..

[16]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2004 .

[17]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[18]  Vijay Raghavan Bounded degree graph inference from walks , 1991, COLT '91.

[19]  J. Lauri,et al.  Topics in Graph Automorphisms and Reconstruction , 2003 .

[20]  Tatsuya Akutsu,et al.  Inferring a Chemical Structure from a Feature Vector Based on Frequency of Labeled Paths and Small Fragments , 2007, APBC.

[21]  Pavel A. Pevzner,et al.  Computational molecular biology : an algorithmic approach , 2000 .

[22]  Thomas Gärtner,et al.  On Graph Kernels: Hardness Results and Efficient Alternatives , 2003, COLT.

[23]  Hiroshi Nagamochi A Detachment Algorithm for Inferring a Graph from Path Frequency , 2006, COCOON.

[24]  Jason Weston,et al.  A General Regression Framework for Learning String-to-String Mappings , 2006 .

[25]  Alon Itai,et al.  On an Algorithm of Zemlyachenko for Subtree Isomorphism , 1999, Inf. Process. Lett..

[26]  Satoru Miyano,et al.  Inferring a Tree from Walks , 1996, Theor. Comput. Sci..

[27]  Jason Weston,et al.  A general regression technique for learning transductions , 2005, ICML '05.