We propose a novel approach to drawing graphs that simultaneously optimizes two criteria (i) preserving proximity relations as measured by some embedding objective, and (ii) minimizing edge-crossings, to create a clear representation of the underlying graph structure. Frequently, the nodes of the graph represent objects that have their own intrinsic properties with associated distances or similarity measures. In particular, we investigate graphing of spoligoforests to visualize the genetic relatedness between strains of the Mycobacterium tuberculosis complex using multiple genetic markers. It is often desirable, that drawings of such graphs map nodes from high-dimensional feature space to low-dimensional vectors that preserve these pairwise distances. This desired quality is frequently expressed as a function of the embedding and then optimized, eg. Multidimensional Scaling (MDS), the goal is to minimize the difference between the actual distances and Euclidean distances between all nodes in the embedding.
[1]
Vladimir N. Vapnik,et al.
The Nature of Statistical Learning Theory
,
2000,
Statistics for Engineering and Information Science.
[2]
David S. Johnson,et al.
Crossing Number is NP-Complete
,
1983
.
[3]
Thomas Puppe.
Spectral Graph Drawing: A Survey
,
2008
.
[4]
Yehuda Koren,et al.
Graph Drawing by Stress Majorization
,
2004,
GD.
[5]
Yehuda Koren,et al.
On Spectral Graph Drawing
,
2003,
COCOON.
[6]
Olvi L. Mangasarian,et al.
Nonlinear Programming
,
1969
.
[7]
Mikhail Belkin,et al.
Laplacian Eigenmaps for Dimensionality Reduction and Data Representation
,
2003,
Neural Computation.
[8]
Stephen P. Boyd,et al.
Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers
,
2011,
Found. Trends Mach. Learn..