Robust estimation of tree structured models

Consider the problem of learning undirected graphical models on trees from corrupted data. Recently [13] showed that it is possible to recover trees from noisy binary data up to a small equivalence class of possible trees. Their other paper on the Gaussian case follows a similar pattern. By framing this as a special phylogenetic recovery problem we largely generalize these two settings. Using the framework of linear latent tree models we discuss tree identifiability for binary data under a continuous corruption model. For the Ising and the Gaussian tree model we also provide a characterisation of when the Chow-Liu algorithm consistently learns the underlying tree from the noisy data.

[1]  Susan A. Murphy,et al.  Monographs on statistics and applied probability , 1990 .

[2]  P. Buneman The Recovery of Trees from Measures of Dissimilarity , 1971 .

[3]  David Edwards,et al.  Selecting high-dimensional mixed graphical models using minimal AIC or BIC forests , 2010, BMC Bioinformatics.

[4]  Michael I. Jordan Graphical Models , 2003 .

[5]  S. Jeffery Evolution of Protein Molecules , 1979 .

[6]  William Norrie Everitt,et al.  A note on positive definite matrices , 1958, Proceedings of the Glasgow Mathematical Association.

[7]  Giuseppe Scarpa,et al.  A tree-structured Markov random field model for Bayesian image segmentation , 2003, IEEE Trans. Image Process..

[8]  Elchanan Mossel,et al.  Robust Estimation of Latent Tree Graphical Models: Inferring Hidden States With Inexact Parameters , 2011, IEEE Transactions on Information Theory.

[9]  Ignas Budvytis,et al.  Semi-Supervised Video Segmentation Using Tree Structured Graphical Models , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[11]  C. N. Liu,et al.  Approximating discrete probability distributions with dependence trees , 1968, IEEE Trans. Inf. Theory.

[12]  Anand D. Sarwate,et al.  Predictive Learning on Hidden Tree-Structured Ising Models , 2018, J. Mach. Learn. Res..

[13]  Piotr Zwiernik Latent Tree Models , 2017, Handbook of Graphical Models.

[14]  Constantine Caramanis,et al.  Robust estimation of tree structured Gaussian Graphical Model , 2019, ICML.

[15]  Constantine Caramanis,et al.  Robust Estimation of Tree Structured Ising Models , 2020, ArXiv.

[16]  Piotr Zwiernik,et al.  Semialgebraic Statistics and Latent Tree Models , 2015 .

[17]  Guy Bresler,et al.  Learning a Tree-Structured Ising Model in Order to Make Predictions , 2016, The Annals of Statistics.

[18]  Le Song,et al.  Spectral Methods for Learning Multivariate Latent Tree Structure , 2011, NIPS.

[19]  Nevin Lianwen Zhang,et al.  Hierarchical latent class models for cluster analysis , 2002, J. Mach. Learn. Res..

[20]  Klaus Peter Schliep,et al.  phangorn: phylogenetic analysis in R , 2010, Bioinform..

[21]  Vincent Y. F. Tan,et al.  Learning Latent Tree Graphical Models , 2010, J. Mach. Learn. Res..

[22]  Joseph T. Chang,et al.  Full reconstruction of Markov models on evolutionary trees: identifiability and consistency. , 1996, Mathematical biosciences.