Community Detection with Secondary Latent Variables

Community detection refers to recovering a (latent) label on which the distribution of the observed graph depends. Recent work has also investigated the impact of additionally knowing the value of another variable at each vertex that is correlated with the vertex label (side information), while assuming side information is independent of the graph edges conditioned on the label. This work extends the scope of community detection in two ways. First, we consider a side information that does not form a Markov chain with the label and graph, and analyze the detection threshold of semidefinite programming subject to knowledge of this side information, which is a non-label latent variable on which the graph edges also depend. In the second part of the work, we consider aside from vertex labels a second latent variable that is unknown both in realization and in distribution. We then investigate the performance of the semidefinite programming community detection as a function of the (unknown) composition of the nuisance latent variable. In both cases, it is shown that semidefinite programming can achieve exact recovery down to the optimal (information theoretic) threshold.

[1]  Emmanuel Abbe,et al.  Community detection and stochastic block models: recent developments , 2017, Found. Trends Commun. Inf. Theory.

[2]  Aria Nosratinia,et al.  Exact Recovery in Community Detection With Continuous-Valued Side Information , 2019, IEEE Signal Processing Letters.

[3]  Aria Nosratinia,et al.  Community Detection with Side Information via Semidefinite Programming , 2019, 2019 IEEE International Symposium on Information Theory (ISIT).

[4]  Bruce E. Hajek,et al.  Achieving Exact Cluster Recovery Threshold via Semidefinite Programming: Extensions , 2015, IEEE Transactions on Information Theory.

[5]  Bruce E. Hajek,et al.  Achieving Exact Cluster Recovery Threshold via Semidefinite Programming , 2016, IEEE Trans. Inf. Theory.

[6]  Aria Nosratinia,et al.  Exact Recovery by Semidefinite Programming in the Binary Stochastic Block Model with Partially Revealed Side Information , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[7]  Bruce E. Hajek,et al.  Achieving exact cluster recovery threshold via semidefinite programming , 2015, 2015 IEEE International Symposium on Information Theory (ISIT).

[8]  Aria Nosratinia,et al.  Community Detection With Side Information: Exact Recovery Under the Stochastic Block Model , 2018, IEEE Journal of Selected Topics in Signal Processing.

[9]  Aria Nosratinia,et al.  Side Information in Recovering a Single Community: Information Theoretic Limits , 2018, 2018 IEEE International Symposium on Information Theory (ISIT).