Learning without recall in directed circles and rooted trees

This work investigates the case of a network of agents that attempt to learn some unknown state of the world amongst the finitely many possibilities. At each time step, agents all receive random, independently distributed private signals whose distributions are dependent on the unknown state of the world. However, it may be the case that some or any of the agents cannot distinguish between two or more of the possible states based only on their private observations, as when several states result in the same distribution of the private signals. In our model, the agents form some initial belief (probability distribution) about the unknown state and then refine their beliefs in accordance with their private observations, as well as the beliefs of their neighbors. An agent learns the unknown state when her belief converges to a point mass that is concentrated at the true state. A rational agent would use the Bayes' rule to incorporate her neighbors' beliefs and own private signals over time. While such repeated applications of the Bayes' rule in networks can become computationally intractable; in this paper, we show that in the canonical cases of directed star, circle or path networks and their combinations, one can derive a class of memoryless update rules that replicate that of a single Bayesian agent but replace the self beliefs with the beliefs of the neighbors. This way, one can realize an exponentially fast rate of learning similar to the case of Bayesian (fully rational) agents. The proposed rules are a special case of the Learning without Recall approach that we develop in a companion paper, and it has the advantage that while preserving essential features of the Bayesian inference, they are made tractable. In particular, the agents can rely on the observational abilities of their neighbors and their neighbors' neighbors etc. to learn the unknown state; even though they themselves cannot distinguish the truth.

[1]  P. DeMarzo,et al.  Persuasion Bias, Social Influence, and Uni-Dimensional Opinions , 2001 .

[2]  Shahin Shahrampour,et al.  Distributed Detection: Finite-Time Analysis and Impact of Network Topology , 2014, IEEE Transactions on Automatic Control.

[3]  Shahin Shahrampour,et al.  Online Learning of Dynamic Parameters in Social Networks , 2013, NIPS.

[4]  Kamiar Rahnama Rad,et al.  Distributed parameter estimation in networks , 2010, 49th IEEE Conference on Decision and Control (CDC).

[5]  Ali Jadbabaie,et al.  Non-Bayesian Social Learning , 2011, Games Econ. Behav..

[6]  Elchanan Mossel,et al.  Asymptotic learning on Bayesian social networks , 2012, Probability Theory and Related Fields.

[7]  D. Blackwell,et al.  Merging of Opinions with Increasing Information , 1962 .

[8]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[9]  C. Chamley Rational Herds: Economic Models of Social Learning , 2003 .

[10]  J. Norris Appendix: probability and measure , 1997 .

[11]  Manuel Mueller-Frank,et al.  A general framework for rational learning in social networks: Framework for rational learning , 2013 .

[12]  V. Borkar,et al.  Asymptotic agreement in distributed estimation , 1982 .

[13]  Manuel Mueller-Frank,et al.  A general framework for rational learning in social networks , 2011 .

[14]  Douglas Gale,et al.  Bayesian learning in social networks , 2003, Games Econ. Behav..

[15]  T. Javidi,et al.  Social learning and distributed hypothesis testing , 2014, 2014 IEEE International Symposium on Information Theory.

[16]  Ehud Lehrer,et al.  Merging and learning , 1996 .

[17]  Shahin Shahrampour,et al.  Exponentially fast parameter estimation in networks using distributed dual averaging , 2013, 52nd IEEE Conference on Decision and Control.