Do GANs learn the distribution? Some Theory and Empirics