Unsupervised Double Articulation of Natural Speech in a Video Game Environment -- Toward a Constructive Understanding of Language Acquisition