Composition 2.0: Toward a Multilingual and Multimodal Framework