Orchestrating and sharing large multimodal data for transparent and reproducible research