META-GUI: Towards Multi-modal Conversational Agents on Mobile GUI