Information fusion for mental disorders detection: multimodal BERT against fusioning multiple BERTs