Hierarchical topic trajectory model for video annotation retrieval considering cross-modal co-occurrences