论文引用

George Awad, Heiko Schuldt, Luca Rossetto et al.,

2018,

MMM

With the widespread use of smartphones as recording devices and the massive growth in bandwidth, the number and volume of video collections has increased significantly in the last years. This poses no...

A Crossmodal Approach to Multimodal Fusion in Video Hyperlinking

Guillaume Gravier, Christian Raymond, Vedran Vukotic,

2018,

IEEE MultiMedia

With the recent resurgence of neural networks and the proliferation of massive amounts of unlabeled multimodal data, recommendation systems and multimodal retrieval systems based on continuous represe...

Technische Universität Chemnitz and Hochschule Mittweida at TRECVID Instance Search 2017

Stefan Kahl, Maximilian Eibl, Marc Ritter et al.,

2017,

TRECVID

With our submission to the 2017 TRECVID Instance Search task (Awad et al., 2017b), we focused on the usage of dedicated CNN models. We limited the available video training data to only three sources: ...

The NVIDIA AI City Challenge

David C. Anastasiu, Siwei Lyu, Jerry Zeyu Gao et al.,

2017 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computed, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI)

Web image analysis has witnessed an AI renaissance. The ILSVRC benchmark has been instrumental in providing a corpus and standardized evaluation. The NVIDIA AI City Challenge is envisioned to provide ...

Informedia @ TRECVID 2017

Alexander G. Hauptmann, Shizhe Chen, Qin Jin et al.,

2017,

TRECVID

We report on our system used in the TRECVID 2017 Multimedia Event Detection (MED) and Ad-hoc Video Search (AVS) tasks. On the MED task, the CMU team submitted runs in 010Ex settings for the Pre-specif...

Interactive Video Retrieval in the Age of Deep Learning

Luca Rossetto, Klaus Schöffmann, Werner Bailer et al.,

2019,

ICMR

We present a tutorial focusing on video retrieval tasks, where state-of-the-art deep learning approaches still benefit from interactive decisions of users. The tutorial covers general introduction to ...

How Experts Search Different than Novices – An Evaluation of the Divexplore Video Retrieval System at Video Browser Showdown 2018

Klaus Schöffmann, Bernd Münzer, Sabrina Kletz et al.,

2018 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)

We present a modern interactive video retrieval tool, called diveXplore, that has been used for several iterations of the Video Browser Showdown (VBS) competition with great success – 2nd place for th...

Evaluation of automatic video captioning using direct assessment

George Awad, Alan F. Smeaton, Yvette Graham et al.,

2017,

PloS one

We present Direct Assessment, a method for manually assessing the quality of automatically-generated captions for video. Evaluating the accuracy of video captions is particularly difficult because for...

Cmu-ucr-bosch @ Trecvid 2017: Video to Text Retrieval

Florian Metze, Samarjit Das, Niluthpol C. Mithun et al.,

2017,

TRECVID

We participated in the matching and ranking subtask in TRECVid challenge 2017. The task here was to return a ranked list of the most likely text descriptions that correspond to each video. We adopted ...

NTT Communication Science Laboratories and National Institute of Informatics at TRECVID 2017 Instance Search

Kunio Kashino, Xiaomeng Wu, Hidehisa Nagano et al.,

2017,

TRECVID

We describe our approaches that were tested in the TRECVID 2017 Instance Search (INS) task. In this year’s INS, shots including a person at a location were to be retrieved after a location query and p...

Neighbourhood Structure Preserving Cross-Modal Embedding for Video Hyperlinking

Chong-Wah Ngo, Benoit Huet, Yanbin Hao et al.,

2020,

IEEE Transactions on Multimedia

Video hyperlinking is a task aiming to enhance the accessibility of large archives, by establishing links between fragments of videos. The links model the aboutness between fragments for efficient tra...

Video Description

Wei Liu, Syed Zulqarnain Gilani, Ajmal Mian et al.,

2018,

ACM Comput. Surv.

Video description is the automatic generation of natural language sentences that describe the contents of a given video. It has applications in human-robot interaction, helping the visually impaired a...

Effective video hyperlinking by means of enriched feature sets and monomodal query combinations

Elena Baralis, Daniele Apiletti, Benoit Huet et al.,

2019,

International Journal of Multimedia Information Retrieval

Video content has been increasing at an unprecedented rate in recent years, bringing the need for improved tools providing efficient access to specific contents of interest. Within the management of v...

Interactive Search or Sequential Browsing? A Detailed Analysis of the Video Browser Showdown 2018

Ralph Gasser, Klaus Schöffmann, Werner Bailer et al.,

2019,

ACM Trans. Multim. Comput. Commun. Appl.

This work summarizes the findings of the 7th iteration of the Video Browser Showdown (VBS) competition organized as a workshop at the 24th International Conference on Multimedia Modeling in Bangkok. T...

The diveXplore System at the Video Browser Showdown 2018 - Final Notes

Klaus Schöffmann, Bernd Münzer, Andreas Leibetseder et al.,

2018,

ArXiv

This short paper provides further details of the diveXplore system (formerly known as CoViSS), which has been used by team ITEC1 for the Video Browser Showdown (VBS) 2018. In particular, it gives a sh...

Sloth Search System at the Video Browser Showdown 2018 - Final Notes

Sanparith Marukatat, Sitapa Rujikietgumjorn, Nattachai Watcharapinchai,

2018,

ArXiv

This short paper provides further details of the Sloth Search System, which was developed by the NECTEC team for the Video Browser Showdown (VBS) 2018.

ITI-CERTH participation in TRECVID 2018

Yiannis Kompatsiaris, Konstantinos Ioannidis, Stefanos Vrochidis et al.,

2017,

TRECVID

This paper provides an overview of the runs submitted to TRECVID 2018 by ITI-CERTH. ITI-CERTH participated in the Ad-hoc Video Search (AVS), Instance Search (INS) and Activities in Extended Video (Act...

The Helmholtz Method: Using Perceptual Compression to Reduce Machine Learning Complexity

Bo Li, Gerald Friedland, Ruoxi Jia et al.,

2018,

ArXiv

This paper proposes a fundamental answer to a frequently asked question in multimedia computing and machine learning: Do artifacts from perceptual compression contribute to error in the machine learni...

IRISA at TrecVid 2017: Beyond Crossmodal and Multimodal Models for Video Hyperlinking

Guillaume Gravier, Rémi Bois, Gabriel Sargent et al.,

2017,

TRECVID

This paper presents the runs that were submitted to the TRECVid Challenge 2017 for the Video Hyperlinking task. The goal of the task is to propose a list of video segments, called targets, to compleme...

Kobe University, NICT and University of Siegen at TRECVID 2017 AVS Task

Kimiaki Shirahama, Marcin Grzegorzek, Kuniaki Uehara et al.,

2017,

TRECVID

This paper presents the result of the TRECVID 2017 AVS task by kobe nict siegen team. Consisting of three research institutes Kobe University, NICT and University of Siegen. We submitted the following...