Natural language querying for video databases

The video databases have become popular in various areas due to the recent advances in technology. Video archive systems need user-friendly interfaces to retrieve video frames. In this paper, a user interface based on natural language processing (NLP) to a video database system is described. The video database is based on a content-based spatio-temporal video data model. The data model is focused on the semantic content which includes objects, activities, and spatial properties of objects. Spatio-temporal relationships between video objects and also trajectories of moving objects can be queried with this data model. In this video database system, a natural language interface enables flexible querying. The queries, which are given as English sentences, are parsed using link parser. The semantic representations of the queries are extracted from their syntactic structures using information extraction techniques. The extracted semantic representations are used to call the related parts of the underlying video database system to return the results of the queries. Not only exact matches but similar objects and activities are also returned from the database with the help of the conceptual ontology module. This module is implemented using a distance-based method of semantic similarity search on the semantic domain-independent ontology, WordNet.

[1]  Shih-Fu Chang,et al.  VideoQ: an automated content based video search system using visual cues , 1997, MULTIMEDIA '97.

[2]  Troels Andreasen,et al.  Perspectives on ontology‐based querying , 2007, Int. J. Intell. Syst..

[3]  KucuktuncOnur,et al.  A Natural Language-Based Interface for Querying a Video Database , 2007 .

[4]  Katsumi Tanaka,et al.  OVID: Design and Implementation of a Video-Object Database System , 1993, IEEE Trans. Knowl. Data Eng..

[5]  Rune Hjelsvold,et al.  Integrated video archive tools , 1995, MULTIMEDIA '95.

[6]  Mohand-Said Hacid,et al.  Modeling and querying video databases , 1998, Proceedings. 24th EUROMICRO Conference (Cat. No.98EX204).

[7]  Duane Szafron,et al.  Multimedia Extensions To Database Query Languages , 1997 .

[8]  Jay F. Nunamaker,et al.  A natural language approach to content-based video indexing and retrieval for interactive e-learning , 2004, IEEE Transactions on Multimedia.

[9]  Carl Vogel,et al.  The Topology of WordNet: Some Metrics , 2004 .

[10]  Özgür Ulusoy,et al.  BilVideo: Design and Implementation of a Video Database Management System , 2005, Multimedia Tools and Applications.

[11]  Max J. Egenhofer,et al.  Determining Semantic Similarity among Entity Classes from Different Ontologies , 2003, IEEE Trans. Knowl. Data Eng..

[12]  Troels Andreasen,et al.  Ontology-Based Querying , 2000, FQAS.

[13]  Alan F. Smeaton,et al.  User Interface Issues for Browsing Digital Video , 1999, BCS-IRSG Annual Colloquium on IR Research.

[14]  Grace Hui Yang,et al.  VideoQA: question answering on news video , 2003, MULTIMEDIA '03.

[15]  Daniel Dominic Sleator,et al.  Parsing English with a Link Grammar , 1995, IWPT.

[16]  W. Eric L. Grimson,et al.  Answering Questions about Moving Objects in Surveillance Videos , 2003, New Directions in Question Answering.

[17]  Adnan Yazici,et al.  Spatio-temporal querying in video databases , 2004, Inf. Sci..

[18]  Peter Thanisch,et al.  MASQUE/SQL: an efficient and portable natural language query interface for relational databases , 1993 .

[19]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[20]  Ramesh C. Jain,et al.  Knowledge-guided parsing in video databases , 1993, Electronic Imaging.

[21]  Yorick Wilks,et al.  Extracting relational facts for indexing and retrieval of crime-scene photographs , 2003, Knowl. Based Syst..

[22]  Arun K. Majumdar,et al.  Video model for dynamic objects , 2006, Inf. Sci..

[23]  Mohand-Said Hacid,et al.  A database approach for modeling and querying video data , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[24]  S. Raman,et al.  Natural language interface to video database , 2001, Nat. Lang. Eng..

[25]  Ted Pedersen,et al.  Maximizing Semantic Relatedness to Perform Word Sense Disambiguation , 2005 .

[26]  K. Selçuk Candan,et al.  The Advanced Video Information System: data structures and query processing , 1996, Multimedia Systems.

[27]  Funda Durupinar,et al.  Intelligent Indexing, Querying and Reconstruction of Crime Scene Photographs , 2004 .

[28]  Graeme Hirst,et al.  Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures , 2004 .

[29]  Mohand-Said Hacid,et al.  A Database Approach for Modeling and Querying Video Data , 2000, IEEE Trans. Knowl. Data Eng..

[30]  Nihan Kesim Cicekli,et al.  Natural Language Interface on a Video Data Model , 2005, Databases and Applications.

[31]  Mitchell P. Marcus,et al.  Text Chunking using Transformation-Based Learning , 1995, VLC@ACL.

[32]  Arbee L. P. Chen,et al.  Content-Based Query Processing for Video Databases , 2000, IEEE Trans. Multim..

[33]  Peter Thanisch,et al.  Natural language interfaces to databases – an introduction , 1995, Natural Language Engineering.

[34]  Martha Palmer,et al.  Verb Semantics and Lexical Selection , 1994, ACL.

[35]  B. C. Brookes,et al.  Information Sciences , 2020, Cognitive Skills You Need for the 21st Century.

[36]  Rune Hjelsvold,et al.  Modelling and Querying Video Data , 1994, VLDB.

[37]  Stephen W. Smoliar,et al.  Video parsing, retrieval and browsing: an integrated and content-based solution , 1997, MULTIMEDIA '95.

[38]  John M. Gauch,et al.  The vision digital video library , 1997, Inf. Process. Manag..

[39]  Özgür Ulusoy,et al.  A Natural Language-Based Interface for Querying a Video Database , 2007, IEEE MultiMedia.

[40]  Jimmy J. Lin,et al.  Omnibase: Uniform Access to Heterogeneous Data for Question Answering , 2002, NLDB.