ClassMiner: Mining Medical Video Content Structure and Events Towards Efficient Access and Scalable Skimming

To achieve more efficient video indexing and access, we introduce a video content structure and event mining framework. A video shot segmentation and key-frame selection strategy are first utilized to parse the continuous video stream into physical units. Video shot grouping, group merging, and scene clustering schemes are then proposed to organize the video shots into a hierarchical structure using clustered scenes, scenes, groups, and shots, in increasing granularity from top to bottom. Then, audio and video processing techniques are integrated to mine event information, such as dialog, presentation and clinical operation, among the detected scenes. Finally, the acquired video content structure and events are integrated to construct a scalable video skimming tool which can be used to visualize the video content hierarchy and event information for efficient access. Experimental results are also presented to evaluate the performance of the proposed algorithms.

[1]  Atreyi Kankanhalli,et al.  Automatic partitioning of full-motion video , 1993, Multimedia Systems.

[2]  Boon-Lock Yeo,et al.  Time-constrained clustering for segmentation of video into story units , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[3]  HongJiang Zhang,et al.  Automatic video scene extraction by shot grouping , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[4]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[5]  Jianping Fan,et al.  Automatic model-based semantic object extraction algorithm , 2001, IEEE Trans. Circuits Syst. Video Technol..

[6]  John R. Kender,et al.  Video scene segmentation via continuous video coherence , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[7]  Thomas S. Huang,et al.  Constructing table-of-content for videos , 1999, Multimedia Systems.

[8]  Chengcui Zhang,et al.  Multimedia Data Mining for Traffic Video Sequences , 2001, MDM/KDD.

[9]  Jianping Fan,et al.  Towards facial feature extraction and verification for omni-face detection in video/images , 2002, Proceedings. International Conference on Image Processing.

[10]  Jiawei Han,et al.  Mining MultiMedia Data , 1999 .

[11]  Shih-Fu Chang,et al.  Clustering methods for video browsing and annotation , 1996, Electronic Imaging.

[12]  Christos Faloutsos,et al.  VideoGraph: a new tool for video mining and classification , 2001, JCDL '01.

[13]  Bhavani Thuraisingham,et al.  Managing and Mining Multimedia Databases , 2001, Int. J. Artif. Intell. Tools.

[14]  Kuang Ping-jian,et al.  An Overview of Data Mining and Knowledge Discovery in Database , 2002 .

[15]  Jianping Fan,et al.  Mining of Video Database , 2003 .

[16]  Philip S. Yu,et al.  Data Mining: An Overview from a Database Perspective , 1996, IEEE Trans. Knowl. Data Eng..

[17]  Tomasz Imielinski,et al.  Database Mining: A Performance Perspective , 1993, IEEE Trans. Knowl. Data Eng..

[18]  Jianping Fan,et al.  MultiView: Multilevel video content representation and retrieval , 2001, J. Electronic Imaging.

[19]  Ramasamy Uthurusamy,et al.  Data mining and knowledge discovery in databases , 1996, CACM.

[20]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[21]  Jianping Fan,et al.  A hierarchical access control model for video database systems , 2003, TOIS.

[22]  Qian Huang,et al.  Classification of audio events in broadcast news , 1998, 1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175).

[23]  Christian Wellekens,et al.  DISTBIC: A speaker-based segmentation for audio data indexing , 2000, Speech Commun..

[24]  Jianping Fan,et al.  Hierarchical video summarization for medical data , 2001, IS&T/SPIE Electronic Imaging.