FaceCloud: Heterogeneous Cloud Visualization of Multiplex Networks for Multimedia Archive Exploration

Multimedia data is by nature heterogeneous, conveying semantic information through multiple cues. Text analysis of closed captions already brought us understanding of the spoken information. Today's advances in computer vision now enable us to look for relevant semantic information from the visual content of real-world archives. Combining these two levels of extracted information to make sense of an archive still remains a challenge. Multiplex net- works, which model multiple families of interactions in a graph, can capture and combine both sources of semantics. We can leverage on these objects to extract hierarchies and integrate them in an interactive heterogeneous "visual cloud". Inspired by word clouds, these clouds allow to grasp visual and textual semantic information captured from a multimedia collection all at once. The interaction then enables direct access to the relevant video. We demonstrate our system with the exploration of a Japanese news archive.

[1]  Hongan Wang,et al.  Visualization of large hierarchical data by circle packing , 2006, CHI.

[2]  William Ribarsky,et al.  Multimedia Analysis + Visual Analytics = Multimedia Analytics , 2010, IEEE Computer Graphics and Applications.

[3]  Duy-Dinh Le,et al.  Face Retrieval in Large-Scale News Video Datasets , 2013, IEICE Trans. Inf. Syst..

[4]  Martin Wattenberg,et al.  Participatory Visualization with Wordle , 2009, IEEE Transactions on Visualization and Computer Graphics.

[5]  Shin'ichi Satoh,et al.  Topic Threading for Structuring a Large-Scale News Video Archive , 2004, CIVR.

[6]  Florian Heimerl,et al.  Visual Movie Analytics , 2016, IEEE Transactions on Multimedia.

[7]  Mahadev Satyanarayanan,et al.  OpenFace: A general-purpose face recognition library with mobile applications , 2016 .

[8]  Guy Melançon,et al.  Entanglement in Multiplex Networks: Understanding Group Cohesion in Homophily Networks , 2014, Social Network Analysis.

[9]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[11]  Tamara Munzner,et al.  Detangler: Visual Analytics for Multiplex Networks , 2015, Comput. Graph. Forum.

[12]  Duy-Dinh Le,et al.  Indexing Faces in Broadcast News Video Archives , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.