Post-Hoc Interactive Analytics of Errors in the Context of a Person Discovery Task

Part of the research effort in automatic person discovery in multimedia content consists in analyzing the errors made by algorithms. However exploring the space of models relating algorithmic errors in person discovery to intrinsic properties of associated shots (e.g. person facing the camera) - coined as post-hoc analysis in this paper - requires data curation and statistical model tuning, which can be cumbersome. In this paper we present a visual and interactive tool that facilitates this exploration. A case study is conducted with multimedia researchers to validate the tool. Real data obtained from the MediaEval person discovery task was used for this experiment. Our approach yielded novel insight that was completely unsuspected previously.

[1]  Cynthia A. Brewer,et al.  ColorBrewer.org: An Online Tool for Selecting Colour Schemes for Maps , 2003 .

[2]  G. McLachlan On Bootstrapping the Likelihood Ratio Test Statistic for the Number of Components in a Normal Mixture , 1987 .

[3]  Thomas Tamisier,et al.  A Visual Analytics Approach to Finding Factors Improving Automatic Speaker Identifications , 2015, ICMI.

[4]  Enrico Bertini,et al.  INFUSE: Interactive Feature Selection for Predictive Modeling of High Dimensional Data , 2014, IEEE Transactions on Visualization and Computer Graphics.

[5]  Claude Barras,et al.  Multimodal person discovery in broadcast TV: lessons learned from MediaEval 2015 , 2017, Multimedia Tools and Applications.

[6]  T. J. Jankun-Kelly,et al.  Guided analysis of hurricane trends using statistical processes integrated with interactive parallel coordinates , 2009, 2009 IEEE Symposium on Visual Analytics Science and Technology.

[7]  Francisco J. Valverde-Albacete,et al.  100% Classification Accuracy Considered Harmful: The Normalized Information Transfer Factor Explains the Accuracy Paradox , 2014, PloS one.

[8]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[9]  P. Simpson,et al.  Statistical methods in cancer research , 2001, Journal of surgical oncology.

[10]  Hanspeter Pfister,et al.  LineUp: Visual Analysis of Multi-Attribute Rankings , 2013, IEEE Transactions on Visualization and Computer Graphics.

[11]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[12]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.