The HyperBagGraph DataEdron: An Enriched Browsing Experience of Multimedia Datasets

Traditional verbatim browsers give back information in a linear way according to a ranking performed by a search engine that may not be optimal for the surfer. The latter may need to assess the pertinence of the information retrieved, particularly when she wants to explore other facets of a multi-facetted information space. For instance, in a multimedia dataset different facets such as keywords, authors, publication category, organisations and figures can be of interest. The facet simultaneous visualisation can help to gain insights on the information retrieved and call for further searches. Facets are cooccurence networks, modeled by HyperBag-Graphs - families of multisets - and are in fact linked not only to the publication itself, but to any chosen reference. These references allow to navigate inside the dataset and perform visual queries. We explore here the case of scientific publications based on Arxiv searches.

[1]  Ronald Fagin,et al.  Degrees of acyclicity for hypergraphs and relational database schemes , 1983, JACM.

[2]  Jerome H. Friedman,et al.  On Bias, Variance, 0/1—Loss, and the Curse-of-Dimensionality , 2004, Data Mining and Knowledge Discovery.

[3]  Piotr Indyk,et al.  Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.

[4]  Hans-Jörg Schek,et al.  A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces , 1998, VLDB.

[5]  Susan T. Dumais,et al.  PivotPaths: Strolling through Faceted Information Spaces , 2012, IEEE Transactions on Visualization and Computer Graphics.

[6]  Stéphane Marchand-Maillet,et al.  Hypergraph Modeling and Visualisation of Complex Co-occurence Networks , 2018, Electron. Notes Discret. Math..

[7]  David A. Bader,et al.  A performance evaluation of open source graph databases , 2014, PPAA '14.

[8]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[9]  Jean-Marie Le Goff,et al.  Interactive graph query language for multidimensional data in Collaboration Spotting visual analytics framework , 2017, ArXiv.

[10]  Stéphane Marchand-Maillet,et al.  Adjacency and Tensor Representation in General Hypergraphs.Part 2: Multisets, Hb-graphs and Related e-adjacency Tensors , 2018, ArXiv.

[11]  Charu C. Aggarwal,et al.  On the Surprising Behavior of Distance Metrics in High Dimensional Spaces , 2001, ICDT.

[12]  M. Newman,et al.  Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  Jian Zhao,et al.  Interactive Exploration of Implicit and Explicit Relations in Faceted Datasets , 2013, IEEE Transactions on Visualization and Computer Graphics.