Sequential Pattern Mining using FCA and Pattern Structures for Analyzing Visitor Trajectories in a Museum

This paper presents our work on mining visitor trajectories in Hecht Museum (Haifa, Israel), within the framework of CrossCult Eu-ropean Project about cultural heritage. We present a theoretical and practical research work about the characterization of visitor trajectories and the mining of these trajectories as sequences. The mining process is based on two approaches in the framework of FCA, namely the mining of subsequences without any constraint and the mining of frequent contiguous subsequences. Both approaches are based on pattern structures. In parallel, a similarity measure allows us to build a hierarchical classification which is used for interpretation and characterization of the trajectories w.r.t. four well-known visiting styles.

[1]  Johannes Gehrke,et al.  Sequential PAttern mining using a bitmap representation , 2002, KDD.

[2]  Aleksey Buzmakov,et al.  A Hybrid Classification Approach based on FCA and Emerging Patterns - An application for the classification of biological inhibitors , 2012, CLA.

[3]  Bernhard Ganter,et al.  Formal Concept Analysis: Mathematical Foundations , 1998 .

[4]  Amedeo Napoli,et al.  A Proposition for Sequence Mining Using Pattern Structures , 2017, ICFCA.

[5]  Tsvi Kuflik,et al.  The Influence of a Location-Aware Mobile Guide on Museum Visitors' Behavior , 2013, Interact. Comput..

[6]  Sergei O. Kuznetsov,et al.  Concept Stability for Constructing Taxonomies of Web-site Users , 2009, ArXiv.

[7]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[8]  Tsvi Kuflik,et al.  Analyzing Museum Visitors' Behavior Patterns , 2007, User Modeling.

[9]  Chedy Raïssi,et al.  On measuring similarity for sequences of itemsets , 2014, Data Mining and Knowledge Discovery.

[10]  Jinyan Li,et al.  Efficient mining of emerging patterns: discovering trends and differences , 1999, KDD '99.

[11]  Jiawei Han,et al.  BIDE: efficient mining of frequent closed sequences , 2004, Proceedings. 20th International Conference on Data Engineering.

[12]  Xifeng Yan,et al.  CloSpan: Mining Closed Sequential Patterns in Large Datasets , 2003, SDM.

[13]  Roque Marín,et al.  ClaSP: An Efficient Algorithm for Mining Frequent Closed Sequences , 2013, PAKDD.

[14]  Chedy Raïssi,et al.  On mining complex sequential data by means of FCA and pattern structures , 2015, Int. J. Gen. Syst..

[15]  Tsvi Kuflik,et al.  Analysis and Prediction of Museum Visitors' Behavioral Pattern Types , 2012, Ubiquitous Display Environments.

[16]  Maguelonne Teisseire,et al.  A contribution to the discovery of multidimensional patterns in healthcare trajectories , 2014, Journal of Intelligent Information Systems.

[17]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .