Intelligent state machine for social ad hoc data management and reuse

Recent advances in information technology have turned out World Wide Web to be the main platform for interactions where participants—users and corresponding events—are triggered. Although the participants vary in accordance with scenarios, a considerable size of data will be generated. This phenomenon indeed causes the complexity in information retrieval, management, and resuse, and meanwhile, turns down the value of this data. In this research, we attempt to achieve efficient management of user-generated data and its derivative contexts (i.e., social ad hoc data) for human supports. The correlations among data, contexts, and their hybridization are specifically concentrated. An intelligent state machine is proposed to outline the relations of data and contexts, and applied to further identify their usage scenarios. The performance and feasibility can be revealed by the experiments that were conducted on the data collected from open social networks (e.g., Facebook, Twitter, etc.) in the past few years with size around 500 users and 8,000,000 shared contents from them.

[1]  Christos Faloutsos,et al.  Fast discovery of connection subgraphs , 2004, KDD.

[2]  Eamonn J. Keogh,et al.  On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration , 2002, Data Mining and Knowledge Discovery.

[3]  Fabrizio Silvestri,et al.  Boosting the performance of Web search engines: Caching and prefetching query results by exploiting historical usage data , 2006, TOIS.

[4]  Christoforos N. Hadjicostis,et al.  Designs of Bisimilar Petri Net Controllers With Fault Tolerance Capabilities , 2008, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[5]  Qun Jin,et al.  LONET: An interactive search network for intelligent lecture path generation , 2013, TIST.

[6]  Wil M. P. van der Aalst,et al.  Mining Social Networks: Uncovering Interaction Patterns in Business Processes , 2004, Business Process Management.

[7]  Shonali Krishnaswamy,et al.  Mining data streams: a review , 2005, SGMD.

[8]  Sankar K. Pal,et al.  Data mining in soft computing framework: a survey , 2002, IEEE Trans. Neural Networks.

[9]  G. Nolan,et al.  Computational solutions to large-scale data management and analysis , 2010, Nature Reviews Genetics.

[10]  Jari Saramäki,et al.  Small But Slow World: How Network Topology and Burstiness Slow Down Spreading , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Joshua B. Tenenbaum,et al.  The Large-Scale Structure of Semantic Networks: Statistical Analyses and a Model of Semantic Growth , 2001, Cogn. Sci..

[12]  Dawid Weiss,et al.  A survey of Web clustering engines , 2009, CSUR.

[13]  J. M. Saul,et al.  Parallel controller synthesis using Petri nets , 1995 .

[14]  Kasper Hornbæk,et al.  Social design feedback: evaluations with users in online ad-hoc groups , 2013, Human-centric Computing and Information Sciences.

[15]  Bengt Jonsson,et al.  Generating models of infinite-state communication protocols using regular inference with abstraction , 2015, Formal Methods Syst. Des..

[16]  Peter Mika,et al.  Ontologies are us: A unified model of social networks and semantics , 2005, J. Web Semant..

[17]  Mike Wright,et al.  Petri net-based modelling of workflow systems: An overview , 2001, Eur. J. Oper. Res..

[18]  Yixin Chen,et al.  Multi-Dimensional Regression Analysis of Time-Series Data Streams , 2002, VLDB.

[19]  Kwang-Ting Cheng,et al.  Automatic generation of functional vectors using the extended finite state machine model , 1996, TODE.

[20]  Mike Thelwall,et al.  A web crawler design for data mining , 2001, J. Inf. Sci..

[21]  Lars Michael Kristensen,et al.  Coloured Petri Nets and CPN Tools for modelling and validation of concurrent systems , 2007, International Journal on Software Tools for Technology Transfer.

[22]  RadhaKanta Mahapatra,et al.  Business data mining - a machine learning perspective , 2001, Inf. Manag..

[23]  Thorsten Joachims,et al.  Accurately interpreting clickthrough data as implicit feedback , 2005, SIGIR '05.

[24]  Qun Jin,et al.  A human-centric integrated approach to web information search and sharing , 2011, Human-centric Computing and Information Sciences.

[25]  David J. Faulds,et al.  Social media: The new hybrid element of the promotion mix , 2009 .

[26]  Ana R. Cavalli,et al.  New approaches for passive testing using an Extended Finite State Machine specification , 2003, Inf. Softw. Technol..

[27]  Miroslaw Malek,et al.  Current solutions for Web service composition , 2004, IEEE Internet Computing.

[28]  Luis Gomes,et al.  From UML state machines to Petri nets: History attribute translation strategies , 2011, IECON 2011 - 37th Annual Conference of the IEEE Industrial Electronics Society.

[29]  Jia Zhang,et al.  WS-Net: a Petri-net based specification model for Web services , 2004, Proceedings. IEEE International Conference on Web Services, 2004..

[30]  Yian-Kui Liu,et al.  Expected value of fuzzy variable and fuzzy expected value models , 2002, IEEE Trans. Fuzzy Syst..

[31]  Sekhar Ranjan Bhadra Chaudhuri,et al.  SOFT COMPUTING APPROACH IN PREDICTION OF A TIME SERIES DATA , 2008 .

[32]  Wendy A. Kellogg,et al.  Social translucence: an approach to designing systems that support social processes , 2000, TCHI.

[33]  Andrew McCallum,et al.  Extracting social networks and contact information from email and the Web , 2004, CEAS.

[34]  Wu Meng,et al.  Application of Support Vector Machines in Financial Time Series Forecasting , 2007 .

[35]  Alok N. Choudhary,et al.  Real-time disease surveillance using Twitter data: demonstration on flu and cancer , 2013, KDD.

[36]  Jaideep Srivastava,et al.  Event detection from time series data , 1999, KDD '99.

[37]  Doo-Hwan Bae,et al.  Software modeling and analysis using a hierarchical object-oriented Petri net , 2000, Inf. Sci..

[38]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[39]  M. Harada,et al.  Finding authoritative people from the Web , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[40]  Nuno Constantino Castro,et al.  Time Series Data Mining , 2009, Encyclopedia of Database Systems.