State Variation Mining: On Information Divergence with Message Importance in Big Data

Information transfer which reveals the state variation of variables usually plays a vital role in big data analytics and processing. In fact, the measures for information transfer could reflect the system change by use of the variable distributions, similar to KL divergence and Renyi divergence. Furthermore, in terms of the information transfer in big data, small probability events usually dominate the importance of the total message to some degree. Therefore, it is significant to design an information transfer measure based on the message importance which emphasizes the small probability events. In this paper, we propose a message importance transfer measure (MITM) and investigate its characteristics and applications on three aspects. First, the message importance transfer capacity based on MITM is presented to offer an upper bound for the information transfer process with disturbance. Then, we extend the MITM to the continuous case and discuss the robustness by using it to measuring information distance. Finally, we utilize the MITM to guide the queue length selection in the caching operation of mobile edge computing.

[1]  Shao-Lun Huang,et al.  An information-theoretic approach to universal feature selection in high-dimensional inference , 2017, 2017 IEEE International Symposium on Information Theory (ISIT).

[2]  Tapani Ristaniemi,et al.  Multi-objective optimization for computation offloading in mobile-edge computing , 2017, 2017 IEEE Symposium on Computers and Communications (ISCC).

[3]  Ger Koole,et al.  An Explicit Solution for the Value Function of a Priority Queue , 2004, Queueing Syst. Theory Appl..

[4]  B. Krishna Kumar,et al.  On multiserver feedback retrial queues with balking and control retrial rate , 2006, Ann. Oper. Res..

[5]  Jingrui He,et al.  Rare Category Detection on Time-Evolving Graphs , 2015, 2015 IEEE International Conference on Data Mining.

[6]  Haim H. Permuter,et al.  Universal Estimation of Directed Information , 2010, IEEE Transactions on Information Theory.

[7]  Schreiber,et al.  Measuring information transfer , 2000, Physical review letters.

[8]  Shaofeng Zou,et al.  Linear-complexity exponentially-consistent tests for universal outlying sequence detection , 2017, 2017 IEEE International Symposium on Information Theory (ISIT).

[9]  Pingyi Fan,et al.  Non-Parametric Message Importance Measure: Storage Code Design and Transmission Planning for Big Data , 2017, IEEE Transactions on Communications.

[10]  Pingyi Fan,et al.  Message Importance Measure and Its Application to Minority Subset Detection in Big Data , 2016, 2016 IEEE Globecom Workshops (GC Wkshps).

[11]  G. Crooks On Measures of Entropy and Information , 2015 .

[12]  Pingyi Fan,et al.  Amplifying Inter-Message Distance: On Information Divergence Measures in Big Data , 2017, IEEE Access.

[13]  Kate Smith-Miles,et al.  A Comprehensive Survey of Data Mining-based Fraud Detection Research , 2010, ArXiv.

[14]  Alfred O. Hero,et al.  On Local Intrinsic Dimension Estimation and Its Applications , 2010, IEEE Transactions on Signal Processing.

[15]  Umesh Vaidya,et al.  Causality preserving information transfer measure for control dynamical system , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[16]  Jingrui He,et al.  Graph-Based Rare Category Detection , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[17]  Aleksandra Zięba,et al.  Counterterrorism Systems of Spain and Poland: Comparative Studies , 2015 .