Streaming generalized cross entropy

We propose a new method to combine adaptive processes with a class of entropy estimators for the case of streams of data. Starting from a first estimation obtained from a batch of initial data, model parameters are estimated at each step by combining the prior knowledge with the new observation (or a block of observations). This allows to extend the maximum entropy technique to a dynamical setting, also distinguishing between entropic contributions of the signal and the error. Furthermore, it provides a suitable approximation of standard GME problems when the exacted solutions are hard to evaluate. We test this method by performing numerical simulations at various sample sizes and batch dimensions. Moreover, we extend this analysis exploring intermediate cases between streaming GCE and standard GCE, i.e. considering blocks of observations of different sizes to update the estimates, and incorporating collinearity effects as well. The role of time in the balance between entropic contributions of signal and errors is further explored considering a variation of the Streaming GCE algorithm, namely, Weighted Streaming GCE. Finally, we discuss the results: in particular, we highlight the main characteristics of this method, the range of application, and future perspectives.

[1]  Enrico Ciavolino,et al.  Entropy-Based Estimators in the Presence of Multicollinearity and Outliers , 2013 .

[2]  Ray J. Solomonoff,et al.  A Formal Theory of Inductive Inference. Part II , 1964, Inf. Control..

[3]  Dan Simon,et al.  Optimal State Estimation: Kalman, H∞, and Nonlinear Approaches , 2006 .

[4]  Amos Golan Foundations of Info-Metrics: Modeling, Inference, and Imperfect Information , 2017 .

[5]  M. .. Moore Statistical Mechanics: A Set of Lectures , 1974 .

[6]  C. Jarzynski Nonequilibrium Equality for Free Energy Differences , 1996, cond-mat/9610209.

[7]  Amos Golan Information and Entropy Econometrics - A Review and Synthesis , 2008 .

[8]  Enrico Ciavolino,et al.  GME Estimation of Spatial Structural Equations Models , 2011, J. Classif..

[9]  Aleksandr Yakovlevich Khinchin,et al.  Mathematical foundations of information theory , 1959 .

[10]  H. He,et al.  Efficient Reinforcement Learning Using Recursive Least-Squares Methods , 2011, J. Artif. Intell. Res..

[11]  Renato Zanetti,et al.  Recursive Update Filtering for Nonlinear Estimation , 2012, IEEE Transactions on Automatic Control.

[12]  Enrico Ciavolino,et al.  Generalized cross entropy method for analysing the SERVQUAL model , 2015 .

[13]  Ray J. Solomonoff,et al.  A Formal Theory of Inductive Inference. Part I , 1964, Inf. Control..

[14]  L BergerAdam,et al.  A maximum entropy approach to natural language processing , 1996 .

[15]  B. Konopelchenko,et al.  Zeros and amoebas of partition functions , 2016, Reviews in Mathematical Physics.

[16]  F. Pukelsheim The Three Sigma Rule , 1994 .

[17]  F. Daum Nonlinear filters: beyond the Kalman filter , 2005, IEEE Aerospace and Electronic Systems Magazine.

[18]  R. Baierlein Probability Theory: The Logic of Science , 2004 .

[19]  L. Amusa,et al.  Examination of Entropy balancing technique for estimating some standard measures of treatment effects: A simulation study , 2019 .

[20]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[21]  C. Ray Smith,et al.  Maximum Entropy and Bayesian Methods , 1992 .

[22]  Bernard Widrow,et al.  Neural nets for adaptive filtering and adaptive pattern recognition , 1988, Computer.

[23]  Enrico Ciavolino,et al.  The GME estimator for the regression model with a composite indicator as explanatory variable , 2015 .

[24]  Roderick C. Dewar,et al.  Maximum Entropy Production as an Inference Algorithm that Translates Physical Assumptions into Macroscopic Predictions: Don't Shoot the Messenger , 2009, Entropy.

[25]  Enrico Ciavolino,et al.  Modelling the quality of work in the Italian social co-operatives combining NPCA-RSM and SEM-GME approaches , 2015 .

[26]  Ximing Wu A Weighted Generalized Maximum Entropy Estimator with a Data-driven Weight , 2009, Entropy.

[27]  Amjad D. Al-Nasser,et al.  Comparing generalised maximum entropy and partial least squares methods for structural equation models , 2009 .

[28]  G. Crooks Entropy production fluctuation theorem and the nonequilibrium work relation for free energy differences. , 1999, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[29]  Nozer D. Singpurwalla,et al.  Foundations of Info-Metrics (Modeling, Inference and Imperfect Information) , 2019, Technometrics.

[30]  Thomas M. Cover,et al.  Elements of Information Theory: Cover/Elements of Information Theory, Second Edition , 2005 .

[31]  D. Bertsekas COMBINED PRIMAL-DUAL AND PENALTY METHODS FOR CONSTRAINED MINIMIZATION* , 1975 .

[32]  Comparison of regression models under multi-collinearity , 2018 .

[33]  Andreas Holzinger,et al.  Interactive Knowledge Discovery and Data Mining in Biomedical Informatics , 2014, Lecture Notes in Computer Science.

[34]  D. Haar,et al.  Statistical Physics , 1971, Nature.

[35]  Armando J. Pinho,et al.  On Entropy-Based Data Mining , 2014, Interactive Knowledge Discovery and Data Mining in Biomedical Informatics.

[36]  J. J. Dahlgaard,et al.  Simultaneous Equation Model based on the generalized maximum entropy for studying the effect of management factors on enterprise performance , 2009 .

[37]  Enrico Ciavolino,et al.  A Generalized Maximum Entropy (GME) estimation approach to fuzzy regression model , 2016, Appl. Soft Comput..

[38]  Thomas Lukasiewicz MAXIMUM ENTROPY , 2000 .

[39]  Douglas J. Miller,et al.  Maximum entropy econometrics: robust estimation with limited data , 1996 .

[40]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .