A Note on Covariances for Categorical Data
暂无分享,去创建一个
Generalization of the covariance concept is discussed for mixed categorical and numerical data. Gini's definition of variance for categorical data gives us a starting point to address this issue. The value difference in the original definition is changed to a vector in value space, giving a new definition of covariance for categorical and numerical data. It leads to reasonable correlation coefficients when applied to typical contingency tables.
[1] B. Margolin,et al. An Analysis of Variance for Categorical Data , 1971 .
[2] Takashi Okada. Sum of Squares Decomposition for Categorical Data , 1999 .
[3] Jan Komorowski,et al. Principles of Data Mining and Knowledge Discovery , 2001, Lecture Notes in Computer Science.
[4] Takashi Okada,et al. Rule Induction in Cascade Model Based on Sum of Squares Decomposition , 1999, PKDD.