论文信息 - tt: Treelet Transform with Stata

tt: Treelet Transform with Stata

The treelet transform is a recent data reduction technique from the field of machine learning. Sharing many similarities with principal component analysis, the treelet transform can reduce a multidimensional dataset to the projections on a small number of directions or components that account for much of the variation in the original data. However, in contrast to principal component analysis, the treelet transform produces sparse components. This can greatly simplify interpretation. I describe the tt Stata add-on for performing the treelet transform. The addon includes a Mata implementation of the treelet transform algorithm alongside other functionality to aid in the practical application of the treelet transform. I demonstrate an example of a basic exploratory data analysis using the tt add-on.

Anders Gorst-Rasmussen | A. Gorst-rasmussen

[1] Isabelle Guyon,et al. A Stability Based Method for Discovering Structure in Clustered Data , 2001, Pacific Symposium on Biocomputing.

[2] Daniel Gervini,et al. Criteria for Evaluating Dimension-Reducing Components for Multivariate Data , 2004 .

[3] Ann B. Lee,et al. Treelets--An adaptive multi-scale basis for sparse unordered data , 2007, 0707.0481.

[4] Claus Dethlefsen,et al. Exploring dietary patterns by using the treelet transform. , 2011, American journal of epidemiology.

[5] R. Tibshirani,et al. Sparse Principal Component Analysis , 2006 .