Biplots: quantitative data

Biplots provide visualizations of two things, usually, but not necessarily, in two dimensions. This paper deals exclusively with biplots for quantitative data X; qualitative data or data in the form of counts will be addressed in a subsequent paper. Data X may represent either (1) a matrix with n rows representing samples/cases and columns representing p quantitative variables or (2) a two-way table whose rows and columns both represent classifying variables. Data sets of both types (1) and (2) are considered. Plotting symbols are usually points (typically for samples and distinguished by shape and/or color) and lines (typically for variables which may be calibrated or treated as arrowed vectors). Furthermore, variables may be nonlinear in both regularity of calibration and/or curvature. Interpretation is through distance, inner-products, and sometimes area. Biplots may be improved by judicious shifts of axes, by scaling and by rotation. Nearly always, biplots give approximations to X and measures, incorporated in the biplot, expressing the degree of approximation are discussed. These aspects are illustrated with reference to examples from principal component analysis, nonlinear biplots, biplots for biadditive models, canonical variate analysis and the analysis of distance between grouped samples. WIREs Comput Stat 2015, 7:42–62. doi: 10.1002/wics.1338 For further resources related to this article, please visit the WIREs website. Conflict of interest: The authors have declared no conflicts of interest for this article.

[1]  J. Gower Adding a point to vector diagrams in multivariate analysis , 1968 .

[2]  John C. Gower,et al.  Better biplots , 2009, Comput. Stat. Data Anal..

[3]  Camille Jordan Mémoire sur les covariants des formes binaires. , 1876 .

[4]  K. Gabriel,et al.  The biplot graphic display of matrices with application to principal component analysis , 1971 .

[5]  M. Greenacre Correspondence analysis of multivariate categorical data by weighted least-squares , 1988 .

[6]  L. G. Underhill The coefficient of variation biplot , 1990 .

[7]  I. Jolliffe Principal Component Analysis , 2002 .

[8]  C. Jordan,et al.  Mémoire sur les formes bilinéaires. , 1874 .

[9]  Approximating a symmetric matrix , 1990 .

[10]  S. Gardner-Lubbe,et al.  Measures of fit in principal component and canonical variate analyses , 2008 .

[11]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[12]  G. W. Stewart,et al.  On the Early History of the Singular Value Decomposition , 1993, SIAM Rev..

[13]  E. Schmidt Zur Theorie der linearen und nichtlinearen Integralgleichungen , 1907 .

[14]  Niel J. le Roux,et al.  Analysing Your Multivariate Data as a Pictorial: A Case for Applying Biplot Methodology? , 2006 .

[15]  K. R. Clarke,et al.  Non‐parametric multivariate analyses of changes in community structure , 1993 .

[16]  John C. Gower,et al.  Understanding Biplots: Gower/Understanding Biplots , 2011 .

[17]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .

[18]  David W. Scott,et al.  Multivariate Density Estimation: Theory, Practice, and Visualization , 1992, Wiley Series in Probability and Statistics.

[19]  John C. Gower,et al.  Nonlinearity effects in multidimensional scaling , 2005 .

[20]  R. Clarke,et al.  Theory and Applications of Correspondence Analysis , 1985 .

[21]  C. Eckart,et al.  The approximation of one matrix by another of lower rank , 1936 .

[22]  C. Braak Canonical Correspondence Analysis: A New Eigenvector Technique for Multivariate Direct Gradient Analysis , 1986 .

[23]  J. Gower,et al.  Metric and Euclidean properties of dissimilarity coefficients , 1986 .

[24]  R. Fisher,et al.  STUDIES IN CROP VARIATION , 2009 .