Compositional data naturally arises from the scientific analysis of the chemical
composition of archaeological material such as ceramic and glass artefacts. Data of this
type can be explored using a variety of techniques, from standard multivariate methods
such as principal components analysis and cluster analysis, to methods based upon the
use of log-ratios. The general aim is to identify groups of chemically similar artefacts
that could potentially be used to answer questions of provenance.
This paper will demonstrate work in progress on the development of a documented
library of methods, implemented using the statistical package R, for the analysis of
compositional data. R is an open source package that makes available very powerful
statistical facilities at no cost. We aim to show how, with the aid of statistical software
such as R, traditional exploratory multivariate analysis can easily be used alongside, or
in combination with, specialist techniques of compositional data analysis.
The library has been developed from a core of basic R functionality, together with
purpose-written routines arising from our own research (for example that reported at
CoDaWork'03). In addition, we have included other appropriate publicly available
techniques and libraries that have been implemented in R by other authors. Available
functions range from standard multivariate techniques through to various approaches to
log-ratio analysis and zero replacement. We also discuss and demonstrate a small
selection of relatively new techniques that have hitherto been little-used in
archaeometric applications involving compositional data. The application of the library
to the analysis of data arising in archaeometry will be demonstrated; results from
different analyses will be compared; and the utility of the various methods discussed
[1]
Brian D. Ripley,et al.
Modern Applied Statistics with S Fourth edition
,
2002
.
[2]
Mike Baxter,et al.
Principal component and correspondence analysis of compositional data: some similarities
,
1990
.
[3]
Christina Gloeckner,et al.
Modern Applied Statistics With S
,
2003
.
[4]
Mike Baxter,et al.
A REVIEW OF SUPERVISED AND UNSUPERVISED PATTERN RECOGNITION IN ARCHAEOMETRY
,
2006
.
[5]
J. Aitchison,et al.
Biplots of Compositional Data
,
2002
.
[6]
J. B. I. Garrigós,et al.
Alteration and Contamination of Archaeological Ceramics: the Perturbation Problem
,
1999
.
[7]
William N. Venables,et al.
S Programming
,
2000
.
[8]
C. M. Jackson,et al.
Compositional data analysis of archaeological glass: problems and possible solutions
,
2003
.
[9]
M. Greenacre.
Ratio maps and correspondence analysis
,
2002
.
[10]
John Aitchison,et al.
The Statistical Analysis of Compositional Data
,
1986
.
[11]
V. Pawlowsky-Glahn,et al.
Dealing with Zeros and Missing Values in Compositional Data Sets Using Nonparametric Imputation
,
2003
.
[12]
Mike Baxter,et al.
Exploratory Multivariate Analysis in Archaeology
,
1994
.
[13]
M. Baxter,et al.
Compositional analysis of archaeological glasses
,
2005
.
[14]
C. M. Jackson,et al.
Compositional Data Analysis of Some Alkaline Glasses
,
2005
.