An open source implementation of the Modern Analog Technique (MAT) within the R computing environment

The purpose of this paper is to introduce an analytical solution for Quaternary geoscientists applying the modern analog technique (MAT) to fossil biological assemblages. We present a package called MATTOOLS that implements the MAT and offers new calibration techniques related to Monte Carlo simulation and response operating curves (ROC) that are used in assessing the critical thresholds of biological assemblage dissimilarity. The MATTOOLS solution to the MAT has the advantage of operating in the R language environment, a free open-source high-level language with hundreds of functions for statistical analysis and visualization. MATTOOLS therefore offers an easily extensible solution for individual research endeavors. We review current solutions for MAT calculations and provide an example of modern calibration using MATTOOLS.

[1]  E. C. Grunsky,et al.  R: a data analysis and statistical programming environment-an emerging tool for the geosciences , 2002 .

[2]  K. Gajewski,et al.  Modern pollen data from North America and Greenland for multi-scale paleoenvironmental applications , 2005 .

[3]  T. Ager Late Quaternary vegetation and climate history of the central Bering land bridge from St. Michael Island, western Alaska , 2003, Quaternary Research.

[4]  Rachid Cheddadi,et al.  The climate in Western Europe during the last Glacial/Interglacial cycle derived from pollen and insect remains , 1993 .

[5]  K. Gajewski,et al.  Modern Analogues of Late-Quaternary Pollen Spectra from the Western Interior of North America , 1989 .

[6]  John W. Williams Variations in tree cover in North America since the last glacial maximum , 2003 .

[7]  Yoshiro Saito,et al.  Heinrich event imprints in the Okinawa Trough: evidence from oxygen isotope and planktonic foraminifera , 2001 .

[8]  D. Raup,et al.  Handbook of paleontological techniques , 1965 .

[9]  T. Cronin,et al.  Quantitative analysis of Ostracoda and water masses around Japan; application to Pliocene and Pleistocene paleoceanography , 1993 .

[10]  H. Bauch,et al.  Surface ocean temperatures in the north‐east Atlantic during the last 500 000 years: evidence from foraminiferal census data , 2003 .

[11]  J. Duplessy,et al.  Improving past sea surface temperature estimates based on planktonic fossil faunas , 1998 .

[12]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[13]  Patrick J. Bartlein,et al.  Paleoclimatic interpretation of the Elk Lake pollen record , 1993 .

[14]  J. Grimalt,et al.  Western Mediterranean planktonic foraminifera events and millennial climatic variability during the last 70 kyr , 2003 .

[15]  D. Gavin,et al.  Pollen‐vegetation calibration for tundra communities in the Arctic Foothills, northern Alaska , 2003 .

[16]  C. Metz,et al.  Maximum likelihood estimation of receiver operating characteristic (ROC) curves from continuously-distributed data. , 1998, Statistics in medicine.

[17]  J. Chambers,et al.  The New S Language , 1989 .

[18]  M. Maslin,et al.  Glacial North Atlantic: Sea-surface conditions reconstructed by GLAMAP 2000 , 2003 .

[19]  M. Davis Palynology after Y2K—Understanding the Source Area of Pollen in Sediments , 2000 .

[20]  Eugene R. Wahl,et al.  A general framework for determining cutoff values to select pollen analogs with dissimilarity metrics in the modern analog technique , 2004 .

[21]  Michael C. Mozer,et al.  Optimizing Classifier Performance via an Approximation to the Wilcoxon-Mann-Whitney Statistic , 2003, ICML.

[22]  W. Peltier,et al.  Comparison of North-American pollen-based temperature and global lake-status with CCCma AGCM2 output at 6 ka , 2004 .

[23]  C A Roe,et al.  Statistical Comparison of Two ROC-curve Estimates Obtained from Partially-paired Datasets , 1998, Medical decision making : an international journal of the Society for Medical Decision Making.

[24]  D. Gavin,et al.  A statistical approach to evaluating distance metrics and analog assignments for pollen records , 2003, Quaternary Research.

[25]  U. Pflaumann,et al.  SIMMAX : a modern analog technique to deduce Atlantic Sea Surface Temperatures from planktonic foraminifera in deep sea sediments , 1996 .

[26]  J. Overpeck,et al.  Quantitative Interpretation of Fossil Pollen Spectra: Dissimilarity Coefficients and the Method of Modern Analogs , 1985, Quaternary Research.

[27]  Iain Colin Prentice,et al.  Multidimensional scaling as a research tool in quaternary palynology: A review of theory and methods , 1980 .

[28]  W. Dean,et al.  Elk Lake, Minnesota : evidence for rapid climate change in the north-central United States , 1993 .

[29]  E. DeLong,et al.  Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. , 1988, Biometrics.

[30]  K. Gajewski,et al.  Comparison of marine and terrestrial Holocene climatic reconstructions from northeastern North America , 1999 .

[31]  F. Serrano,et al.  Sea surface temperature during the Quaternary at ODP Sites 976 and 975 (western Mediterranean) , 2000 .

[32]  Rainer Gersonde,et al.  New software package available for quantitative paleoenvironmental reconstructions , 1999 .

[33]  John W. Williams,et al.  DISSIMILARITY ANALYSES OF LATE-QUATERNARY VEGETATION AND CLIMATE IN EASTERN NORTH AMERICA , 2001 .

[34]  H. Birks,et al.  D.G. Frey and E.S. Deevey Review 1: Numerical tools in palaeolimnology – Progress, potentialities, and problems , 1998 .

[35]  Rainer Gersonde,et al.  PaleoTools - extended software package for quantitative paleoenvironmental reconstructions , 1999 .

[36]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[37]  I. Fung,et al.  The climate of North America and adjacent ocean waters ca. 6 ka , 2000 .

[38]  Stephen T. Jackson,et al.  MODERN ANALOGS IN QUATERNARY PALEOECOLOGY: Here Today, Gone Yesterday, Gone Tomorrow? , 2004 .

[39]  Peter N. Schweitzer ANALOG: a program for estimating paleoclimate parameters using the method of modern analogs , 1994 .

[40]  J. Copas,et al.  Overestimation of the receiver operating characteristic curve for logistic regression , 2002 .