No Surprises? The Reliability and Validity of Test Pit Sampling

In attempts to implement probabilistic survey designs in areas of reduced surface visibility, archaeologists have turned to shovel testing or Test Pit Sampling (TPS). Characteristically TPS involves excavation of small, systematically spaced test pits within larger survey units as a method of searching for archaeological materials that would otherwise go undiscovered. While TPS has been the subject of considerable study most studies have been theoretical in nature. As a result, while the characteristics of TPS are understood generally, it is not known how well the method functions in known archaeological contexts. This article describes the results of research directed at estimating the reliability and validity of the test pit method when carried out on known archaeological sites under varying conditions of artifact density and spatial clustering. Split-half correlations and logistic regressions show that TPS is reliable in the sense that it produces replicable results, but is biased against discovery of small, low-density sites, especially when these sites exhibit high degrees of spatial clustering of artifacts. A model relating TPS to regional survey in general is presented and a means of estimating potential biases of the method is illustrated.

[1]  Harold L. Dibble,et al.  A Comparative Study of Basic Edge Angle Measurement Techniques , 1980, American Antiquity.

[2]  R. Gould,et al.  Foundations of northeast archaeology , 1984 .

[3]  S. Siegel,et al.  Nonparametric Statistics for the Behavioral Sciences , 2022, The SAGE Encyclopedia of Research Design.

[4]  P. Fish Consistency in Archaeological Measurement and Classification: A Pilot Study , 1978, American Antiquity.

[5]  A. E. Maxwell,et al.  Fundamental statistics in psychology and education , 1943 .

[6]  J. L. Chartkoff Transect Interval Sampling in Forests , 1978, American Antiquity.

[7]  J. Guilford Fundamental statistics in psychology and education , 1943 .

[8]  Francis P. McManamon Prehistoric Land Use on Outer Cape Cod , 1982 .

[9]  Arthur S. Keene,et al.  Archaeological hammers and theories , 1983 .

[10]  R. Prentice,et al.  A generalization of the probit and logit methods for dose response curves. , 1976, Biometrics.

[11]  George L. Cowgill,et al.  Some Sampling and Reliability Problems in Archaeology , 1970 .

[12]  A. Ammerman Surveys and Archaeological Research , 1981 .

[13]  W. Dixon,et al.  BMDP statistical software , 1983 .

[14]  G. A. Ferguson,et al.  Statistical analysis in psychology and education , 1960 .

[15]  Edward G. Carmines,et al.  Reliability and Validity Assessment , 1979 .

[16]  Diana Alexander The Limitations of Traditional Surveying Techniques in a Forested Environment , 1983 .

[17]  Dena F. Dincauze,et al.  Retrospective Assessment of Archaeological Survey Contracts in Massachusetts, 1970-1979 , 1980 .

[18]  David L. Clarke,et al.  Models in archaeology , 1972 .

[19]  W. Lovis Quarter Sections and Forests: An Example of Probability Sampling in the Northeastern Woodlands , 1976, American Antiquity.

[20]  B. Nicholson A Comparative Evaluation of Four Sampling Techniques and of the Reliability of Microdebitage As A Cultural Indicator in Regional Surveys , 1983 .

[21]  E. C. Pielou An introduction to mathematical ecology , 1970 .

[22]  Robert C. Dunnell,et al.  7 – The Siteless Survey: A Regional Scale Data Collection Strategy , 1983 .

[23]  Michael B. Schiffer,et al.  Conservation archaeology. A guide for cultural resource management studies , 1977 .

[24]  Fred T. Plog,et al.  10 – Decision Making in Modern Surveys , 1978 .

[25]  Jack D. Nance,et al.  Regional Sampling in Archaeological Survey: The Statistical Perspective , 1983 .

[26]  C. Spearman CORRELATION CALCULATED FROM FAULTY DATA , 1910 .

[27]  W. Brown SOME EXPERIMENTAL RESULTS IN THE CORRELATION OF MENTAL ABILITIES1 , 1910 .

[28]  B. Hole Sampling in Archaeology: A Critique , 1980 .

[29]  M. R. Novick,et al.  Statistical Theories of Mental Test Scores. , 1971 .

[30]  M. J. Allen Introduction to Measurement Theory , 1979 .

[31]  B. Mark Lynch,et al.  Site Artifact Density and the Effectiveness of Shovel Probes , 1980, Current Anthropology.

[32]  Michael J. Shott,et al.  Design and Evaluation of Shovel-Test Sampling in Regional Archaeological Survey , 1983 .

[33]  B. J. Winer Statistical Principles in Experimental Design , 1992 .

[34]  David L. Asch,et al.  Sampling in Archaeology , 1975 .

[35]  H. Wobst We Can't See the Forest for the Trees: Sampling and the Shapes of Archaeological Distributions , 1983 .

[36]  Francis P. Mcmanamon,et al.  Discovering Sites Unseen , 1984 .

[37]  J. Nance Non-Site Sampling in the Lower Cumberland River Valley, Kentucky , 1980 .

[38]  Michael B. Schiffer,et al.  Advances in Archaeological Method and Theory , 1978 .

[39]  J. Nance Regional Subsampling and Statistical Inference in Forested Habitats , 1979, American Antiquity.

[40]  Jeremy A. Sabloff,et al.  The Early Mesoamerican Village , 2019 .

[41]  Randal H Mcguire,et al.  A consideration of observational error in lithic use wear analysis , 1982 .

[42]  Michael B. Schiffer,et al.  The design of archaeological surveys , 1978 .

[43]  G. H. Slusser,et al.  Statistical analysis in psychology and education , 1960 .

[44]  K. Schuessler,et al.  Analyzing social data : a statistical orientation , 1971 .

[45]  D. Kleinbaum,et al.  Applied Regression Analysis and Other Multivariate Methods , 1978 .