Observer ratings: validity and value as a tool for animal welfare research

Ratings by human observers have long been used by animal scientists and veterinarians to assess certain physical traits (e.g. body fat), and can also be applied to the assessment of behaviour and a variety of welfare-relevant variables (e.g. pain responsiveness, alopecia/barbering). Observer ratings offer a myriad of advantages, not just practical (e.g. money-saving) but also scientific: they can be used to integrate multimodal information across time and situations, and for constructs that are otherwise very difficult to assess (e.g. nest quality). Because observer ratings involve subjective judgements, some researchers may question whether they can be trusted to reflect reality in an unbiased manner. In this paper, I present evidence from a range of zoo, laboratory and farm animal studies demonstrating that observer ratings can be both reliable and valid. They have been shown to predict important biological phenomena such as reproductive success in rhinoceroses and cheetahs. Biases are indeed a risk, particularly when the ratings could reflect on the observer's own care of the animals or on their institution; however, this risk can be minimized through careful experimental design, including blinding and careful phrasing of the questions the observers need to answer. I review the steps involved in validating an observer rating scheme, and also discuss both study design issues (e.g. selecting terms to be rated and appropriate observers) and the statistical issues some schemes may raise (e.g. ordinal data are not truly normal).

[1]  B. Brewer,et al.  Development of a scoring system for the early diagnosis of equine neonatal sepsis. , 1988, Equine veterinary journal.

[2]  D. Fraser Animal ethics and animal welfare science: bridging the two cultures 1 It is great honour to give thi , 1999 .

[3]  Paul S. Martin,et al.  Measuring Behaviour: An Introductory Guide , 1986 .

[4]  M. Dawkins,et al.  A non-intrusive method of assessing plumage condition in commercial flocks of laying hens , 2006, Animal Welfare.

[5]  M. Mendl,et al.  Assessing the ‘whole animal’: a free choice profiling approach , 2001, Animal Behaviour.

[6]  Brian L. Keeley Anthropomorphism, primatomorphism, mammalomorphism: understanding cross-species comparisons , 2004 .

[7]  C. Osgood,et al.  The Measurement of Meaning , 1958 .

[8]  J. Serpell,et al.  Development and validation of a questionnaire for measuring behavior and temperament traits in pet dogs. , 2003, Journal of the American Veterinary Medical Association.

[9]  A. Agresti An introduction to categorical data analysis , 1997 .

[10]  D. Weary,et al.  Effect of hoof pathologies on subjective assessments of dairy cow gait. , 2006, Journal of dairy science.

[11]  Arnold S. Chamove,et al.  Visitors excite primates in zoos , 1988 .

[12]  S. Gosling,et al.  Do People Know How They Behave? Self-Reported Act Frequencies Compared With On-Line Codings by Observers , 1998 .

[13]  R. G. Gunn,et al.  Subjective assessment of body fat in live sheep , 1969, The Journal of Agricultural Science.

[14]  D. Kleiman,et al.  Black rhinoceros (Diceros bicornis) in U.S. zoos: I. individual behavior profiles and their relationship to breeding success , 1999 .

[15]  M. Goddard,et al.  A factor analysis of fearfulness in potential guide dogs , 1984 .

[16]  E. Kristensen,et al.  Within- and across-person uniformity of body condition scoring in Danish Holstein cattle. , 2006, Journal of dairy science.

[17]  Expert and novice intuitive judgments about animal behavior , 1993 .

[18]  M. Mendl,et al.  The spontaneous qualitative assessment of behavioural expressions in pigs: first explorations of a novel methodology for integrative animal welfare measurement. , 2000, Applied animal behaviour science.

[19]  F. Wemelsfelder How animals communicate quality of life: the qualitative assessment of animal behaviour , 2007 .

[20]  R. Krueger,et al.  Handbook of research methods in personality psychology , 2007 .

[21]  K. Svartberg A comparison of behaviour in test and in everyday life: evidence of three consistent boldness-related personality traits in dogs , 2005 .

[22]  Charlotte C. Burn,et al.  What is it like to be a rat? Rat sensory perception and its implications for experimental design and rat welfare , 2008 .

[23]  S. Suomi,et al.  Cortisol reactivity and its relation to homecage behavior and personality ratings in tufted capuchin (Cebus apella) juveniles from birth to six years of age , 2002, Psychoneuroendocrinology.

[24]  K. Machin Amphibian pain and analgesia. , 1999, Journal of zoo and wildlife medicine : official publication of the American Association of Zoo Veterinarians.

[25]  R. Tourangeau,et al.  Sensitive questions in surveys. , 2007, Psychological bulletin.

[26]  Simine Vazire,et al.  The self-report method. , 2007 .

[27]  J. P. Mccarty,et al.  NEST-BUILDING BEHAVIOR IN PCB-CONTAMINATED TREE SWALLOWS , 1999 .

[28]  J. Stevenson-Hinde,et al.  Subjective assessment of rhesus monkeys over four successive years , 2006, Primates.

[29]  Robert R. McCrae,et al.  Observer ratings of personality , 2007 .

[30]  S. Kestin,et al.  Prevalence of leg weakness in broiler chickens and its relationship with genotype , 1992, Veterinary Record.

[31]  L. Cronbach,et al.  Construct validity in psychological tests. , 1955, Psychological bulletin.

[32]  S. Gosling From mice to men: what can we learn about personality from animal research? , 2001, Psychological bulletin.

[33]  G. Ellison Is nest building an important component of thermoregulatory behaviour in the pouched mouse (Saccostomus campestris)? , 1995, Physiology & Behavior.

[34]  S. Holden,et al.  The WALTHAM International Nutritional Sciences Symposia A Simple , Reliable Tool for Owners to Assess the Body Condition of Their Dog or Cat 1 – 3 , 2006 .

[35]  C. H. Lawshe A QUANTITATIVE APPROACH TO CONTENT VALIDITY , 1975 .

[36]  J. Serpell,et al.  Development and validation of a novel method for evaluating behavior and temperament in guide dogs. , 2001, Applied animal behaviour science.

[37]  A M Firth,et al.  Development of a scale to evaluate postoperative pain in dogs. , 1999, Journal of the American Veterinary Medical Association.

[38]  Earl R. Babbie,et al.  Fundamentals of social research , 2010 .

[39]  D. Kleiman,et al.  Black Rhinoceros (Diceros bicornis) in U.S. Zoos: II. Behavior, Breeding Success, and Mortality in Relation to Housing Facilities , 1999 .

[40]  J. Stevenson-Hinde,et al.  Subjective assessment of individual rhesus monkeys , 1978, Primates.

[41]  A. Boissy,et al.  A critical review of fear tests used on cattle, pigs, sheep, poultry and horses , 2007, Physiology & Behavior.

[42]  Tom Tyler If Horses Had Hands , 2003 .

[43]  Delroy L. Paulhus,et al.  Social desirable responding: The evolution of a construct , 2002 .

[44]  A. Boissy Fear and Fearfulness in Animals , 1995, The Quarterly Review of Biology.

[45]  S. Gosling,et al.  Temperament and personality in dogs (Canis familiaris): A review and evaluation of past research , 2005 .

[46]  Richard E. Brown Behavioural phenotyping of transgenic mice. , 2007, Canadian journal of experimental psychology = Revue canadienne de psychologie experimentale.

[47]  David E. Wiley,et al.  The role of constructs in psychological and educational measurement , 2001 .

[48]  F. Wemelsfelder The scientific validity of subjective concepts in models of animal welfare , 1997 .

[49]  J. Garner,et al.  Reliability and validity of a modified gait scoring system and its use in assessing tibial dyschondroplasia in broilers , 2002, British poultry science.

[50]  Á. Miklósi,et al.  Measuring attention deficit and activity in dogs: A new application and validation of a human ADHD questionnaire , 2007 .

[51]  Philip N. Lehner,et al.  Handbook of ethological methods , 1979 .

[52]  A R Feinstein,et al.  A comparative contrast of clinimetric and psychometric methods for constructing indexes and rating scales. , 1992, Journal of clinical epidemiology.

[53]  T. Maple,et al.  Personality Assessment in the Gorilla and Its Utility As a Management Tool , 1994 .

[54]  M. Kalinichev,et al.  Locomotor response to novelty as a predictor of reactivity to aversive stimuli in the rat , 2007, Brain Research.

[55]  C. H. Vanderwolf Brain, Behavior, and Mind: What do we know and What can we Know? , 1998, Neuroscience & Biobehavioral Reviews.

[56]  J. Gray,et al.  The psychology of fear and stress , 1971 .

[57]  V. Bacharach,et al.  Psychometrics : An Introduction , 2007 .

[58]  K. Rutherford Assessing Pain in Animals , 2002, Animal Welfare.

[59]  J. S. Long,et al.  Regression Models for Categorical and Limited Dependent Variables , 1997 .

[60]  R. Deacon Assessing nest building in mice , 2006, Nature Protocols.

[61]  Alan M. Batterham,et al.  Validity in clinical research: a review of basic concepts and definitions , 2003 .

[62]  J. Roughan,et al.  Training in behaviour-based post-operative pain scoring in rats—An evaluation based on improved recognition of analgesic requirements , 2006 .

[63]  H. Würbel,et al.  Non-invasively Assessing Disturbance and Stress in Laboratory Rats by Scoring Chromodacryorrhoea , 2004, Alternatives to laboratory animals : ATLA.

[64]  H. Davis,et al.  The Inevitable bond : examining scientist-animal interactions , 1992 .

[65]  M. Conzemius,et al.  Correlation between subjective and objective measures used to determine severity of postoperative pain in dogs. , 1997, Journal of the American Veterinary Medical Association.

[66]  T. Olivry,et al.  Validation of CADESI-03, a severity scale for clinical trials enrolling dogs with atopic dermatitis. , 2007, Veterinary dermatology.

[67]  J. Murray,et al.  Prevalence of obesity in riding horses in Scotland , 2008, Veterinary Record.

[68]  H. Stryhn,et al.  Development of a discriminative questionnaire to assess nonphysical aspects of quality of life of dogs. , 2005, American journal of veterinary research.

[69]  P. Sandøe,et al.  Assessment of Farm Animal Welfare at Herd Level: Many Goals, Many Methods , 2001 .

[70]  Penny Hawkins,et al.  Recognizing and assessing pain, suffering and distress in laboratory animals: a survey of current practice in the UK with recommendations , 2002, Laboratory animals.

[71]  C. B. Lynch Clinal Variation in Cold Adaptation in Mus domesticus: Verification of Predictions from Laboratory Populations , 1992, The American Naturalist.

[72]  V. A. Kral,et al.  Clinical Value of the London Psychogeriatric Rating Scale , 1978, Journal of the American Geriatrics Society.

[73]  W. H. Broster,et al.  Body score of dairy cows , 1998, Journal of Dairy Research.

[74]  S. File,et al.  Validation of open : closed arm entries in an elevated plus-maze as a measure of anxiety in the rat , 1985, Journal of Neuroscience Methods.

[75]  R. Krecek,et al.  Testing for clinical anaemia caused by Haemonchus spp. in goats farmed under resource-poor conditions in South Africa using an eye colour chart developed for sheep. , 2001, Veterinary parasitology.

[76]  J. Panksepp Can anthropomorphic analyses of separation cries in other animals inform us about the emotional nature of social loss in humans? Comment on Blumberg and Sokoloff (2001). , 2003, Psychological review.

[77]  Dirk U. Pfeiffer,et al.  Reliability of assessment of dogs' behavioural responses by staff working at a welfare charity in the UK , 2008 .

[78]  M. Ullman-Cullere,et al.  Body condition scoring: a rapid and accurate method for assessing health status in mice. , 1999, Laboratory animal science.

[79]  Jack Block,et al.  The Q-sort method in personality assessment and psychiatric research , 1964 .

[80]  S. Wolfensohn,et al.  Alopecia Scoring: The Quantitative Assessment of Hair Loss in Captive Macaques , 2005, Alternatives to laboratory animals : ATLA.

[81]  Tine Rousing,et al.  Qualitative assessment of social behaviour of dairy cows housed in loose housing systems , 2006 .

[82]  N. Wielebnowski Behavioral differences as predictors of breeding status in captive cheetahs , 1999 .

[83]  I. Duncan,et al.  ‘Pleasures’,'Pains’ and Animal Welfare: Toward a Natural History of Affect , 1998, Animal Welfare.

[84]  F. Wemelsfelder,et al.  The qualitative assessment of responsiveness to environmental challenge in horses and ponies , 2008 .