HIGH-THROUGHPUT DATA ANALYSIS IN BEHAVIOR GENETICS* BY ANAT SAKOV,

In recent years, a growing need has arisen in different fields for the development of computational systems for automated analysis of large amounts of data (high-throughput). Dealing with nonstandard noise structure and outliers, that could have been detected and corrected in manual analysis, must now be built into the system with the aid of robust methods. We discuss such problems and present insights and solutions in the context of behavior genetics, where data consists of a time series of locations of a mouse in a circular arena. In order to estimate the location, velocity and acceleration of the mouse, and identify stops, we use a nonstandard mix of robust and resistant methods: LOWESS and repeated running median. In addition, we argue that protection against small deviations from experimental protocols can be handled automatically using statistical methods. In our case, it is of biological interest to measure a rodent’s distance from the arena’s wall, but this measure is corrupted if the arena is not a perfect circle, as required in the protocol. The problem is addressed by estimating robustly the actual boundary of the arena and its center using a nonparametric regression quantile of the behavioral data, with the aid of a fast algorithm developed for that purpose.

[1]  Hans-Georg Müller,et al.  Functional Data Analysis , 2016 .

[2]  Pietro Perona,et al.  High-throughput Ethomics in Large Groups of Drosophila , 2009, Nature Methods.

[3]  Molly E. Lutcavage,et al.  Filtering and interpreting location errors in satellite telemetry of marine animals , 2008 .

[4]  Y. Honda,et al.  Choice of Degree of Smoothing in Fitting Nonparametric Regression Models for Temperature-mortality Relation in Japan Based on a Priori Knowledge , 2008 .

[5]  P. Mitra,et al.  Analysis of the Trajectory of Drosophila melanogaster in a Circular Open Field Arena , 2007, PloS one.

[6]  Andrew D. Steele,et al.  The power of automated high-resolution behavior analysis revealed by its application to mouse models of Huntington's and prion diseases , 2007, Proceedings of the National Academy of Sciences.

[7]  R. Koenker,et al.  Regression Quantiles , 2007 .

[8]  I. Vaughan L. Clarkson,et al.  A statistical analysis of the Delogne-Kåsa method for fitting circles , 2006, Digit. Signal Process..

[9]  R. Koenker Quantile Regression: Fundamentals of Quantile Regression , 2005 .

[10]  Michael Vinther,et al.  Validation of a digital video tracking system for recording pig locomotor behaviour , 2005, Journal of Neuroscience Methods.

[11]  Jian-Gang Wang,et al.  Estimating the eye gaze from one eye , 2005, Comput. Vis. Image Underst..

[12]  Anat Sakov,et al.  Genotype-environment interactions in mouse behavior: a way out of the problem. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Jean-René Martin,et al.  Centrophobism/thigmotaxis, a new role for the mushroom bodies in Drosophila. , 2005, Journal of neurobiology.

[14]  Anat Sakov,et al.  New replicable anxiety-related measures of wall vs center behavior of mice in the open field. , 2004, Journal of applied physiology.

[15]  Anat Sakov,et al.  The dynamics of spatial behavior: how can robust smoothing techniques help? , 2004, Journal of Neuroscience Methods.

[16]  J. Crabbe,et al.  Genetic animal models of anxiety , 2003, Neurogenetics.

[17]  Eric Nestler,et al.  In need of high-throughput behavioral systems. , 2002, Drug discovery today.

[18]  Y. T. Chan,et al.  Estimation of Circle Parameters by Centroiding , 2002 .

[19]  V. Karimdki Effective circle fitting for particle trajectories , 2002 .

[20]  A. J Spink,et al.  The EthoVision video tracking system—A tool for behavioral phenotyping of transgenic mice , 2001, Physiology & Behavior.

[21]  L P Noldus,et al.  EthoVision: A versatile video tracking system for automation of behavioral experiments , 2001, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[22]  Ilan Golani,et al.  SEE: a tool for the visualization and analysis of rodent exploratory behavior , 2001, Neuroscience & Biobehavioral Reviews.

[23]  V. Bolivar,et al.  List of transgenic and knockout mice: behavioral profiles , 2000, Mammalian Genome.

[24]  Yoav Benjamini,et al.  Statistical discrimination of natural modes of motion in rat exploratory behavior , 2000, Journal of Neuroscience Methods.

[25]  J. Crabbe,et al.  Genetics of mouse behavior: interactions with laboratory environment. , 1999, Science.

[26]  M. C. Jones,et al.  Local Linear Quantile Regression , 1998 .

[27]  David Eilam,et al.  Stopping behavior: constraints on exploration in rats (Rattus norvegicus) , 1993, Behavioural Brain Research.

[28]  M. Braga,et al.  Exploratory Data Analysis , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[29]  P. J. Green,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[30]  Chul E. Kim,et al.  Digital Disks , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  W. Cleveland Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[32]  S. Shapiro Properties of transforms for the detection of curves in noisy pictures , 1978 .

[33]  J. Archer Tests for emotionality in rats and mice: a review. , 1973, Animal behaviour.

[34]  C. S. Hall,et al.  Emotional behavior in the rat. III. The relationship between emotionality and ambulatory activity. , 1936 .