Statistical Learning With Time Series Dependence: An Application to Scoring Sleep in Mice

We develop methodology that combines statistical learning methods with generalized Markov models, thereby enhancing the former to account for time series dependence. Our methodology can accommodate very general and very long-term time dependence structures in an easily estimable and computationally tractable fashion. We apply our methodology to the scoring of sleep behavior in mice. As methods currently used to score sleep in mice are expensive, invasive, and labor intensive, there is considerable interest in developing high-throughput automated systems which would allow many mice to be scored cheaply and quickly. Previous efforts at automation have been able to differentiate sleep from wakefulness, but they are unable to differentiate the rare and important state of rapid eye movement (REM) sleep from non-REM sleep. Key difficulties in detecting REM are that (i) REM is much rarer than non-REM and wakefulness, (ii) REM looks similar to non-REM in terms of the observed covariates, (iii) the data are noisy, and (iv) the data contain strong time dependence structures crucial for differentiating REM from non-REM. Our new approach (i) shows improved differentiation of REM from non-REM sleep and (ii) accurately estimates aggregate quantities of sleep in our application to video-based sleep scoring of mice. Supplementary materials for this article are available online.

[1]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[2]  Nengjun Yi,et al.  The Collaborative Cross, a community resource for the genetic analysis of complex traits , 2004, Nature Genetics.

[3]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[4]  Yan Zhu,et al.  Aging Impairs the Unfolded Protein Response to Sleep Deprivation and Leads to Proapoptotic Signaling , 2008, The Journal of Neuroscience.

[5]  M. Steriade Brain Electrical Activity and Sensory Processing during Waking and Sleep States , 2005 .

[6]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[7]  Paul Franken,et al.  Behavioral / Systems / Cognitive Differential Effects of GABAB Receptor Subtypes ,-Hydroxybutyric Acid , and Baclofen on EEG Activity and Sleep Regulation , 2010 .

[8]  Seiji Nishino,et al.  The Transcriptional Repressor DEC2 Regulates Sleep Length in Mammals , 2009, Science.

[9]  Jiangang Gao,et al.  A review of current large‐scale mouse knockout efforts , 2010, Genesis.

[10]  Jin H. Kim,et al.  Nonstationary hidden Markov model , 1995, Signal Process..

[11]  Edilio Borroni,et al.  Orexin receptor 2 expression in the posterior hypothalamus rescues sleepiness in narcoleptic mice , 2011, Proceedings of the National Academy of Sciences.

[12]  S. Haggarty,et al.  Zebrafish Behavioral Profiling Links Drugs to Biological Targets and Rest/Wake Regulation , 2010, Science.

[13]  T. G. Overmire,et al.  HOMEOSTATIC REGULATION. , 1963, BSCS pamphlets.

[14]  Seiji Nishino,et al.  Specificity of direct transition from wake to REM sleep in orexin/ataxin-3 transgenic narcoleptic mice , 2009, Experimental Neurology.

[15]  Dorret I. Boomsma,et al.  GENETIC ANALYSIS OF MORNINGNESS AND EVENINGNESS , 2001, Chronobiology international.

[16]  Petar M. Djuric,et al.  An MCMC sampling approach to estimation of nonstationary hidden Markov models , 2002, IEEE Trans. Signal Process..

[17]  Hrishikesh Deshpande,et al.  Pattern Recognition of Sleep in Rodents Using Piezoelectric Signals Generated by Gross Body Movements , 2007, IEEE Transactions on Biomedical Engineering.

[18]  Padhraic Smyth,et al.  Markov monitoring with unknown states , 1994, IEEE J. Sel. Areas Commun..

[19]  D Chollet,et al.  The Homeostatic Regulation of Sleep Need Is under Genetic Control , 2001, The Journal of Neuroscience.

[20]  Ulf Böckenholt,et al.  Comments on: Latent Markov models: a review of the general framework for the analysis of longitudinal data with covariates , 2014 .

[21]  J. Harrow,et al.  A conditional knockout resource for the genome-wide study of mouse gene function , 2011, Nature.

[22]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[23]  Yves Dauvilliers,et al.  Age-related changes in sleep in inbred mice are genotype dependent , 2012, Neurobiology of Aging.

[24]  Greg Maislin,et al.  Role of Homer Proteins in the Maintenance of Sleep-Wake States , 2012, PloS one.

[25]  A C Heath,et al.  Evidence for genetic influences on sleep disturbance and sleep pattern in twins. , 1990, Sleep.

[26]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[27]  M. Tafti,et al.  Genetics of sleep and sleep disorders. , 2003, Frontiers in bioscience : a journal and virtual library.

[28]  David Mease,et al.  Evidence Contrary to the Statistical View of Boosting , 2008, J. Mach. Learn. Res..

[29]  Lee Friedman,et al.  Ventilatory behavior during sleep among A/J and C57BL/6J mouse strains. , 2004, Journal of applied physiology.

[30]  Shane T. Jensen,et al.  Characterization of the bout durations of sleep and wakefulness , 2010, Journal of Neuroscience Methods.

[31]  Leonard McMillan,et al.  High-Resolution Genetic Mapping Using the Mouse Diversity Outbred Population , 2012, Genetics.

[32]  Shane T. Jensen,et al.  Assessing REM sleep in mice using video data. , 2012, Sleep.

[33]  A. Malafosse,et al.  Genetic determinants of sleep regulation in inbred mice. , 1999, Sleep.

[34]  Stephen E. Levinson,et al.  Continuously variable duration hidden Markov models for automatic speech recognition , 1986 .

[35]  M Partinen,et al.  Genetic and environmental determination of human sleep. , 1983, Sleep.

[36]  C. Saper,et al.  Hypothalamic regulation of sleep and circadian rhythms , 2005, Nature.

[37]  Elissa J. Chesler,et al.  The Collaborative Cross at Oak Ridge National Laboratory: developing a powerful resource for systems genetics , 2008, Mammalian Genome.

[38]  A. Buja,et al.  Loss Functions for Binary Class Probability Estimation and Classification: Structure and Applications , 2005 .

[39]  R. Stickgold,et al.  Sleep, Learning, and Dreams: Off-line Memory Reprocessing , 2001, Science.

[40]  Dimitris N. Metaxas,et al.  Novel method for high-throughput phenotyping of sleep in mice. , 2007, Physiological genomics.

[41]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[42]  P. Bühlmann,et al.  Variable Length Markov Chains: Methodology, Computing, and Software , 2004 .

[43]  David Mease,et al.  Boosted Classification Trees and Class Probability/Quantile Estimation , 2007, J. Mach. Learn. Res..

[44]  Thomas G. Dietterich,et al.  Training conditional random fields via gradient tree boosting , 2004, ICML.

[45]  N. Limnios,et al.  Semi-Markov Models and Applications , 2011 .

[46]  Andrew McCallum,et al.  Efficiently Inducing Features of Conditional Random Fields , 2002, UAI.

[47]  J. Stoker,et al.  The Department of Health and Human Services. , 1999, Home healthcare nurse.

[48]  L. J. Savage Elicitation of Personal Probabilities and Expectations , 1971 .

[49]  S. V. Vaseghi State duration modelling in hidden Markov models , 1995, Signal Process..

[50]  Lisa E. Gralinski,et al.  The Genome Architecture of the Collaborative Cross Mouse Genetic Reference Population , 2012, Genetics.