Life patterns : structure from wearable sensors

In this thesis I develop and evaluate computational methods for extracting life’s patterns from wearable sensor data. Life patterns are the reoccurring events in daily behavior, such as those induced by the regular cycle of night and day, weekdays and weekends, work and play, eating and sleeping. My hypothesis is that since a “raw, low-level” wearable sensor stream is intimately connected to the individual’s life, it provides the means to directly match similar events, statistically model habitual behavior and highlight hidden structures in a corpus of recorded memories. I approach the problem of computationally modeling daily human experience as a task of statistical data mining similar to the earlier efforts of speech researchers searching for the building block that were believed to make up speech. First we find the atomic immutable events that mark the succession of our daily activities. These are like the “phonemes” of our lives, but don’t necessarily take on their finite and discrete nature. Since our activities and behaviors operate at multiple time-scales from seconds to weeks, we look at how these events combine into sequences, and then sequences of sequences, and so on. These are the words, sentences and grammars of an individual’s daily experience. I have collected 100 days of wearable sensor data from an individual’s life. I show through quantitative experiments that clustering, classification, and prediction is feasible on a data set of this nature. I give methods and results for determining the similarity between memories recorded at different moments in time, which allow me to associate almost every moment of an individual’s life to another similar moment. I present models that accurately and automatically classify the sensor data into location and activity. Finally, I show how to use the redundancies in an individual’s life to predict his actions from his past behavior. Thesis Advisor: Alex P. Pentland Title: Toshiba Professor of Media Arts and Sciences

[1]  John P. Oakley,et al.  Storage and Retrieval for Image and Video Databases , 1993 .

[2]  Pattie Maes,et al.  Just-in-time information retrieval , 2000 .

[3]  Alex Pentland,et al.  Auditory Context Awareness via Wearable Computing , 1998 .

[4]  Wen Gao,et al.  Discriminative learning of additive noise and channel distortions for robust speech recognition , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[5]  W. Eric L. Grimson,et al.  Using adaptive tracking to classify and monitor activities in a site , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[6]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Electronic Imaging.

[7]  Wayne H. Ward,et al.  Speech recognition , 1997 .

[8]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Other Conferences.

[9]  Gregory D. Abowd,et al.  The Conference Assistant: combining context-awareness with wearable computing , 1999, Digest of Papers. Third International Symposium on Wearable Computers.

[10]  E. Spelke,et al.  Human Spatial Representation: Insights from Animals , 2002 .

[11]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[12]  B. Feiten,et al.  Automatic indexing of a sound database using self-organizing neural nets , 1994 .

[13]  Vannevar Bush,et al.  As we may think , 1945, INTR.

[14]  Bernhard Ronacher,et al.  How do bees learn and recognize visual patterns? , 1998, Biological Cybernetics.

[15]  HongJiang Zhang,et al.  Automatic video scene extraction by shot grouping , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[16]  Steve Mann,et al.  Wearable Computing: A First Step Toward Personal Imaging , 1997, Computer.

[17]  F ATTNEAVE,et al.  Dimensions of similarity. , 1950, The American journal of psychology.

[18]  Douglas D. O'Shaughnessy,et al.  Speech communication : human and machine , 1987 .

[19]  Bill N. Schilit,et al.  The PARCTAB mobile computing system , 1993, Proceedings of IEEE 4th Workshop on Workstation Operating Systems. WWOS-III.

[20]  P. Dourish,et al.  Context-Aware Computing , 2001 .

[21]  Alex Pentland,et al.  Face recognition using view-based and modular eigenspaces , 1994, Optics & Photonics.

[22]  河原 温,et al.  On Kawara : date paintings in 89 cities , 1992 .

[23]  Jon Orwant,et al.  Doppelgänger goes to school : machine learning for user modeling , 1993 .

[24]  Philip E. Agre,et al.  The dynamic structure of everyday life , 1988 .

[25]  Ramesh Jain,et al.  Storage and Retrieval for Still Image and Video Databases IV , 1996 .

[26]  Alex Pentland,et al.  Unsupervised clustering of ambulatory audio and video , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[27]  Nicolas Saint-Arnaud Classification of sound textures , 1995 .

[28]  Harold Thimbleby,et al.  People and Computers XII , 1997, Springer London.

[29]  Jodi Lynn Forget-Me-Not , 2003 .

[30]  Jennifer Healey,et al.  StartleCam: a cybernetic wearable camera , 1998, Digest of Papers. Second International Symposium on Wearable Computers (Cat. No.98EX215).

[31]  Alex Pentland,et al.  Visual contextual awareness in wearable computing , 1998, Digest of Papers. Second International Symposium on Wearable Computers (Cat. No.98EX215).

[32]  Yunxin Zhao A speaker-independent continuous speech recognition system using continuous mixture Gaussian density HMM of phoneme-sized units , 1993, IEEE Trans. Speech Audio Process..

[33]  Gregory D. Abowd,et al.  Context-aware computing , 2002 .

[34]  R. Shepard Attention and the metric structure of the stimulus space. , 1964 .

[35]  Patrick Kenny,et al.  Speech recognition in non-stationary adverse environments , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[36]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[37]  P. Hayes,et al.  Cognitive Wheels : The Frame Problem of AI , 2022 .

[38]  L. Barsalou The content and organization of autobiographical memories , 1988 .

[39]  Marvin Minsky,et al.  A framework for representing knowledge , 1974 .

[40]  Ales Leonardis,et al.  Robust localization using panoramic view-based recognition , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[41]  Henry Lieberman,et al.  Instructible Agents: Software that Just Keeps Getting Better , 1996, IBM Syst. J..

[42]  Adrienne Lehrer,et al.  Frames, Fields, and Contrasts : New Essays in Semantic and Lexical Organization , 1992 .

[43]  Michael G. Lamming,et al.  Does a video diary help recall? in a , 1992 .

[44]  L. Barsalou Frames, concepts, and conceptual fields , 1992 .

[45]  Luc Van Gool,et al.  Recognizing color patterns irrespective of viewpoint and illumination , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[46]  R. Pfeifer,et al.  A mobile robot employing insect strategies for navigation , 2000, Robotics Auton. Syst..

[47]  Jonathan Foote,et al.  A Similarity Measure for Automatic Audio Classification , 1997 .

[48]  Pieter D. Biemond,et al.  Wearable Sensor Badge & Sensor Jacket for Context Awareness , 1999 .

[49]  Matthew Turk,et al.  Perceptual user interfaces , 2000 .

[50]  Giridharan Iyengar,et al.  VideoBook: an experiment in characterization of video , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[51]  James Church,et al.  Wearable sensor badge and sensor jacket for context awareness , 1999, Digest of Papers. Third International Symposium on Wearable Computers.

[52]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[53]  Ulric Neisser,et al.  Remembering Reconsidered: Ecological and Traditional Approaches to the Study of Memory , 1990 .

[54]  G. Reeke The society of mind , 1991 .

[55]  Yasuyuki Sumi,et al.  C-MAP: Building a Context-Aware Mobile Assistant for Exhibition Tours , 1998, Community Computing and Support Systems.

[56]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[57]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[58]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[59]  Pietro Laface,et al.  Discriminative training of hidden Markov models using a classification measure criterion , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[60]  F. Arman,et al.  A Statistical Approach to Scene Change Detection , 1995 .

[61]  W. R. Garner The Processing of Information and Structure , 1974 .

[62]  Bill N. Schilit,et al.  Context-aware computing applications , 1994, Workshop on Mobile Computing Systems and Applications.

[63]  Shih-Fu Chang,et al.  Clustering methods for video browsing and annotation , 1996, Electronic Imaging.

[64]  T. Collett,et al.  Multiple stored views and landmark guidance in ants , 1998, Nature.

[65]  David J. Field,et al.  What Is the Goal of Sensory Coding? , 1994, Neural Computation.

[66]  Nilesh V. Patel,et al.  Statistical approach to scene change detection , 1995, Electronic Imaging.

[67]  Alex Pentland,et al.  Attentional Objects for Visual Context Understanding , 1999 .

[68]  M. Lamming,et al.  "Forget-me-not" Intimate Computing in Support of Human Memory , 1994 .