Probabilistic Mining of Socio-Geographic Routines From Mobile Phone Data

There is relatively little work on the investigation of large-scale human data in terms of multimodality for human activity discovery. In this paper, we suggest that human interaction data, or human proximity, obtained by mobile phone Bluetooth sensor data, can be integrated with human location data, obtained by mobile cell tower connections, to mine meaningful details about human activities from large and noisy datasets. We propose a model, called bag of multimodal behavior, that integrates the modeling of variations of location over multiple time-scales, and the modeling of interaction types from proximity. Our representation is simple yet robust to characterize real-life human behavior sensed from mobile phones, which are devices capable of capturing large-scale data known to be noisy and incomplete. We use an unsupervised approach, based on probabilistic topic models, to discover latent human activities in terms of the joint interaction and location behaviors of 97 individuals over the course of approximately a 10-month period using data from MIT's Reality Mining project. Some of the human activities discovered with our multimodal data representation include “going out from 7 pm-midnight alone” and “working from 11 am-5 pm with 3-5 other people,” further finding that this activity dominantly occurs on specific days of the week. Our methodology also finds dominant work patterns occurring on other days of the week. We further demonstrate the feasibility of the topic modeling framework for human routine discovery by predicting missing multimodal phone data at specific times of the day.

[1]  Albert-László Barabási,et al.  Understanding the Spreading Patterns of Mobile Phone Viruses , 2009, Science.

[2]  Albert-László Barabási,et al.  Understanding individual human mobility patterns , 2008, Nature.

[3]  A. Pentland,et al.  Eigenbehaviors: identifying structure in routine , 2009, Behavioral Ecology and Sociobiology.

[4]  Nathan Eagle,et al.  Persistence and periodicity in a dynamic proximity network , 2012, ArXiv.

[5]  C. Elkan,et al.  Topic Models , 2008 .

[6]  Naranker Dulay,et al.  Activity Inference through Sequence Alignment , 2009, LoCA.

[7]  T. Jebara,et al.  CitySenseTM : multiscale space time clustering of GPS points and trajectories , 2009 .

[8]  Gregor Heinrich Parameter estimation for text analysis , 2009 .

[9]  Eric P. Xing,et al.  MedLDA: maximum margin supervised topic models for regression and classification , 2009, ICML '09.

[10]  Alex Pentland,et al.  VibeFones: Socially Aware Mobile Phones , 2006, 2006 10th IEEE International Symposium on Wearable Computers.

[11]  Eric P. Xing,et al.  MedLDA: maximum margin supervised topic models , 2012, J. Mach. Learn. Res..

[12]  Daniel Gatica-Perez,et al.  Learning and predicting multimodal daily life patterns from cell phones , 2009, ICMI-MLMI '09.

[13]  Thomas L. Griffiths,et al.  Learning author-topic models from text corpora , 2010, TOIS.

[14]  Matthai Philipose,et al.  Towards Activity Databases: Using Sensors and Statistical Models to Summarize People's Lives , 2006, IEEE Data Eng. Bull..

[15]  S. Garrod,et al.  Group Discussion as Interactive Dialogue or as Serial Monologue: The Influence of Group Size , 2000, Psychological science.

[16]  Dieter Fox,et al.  Location-Based Activity Recognition , 2005, KI.

[17]  Wei Pan,et al.  SoundSense: scalable sound sensing for people-centric applications on mobile phones , 2009, MobiSys '09.

[18]  Daniel Gatica-Perez,et al.  What did you do today?: discovering daily routines from large-scale mobile data , 2008, ACM Multimedia.

[19]  Alex Pentland,et al.  Sensible Organizations: Technology and Methodology for Automatically Measuring Organizational Behavior , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[20]  C. Rodriguez-Sickert,et al.  The dynamics of a mobile phone network , 2007, 0712.4031.

[21]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[22]  Bernt Schiele,et al.  Discovery of activity patterns using topic models , 2008 .

[23]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[24]  G. Madey,et al.  Uncovering individual and collective human dynamics from mobile phone records , 2007, 0710.2939.