Incremental Learning with Accuracy Prediction of Social and Individual Properties from Mobile-Phone Data

As truly ubiquitous wearable computers, mobile phones are quickly becoming the primary source for social, behavioral, and environmental sensing and data collection. Today's smart phones are equipped with increasingly more sensors and accessible data types that enable the collection of literally dozens of signals regarding the phone, its user, and their environment. A great deal of research effort in academia and industry is put into mining this data for higher level sense-making, such as understanding user context, inferring social networks, learning individual features, and so on. In many cases this analysis work is the result of exploratory forays and trial-and-error. Adding to the challenge, the devices themselves are limited platforms, hence data collection campaign must be carefully designed in order to collect the signals in the appropriate frequency, avoiding the exhausting the the device's limited battery and processing power. Currently however, there is no structured methodology for the design of mobile data collection and analysis initiatives. In this work we investigate the properties of learning and inference of real world data collected via mobile phones over time. In particular, we analyze how the ability to predict individual parameters and social links is incrementally enhanced with the accumulation of additional data. To do so we use the Friends and Family dataset, containing rich data signals gathered from the smart phones of 140 adult members of an MIT based young-family residential community for over a year, and is one of the most comprehensive mobile phone datasets gathered in academia to date. We develop several models for predicting social and individual properties from sensed mobile phone data over time, including detection of life-partners, ethnicity, and whether a person is a student or not. Finally, we propose a method for predicting the maximal learning accuracy possible for the learning task at hand, based on an initial set of measurements. This has various practical implications, such as better design of mobile data collection campaigns, or evaluating of planned analysis strategies.

[1]  Krishna P. Gummadi,et al.  You are who you know: inferring user profiles in online social networks , 2010, WSDM '10.

[2]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[3]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[4]  Meir Kalech,et al.  Who is going to win the next Association for the Advancement of Artificial Intelligence Fellowship Award? Evaluating researchers by mining bibliographic data , 2011, J. Assoc. Inf. Sci. Technol..

[5]  M. Kalmijn,et al.  Intermarriage and homogamy: causes, patterns, trends. , 1998, Annual review of sociology.

[6]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[7]  Jeff Burke,et al.  Campaignr: A Framework for Participatory Data Collection on Mobile Phones , 2007 .

[8]  Jon M. Kleinberg,et al.  The link-prediction problem for social networks , 2007, J. Assoc. Inf. Sci. Technol..

[9]  A. Pentland,et al.  Life in the network: The coming age of computational social science: Science , 2009 .

[10]  Boleslaw K. Szymanski,et al.  Community detection using a neighborhood strength driven Label Propagation Algorithm , 2011, 2011 IEEE Network Science Workshop.

[11]  N. Eagle,et al.  Network Diversity and Economic Development , 2010, Science.

[12]  Alex Pentland,et al.  Composite Social Network for Predicting Mobile Apps Installation , 2011, AAAI.

[13]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[14]  Albert-László Barabási,et al.  Understanding individual human mobility patterns , 2008, Nature.

[15]  Jennifer L. Glanville,et al.  BIRDS OF A FEATHER : Homophily in Social Networks , 2014 .

[16]  A. d’Onofrio A general framework for modeling tumor-immune system competition and immunotherapy: Mathematical analysis and biomedical inferences , 2005, 1309.3337.

[17]  Miroslaw Lachowicz,et al.  A general framework for modeling tumor-immune system competition at the mesoscopic level , 2012, Appl. Math. Lett..

[18]  P. Currie,et al.  Tyrannosaur Life Tables: An Example of Nonavian Dinosaur Population Biology , 2006, Science.

[19]  P. Rouvinen,et al.  Diffusion of Digital Mobile Telephony : Are Developing Countries Different? , 2006 .

[20]  Alex Pentland,et al.  Social fMRI: Investigating and shaping social mechanisms in the real world , 2011, Pervasive Mob. Comput..

[21]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[22]  Peter Schoo,et al.  Infiltrating Critical Infrastructures with Next-Generation Attacks W32.Stuxnet as a Showcase Threat , 2010 .

[23]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[24]  Alex Pentland,et al.  Social sensing for epidemiological behavior change , 2010, UbiComp.

[25]  Christoph Meinel,et al.  Measuring Expertise in Online Communities , 2011, IEEE Intelligent Systems.

[26]  Zhigang Liu,et al.  The Jigsaw continuous sensing engine for mobile phone applications , 2010, SenSys '10.

[27]  Katarzyna Wac,et al.  Getting closer: an empirical investigation of the proximity of user to their smart phones , 2011, UbiComp '11.

[28]  Balachander Krishnamurthy,et al.  On the leakage of personally identifiable information via online social networks , 2010, Comput. Commun. Rev..

[29]  D. Lazer,et al.  Inferring Social Network Structure using Mobile Phone Data , 2006 .

[30]  Mark S. Ackerman,et al.  Personal and Ubiquitous Computing , 2004, Personal and Ubiquitous Computing.

[31]  David Lazer,et al.  Inferring friendship network structure by using mobile phone data , 2009, Proceedings of the National Academy of Sciences.

[32]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008, Proceedings of the Python in Science Conference.

[33]  Alex Pentland,et al.  Reality mining: sensing complex social systems , 2006, Personal and Ubiquitous Computing.

[34]  Alex Pentland,et al.  Pervasive Sensing to Model Political Opinions in Face-to-Face Networks , 2011, Pervasive.

[35]  Alex Pentland,et al.  Sensible Organizations: Technology and Methodology for Automatically Measuring Organizational Behavior , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[36]  Daniel Gatica-Perez,et al.  Discovering human places of interest from multimodal mobile phone data , 2010, MUM.

[37]  Leonidas J. Guibas,et al.  Mobiscopes for Human Spaces , 2007, IEEE Pervasive Computing.