Extracting multistage screening rules from online dating activity data

Significance Online activity data—for example, from dating, housing search, or social networking websites—make it possible to study human behavior with unparalleled richness and granularity. However, researchers typically rely on statistical models that emphasize associations among variables rather than behavior of human actors. Harnessing the full informatory power of activity data requires models that capture decision-making processes and other features of human behavior. Our model aims to describe mate choice as it unfolds online. It allows for exploratory behavior and multiple decision stages, with the possibility of distinct evaluation rules at each stage. This framework is flexible and extendable, and it can be applied in other substantive domains where decision makers identify viable options from a larger set of possibilities. This paper presents a statistical framework for harnessing online activity data to better understand how people make decisions. Building on insights from cognitive science and decision theory, we develop a discrete choice model that allows for exploratory behavior and multiple stages of decision making, with different rules enacted at each stage. Critically, the approach can identify if and when people invoke noncompensatory screeners that eliminate large swaths of alternatives from detailed consideration. The model is estimated using deidentified activity data on 1.1 million browsing and writing decisions observed on an online dating site. We find that mate seekers enact screeners (“deal breakers”) that encode acceptability cutoffs. A nonparametric account of heterogeneity reveals that, even after controlling for a host of observable attributes, mate evaluation differs across decision stages as well as across identified groupings of men and women. Our statistical framework can be widely applied in analyzing large-scale data on multistage choices, which typify searches for “big ticket” items.

[1]  Ken-Hou Lin,et al.  Mate Selection in Cyberspace: The Intersection of Race, Gender, and Education1 , 2013, American Journal of Sociology.

[2]  D. McFadden Conditional logit analysis of qualitative choice behavior , 1972 .

[3]  E. Shafir The Behavioral Foundations of Public Policy , 2013 .

[4]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[5]  N. Weinberg,et al.  Sifting and sorting : Personal contacts and hiring in a retail bank , 1997 .

[6]  N. Cowan,et al.  The Magical Mystery Four , 2010, Current directions in psychological science.

[7]  R. Tibshirani,et al.  Generalized additive models for medical research , 1986, Statistical methods in medical research.

[8]  Carl de Boor,et al.  A Practical Guide to Splines , 1978, Applied Mathematical Sciences.

[9]  W. Clark,et al.  Housing Market Search Behavior and Expected Utility Theory: 2. The Process of Search , 1982 .

[10]  Mark Hoekstra,et al.  The Effect of Attending the Flagship State University on Earnings: A Discontinuity-Based Approach , 2009, The Review of Economics and Statistics.

[11]  G. Yancey,et al.  Does Height Matter? An Examination of Height Preferences in Romantic Coupling , 2016 .

[12]  D. Bills Employers' Use of Job History Data for Making Hiring Decisions: A Fuller Specification of Job Assignment and Status Attainment , 1990 .

[13]  John W. Payne,et al.  The adaptive decision maker: Name index , 1993 .

[14]  G. Talarchek SEQUENTIAL ASPECTS OF RESIDENTIAL SEARCH AND SELECTION , 1982 .

[15]  John W. Payne,et al.  Task complexity and contingent processing in decision making: An information search and protocol analysis☆ , 1976 .

[16]  D. Kenrick,et al.  Age preferences in mates reflect sex differences in human reproductive strategies , 1992, Behavioral and Brain Sciences.

[17]  Eric J. Johnson,et al.  The adaptive decision maker , 1993 .

[18]  John R. Hauser,et al.  Consideration-set heuristics ☆ , 2014 .

[19]  S. Zimmerman NORC at the University of Chicago The University of Chicago The Returns to College Admission for Academically Marginal Students , 2014 .

[20]  Derek S. Young,et al.  Mixtures of regressions with changepoints , 2014, Stat. Comput..

[21]  Fred M. Feinberg,et al.  Capturing Flexible Heterogeneous Utility Curves: A Bayesian Spline Approach , 2007, Manag. Sci..

[22]  Allen Newell,et al.  Human Problem Solving. , 1973 .

[23]  Gary H. McClelland,et al.  Effects of choice task on attribute memory , 1987 .

[24]  Frank E. Harrell,et al.  Ordinal Logistic Regression , 2001 .

[25]  Günter J. Hitsch,et al.  Matching and Sorting in Online Dating , 2008 .

[26]  Kevin Lewis The limits of racial prejudice , 2013, Proceedings of the National Academy of Sciences.

[27]  Frank E. Harrell,et al.  Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis , 2001 .