Empirically-Derived Behavioral Rules in Agent-Based Models Using Decision Trees Learned from Questionnaire Data

With the increasing trend in exploring the use of agent-based models in empirical contexts, this paper reflects on the use of decision trees learned from questionnaire data as behavioral models for the agents. Decision trees are machine learning algorithms most commonly used in the data mining literature, especially for smaller datasets where other techniques such as Bayesian Networks cannot be applied. In agent-based modelling contexts, decision trees have the advantage over some other machine learning techniques in that the results are more transparent, and can be critiqued by domain experts without a background in computing or artificial intelligence. However, decision trees are sensitive to the way in which they are constructed, particularly with respect to preprocessing. We describe the processes by which the decision trees were derived in the context of a model of everyday pro-environmental behavior at work, comparing various preprocessing methods and exploring their differences.

[1]  Fionn Murtagh,et al.  Ward’s Hierarchical Agglomerative Clustering Method: Which Algorithms Implement Ward’s Criterion? , 2011, Journal of Classification.

[2]  Thomas Berger,et al.  Land use decisions in developing countries and their representation in multi-agent systems , 2006 .

[3]  Giuseppe Carrus,et al.  Environmental Beliefs and Endorsement of Sustainable Development Principles in Water Conservation , 2008 .

[4]  Masoud Nikravesh,et al.  Feature Extraction - Foundations and Applications , 2006, Feature Extraction.

[5]  Troy D. Abel,et al.  A Value-Belief-Norm Theory of Support for Social Movements: The Case of Environmentalism , 1999 .

[6]  Amparo Alonso-Betanzos,et al.  An Agent-Based Model for Simulating Environmental Behavior in an Educational Organization , 2014, Neural Processing Letters.

[7]  Daniel G. Brown,et al.  Effects of Heterogeneity in Residential Preferences on an Agent-Based Model of Urban Sprawl , 2006 .

[8]  Carl A. Kallgren,et al.  A focus theory of normative conduct: Recycling the concept of norms to reduce littering in public places. , 1990 .

[9]  K. Hornik,et al.  Model-Based Recursive Partitioning , 2008 .

[10]  Olivier Barreteau,et al.  An Agent-Based Model to Explore Game Setting Effects on Attitude Change During a Role Playing Game Session , 2013, J. Artif. Soc. Soc. Simul..

[11]  Ignacio Rojas,et al.  Advances in Artificial Neural Networks and Computational Intelligence , 2015, Neural Processing Letters.

[12]  Michael Agar,et al.  My Kingdom for a Function: Modeling Misadventures of the Innumerate , 2003, J. Artif. Soc. Soc. Simul..

[13]  Tran Ngoc Trung,et al.  Participatory Simulation of Land-Use Changes in the Northern Mountains of Vietnam: the Combined Use of an Agent-Based Model, a Role-Playing Game, and a Geographic Information System , 2005 .

[14]  S. Schwartz Normative Influences on Altruism , 1977 .

[15]  Javier Bajo,et al.  Trends in Practical Applications of Agents, Multi-Agent Systems and Sustainability , 2015 .

[16]  Francisco Herrera,et al.  Tutorial on practical tips of the most influential data preprocessing algorithms in data mining , 2016, Knowl. Based Syst..

[17]  Umberto Gostoli A Cognitively Founded Model of the Social Emergence of Lexicon , 2008, J. Artif. Soc. Soc. Simul..

[18]  Gary Polhill,et al.  Empirical agent-based modelling of everyday pro-environmental behaviours at work , 2014 .

[19]  R. Likert “Technique for the Measurement of Attitudes, A” , 2022, The SAGE Encyclopedia of Research Design.

[20]  Verónica Bolón-Canedo,et al.  A review of feature selection methods on synthetic data , 2013, Knowledge and Information Systems.

[21]  Huan Liu,et al.  Consistency-based search in feature selection , 2003, Artif. Intell..

[22]  G Gigerenzer,et al.  Reasoning the fast and frugal way: models of bounded rationality. , 1996, Psychological review.

[23]  Jeffrey S. Simonoff,et al.  Tree Induction Vs Logistic Regression: A Learning Curve Analysis , 2001, J. Mach. Learn. Res..

[24]  Huan Liu,et al.  Efficient Feature Selection via Analysis of Relevance and Redundancy , 2004, J. Mach. Learn. Res..

[25]  Scott Moss,et al.  Growing Qawm: an Evidence-Driven Declarative Model of Afghan Power Structures , 2008, Adv. Complex Syst..

[26]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Verónica Bolón-Canedo,et al.  Recent advances and emerging challenges of feature selection in the context of big data , 2015, Knowl. Based Syst..

[28]  J. D. Groot,et al.  Mean or green? Value orientations, morality and prosocial behaviour , 2008 .

[29]  Giuseppe Carrus,et al.  Distinguishing the sources of normative influence on proenvironmental behaviors , 2011 .

[30]  Georg Holtz,et al.  Generating Social Practices , 2014, J. Artif. Soc. Soc. Simul..

[31]  L. Steg,et al.  The Oxford handbook of environmental and conservation psychology , 2012 .

[32]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[33]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[34]  Huan Liu,et al.  Discretization: An Enabling Technique , 2002, Data Mining and Knowledge Discovery.

[35]  Sarah Davis,et al.  Complexity, land-use modeling, and the human dimension: Fundamental challenges for mapping unknown outcome spaces , 2008 .

[36]  L. Steg,et al.  The Significance of Hedonic Values for Environmentally Relevant Attitudes, Preferences, and Actions , 2014 .

[37]  G. Gigerenzer,et al.  Probabilistic mental models: a Brunswikian theory of confidence. , 1991, Psychological review.

[38]  Huan Liu,et al.  Searching for Interacting Features , 2007, IJCAI.

[39]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[40]  S. Schwartz Universals in the Content and Structure of Values: Theoretical Advances and Empirical Tests in 20 Countries , 1992 .

[41]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[42]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[43]  Amparo Alonso-Betanzos,et al.  Influence of internal values and social networks for achieving sustainable organizations , 2014, ECAI.

[44]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[45]  Amparo Alonso-Betanzos,et al.  Designing Decision Trees for Representing Sustainable Behaviours in Agents , 2015, PAAMS.

[46]  Wenjie Sun,et al.  Spatially explicit experiments for the exploration of land‐use decision‐making dynamics , 2006, Int. J. Geogr. Inf. Sci..

[47]  Nicholas Mark Gotts,et al.  Case-Based Reasoning, Social Dilemmas, and a New Equilibrium Concept , 2004, J. Artif. Soc. Soc. Simul..

[48]  E. Ostrom,et al.  Empirically Based, Agent-based models , 2006 .

[49]  T. Caliński,et al.  A dendrite method for cluster analysis , 1974 .

[50]  L. Steg,et al.  The value of environmental self-identity: The relationship between biospheric values, environmental self-identity and environmental preferences, intentions and behaviour , 2013 .

[51]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[52]  Erin Bohensky,et al.  Behaviour and space in agent-based modelling: Poverty patterns in East Kalimantan, Indonesia , 2013, Environ. Model. Softw..

[53]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .