From T-Mazes to Labyrinths: Learning from Model-Based Feedback

Many organizational actions need not have any immediate or direct payoff consequence but set the stage for subsequent actions that bring the organization toward some actual payoff. Learning in such settings poses the challenge of credit assignment (Minsky 1961), that is, how to assign credit for the overall outcome of a sequence of actions to each of the antecedent actions. To explore the process of learning in such contexts, we create a formal model in which the actors develop a mental model of the value of stage-setting actions as a complex problem-solving task is repeated. Partial knowledge, either of particular states in the problem space or inefficient and circuitous routines through the space, is shown to be quite valuable. Because of the interdependence of intelligent action when a sequence of actions must be identified, however, organizational knowledge is relatively fragile. As a consequence, while turnover may stimulate search and have largely benign implications in less interdependent task settings, it is very destructive of the organization's near-term performance when the learning problem requires a complementarity among the actors' knowledge.

[1]  H. Greve Performance, Aspirations, and Risky Organizational Change , 1998 .

[2]  J. March,et al.  Adaptation as Information Restriction: The Hot Stove Effect , 2001 .

[3]  Arthur L. Samuel,et al.  Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..

[4]  J. Forrester Industrial Dynamics , 1997 .

[5]  Theresa K. Lant,et al.  Aspiration Level Adaptation: An Empirical Exploration , 1992 .

[6]  A. L. Samuel,et al.  Some studies in machine learning using the game of checkers. II: recent progress , 1967 .

[7]  Giovanni Dosi,et al.  The nature and dynamics of organizational capabilities , 2001 .

[8]  A G Barto,et al.  Toward a modern theory of adaptive networks: expectation and prediction. , 1981, Psychological review.

[9]  Herbert A. Simon,et al.  The Sciences of the Artificial , 1970 .

[10]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[11]  James D. Thompson Organizations in Action , 1967 .

[12]  Araújo,et al.  An Evolutionary theory of economic change , 1983 .

[13]  L. Argote Organizational Learning: Creating, Retaining and Transferring Knowledge , 1999 .

[14]  J. Sterman Misperceptions of feedback in dynamic decision making , 1989 .

[15]  M. Mazumdar,et al.  Avoiding Complexity Catastrophe in Coevolutionary Pockets: Strategies for Rugged Landscapes , 1999 .

[16]  Tony Curzon Price,et al.  Emergence: From Chaos to Order by John H. Holland , 1998, J. Artif. Soc. Soc. Simul..

[17]  Colin Camerer Progress in Behavioral Game Theory , 1997 .

[18]  Stuart A. Kauffman,et al.  ORIGINS OF ORDER , 2019, Origins of Order.

[19]  Louis E. Yelle THE LEARNING CURVE: HISTORICAL REVIEW AND COMPREHENSIVE SURVEY , 1979 .

[20]  S. Mezias,et al.  MANAGING DISCONTINUOUS CHANGE: A SIMULATION STUDY OF ORGANIZATIONAL LEARNING AND ENTREPRENEURSHIP , 1990 .

[21]  R. Katz The Effects of Group Longevity on Project Communication and Performance. , 1982 .

[22]  Kathleen M. Carley,et al.  Organizational response: the cost performance tradeoff , 1997 .

[23]  Anne S. Miner,et al.  Ugly Duckling No More: Pasts and Futures of Organizational Learning Research , 1996 .

[24]  J. March,et al.  An introduction to models in the social sciences , 1975 .

[25]  Kathleen M. Carley Organizational Learning and Personnel Turnover , 1992 .

[26]  M. Shubik,et al.  A Behavioral Theory of the Firm. , 1964 .

[27]  Erik R. Larsen,et al.  Adaptive Learning in Organizations: A System Dynamics-Based Exploration , 1997 .

[28]  S. Winter,et al.  An evolutionary theory of economic change , 1983 .

[29]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[30]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[31]  Daniel A. Levinthal,et al.  Learning from Experience in Organizations , 1985 .

[32]  Jan W. Rivkin,et al.  Balancing Search and Stability: Interdependencies Among Elements of Organizational Design , 2003, Manag. Sci..

[33]  Daniel A. Levinthal Adaptation on rugged landscapes , 1997 .

[34]  D CohenMichael,et al.  Organizational Routines Are Stored as Procedural Memory , 1994 .

[35]  Jan W. Rivkin Imitation of Complex Strategies , 2000 .

[36]  J. Walsh Managerial and Organizational Cognition: Notes from a Trip Down Memory Lane , 1995 .

[37]  S. Mezias,et al.  An Organizational Learning Model of Convergence and Reorientation , 1992 .

[38]  S. Mezias,et al.  The three faces of corporate renewal: institution, revolution, and evolution , 1993 .

[39]  G. Huber Organizational Learning: The Contributing Processes and the Literatures , 1991 .

[40]  John H. Holland,et al.  Induction: Processes of Inference, Learning, and Discovery , 1987, IEEE Expert.

[41]  Daniel A. Levinthal,et al.  A model of adaptive organizational search , 1981 .

[42]  Theresa K. Lant,et al.  Computer simulations of organizations as experiential learning systems: implications for organization theory , 1994 .

[43]  M. Sastry Problems and Paradoxes in a Model of Punctuated Organizational Change , 1997 .

[44]  Michael X Cohen,et al.  Organizational Routines Are Stored as Procedural Memory: Evidence from a Laboratory Study , 1994 .

[45]  Michael X Cohen,et al.  Harnessing Complexity: Organizational Implications of a Scientific Frontier , 2000 .

[46]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[47]  Leslie Pack Kaelbling,et al.  Learning in embedded systems , 1993 .

[48]  永福 智志 The Organization of Learning , 2005, Journal of Cognitive Neuroscience.

[49]  Zenas Block,et al.  Milestones for successful venture planning , 2006 .

[50]  John D. Sterman,et al.  Business dynamics : systems thinking and modelling for acomplex world , 2002 .

[51]  James G. March,et al.  Adaptive Coordination of a Learning Team , 1987 .

[52]  Charles R. Schwenk,et al.  TOP MANAGEMENT, STRATEGY AND ORGANIZATIONAL KNOWLEDGE STRUCTURES , 1992 .

[53]  Steffen Bayer,et al.  Business dynamics: Systems thinking and modeling for a complex world , 2004 .

[54]  Jerker Denrell Comment: The Performance of Performance , 2004 .

[55]  Daniel A. Levinthal,et al.  Looking Forward and Looking Backward: Cognitive and Experiential Search , 2000 .

[56]  D. Plaut,et al.  Learning in Dynamic Decision Tasks: Computational Model and Empirical Evidence , 1997 .

[57]  Daniel A. Levinthal Organizational Capabilities in Complex Worlds , 2001 .

[58]  Nelson P. Repenning,et al.  Capability Traps and Self-Confirming Attribution Errors in the Dynamics of Process Improvement , 2002 .

[59]  G MarchJames,et al.  Adaptation as Information Restriction , 2001 .

[60]  Linda Argote,et al.  Group Learning Curves: The Effects of Turnover and Task Complexity on Group Performance1 , 1995 .

[61]  Erhard Bruderer,et al.  Organizational Evolution, Learning, and Selection: A Genetic-Algorithm-Based Model , 1996 .

[62]  J. Harrington,et al.  Centralization vs. Decentralization in a Multi-Unit Organization: A Computational Model of a Retail Chain as a Multi-Agent Adaptive System , 2000 .

[63]  D. Sterman,et al.  Misperceptions of Feedback in a Dynamic Decision Making Experiment , 1989 .