Cognitive User Modeling Computed by a Proposed Dialogue Strategy Based on an Inductive Game Theory

This paper advocates the concept of user modeling (UM), which involves dialogue strategies. We focus on human-machine collaboration, which is endowed with human-like capabilities and in this regard, UM could be related to cognitive modeling, which deals with issues of perception, behavioral decision and selective attention by humans. In our UM, approximating a pay-off matrix or function will be the method employed in order to estimate user's pay-offs, which is basically calculated by user's action. Our proposed computation method allows dialogue strategies to be determined by maximizing mutual expectations of the pay-off matrix. We validated the proposed computation using a social game called ``Iterative Prisoner's Dilemma (IPD)'' that is usually used for modeling social relationships based on reciprocal altruism. Furthermore, we also allowed the pay-off matrix to be used with a probability distribution function. That is, we assumed that a person's pay-off could fluctuate over time, but that the fluctuation could be utilized in order to avoid dead reckoning in a true pay-off matrix. Accordingly, the computational structure is reminiscent of the regularization implicated by the machine learning theory. In a way, we are convinced that the crucial role of dialogue strategies is to enable user models to be smoother by approximating probabilistic pay-off functions. That is, their user models can be more accurate or more precise since the H. Asai et al.: Cognitive User Modeling Computed by a Proposed Dialogue Strategy Based on

[1]  G. Debreu ON THE CONTINUITY PROPERTIES OF PARETIAN UTILITY , 1963 .

[2]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, STOC '84.

[3]  M. Kaneko,et al.  Inductive Game Theory: Discrimination and Prejudices , 1999 .

[4]  Norman I. Badler,et al.  Design of a Virtual Human Presenter , 2000, IEEE Computer Graphics and Applications.

[5]  R. Boyd,et al.  Is the repeated prisoner's dilemma a good model of reciprocal altruism? , 1988 .

[6]  Pat Langley,et al.  Learning to Predict User Operations for Adaptive Scheduling , 1998, AAAI/IAAI.

[7]  Gerhard Fischer,et al.  User Modeling in Human–Computer Interaction , 2001, User Modeling and User-Adapted Interaction.

[8]  M. Legerstee,et al.  Precursors to the development of intention at 6 months: understanding people and their actions. , 2000, Developmental psychology.

[9]  A. Newell Unified Theories of Cognition , 1990 .

[10]  Henry Lieberman,et al.  Letizia: An Agent That Assists Web Browsing , 1995, IJCAI.

[11]  William W. Cohen,et al.  Recommendation as Classification: Using Social and Content-Based Information in Recommendation , 1998, AAAI/IAAI.

[12]  Thomas Rist,et al.  Integrating reactive and scripted behaviors in a life-like presentation agent , 1998, AGENTS '98.

[13]  Yves Rosseel,et al.  A new EM algorithm using the Tikhonov regularizer and the GMB-REM robot's position estimation system. , 2001 .

[14]  A. Ben Hamza,et al.  Unifying probabilistic and variational estimation , 2002, IEEE Signal Process. Mag..

[15]  W. Hamilton,et al.  The Evolution of Cooperation , 1984 .

[16]  Howard Rachlin,et al.  Probability of reciprocation in repeated prisoner's dilemma games , 2001 .

[17]  Jari P. Kaipio,et al.  Tikhonov regularization and prior information in electrical impedance tomography , 1998, IEEE Transactions on Medical Imaging.

[18]  G. Debreu Mathematical Economics: Continuity properties of Paretian utility , 1964 .

[19]  Hiroshi Tsujino,et al.  Semantic Rewiring Mechanism of Neural Cross-Modal Integration Based on Spatial and Temporal Properties of Attention , 2002, Neurocomputing.

[20]  R. Nesse Evolutionary explanations of emotions , 1990, Human nature.

[21]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .