Socially aware conversational agents

• A submitted manuscript is the version of the article upon submission and before peer-review. There can be important differences between the submitted version and the official published version of record. People interested in the research are advised to contact the author for the final version of the publication, or visit the DOI to the publisher's website. • The final author version and the galley proof are versions of the publication after peer review. • The final published version features the final layout of the paper including the volume, issue and page numbers.

[1]  Berry Eggen,et al.  Identifying the intended addressee in mixed human-human and human-computer interaction from non-verbal features , 2005, ICMI '05.

[2]  Austin Henderson,et al.  Making sense of sensing systems: five questions for designers and researchers , 2002, CHI.

[3]  M. Argyle,et al.  Gaze and Mutual Gaze , 1994, British Journal of Psychiatry.

[4]  Sharon L. Oviatt,et al.  Toward open-microphone engagement for multiparty interactions , 2006, ICMI '06.

[5]  Eric Horvitz,et al.  Learning and reasoning about interruption , 2003, ICMI '03.

[6]  Tanja Schultz,et al.  Identifying the addressee in human-human-robot interactions based on head pose and speech , 2004, ICMI '04.

[7]  Sharon L. Oviatt,et al.  Ten myths of multimodal interaction , 1999, Commun. ACM.

[8]  Matthias Rauterberg,et al.  HCI reality - an 'Unreal Tournament'? , 2007, Int. J. Hum. Comput. Stud..

[9]  Bogdan Raducanu,et al.  Human presence detection by smart devices , 2004 .

[10]  M. Cranach The Role of Orienting Behavior in Human Interaction , 1971 .

[11]  Mervyn Jack,et al.  Proceedings of HCI 2004 , 2004 .

[12]  Trevor Darrell,et al.  Ausio-visual Segmentation and "The Cocktail Party Effect" , 2000, ICMI.

[13]  Wolfgang Minker,et al.  A spoken language system for information retrieval , 1994, ICSLP.

[14]  Harry Bunt,et al.  Dialogue pragmatics and context specification , 2000, Abduction, Belief and Context in Dialogue.

[15]  J. Searle,et al.  Expression and Meaning. , 1982 .

[16]  Anton Nijholt,et al.  Addressee Identification in Face-to-Face Meetings , 2006, EACL.

[17]  A. Koller,et al.  Speech Acts: An Essay in the Philosophy of Language , 1969 .

[18]  Natasa Jovanovic,et al.  To whom it may concern : adressee identification in face-to-face meetings , 2007 .

[19]  Sandeep Purao,et al.  Being Proactive: Where Action Research Meets Design Research , 2005, ICIS.

[20]  Jacques M. B. Terken,et al.  Real-Time Feedback on Nonverbal Behaviour to Enhance Social Dynamics in Small Group Meetings , 2005, MLMI.

[21]  Gwm Matthias Rauterberg,et al.  How to characterize a research line for user-system interaction , 2000 .

[22]  K. Chang,et al.  Embodiment in conversational interfaces: Rea , 1999, CHI '99.

[23]  Jonathan J. Cadiz,et al.  "Let There Be Light": Examining Interfaces for Homes of the Future , 2001, INTERACT.

[24]  E. Schegloff Sequencing in Conversational Openings , 1968 .

[25]  Rainer Stiefelhagen,et al.  Head pose estimation using stereo vision for human-robot interaction , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[26]  Sharon L. Oviatt,et al.  Audio-visual cues distinguishing self- from system-directed speech in younger and older adults , 2005, ICMI '05.

[27]  Philip R. Cohen,et al.  MULTIMODAL INTERFACES THAT PROCESS WHAT COMES NATURALLY , 2000 .

[28]  Boaz Keysar,et al.  Unconfounding common ground , 1997 .

[29]  Jie Zhu,et al.  Head orientation and gaze direction in meetings , 2002, CHI Extended Abstracts.

[30]  M. Knapp,et al.  Nonverbal communication in human interaction , 1972 .

[31]  Michael Argyle,et al.  The central Europe experiment: Looking at persons and looking at objects , 1976 .

[32]  Frank Dignum Advances in Agent Communication , 2003, Lecture Notes in Computer Science.

[33]  C. D. Forgie,et al.  Automatic Recognition of Spoken Digits , 1958 .

[34]  Anton Batliner,et al.  Using Prosodic Features To Characterize Off-Talk In Human-Computer Interaction , 2001 .

[35]  Arthur C. Clarke,et al.  From 2001: A Space Odyssey , 2001 .

[36]  Jason W. Osborne,et al.  Best practices in exploratory factor analysis: four recommendations for getting the most from your analysis. , 2005 .

[37]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[38]  Paul Dourish,et al.  Where the action is , 2001 .

[39]  Matthias Rauterberg,et al.  HCI as an engineering discipline: to be or not to be!? , 2006, Afr. J. Inf. Commun. Technol..

[40]  Climent Nadeu,et al.  Automatic Speech Activity Detection, Source Localization, and Speech Recognition on the Chil Seminar Corpus , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[41]  P Kuyper,et al.  The cocktail party effect. , 1972, Audiology : official organ of the International Society of Audiology.

[42]  B. Brumitt Comparing Interfaces for Homes of the Future , 2000 .

[43]  D. Holdcroft Expression and Meaning. , 1982 .

[44]  Maurizio Omologo,et al.  Use of a CSP-based voice activity detector for distant-talking ASR , 2003, INTERSPEECH.

[45]  Larry S. Davis,et al.  Computing 3-D head orientation from a monocular image sequence , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[46]  Jacques M. B. Terken,et al.  Multimodalcues for addressee-hood in triadic communication with a human information retrieval agent , 2007, ICMI '07.

[47]  D. Rutter,et al.  Looking and Seeing: The Role of Visual Communication in Social Interaction , 1984 .

[48]  C. H. Dorst,et al.  Describing Design - A comparison of paradigms , 1997 .

[49]  Alan R. Hevner,et al.  Design Science in Information Systems Research , 2004, MIS Q..

[50]  E. Schegloff,et al.  A simplest systematics for the organization of turn-taking for conversation , 1974 .

[51]  A. Kendon Gesture: Visible Action as Utterance , 2004 .

[52]  Aristide H. Esser,et al.  Behavior and environment : the use of space by animals and men : proceedings of an international symposium held at the 1968 meeting of the American Association for the Advancement of Science in Dallas, Texas , 1971 .

[53]  J. Cassell,et al.  Turn taking vs. Discourse Structure: How Best to Model Multimodal Conversation , 1998 .

[54]  J. Dreyfus-Graf Sonograph and Sound Mechanics , 1950 .

[55]  C. Goodwin Conversational Organization: Interaction Between Speakers and Hearers , 1981 .

[56]  A. Kendon Some functions of gaze-direction in social interaction. , 1967, Acta psychologica.

[57]  Sandeep Purao,et al.  Design Research in the Technology of Information Systems: Truth or Dare , 2002 .

[58]  Susan R. Fussell,et al.  Coordination of knowledge in communication: effects of speakers' assumptions about what others know. , 1992, Journal of personality and social psychology.

[59]  James M. Rehg,et al.  Vision for a smart kiosk , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[60]  Rainer Stiefelhagen,et al.  Tracking and modeling focus of attention in meetings , 2002 .

[61]  Panos Markopoulos,et al.  THE IDEA-COLLECTOR: A DEVICE FOR CREATIVE FACE-TO-FACE MEETINGS. , 2002 .

[62]  Dzmitry Aliakseyeu,et al.  Transcription Table: Text Support During Meetings , 2005, INTERACT.

[63]  Louis Vuurpijl,et al.  Conversational agent or direct manipulation in human-system interaction , 2005, Speech Commun..

[64]  J. Cassell,et al.  Communicative humanoids: a computational model of psychosocial dialogue skills , 1996 .

[65]  Susan R. Fussell,et al.  Coordination of knowledge in communication: Effects of speakers' assumptions about what others know. , 1992 .

[66]  Rainer Stiefelhagen,et al.  Multi-view head pose estimation using neural networks , 2005, The 2nd Canadian Conference on Computer and Robot Vision (CRV'05).

[67]  Janienke Sturm,et al.  The effect of prolonged use on multimodal interaction , 2002 .

[68]  Csr Young,et al.  How to Do Things With Words , 2009 .

[69]  Susan R. Fussell,et al.  Where do helpers look?: gaze targets during collaborative physical tasks , 2003, CHI Extended Abstracts.

[70]  Alexander H. Waibel,et al.  Simultaneous tracking of head poses in a panoramic view , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[71]  Ben Shneiderman,et al.  Direct manipulation vs. interface agents , 1997, INTR.

[72]  Nicole Beringer,et al.  Off-talk - a problem for human-machine-interaction? , 2001, INTERSPEECH.

[73]  Tom Fawcett,et al.  ROC Graphs: Notes and Practical Considerations for Researchers , 2007 .

[74]  Foster J. Provost,et al.  Confidence Bands for Roc Curves , 2004, ROCAI.

[75]  Sharon L. Oviatt,et al.  Multimodal system processing in mobile environments , 2000, UIST '00.

[76]  J. Chatwin Conversation analysis. , 2004, Complementary therapies in medicine.

[77]  Victor Zue,et al.  On the design of effective speech-based interfaces for desktop applications , 1997, EUROSPEECH.

[78]  Alexander H. Waibel,et al.  The connector: facilitating context-aware communication , 2005, ICMI '05.

[79]  Robert Graham,et al.  Towards a tool for the Subjective Assessment of Speech System Interfaces (SASSI) , 2000, Natural Language Engineering.

[80]  Lori Lamel,et al.  Data collection for the MASK kiosk: WOz vs. prototype system , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[81]  Jacques M. B. Terken,et al.  Facial Orientation During Multi-party Interaction with Information Kiosks , 2003, INTERACT.

[82]  C. Abraham,et al.  Interventions to change health behaviours: evidence-based or evidence-inspired? , 2004 .

[83]  Jan-Peter de Holger N. J. Ruiter,et al.  Projecting the End of a Speaker's Turn: A Cognitive Cornerstone of Conversation , 2006 .

[84]  Rieks op den Akker,et al.  Towards Automatic Addressee Identification in Multi-party Dialogues , 2004, SIGDIAL Workshop.

[85]  Shumin Zhai,et al.  Gaze and Speech in Attentive User Interfaces , 2000, ICMI.

[86]  J. Sturm,et al.  On the usability of multimodal interaction for mobile access to information services , 2004 .

[87]  Anton Nijholt,et al.  Eye gaze patterns in conversations: there is more to conversational agents than meets the eyes , 2001, CHI.

[88]  Abbie Brown,et al.  Design experiments: Theoretical and methodological challenges in creating complex interventions in c , 1992 .