Modeling discussion topics in interactions with a tablet reading primer

CloudPrimer is a tablet-based interactive reading primer that aims to foster early literacy skills and shared parent-child reading through user-targeted discussion topic suggestions. The tablet application records discussions between parents and children as they read a story and leverages this information, in combination with a common sense knowledge base, to develop discussion topic models. The long-term goal of the project is to use such models to provide context-sensitive discussion topic suggestions to parents during the shared reading activity in order to enhance the interactive experience and foster parental engagement in literacy education. In this paper, we present a novel approach for using commonsense reasoning to effectively model topics of discussion in unstructured dialog. We introduce a metric for localizing concepts that the users are interested in at a given moment in the dialog and extract a time sequence of words of interest. We then present algorithms for topic modeling and refinement that leverage semantic knowledge acquired from ConceptNet, a commonsense knowledge base. We evaluate the performance of our algorithms using transcriptions of audio recordings of parent-child pairs interacting with a tablet application, and compare the output of our algorithms to human-generated topics. Our results show that words of interest and discussion topics selected by our algorithm closely match those identified by human readers.

[1]  Catherine Havasi,et al.  ConceptNet: A lexical resource for common sense knowledge , 2009 .

[2]  Stephen F. Smith,et al.  CMRadar: A Personal Assistant Agent for Calendar Management , 2004, AAAI.

[3]  Steven Bird,et al.  NLTK: The Natural Language Toolkit , 2002, ACL 2006.

[4]  Neil T. Heffernan,et al.  Addressing the testing challenge with a web-based e-assessment system that tutors as it assesses , 2006, WWW '06.

[5]  Martin Porter,et al.  Snowball: A language for stemming algorithms , 2001 .

[6]  George Karypis,et al.  A Comparison of Document Clustering Techniques , 2000 .

[7]  O. Korat,et al.  Reading electronic and printed books with and without adult instruction: effects on emergent reading , 2010 .

[8]  Hanna M. Wallach,et al.  Topic modeling: beyond bag-of-words , 2006, ICML.

[9]  Candace L. Sidner,et al.  COLLAGEN: A Collaboration Manager for Software Interface Agents , 1998, User Modeling and User-Adapted Interaction.

[10]  Adina Shamir,et al.  The educational electronic book as a tool for supporting children's emergent literacy in low versus middle SES groups , 2008, Comput. Educ..

[11]  Heiko Hausendorf,et al.  Patterns of adult-child interaction as a mechanism of discourse acquisition☆ , 1992 .

[12]  Abdenour Bouzouane,et al.  A smart home agent for plan recognition , 2006, AAMAS '06.

[13]  Andreas Krause,et al.  Data association for topic intensity tracking , 2006, ICML '06.

[14]  Henry A. Kautz,et al.  Real-time crowd labeling for deployable activity recognition , 2013, CSCW.

[15]  Steven Bird,et al.  NLTK: The Natural Language Toolkit , 2002, ACL.

[16]  Regina Barzilay,et al.  Bayesian Unsupervised Topic Segmentation , 2008, EMNLP.

[17]  M. Susan Burns,et al.  Starting Out Right: A Guide to Promoting Children's Reading Success , 2000 .

[18]  Panagiotis G. Ipeirotis,et al.  Running Experiments on Amazon Mechanical Turk , 2010, Judgment and Decision Making.

[19]  Lydia B. Chilton,et al.  Personalized Online Education - A Crowdsourcing Challenge , 2012, HCOMP@AAAI.

[20]  Scott Douglass,et al.  Large Declarative Memories in ACT-R , 2009 .

[21]  Juan Enrique Ramos,et al.  Using TF-IDF to Determine Word Relevance in Document Queries , 2003 .