A Noun Phrase Analysis Tool for Mining Online Community Conversations

Online communities are creating a growing legacy of texts in online bulletin board postings, chat, blogs, etc. These texts record conversation, knowledge exchange, and variation in focus as groups grow, mature, and decline; they represent a rich history of group interaction and an opportunity to explore the purpose and development of online communities. However, the quantity of data created by these communities is vast, and to address their processes in a timely manner requires automated processes. This raises questions about how to conduct automated analyses, and what can we gain from them: Can we gain an idea of community interests, priorities, and operation from automated examinations of texts of postings and patterns of posting behavior? Can we mine stored texts to discover patterns of language and interaction that characterize a community?

[1]  Gijsbert Erkens,et al.  Analyzing Collaborative Learning: Multiple Approaches to Understanding Processes and Outcomes , 2006, ICLS.

[2]  P. Fahy Indicators of Support in Online Interaction , 2003 .

[3]  Thorsten Brants,et al.  TnT – A Statistical Part-of-Speech Tagger , 2000, ANLP.

[4]  Helmut Schmidt,et al.  Probabilistic part-of-speech tagging using decision trees , 1994 .

[5]  Elizabeth D. Liddy,et al.  Enhanced Text Retrieval Using Natural Language Processing , 2005 .

[6]  Linda Shields,et al.  Content Analysis , 2015 .

[7]  Joel L. Fagan The effectiveness of a nonsyntatic approach to automatic phrase indexing for document retrieval , 1989 .

[8]  Fay Sudweeks,et al.  Networked Interactivity , 1997, J. Comput. Mediat. Commun..

[9]  R. Weber Basic Content Analysis , 1986 .

[10]  Deborah K. LaPointe,et al.  Speak2Me: Using Synchronous Audio for ESL Teaching in Taiwan , 2004 .

[11]  Gerard Salton,et al.  Syntactic Approaches to Automatic Book Indexing , 1988, ACL.

[12]  Caroline Haythornthwaite,et al.  Learning, culture, and community in online education : research and practice , 2004 .

[13]  ChengXiang Zhai,et al.  Fast Statistical Parsing of Noun Phrases for Document Indexing , 1997, ANLP.

[14]  Steven G. Jones CyberSociety: Computer-Mediated Communication and Community , 1994 .

[15]  Vincent Ooi Aspects of computer-mediated communication for research in corpus linguistics , 2002 .

[16]  Patrick J. Fahy,et al.  Patterns of Interaction in a Computer Conference Transcript , 2001 .

[17]  Susan C. Herring,et al.  Gender and Democracy in Computer-Mediated Communication , 1995, Computerization and Controversy, 2nd Ed..

[18]  Branimir Boguraev,et al.  Applications of term identification technology: domain description and content characterisation , 1999, Natural Language Engineering.

[19]  Harris Wu,et al.  Harvesting social knowledge from folksonomies , 2006, HYPERTEXT '06.

[20]  David Crystal,et al.  Language and the Internet , 2001 .

[21]  Carolyn Penstein Rosé,et al.  Supporting CSCL with automatic corpus analysis technology , 2005, CSCL.

[22]  Joel L. Fagan,et al.  The effectiveness of a nonsyntactic approach to automatic phrase indexing for document retrieval , 1989, JASIS.

[23]  Karin Sixl-Daniell,et al.  Paralinguistic Discussion in an Online Educational Setting: A Preliminary Study , 2005, ICCE.

[24]  ChengXiang Zhai,et al.  Discovering evolutionary theme patterns from text: an exploration of temporal text mining , 2005, KDD '05.

[25]  S. Herring Gender differences in CMC: findings and implications , 2000 .

[26]  B. Boguraev Dynamic presentation of document content for rapid on-line skimming , 1998, AAAI 1998.

[27]  R. Kling Computerization and Controversy , 1997 .

[28]  Thomas Erickson,et al.  Discourse architectures: designing and visualizing computer mediated conversation , 2002, CHI Extended Abstracts.

[29]  Gerardine DeSanctis,et al.  Capturing the Complexity in Advanced Technology Use: Adaptive Structuration Theory , 1994 .

[30]  M. Tremayne,et al.  Preface: Blog terminology , 2006 .