Seeking activity: on the trail of users in open and community source frameworks

Usage data captured and logged by computers has long been an essential source of information for software developers, support services personnel, usability designers, and learning researchers [1, 2]. Whether from mainframes, file servers, network devices, or workstations, the user event data logged in its many forms has served as an essential source of information for those who need to improve software, analyze problems, monitor security, track workflow, report on resource usage, evaluate learning activities etc. With today's generation of open and community source web-based frameworks, however, new challenges arise as to how, where, and when user activity gets captured and analyzed. These frameworks' flexibility in allowing easy integration of different applications, presentation technologies, middleware, and data sources has side effects on usage data: fragmented logs in a wide range of formats often bestrewn across many locations. This paper focuses on common issues faced especially by academic computing support personnel who need to gather and analyze user activity information within heterogeneous, distributed open source web frameworks like Sakai and uPortal. As described in this paper, these kinds of challenges can be met by drawing upon techniques for coordinated distributed event monitoring along with some basic data mining and data visualization approaches. In particular, this paper describes a work-in-progress to develop an approach towards building a distributed capture and analysis systems for a large production deployment of the Sakai Collaboration and Learning Environment in order to meet a wide range of tracking, monitoring, and reporting log analysis in one university setting.

[1]  Arun Sen,et al.  Current trends in web data analysis , 2006, CACM.

[2]  Fatos Xhafa,et al.  A layered framework for evaluating on-line collaborative learning interactions , 2006, Int. J. Hum. Comput. Stud..

[3]  Daniel T. Larose,et al.  Data mining methods and models , 2006 .

[4]  Jeff Smith LabDisplay: bringing computer lab management into the new millennium , 2005, SIGUCCS '05.

[5]  Owen G. McGrath Balancing act: community and local requirements in an open source development process , 2006, SIGUCCS.

[6]  Fabio Paternò,et al.  Intelligent analysis of user interactions with web applications , 2002, IUI '02.

[7]  S. K. Adam,et al.  Strategies for managing the execution of unauthorized programs on a university computer network , 2003 .

[8]  Chris Amelung Using Social Context and E-Learner Identity as a Framework for an E-Learning Notification System , 2007 .

[9]  Elena Gaudioso,et al.  Data mining to support tutoring in virtual learning communities: experiences and challenges , 2005 .

[10]  Ian H. Witten,et al.  Data mining - practical machine learning tools and techniques, Second Edition , 2005, The Morgan Kaufmann series in data management systems.

[11]  Zdravko Markov,et al.  Data mining the web - uncovering patterns in web content, structure, and usage , 2007 .

[12]  Ian Witten,et al.  Data Mining , 2000 .

[13]  Chih-Ming Chen,et al.  Mining learner profile utilizing association rule for web-based learning diagnosis , 2007, Expert systems with applications.

[14]  Martin Theus,et al.  Interactive Data Visualization using Mondrian , 2002 .

[15]  Zdravko Markov,et al.  Comprar Data Mining the Web: Uncovering Patterns in Web Content, Structure, and Usage | Daniel T. Larose | 9780471666554 | Wiley , 2007 .

[16]  Sebastián Ventura,et al.  Educational data mining: A survey from 1995 to 2005 , 2007, Expert Syst. Appl..

[17]  Myra Spiliopoulou,et al.  Analysis of navigation behaviour in web sites integrating multiple information systems , 2000, The VLDB Journal.

[18]  Nathaniel S. Borenstein,et al.  CMU's Andrew project: a retrospective , 1996, CACM.

[19]  Owen G. McGrath Building an instructional portal: channeling the writing lab , 2002, SIGUCCS '02.

[20]  Marti A. Hearst,et al.  The state of the art in automating usability evaluation of user interfaces , 2001, CSUR.

[21]  Absatz Computer Support for Collaborative Learning Environments , 2005 .

[22]  Erno Lehtinen,et al.  Evaluating CSCL log files by social network analysis , 1999, CSCL.

[23]  Osmar R. Zaïane,et al.  Clustering Web sessions by sequence alignment , 2002, Proceedings. 13th International Workshop on Database and Expert Systems Applications.