Towards a Hypermedia-enabled and Web-based Data Analysis Framework

Making better business decisions is often the key to gaining competitive advantage. An important factor in effective decision-making is the ability to access, understand, and utilise business information easily. In this paper, we propose a framework that focuses on integrating data analysis tools into the Web and providing hypermedia functionality to them, and supporting the development of hypermedia functionalities with Web usage mining techniques. In this framework, data analysis tools play the role of utilizing historical data to discover useful information and improve the process of business decisions. Augmenting data analysis tools with rich hypermedia functionality would streamline access to and provide rich navigational features around related information, while augmenting hypermedia facilities with Web usage mining would allow them to dynamically restructure the Web site to fit users’ browsing needs based on past navigation history. We believe that the proposed framework should result in new ways to view and manage data analysis tools and help users easily browse Web sites.

[1]  Joonhee Yoo,et al.  Hypermedia: a design philosophy , 1999, CSUR.

[2]  G. Halasz Frank,et al.  Reflections on NoteCards: seven issues for the next generation of hypermedia systems , 1987, CACM.

[3]  Anupam Joshi,et al.  On Mining Web Access Logs , 2000, ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery.

[4]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[5]  Florent Masseglia,et al.  WebTool: An Integrated Framework for Data Mining , 1999, DEXA.

[6]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[7]  Cyrus Shahabi,et al.  Knowledge discovery from users Web-page navigation , 1997, Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications.

[8]  Oren Etzioni,et al.  Adaptive Web sites , 2000, CACM.

[9]  Fabio Vitali,et al.  Extending HTML in a Principled Way with Displets , 1997, Comput. Networks.

[10]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[11]  Steven J. DeRose,et al.  Xml linking language (xlink), version 1. 0 , 2000, WWW 2000.

[12]  Jeff Conklin,et al.  Hypertext: An Introduction and Survey , 1987, Computer.

[13]  J. Palous,et al.  Machine Learning and Data Mining , 2002 .

[14]  Anders Berglund,et al.  Extensible Stylesheet Language (XSL) Version 1.0 , 1998 .

[15]  Harri Oinas-Kukkonen,et al.  Fourth generation hypermedia: some missing links for the World Wide Web , 1997, Int. J. Hum. Comput. Stud..

[16]  Norman Walsh A Guide to XML , 1997, World Wide Web J..

[17]  Herbert A. Simon,et al.  Applications of machine learning and rule induction , 1995, CACM.

[18]  Chao-Min Chiu Reengineering Information Systems with Xml , 2000, Inf. Syst. Manag..

[19]  Nicole Yankelovich,et al.  InterNote: extending a hypermedia framework to support annotative collaboration , 1989, Hypertext.

[20]  Marc Najork,et al.  Mercator: A scalable, extensible Web crawler , 1999, World Wide Web.

[21]  Steven M. Drucker,et al.  Intermedia: the concept and the construction of a seamless information environment , 1988, Computer.

[22]  Dan Connolly,et al.  The Evolution of Web Documents: The Ascent of XML , 1997, World Wide Web J..

[23]  Stephen R. Gardner Building the data warehouse , 1998, CACM.

[24]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[25]  Dominic A. Orchard,et al.  XML Linking Language (XLink) Version 1. 0. World Wide Web Consortium, Proposed Recommendation PR - x , 2000 .

[26]  Philip S. Yu,et al.  Efficient Data Mining for Path Traversal Patterns , 1998, IEEE Trans. Knowl. Data Eng..

[27]  Bernard Widrow,et al.  The basic ideas in neural networks , 1994, CACM.

[28]  Jerome H. Friedman,et al.  Flexible Metric Nearest Neighbor Classification , 1994 .

[29]  Umeshwar Dayal,et al.  From User Access Patterns to Dynamic Hypertext Linking , 1996, Comput. Networks.

[30]  Sharon C. Adler Previous version: , 1997 .

[31]  Jakob Nielsen,et al.  The art of navigating through hypertext , 1990, CACM.

[32]  Randall H. Trigg Guided tours and tabletops: tools for communicating in a hypertext environment , 1988, TOIS.

[33]  Sergio A. Alvarez,et al.  Efficient Adaptive-Support Association Rule Mining for Recommender Systems , 2004, Data Mining and Knowledge Discovery.

[34]  Jaideep Srivastava,et al.  Web mining: information and pattern discovery on the World Wide Web , 1997, Proceedings Ninth IEEE International Conference on Tools with Artificial Intelligence.

[35]  Chao-Min Chiu Towards integrating hypermedia and information systems on the web , 2003, Inf. Manag..

[36]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[37]  Thomas A. Runkler,et al.  Web mining with relational clustering , 2003, Int. J. Approx. Reason..

[38]  Yoon Ho Cho,et al.  A personalized recommender system based on web usage mining and decision tree induction , 2002, Expert Syst. Appl..

[39]  Tao Luo,et al.  Using sequential and non-sequential patterns in predictive Web usage mining tasks , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[40]  Jakob Nielsen,et al.  Multimedia and Hypertext: The Internet and Beyond , 1995 .

[41]  Harri Oinas-Kukkonen,et al.  Hypertext functionality , 1999, CSUR.

[42]  James Clark,et al.  XSL Transformations (XSLT) Version 1.0 , 1999 .

[43]  Anupam,et al.  Mining Web Access Logs Using Relational Competitive Fuzzy Clustering , 1999 .

[44]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[45]  Vannevar Bush,et al.  As we may think , 1945, INTR.

[46]  Frank G. Halasz,et al.  Reflections on NoteCards: seven issues for the next generation of hypermedia systems , 1987, CACM.

[47]  Gregory Piatetsky-Shapiro,et al.  Advances in Knowledge Discovery and Data Mining , 2004, Lecture Notes in Computer Science.

[48]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[49]  Florent Masseglia,et al.  The PSP Approach for Mining Sequential Patterns , 1998, PKDD.

[50]  Pat Langley,et al.  An Analysis of Bayesian Classifiers , 1992, AAAI.

[51]  Wooju Kim,et al.  Combination of multiple classifiers for the customer's purchase behavior prediction , 2003, Decis. Support Syst..

[52]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.

[53]  Joonhee Yoo,et al.  Towards a relationship navigation analysis , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[54]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[55]  Kyoungro Yoon,et al.  Querying structured hyperdocuments , 1996, Proceedings of HICSS-29: 29th Hawaii International Conference on System Sciences.

[56]  Bradley P. Allen,et al.  Case-based reasoning: business applications , 1994, CACM.

[57]  Oren Etzioni,et al.  Towards adaptive Web sites: Conceptual framework and case study , 1999, Artif. Intell..

[58]  Michael Bieber,et al.  Toward hypermedia support for information relationship management , 2001, J. Inf. Sci..

[59]  Tao Luo,et al.  Discovery and Evaluation of Aggregate Usage Profiles for Web Personalization , 2004, Data Mining and Knowledge Discovery.

[60]  toExcel Extensible Stylesheet Language: Xsl Version 1.0 , 1999 .

[61]  Jaideep Srivastava,et al.  Automatic personalization based on Web usage mining , 2000, CACM.