Resource Aware Distributed Knowledge Discovery

In the introduction it was argued that ubiquitous knowledge discovery systems have to be able to sense their environment and receive data from other devices, to adapt continuously to changing environmental conditions (including their own condition) and evolving user habits and need be capable of predictive self-diagnosis. In the last chapter, resource constraints arising from ubiquitous environments have been discussed in some detail. It has been argued that algorithms have to be resource-aware because of real-time constraints and of limited computing and battery power as well as communication resources.

[1]  Geoffrey I. Webb,et al.  The Need for Low Bias Algorithms in Classification Learning from Large Data Sets , 2002, PKDD.

[2]  Haimonti Dutta,et al.  Orthogonal decision trees , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[3]  Ran Wolff,et al.  Hierarchical decision tree induction in distributed genomic databases , 2005, IEEE Transactions on Knowledge and Data Engineering.

[4]  Ossama Younis,et al.  HEED: a hybrid, energy-efficient, distributed clustering approach for ad hoc sensor networks , 2004, IEEE Transactions on Mobile Computing.

[5]  Rajeev Motwani,et al.  Approximate Frequency Counts over Data Streams , 2012, VLDB.

[6]  M. R. Genesereth,et al.  Knowledge Interchange Format Version 3.0 Reference Manual , 1992, LICS 1992.

[7]  Mohamed Medhat Gaber,et al.  Learning from Data Streams: Processing Techniques in Sensor Networks , 2007 .

[8]  Shai Ben-David,et al.  Detecting Change in Data Streams , 2004, VLDB.

[9]  S. Muthukrishnan,et al.  Data streams: algorithms and applications , 2005, SODA '03.

[10]  Geoff Hulten,et al.  Catching up with the Data: Research Issues in Mining Data Streams , 2001, DMKD.

[11]  Jan Komorowski,et al.  Principles of Data Mining and Knowledge Discovery , 2001, Lecture Notes in Computer Science.

[12]  Yelena Yesha,et al.  Data Mining: Next Generation Challenges and Future Directions , 2004 .

[13]  Alessandra Russo,et al.  Advances in Artificial Intelligence – SBIA 2004 , 2004, Lecture Notes in Computer Science.

[14]  Charu C. Aggarwal,et al.  Data Streams: Models and Algorithms (Advances in Database Systems) , 2006 .

[15]  Geoff Hulten,et al.  Mining time-changing data streams , 2001, KDD '01.

[16]  João Gama,et al.  Learning with Drift Detection , 2004, SBIA.

[17]  Ping Chen,et al.  Using the fractal dimension to cluster datasets , 2000, KDD '00.

[18]  Dipti Verma,et al.  Data Mining: Next Generation Challenges and Future Directions , 2012 .

[19]  Daniel Barbará,et al.  Requirements for clustering data streams , 2002, SKDD.

[20]  Graham Cormode,et al.  An improved data stream summary: the count-min sketch and its applications , 2004, J. Algorithms.

[21]  João Gama,et al.  An Adaptive Prequential Learning Framework for Bayesian Network Classifiers , 2006, PKDD.

[22]  Timothy W. Finin,et al.  KQML as an agent communication language , 1994, CIKM '94.

[23]  Graham Cormode,et al.  Conquering the Divide: Continuous Clustering of Distributed Data Streams , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[24]  Gert Cauwenberghs,et al.  Incremental and Decremental Support Vector Machine Learning , 2000, NIPS.

[25]  Mario Cannataro,et al.  Distributed data mining on the grid , 2002, Future Gener. Comput. Syst..

[26]  Peter Edwards,et al.  Agent-Based Knowledge Discovery , 1995 .

[27]  Salvatore J. Stolfo,et al.  JAM: Java Agents for Meta-Learning over Distributed Databases , 1997, KDD.

[28]  Johannes Fürnkranz,et al.  Knowledge Discovery in Databases: PKDD 2006, 10th European Conference on Principles and Practice of Knowledge Discovery in Databases, Berlin, Germany, September 18-22, 2006, Proceedings , 2006, PKDD.

[29]  J. Andel Sequential Analysis , 2022, The SAGE Encyclopedia of Research Design.

[30]  Francesco Vatalaro,et al.  Ambient Intelligence: The Evolution of Technology, Communication and Cognition Towards the Future of Human-Computer Interaction , 2005 .

[31]  Ran Wolff,et al.  Distributed Data Mining in Peer-to-Peer Networks , 2006, IEEE Internet Computing.

[32]  João Gama,et al.  Hierarchical Clustering of Time-Series Data Streams , 2008, IEEE Transactions on Knowledge and Data Engineering.

[33]  Charu C. Aggarwal,et al.  Data Streams - Models and Algorithms , 2014, Advances in Database Systems.

[34]  Douglas B. Moran,et al.  The Open Agent Architecture: A Framework for Building Distributed Software Systems , 1999, Appl. Artif. Intell..

[35]  Jennifer Widom,et al.  Models and issues in data stream systems , 2002, PODS.

[36]  João Gama,et al.  Accurate decision trees for mining high-speed data streams , 2003, KDD '03.

[37]  Hisham M. Haddad,et al.  Proceedings of the 2008 ACM Symposium on Applied Computing (SAC), Fortaleza, Ceara, Brazil, March 16-20, 2008 , 2008, SAC.

[38]  Martín Farach-Colton LATIN 2004: Theoretical Informatics , 2004, Lecture Notes in Computer Science.

[39]  Geoff Hulten,et al.  Mining high-speed data streams , 2000, KDD '00.

[40]  Nikunj C. Oza,et al.  Online Ensemble Learning , 2000, AAAI/IAAI.

[41]  Philip S. Yu,et al.  A framework for resource-aware knowledge discovery in data streams: a holistic approach with its application to clustering , 2006, SAC '06.

[42]  André Carlos Ponce de Leon Ferreira de Carvalho,et al.  Cluster-based novel concept detection in data streams applied to intrusion detection in computer networks , 2008, SAC '08.

[43]  R. Schapire The Strength of Weak Learnability , 1990, Machine Learning.

[44]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[45]  Russ Bubley,et al.  Randomized algorithms , 1995, CSUR.