Compression Schemes for Mining Large Datasets

[1]  Michael Stonebraker,et al.  A comparison of approaches to large-scale data analysis , 2009, SIGMOD Conference.

[2]  T. Ravindra Babu,et al.  On Simultaneous Selection of Prototypes and Features in Large Data , 2005, PReMI.

[3]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[4]  Erik D. Goodman,et al.  Simultaneous Feature Extraction and Selection Using a Masking Genetic Algorithm , 1997 .

[5]  Sankar K. Pal,et al.  Pattern Recognition Algorithms for Data Mining , 2004 .

[6]  Domenico Rosaci,et al.  Agent clustering based on semantic negotiation , 2008, TAAS.

[7]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[8]  S. Canu,et al.  Training Invariant Support Vector Machines using Selective Sampling , 2005 .

[9]  Robert P. W. Duin,et al.  Using two-class classifiers for multiclass classification , 2002, Object recognition supported by user interaction for service robots.

[10]  Chengqi Zhang,et al.  F-trade: an agent-mining symbiont for financial services , 2007, AAMAS '07.

[11]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[12]  Jiawei Han,et al.  Discriminative Frequent Pattern Analysis for Effective Classification , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[13]  Sam Kwong,et al.  Genetic algorithms: concepts and applications [in engineering design] , 1996, IEEE Trans. Ind. Electron..

[14]  Stephen Marshall,et al.  Convergence Criteria for Genetic Algorithms , 2000, SIAM J. Comput..

[15]  Jan M. Van Campenhout,et al.  On the Possible Orderings in the Measurement Selection Problem , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[16]  Robert E. Schapire,et al.  The strength of weak learnability , 1990, Mach. Learn..

[17]  Liang Dong,et al.  Starfish: A Self-tuning System for Big Data Analytics , 2011, CIDR.

[18]  Nicholas R. Jennings,et al.  Towards a Theory of Cooperative Problem Solving , 1994, MAAMAW.

[19]  Kagan Tumer,et al.  Efficient agent-based cluster ensembles , 2006, AAMAS '06.

[20]  Din J. Wasem,et al.  Mining of Massive Datasets , 2014 .

[21]  Federico Girosi,et al.  Training support vector machines: an application to face detection , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[22]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[23]  R. Sabourin,et al.  Feature subset selection using genetic algorithms for handwritten digit recognition , 2001, Proceedings XIV Brazilian Symposium on Computer Graphics and Image Processing.

[24]  Ming-Syan Chen,et al.  On the Design and Applicability of Distance Functions in High-Dimensional Data Space , 2009, IEEE Trans. Knowl. Data Eng..

[25]  C. G. Hilborn,et al.  The Condensed Nearest Neighbor Rule , 1967 .

[26]  T. Ravindra Babu,et al.  Classification of run-length encoded binary data , 2007, Pattern Recognit..

[27]  Akira Suzuki,et al.  Feature Selection for Character Recognition Using Genetic Algorithm , 2009, 2009 Fourth International Conference on Innovative Computing, Information and Control (ICICIC).

[28]  Khalid Sayood,et al.  Introduction to Data Compression , 1996 .

[29]  Jack Sklansky,et al.  A note on genetic algorithms for large-scale feature selection , 1989, Pattern Recognit. Lett..

[30]  Yoram Singer,et al.  Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers , 2000, J. Mach. Learn. Res..

[31]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[32]  Aditya Krishna Menon,et al.  Random projections and applications to dimensionality reduction , 2007 .

[33]  Tian Zhang,et al.  BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.

[34]  Theodore Johnson,et al.  Squashing flat files flatter , 1999, KDD '99.

[35]  Jacques Ferber,et al.  Multi-agent systems - an introduction to distributed artificial intelligence , 1999 .

[36]  T. Ravindra Babu,et al.  Hybrid learning scheme for data mining applications , 2004, Fourth International Conference on Hybrid Intelligent Systems (HIS'04).

[37]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[38]  T. Ravindra Babu,et al.  Multiagent Systems for Large Data Clustering , 2009, Data Mining and Multi-agent Integration.

[39]  Abraham Silberschatz,et al.  HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads , 2009, Proc. VLDB Endow..

[40]  T. Ravindra Babu,et al.  Comparison of genetic algorithm based prototype selection schemes , 2001, Pattern Recognit..

[41]  Paul Zikopoulos,et al.  Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data , 2011 .

[42]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[43]  Richard J. Enbody,et al.  Further Research on Feature Selection and Classification Using Genetic Algorithms , 1993, ICGA.

[44]  Josef Kittler,et al.  Floating search methods in feature selection , 1994, Pattern Recognit. Lett..

[45]  Jihoon Yang,et al.  Feature Subset Selection Using a Genetic Algorithm , 1998, IEEE Intell. Syst..

[46]  Bilge Karaçali,et al.  Fast minimization of structural risk by nearest neighbor rule , 2003, IEEE Trans. Neural Networks.

[47]  Pedro M. Domingos Occam's Two Razors: The Sharp and the Blunt , 1998, KDD.

[48]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[49]  Michal Pechoucek,et al.  A framework for agent-based distributed machine learning and data mining , 2007, AAMAS '07.

[50]  C. A. Murthy,et al.  Proceedings of the First international conference on Pattern Recognition and Machine Intelligence , 2005 .

[51]  Mike Loukides,et al.  What Is Data Science , 2011 .

[52]  Glenn Fung,et al.  Multicategory Proximal Support Vector Machine Classifiers , 2005, Machine Learning.

[53]  Joseph M. Hellerstein,et al.  MAD Skills: New Analysis Practices for Big Data , 2009, Proc. VLDB Endow..

[54]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, STOC '84.

[55]  T. Ravindra Babu,et al.  Compression Schemes for Mining Large Datasets: A Machine Learning Perspective , 2013 .