Data Mining: Concepts and Techniques, 3rd edition

The book Knowledge Discovery in Databases, edited by Piatetsky-Shapiro and Frawley [PSF91], is an early collection of research papers on knowledge discovery from data. The book Advances in Knowledge Discovery and Data Mining, edited by Fayyad, Piatetsky-Shapiro, Smyth, and Uthurusamy [FPSSe96], is a collection of later research results on knowledge discovery and data mining. There have been many data mining books published in recent years, including Predictive Data Mining by Weiss and Indurkhya [WI98], Data Mining Solutions: Methods and Tools for Solving Real-World Problems by Westphal and Blaxton [WB98], Mastering Data Mining: The Art and Science of Customer Relationship Management by Berry and Linofi [BL99], Building Data Mining Applications for CRM by Berson, Smith, and Thearling [BST99], Data Mining: Practical Machine Learning Tools and Techniques by Witten and Frank [WF05], Principles of Data Mining (Adaptive Computation and Machine Learning) by Hand, Mannila, and Smyth [HMS01], The Elements of Statistical Learning by Hastie, Tibshirani, and Friedman [HTF01], Data Mining: Introductory and Advanced Topics by Dunham, and Data Mining: Multimedia, Soft Computing, and Bioinformatics by Mitra and Acharya [MA03]. There are also books containing collections of papers on particular aspects of knowledge discovery, such as Machine Learning and Data Mining: Methods and Applications edited by Michalski, Brakto, and Kubat [MBK98], and Relational Data Mining edited by Dzeroski and Lavrac [De01], as well as many tutorial notes on data mining in major database, data mining and machine learning conferences.

[1]  Eli Upfal,et al.  Stochastic models for the Web graph , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[2]  Rajeev Motwani,et al.  Computing Iceberg Queries Efficiently , 1998, VLDB.

[3]  Ronald L. Rivest,et al.  Inferring Decision Trees Using the Minimum Description Length Principle , 1989, Inf. Comput..

[4]  John F. Roddick,et al.  An Updated Bibliography of Temporal, Spatial, and Spatio-temporal Data Mining Research , 2000, TSDM.

[5]  Duncan J. Watts,et al.  Six Degrees: The Science of a Connected Age , 2003 .

[6]  David J. DeWitt,et al.  Equi-depth multidimensional histograms , 1988, SIGMOD '88.

[7]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[8]  Jiawei Han,et al.  Metarule-Guided Mining of Multi-Dimensional Association Rules Using Data Cubes , 1997, KDD.

[9]  Divesh Srivastava,et al.  Answering Queries with Aggregation Using Views , 1996, VLDB.

[10]  Alberto O. Mendelzon,et al.  Querying the World Wide Web , 1996, Fourth International Conference on Parallel and Distributed Information Systems.

[11]  Theodore Johnson,et al.  Mining database structure; or, how to build a data quality browser , 2002, SIGMOD '02.

[12]  Lawrence B. Holder,et al.  Knowledge discovery in molecular biology: Identifying structural regularities in proteins , 1999, Intell. Data Anal..

[13]  Jeffrey Scott Vitter,et al.  Data cube approximation and histograms via wavelets , 1998, CIKM '98.

[14]  Philip S. Yu,et al.  An effective hash-based algorithm for mining association rules , 1995, SIGMOD '95.

[15]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[16]  Kyuseok Shim,et al.  Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases , 1995, VLDB.

[17]  Michael J. Franklin,et al.  Streaming Queries over Streaming Data , 2002, VLDB.

[18]  Jiawei Han,et al.  A fast distributed algorithm for mining association rules , 1996, Fourth International Conference on Parallel and Distributed Information Systems.

[19]  A. Guttmma,et al.  R-trees: a dynamic index structure for spatial searching , 1984 .

[20]  Yannis E. Ioannidis,et al.  Selectivity Estimation Without the Attribute Value Independence Assumption , 1997, VLDB.

[21]  R. Higgins Analysis for Financial Management , 2004 .

[22]  Erik Thomsen,et al.  OLAP Solutions - Building Multidimensional Information Systems , 1997 .

[23]  Barbara Hubbard,et al.  The World According to Wavelets , 1996 .

[24]  David J. DeWitt,et al.  NiagaraCQ: a scalable continuous query system for Internet databases , 2000, SIGMOD 2000.

[25]  John F. Roddick,et al.  On the impact of knowledge discovery and data mining , 2000 .

[26]  Daniel S. Hirschberg,et al.  The Time Complexity of Decision Tree Induction , 1995 .

[27]  Ricardo Baeza-Yates,et al.  Information Retrieval: Data Structures and Algorithms , 1992 .

[28]  Peter Clark,et al.  The CN2 Induction Algorithm , 1989, Machine Learning.

[29]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[30]  JOHANNES GEHRKE,et al.  RainForest—A Framework for Fast Decision Tree Construction of Large Datasets , 1998, Data Mining and Knowledge Discovery.

[31]  Giulia Pagallo,et al.  Learning DNF by Decision Trees , 1989, IJCAI.

[32]  E. R. Bareiss,et al.  Protos: An Exemplar-Based Learning Apprentice , 1988, Int. J. Man Mach. Stud..

[33]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Data Mining and Knowledge Discovery.

[34]  Philip S. Yu,et al.  A new framework for itemset generation , 1998, PODS '98.

[35]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[36]  Christos Faloutsos,et al.  Access methods for text , 1985, CSUR.

[37]  Jiawei Han,et al.  Efficient mining of partial periodic patterns in time series database , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[38]  Jorma Rissanen,et al.  SLIQ: A Fast Scalable Classifier for Data Mining , 1996, EDBT.

[39]  S. Avner Discovery of comprehensible symbolic rules in a neural network , 1995, Proceedings First International Symposium on Intelligence in Neural and Biological Systems. INBS'95.

[40]  Vic Barnett,et al.  Outliers in Statistical Data , 1980 .

[41]  Christian Borgelt,et al.  Mining molecular fragments: finding relevant substructures of molecules , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[42]  Helen J. Wang,et al.  Online aggregation , 1997, SIGMOD '97.

[43]  Charu C. Aggarwal,et al.  On the design and quantification of privacy preserving data mining algorithms , 2001, PODS.

[44]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[45]  Yixin Chen,et al.  Multi-Dimensional Regression Analysis of Time-Series Data Streams , 2002, VLDB.

[46]  Ramakrishnan Srikant,et al.  Mining Association Rules with Item Constraints , 1997, KDD.

[47]  Jiawei Han,et al.  Community Mining from Multi-relational Networks , 2005, PKDD.

[48]  Vipin Kumar,et al.  Introduction to Data Mining , 2022, Data Mining and Machine Learning Applications.

[49]  Phyllis Koton,et al.  Reasoning about Evidence in Causal Explanations , 1988, AAAI.

[50]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[51]  Jude W. Shavlik,et al.  Using neural networks for data mining , 1997, Future Gener. Comput. Syst..

[52]  Jiawei Han,et al.  Maintenance of discovered association rules in large databases: an incremental updating technique , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[53]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[54]  Jude W. Shavlik,et al.  Extracting Refined Rules from Knowledge-Based Neural Networks , 1993, Machine Learning.

[55]  C. J. Huberty,et al.  Applied Discriminant Analysis , 1994 .

[56]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[57]  Edward R. Tufte,et al.  Envisioning Information , 1990 .

[58]  Sean R. Eddy,et al.  Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[59]  Finn Verner Jensen,et al.  Introduction to Bayesian Networks , 2008, Innovations in Bayesian Networks.

[60]  Jon M. Kleinberg,et al.  Applications of linear algebra in information retrieval and hypertext analysis , 1999, PODS '99.

[61]  Philip S. Yu,et al.  Mining concept-drifting data streams using ensemble classifiers , 2003, KDD '03.

[62]  Jacques Bertin,et al.  Graphics and graphic information-processing , 1981 .

[63]  A. Agresti An introduction to categorical data analysis , 1997 .

[64]  Edward R. Tufte Visual explanations: images and quantities, evidence and narrative , 1997 .

[65]  Sreerama K. Murthy,et al.  Automatic Construction of Decision Trees from Data: A Multi-Disciplinary Survey , 1998, Data Mining and Knowledge Discovery.

[66]  S. Muthukrishnan,et al.  Data streams: algorithms and applications , 2005, SODA '03.

[67]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[68]  Philip S. Yu,et al.  Fast algorithms for projected clustering , 1999, SIGMOD '99.

[69]  Paul S. Bradley,et al.  Compressed data cubes for OLAP aggregate query approximation on continuous dimensions , 1999, KDD '99.

[70]  G. V. Kass An Exploratory Technique for Investigating Large Quantities of Categorical Data , 1980 .

[71]  Federico Girosi,et al.  An improved training algorithm for support vector machines , 1997, Neural Networks for Signal Processing VII. Proceedings of the 1997 IEEE Signal Processing Society Workshop.

[72]  Jiawei Han,et al.  TFP: an efficient algorithm for mining top-k frequent closed itemsets , 2005, IEEE Transactions on Knowledge and Data Engineering.

[73]  Heikki Mannila,et al.  Finding interesting rules from large sets of discovered association rules , 1994, CIKM '94.

[74]  Hiroshi Motoda,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998, The Springer International Series in Engineering and Computer Science.

[75]  Anthony K. H. Tung,et al.  Constraint-based clustering in large databases , 2001, ICDT.

[76]  Donald E. Brown,et al.  A comparison of decision tree classifiers with backpropagation neural networks for multimodal classification problems , 1992, Pattern Recognit..

[77]  Philipp Slusallek,et al.  Introduction to real-time ray tracing , 2005, SIGGRAPH Courses.

[78]  Jiawei Han,et al.  SeqIndex: Indexing Sequences by Sequential Pattern Analysis , 2005, SDM.

[79]  Herbert A. Simon,et al.  Scientific discovery: compulalional explorations of the creative process , 1987 .

[80]  Yi Zhang,et al.  Entropy-based subspace clustering for mining numerical data , 1999, KDD '99.

[81]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[82]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[83]  Marvin Minsky,et al.  Perceptrons: An Introduction to Computational Geometry , 1969 .

[84]  H. Edelsbrunner,et al.  Efficient algorithms for agglomerative hierarchical clustering methods , 1984 .

[85]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[86]  Nimrod Megiddo,et al.  Discovery-Driven Exploration of OLAP Data Cubes , 1998, EDBT.

[87]  J. Nadal,et al.  Learning in feedforward layered networks: the tiling algorithm , 1989 .

[88]  Stephen Northcutt,et al.  Network intrusion detection , 2003 .

[89]  Ryszard S. Michalski,et al.  AQ15: Incremental Learning of Attribute-Based Descriptions from Examples: The Method and User's Guide , 1986 .

[90]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[91]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1972 .

[92]  David Loshin Enterprise knowledge management: the data quality approach , 2000 .

[93]  Philip S. Yu,et al.  CrossMine: Efficient Classification Across Multiple Database Relations , 2004, Constraint-Based Mining and Inductive Databases.

[94]  Jiawei Han,et al.  Discovery of Multiple-Level Association Rules from Large Databases , 1995, VLDB.

[95]  Jiawei Han,et al.  Exploration of the power of attribute-oriented induction in data mining , 1995, KDD 1995.

[96]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[97]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[98]  Ehud Gudes,et al.  Computing frequent graph patterns from semistructured data , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[99]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[100]  Rakesh Agrawal,et al.  Parallel Mining of Association Rules: Design, Implementation and Experience , 1999 .

[101]  J. Neter,et al.  Applied Linear Statistical Models (3rd ed.). , 1992 .

[102]  F. Ramsey,et al.  The statistical sleuth : a course in methods of data analysis , 2002 .

[103]  Hannu Toivonen,et al.  Sampling Large Databases for Association Rules , 1996, VLDB.

[104]  Larry P. English Improving Data Warehouse and Business Information Quality: Methods for Reducing Costs and Increasing Profits , 1999 .

[105]  D. Watts,et al.  Small Worlds: The Dynamics of Networks between Order and Randomness , 2001 .

[106]  Inderpal Singh Mumick,et al.  Selection of views to materialize in a data warehouse , 1997, IEEE Transactions on Knowledge and Data Engineering.

[107]  Mohammed J. Zaki,et al.  CHARM: An Efficient Algorithm for Closed Itemset Mining , 2002, SDM.

[108]  Philip J. Stone,et al.  Experiments in induction , 1966 .

[109]  Michelangelo Ceci,et al.  Mining Model Trees: A Multi-relational Approach , 2003, ILP.

[110]  Huan Liu,et al.  Chi2: feature selection and discretization of numeric attributes , 1995, Proceedings of 7th IEEE International Conference on Tools with Artificial Intelligence.

[111]  Surajit Chaudhuri,et al.  An overview of data warehousing and OLAP technology , 1997, SGMD.

[112]  Mohammed J. Zaki Scalable Algorithms for Association Mining , 2000, IEEE Trans. Knowl. Data Eng..

[113]  Hongjun Lu,et al.  NeuroRule: A Connectionist Approach to Data Mining , 1995, VLDB.

[114]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[115]  Jiawei Han,et al.  gSpan: graph-based substructure pattern mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[116]  Yasuhiko Morimoto,et al.  Computing Optimized Rectilinear Regions for Association Rules , 1997, KDD.

[117]  Rafael C. González,et al.  Local Determination of a Moving Contrast Edge , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[118]  Thomas C. Redman,et al.  Data Quality Management and Technology , 1992 .

[119]  Mark Sullivan,et al.  Quasi-cubes: exploiting approximations in multidimensional databases , 1997, SGMD.

[120]  Mark Buchanan,et al.  Nexus: Small Worlds and the Groundbreaking Science of Networks , 2002 .

[121]  Chris Clifton,et al.  Privacy-preserving k-means clustering over vertically partitioned data , 2003, KDD '03.

[122]  Wei-Ying Ma,et al.  Locality preserving indexing for document representation , 2004, SIGIR '04.

[123]  G. Reinsel,et al.  Introduction to Mathematical Statistics (4th ed.). , 1980 .

[124]  Umeshwar Dayal,et al.  Multi-dimensional sequential pattern mining , 2001, CIKM '01.

[125]  Manoranjan Dash,et al.  Dimensionality reduction of unsupervised data , 1997, Proceedings Ninth IEEE International Conference on Tools with Artificial Intelligence.

[126]  Johannes Gehrke,et al.  MAFIA: a maximal frequent itemset algorithm for transactional databases , 2001, Proceedings 17th International Conference on Data Engineering.

[127]  Benjamin Van Roy,et al.  Solving Data Mining Problems Through Pattern Recognition , 1997 .

[128]  Takashi Washio,et al.  State of the art of graph-based data mining , 2003, SKDD.

[129]  Gerald Salton,et al.  Automatic text processing , 1988 .

[130]  R. Shanmugam Introduction to Time Series and Forecasting , 1997 .

[131]  Laks V. S. Lakshmanan,et al.  QC-trees: an efficient summary structure for semantic OLAP , 2003, SIGMOD '03.

[132]  Matthew Richardson,et al.  Mining knowledge-sharing sites for viral marketing , 2002, KDD.

[133]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[134]  Saso Dzeroski,et al.  Multi-relational data mining: an introduction , 2003, SKDD.

[135]  Jiawei Han,et al.  Efficient and Effective Clustering Methods for Spatial Data Mining , 1994, VLDB.

[136]  Sergio A. Alvarez,et al.  Efficient Adaptive-Support Association Rule Mining for Recommender Systems , 2004, Data Mining and Knowledge Discovery.

[137]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[138]  Jiawei Han,et al.  Attribute-Oriented Induction in Relational Databases , 1991, Knowledge Discovery in Databases.

[139]  Sudipto Guha,et al.  ROCK: a robust clustering algorithm for categorical attributes , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[140]  W. Hays,et al.  Statistics (3rd ed.). , 1982 .

[141]  Divesh Srivastava,et al.  On computing correlated aggregates over continual data streams , 2001, SIGMOD '01.

[142]  Ben Taskar,et al.  Learning Probabilistic Models of Relational Structure , 2001, ICML.

[143]  Rupert G. Miller,et al.  Survival Analysis , 2022, The SAGE Encyclopedia of Research Design.

[144]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[145]  Luc De Raedt,et al.  Top-down induction of logical decision trees , 1997 .

[146]  Igor Kononenko,et al.  On Biases in Estimating Multi-Valued Attributes , 1995, IJCAI.

[147]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[148]  Huan Liu,et al.  Discretization: An Enabling Technique , 2002, Data Mining and Knowledge Discovery.

[149]  Nicolas Pasquier,et al.  Discovering Frequent Closed Itemsets for Association Rules , 1999, ICDT.

[150]  Charles Elkan,et al.  Boosting and Naive Bayesian learning , 1997 .

[151]  Clement T. Yu,et al.  Priniples of Database Query Processing for Advanced Applications , 1997 .

[152]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[153]  Vasant Dhar,et al.  Abstract-Driven Pattern Discovery in Databases , 1992, IEEE Trans. Knowl. Data Eng..

[154]  Nicholas J. Belkin,et al.  Information filtering and information retrieval: two sides of the same coin? , 1992, CACM.

[155]  Yann LeCun,et al.  Optimal Brain Damage , 1989, NIPS.

[156]  Casimir A. Kulikowski,et al.  Computer Systems That Learn: Classification and Prediction Methods from Statistics, Neural Nets, Machine Learning and Expert Systems , 1990 .

[157]  Wojciech Ziarko,et al.  The Discovery, Analysis, and Representation of Data Dependencies in Databases , 1991, Knowledge Discovery in Databases.

[158]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[159]  Madhuri S. Mulekar Data Mining: Multimedia, Soft Computing, and Bioinformatics , 2004, Technometrics.

[160]  Chris Clifton,et al.  SECURITY AND PRIVACY IMPLICATIONS OF DATA MINING , 1996 .

[161]  Tong Zhang,et al.  Text Mining: Predictive Methods for Analyzing Unstructured Information , 2004 .

[162]  George H. John Behind-the-scenes data mining: a report on the KDD-98 panel , 1999, SKDD.

[163]  Jiong Yang,et al.  SPIN: mining maximal frequent subgraphs from graph databases , 2004, KDD.

[164]  Jennifer Widom,et al.  Continuous queries over data streams , 2001, SGMD.

[165]  Tian Zhang,et al.  BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.

[166]  Andreas D. Baxevanis,et al.  Bioinformatics - a practical guide to the analysis of genes and proteins , 2001, Methods of biochemical analysis.

[167]  Hiroshi Motoda,et al.  Feature Extraction, Construction and Selection: A Data Mining Perspective , 1998 .

[168]  Kenneth A. Ross,et al.  Complex Aggregation at Multiple Granularities , 1998, EDBT.

[169]  Janet L. Kolodner,et al.  Case-Based Reasoning , 1989, IJCAI 1989.

[170]  Christos Faloutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[171]  Jerome H. Friedman,et al.  A Recursive Partitioning Decision Rule for Nonparametric Classification , 1977, IEEE Transactions on Computers.

[172]  Kyuseok Shim,et al.  PUBLIC: A Decision Tree Classifier that Integrates Building and Pruning , 1998, Data Mining and Knowledge Discovery.

[173]  Shashi Shekhar,et al.  Spatial Databases - Accomplishments and Research Needs , 1999, IEEE Trans. Knowl. Data Eng..

[174]  Jon M. Kleinberg,et al.  The Web as a Graph: Measurements, Models, and Methods , 1999, COCOON.

[175]  Howard J. Hamilton,et al.  Efficient Attribute-Oriented Generalization for Knowledge Discovery from Large Databases , 1998, IEEE Trans. Knowl. Data Eng..

[176]  Jack E. Olson,et al.  Data Quality: The Accuracy Dimension , 2003 .

[177]  Jon Louis Bentley,et al.  Quad trees a data structure for retrieval on composite keys , 1974, Acta Informatica.

[178]  Andreas Wierse,et al.  Information Visualization in Data Mining and Knowledge Discovery , 2001 .

[179]  Jiawei Han,et al.  Mining closed relational graphs with connectivity constraints , 2005, 21st International Conference on Data Engineering (ICDE'05).

[180]  Lawrence B. Holder,et al.  Substucture Discovery in the SUBDUE System , 1994, KDD Workshop.

[181]  Sridhar Ramaswamy,et al.  On the Discovery of Interesting Patterns in Association Rules , 1998, VLDB.

[182]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[183]  Laks V. S. Lakshmanan,et al.  Optimization of constrained frequent set queries with 2-variable constraints , 1999, SIGMOD '99.

[184]  Zbigniew Michalewicz,et al.  Genetic Algorithms + Data Structures = Evolution Programs , 1996, Springer Berlin Heidelberg.

[185]  Ramakrishnan Srikant,et al.  Mining generalized association rules , 1995, Future Gener. Comput. Syst..

[186]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[187]  Michael Stonebraker,et al.  Readings in Database Systems , 1988 .

[188]  Henrik Madsen,et al.  Introduction to Generalized Linear Models , 2012 .

[189]  John C. Russ,et al.  The image processing handbook (3. ed.) , 1995 .

[190]  Christopher K. Riesbeck,et al.  Inside Case-Based Reasoning , 1989 .

[191]  David W. Embley,et al.  Record-boundary discovery in Web documents , 1999, SIGMOD '99.

[192]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[193]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[194]  Pedro M. Domingos Mining Social Networks for Viral Marketing , 2022 .

[195]  Teuvo Kohonen,et al.  Self-organization and associative memory: 3rd edition , 1989 .

[196]  Edward R. Tufte,et al.  The Visual Display of Quantitative Information , 1986 .

[197]  Jennifer Widom,et al.  Research problems in data warehousing , 1995, CIKM '95.

[198]  Jiawei Han,et al.  GeoMiner: a system prototype for spatial data mining , 1997, SIGMOD '97.

[199]  Pedro M. Domingos The RISE system: conquering without separating , 1994, Proceedings Sixth International Conference on Tools with Artificial Intelligence. TAI 94.

[200]  J. Ross Quinlan,et al.  Unknown Attribute Values in Induction , 1989, ML.

[201]  Sankar K. Pal,et al.  Fuzzy models for pattern recognition : methods that search for structures in data , 1992 .

[202]  Qiang Yang,et al.  Plan Mining by Divide-and-Conquer , 1999, 1999 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery.

[203]  Jiawei Han,et al.  DBMiner: A System for Mining Knowledge in Large Relational Databases , 1996, KDD.

[204]  Mohammed J. Zaki Efficient enumeration of frequent sequences , 1998, CIKM '98.

[205]  Alberto O. Mendelzon,et al.  Database techniques for the World-Wide Web: a survey , 1998, SGMD.

[206]  Z. Pawlak Rough Sets: Theoretical Aspects of Reasoning about Data , 1991 .

[207]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[208]  Lise Getoor,et al.  Iterative record linkage for cleaning and integration , 2004, DMKD '04.

[209]  Raymond T. Ng,et al.  Finding Aggregate Proximity Relationships and Commonalities in Spatial Data Mining , 1996, IEEE Trans. Knowl. Data Eng..

[210]  Arie Shoshani,et al.  OLAP and statistical databases: similarities and differences , 1997, PODS '97.

[211]  Alberto O. Mendelzon,et al.  Similarity-based queries for time series data , 1997, SIGMOD '97.

[212]  Stuart J. Russell,et al.  Local Learning in Probabilistic Networks with Hidden Variables , 1995, IJCAI.

[213]  Yossi Matias,et al.  New sampling-based summary statistics for improving approximate query answers , 1998, SIGMOD '98.

[214]  Laks V. S. Lakshmanan,et al.  Mining frequent itemsets with convertible constraints , 2001, Proceedings 17th International Conference on Data Engineering.

[215]  Joshua Zhexue Huang,et al.  Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values , 1998, Data Mining and Knowledge Discovery.

[216]  Jiawei Han,et al.  Meta-Rule-Guided Mining of Association Rules in Relational Databases , 1995, KDOOD/TDOOD.

[217]  Sebastian Thrun,et al.  Text Classification from Labeled and Unlabeled Documents using EM , 2000, Machine Learning.

[218]  M. Stone Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[219]  Philip S. Yu,et al.  Mining Asynchronous Periodic Patterns in Time Series Data , 2003, IEEE Trans. Knowl. Data Eng..

[220]  Ronald K. Klimberg,et al.  Applications of Data Mining , 2007 .

[221]  David Konopnicki,et al.  W3QS: A Query System for the World-Wide Web , 1995, VLDB.

[222]  Leonid Khachiyan,et al.  Cubegrades: Generalizing Association Rules , 2002, Data Mining and Knowledge Discovery.

[223]  D. Ellis Visual explanations: Images and quantities , 1997 .

[224]  Thomas G. Dietterich,et al.  Bioinformatics The Machine Learning Approach 2nd ed. , 2001 .

[225]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[226]  Dimitrios Gunopulos,et al.  Automatic subspace clustering of high dimensional data for data mining applications , 1998, SIGMOD '98.

[227]  Ben Taskar,et al.  Probabilistic Classification and Clustering in Relational Data , 2001, IJCAI.

[228]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[229]  Sudipto Guha,et al.  Clustering data streams , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[230]  Soumen Chakrabarti,et al.  Integrating the document object model with hyperlinks for enhanced topic distillation and information extraction , 2001, WWW '01.

[231]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[232]  Saul Greenberg,et al.  How people revisit web pages: empirical findings and implications for the design of history systems , 1997, Int. J. Hum. Comput. Stud..

[233]  Jiawei Han,et al.  Star-Cubing: Computing Iceberg Cubes by Top-Down and Bottom-Up Integration , 2003, Very Large Data Bases Conference.

[234]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[235]  Madhu Sudan,et al.  A statistical perspective on data mining , 1997, Future Gener. Comput. Syst..

[236]  Yves Chauvin,et al.  Backpropagation: theory, architectures, and applications , 1995 .

[237]  Wray L. Buntine,et al.  A Further Comparison of Splitting Rules for Decision-Tree Induction , 1992, Machine Learning.

[238]  Laks V. S. Lakshmanan,et al.  A declarative language for querying and restructuring the Web , 1996, Proceedings RIDE '96. Sixth International Workshop on Research Issues in Data Engineering.

[239]  RamakrishnanRaghu,et al.  Bottom-up computation of sparse and Iceberg CUBE , 1999 .

[240]  Richard A. Davis,et al.  Introduction to time series and forecasting , 1998 .

[241]  Jeffrey Scott Vitter,et al.  Random sampling with a reservoir , 1985, TOMS.

[242]  Ralph Kimball,et al.  The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling , 1996 .

[243]  J. Ross Quinlan,et al.  Bagging, Boosting, and C4.5 , 1996, AAAI/IAAI, Vol. 1.

[244]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[245]  John F. Roddick,et al.  Geographic Data Mining and Knowledge Discovery , 2001 .

[246]  Agnar Aamodt,et al.  Case-Based Reasoning: Foundational Issues, Methodological Variations, and System Approaches , 1994, AI Commun..

[247]  Melanie Mitchell,et al.  An introduction to genetic algorithms , 1996 .

[248]  Jie Wu,et al.  Small Worlds: The Dynamics of Networks between Order and Randomness , 2003 .

[249]  Douglas H. Fisher,et al.  A Case Study of Incremental Concept Induction , 1986, AAAI.

[250]  Raghu Ramakrishnan,et al.  Database Management Systems , 1976 .

[251]  Jay L. Devore,et al.  Probability and statistics for engineering and the sciences , 1982 .

[252]  Jiawei Han,et al.  Document clustering using locality preserving indexing , 2005, IEEE Transactions on Knowledge and Data Engineering.

[253]  Robert L. Grossman,et al.  Data Mining for Scientific and Engineering Applications , 2001, Massive Computing.

[254]  Chabane Djeraba Data mining from multimedia , 2007, Int. J. Parallel Emergent Distributed Syst..

[255]  Sharon L. Milgram,et al.  The Small World Problem , 1967 .

[256]  Stephen Muggleton,et al.  Efficient Induction of Logic Programs , 1990, ALT.

[257]  Jiawei Han,et al.  CPAR: Classification based on Predictive Association Rules , 2003, SDM.

[258]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[259]  Richard Y. Wang,et al.  Anchoring data quality dimensions in ontological foundations , 1996, CACM.

[260]  Chris Clifton,et al.  Defining Privacy for Data Mining , 2002 .

[261]  Aidong Zhang,et al.  WaveCluster: A Multi-Resolution Clustering Approach for Very Large Spatial Databases , 1998, VLDB.

[262]  P. Utgoff,et al.  Multivariate Decision Trees , 1995, Machine Learning.

[263]  Jian Pei,et al.  Efficient computation of Iceberg cubes with complex measures , 2001, SIGMOD '01.

[264]  David W. Aha,et al.  Tolerating Noisy, Irrelevant and Novel Attributes in Instance-Based Learning Algorithms , 1992, Int. J. Man Mach. Stud..

[265]  Mathias Kirsten,et al.  Relational Distance-Based Clustering , 1998, ILP.

[266]  Hans-Peter Kriegel,et al.  VisDB: database exploration using multidimensional visualization , 1994, IEEE Computer Graphics and Applications.

[267]  Gregory F. Cooper,et al.  The Computational Complexity of Probabilistic Inference Using Bayesian Belief Networks , 1990, Artif. Intell..

[268]  Thomas Gärtner,et al.  Kernels and Distances for Structured Data , 2004, Machine Learning.

[269]  Jeffrey C. Schlimmer Learning and Representation Change , 1987, AAAI.

[270]  Hans-Peter Kriegel,et al.  Algorithms for Characterization and Trend Detection in Spatial Databases , 1998, KDD.

[271]  John Mingers,et al.  An Empirical Comparison of Pruning Methods for Decision Tree Induction , 1989, Machine Learning.

[272]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[273]  Vladimir Vapnik,et al.  Chervonenkis: On the uniform convergence of relative frequencies of events to their probabilities , 1971 .

[274]  Willi Klösgen,et al.  A Support System for Interpreting Statistical Data , 1991, Knowledge Discovery in Databases.

[275]  Jian Pei,et al.  CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets , 2000, ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery.

[276]  Ryszard S. Michalski,et al.  Data-Driven Constructive Induction: A Methodology and its Applications , 1998 .

[277]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[278]  Wei-Ying Ma,et al.  Organizing WWW images based on the analysis of page layout and Web link structure , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[279]  I. Bratko,et al.  Learning decision rules in noisy domains , 1987 .

[280]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[281]  Mathias Kirsten,et al.  Extending K-Means Clustering to First-Order Representations , 2000, ILP.

[282]  S. Pizer,et al.  The Image Processing Handbook , 1994 .

[283]  Philip S. Yu,et al.  Data Mining: An Overview from a Database Perspective , 1996, IEEE Trans. Knowl. Data Eng..

[284]  S. Wasserman,et al.  Models and Methods in Social Network Analysis , 2005 .

[285]  Wei Wang,et al.  Efficient mining of frequent subgraphs in the presence of isomorphism , 2003, Third IEEE International Conference on Data Mining.

[286]  Jennifer Widom,et al.  Clustering association rules , 1997, Proceedings 13th International Conference on Data Engineering.

[287]  Joseph M. Hellerstein,et al.  Potter's Wheel: An Interactive Data Cleaning System , 2001, VLDB.

[288]  Peter J. Haas,et al.  Interactive data Analysis: The Control Project , 1999, Computer.

[289]  Wei-Ying Ma,et al.  VIPS: a Vision-based Page Segmentation Algorithm , 2003 .

[290]  S. Muthukrishnan,et al.  Mining Deviants in a Time Series Database , 1999, VLDB.

[291]  Hiroki Arimura,et al.  Efficient Substructure Discovery from Large Semi-Structured Data , 2001, IEICE Trans. Inf. Syst..

[292]  Wei-Ying Ma,et al.  Hierarchical clustering of WWW image search results using visual, textual and link information , 2004, MULTIMEDIA '04.

[293]  Dan Gusfield,et al.  Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[294]  D. Bates,et al.  Mixed-Effects Models in S and S-PLUS , 2001 .

[295]  Paul Wintz,et al.  Digital image processing (2nd ed.) , 1987 .

[296]  Jennifer Widom,et al.  Models and issues in data stream systems , 2002, PODS.

[297]  Jorma Rissanen,et al.  MDL-Based Decision Tree Pruning , 1995, KDD.

[298]  Ron Kohavi,et al.  Mining e-commerce data: the good, the bad, and the ugly , 2001, KDD '01.

[299]  Heikki Mannila,et al.  Theoretical frameworks for data mining , 2000, SKDD.

[300]  Christopher Dean,et al.  Quakefinder: A Scalable Data Mining System for Detecting Earthquakes from Space , 1996, KDD.

[301]  Keinosuke Fukunaga,et al.  Bayes Error Estimation Using Parzen and k-NN Procedures , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[302]  Dorian Pyle,et al.  Data Preparation for Data Mining , 1999 .

[303]  Gregory Piatetsky-Shapiro,et al.  Advances in Knowledge Discovery and Data Mining , 2004, Lecture Notes in Computer Science.

[304]  Dimitrios Gunopulos,et al.  Discovering similar multidimensional trajectories , 2002, Proceedings 18th International Conference on Data Engineering.

[305]  Umeshwar Dayal,et al.  FreeSpan: frequent pattern-projected sequential pattern mining , 2000, KDD '00.

[306]  Peter C. Cheeseman,et al.  Bayesian Classification (AutoClass): Theory and Results , 1996, Advances in Knowledge Discovery and Data Mining.

[307]  Thomas G. Dietterich,et al.  Learning with Many Irrelevant Features , 1991, AAAI.

[308]  Kyuseok Shim,et al.  SPIRIT: Sequential Pattern Mining with Regular Expression Constraints , 1999, VLDB.

[309]  Ivan Bratko,et al.  Machine Learning and Data Mining; Methods and Applications , 1998 .

[310]  Giri Kumar Tayi,et al.  Enhancing data quality in data warehouse environments , 1999, CACM.

[311]  W. Scott Spangler,et al.  Learning Useful Rules from Inconclusive Data , 1991, Knowledge Discovery in Databases.

[312]  A. Dobson An introduction to generalized linear models , 1990 .

[313]  Raúl E. Valdés-Pérez,et al.  Principles of Human Computer Collaboration for Knowledge Discovery in Science , 1999, Artif. Intell..

[314]  Raymond T. Ng,et al.  A Unified Notion of Outliers: Properties and Computation , 1997, KDD.

[315]  Jiawei Han,et al.  Discovering Web access patterns and trends by applying OLAP and data mining technology on Web logs , 1998, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-.

[316]  Hongjun Lu,et al.  On computing, storing and querying frequent patterns , 2003, KDD '03.

[317]  Veda C. Storey,et al.  A Framework for Analysis of Data Quality Research , 1995, IEEE Trans. Knowl. Data Eng..

[318]  Jiawei Han,et al.  MultiMediaMiner: a system prototype for multimedia data mining , 1998, SIGMOD '98.

[319]  Jon Louis Bentley,et al.  An Algorithm for Finding Best Matches in Logarithmic Expected Time , 1977, TOMS.

[320]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[321]  Jiawei Han,et al.  Discovery of Spatial Association Rules in Geographic Information Databases , 1995, SSD.

[322]  Ke Wang,et al.  Mining frequent item sets by opportunistic projection , 2002, KDD.

[323]  Jennifer Widom,et al.  The Lorel query language for semistructured data , 1997, International Journal on Digital Libraries.

[324]  Jiawei Han,et al.  MM-Cubing: computing Iceberg cubes by factorizing the lattice space , 2004, Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004..

[325]  Johannes Gehrke,et al.  Querying and mining data streams: you only get one look a tutorial , 2002, SIGMOD '02.

[326]  Wray L. Buntine Operations for Learning with Graphical Models , 1994, J. Artif. Intell. Res..

[327]  M. A. Wincek Applied Statistical Time Series Analysis , 1990 .

[328]  P. Fayers,et al.  The Visual Display of Quantitative Information , 1990 .

[329]  Lise Getoor,et al.  Link mining: a new data mining challenge , 2003, SKDD.

[330]  Wei-Ying Ma,et al.  Block-level link analysis , 2004, SIGIR '04.

[331]  James P. Egan,et al.  Signal detection theory and ROC analysis , 1975 .

[332]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[333]  Peter J. Haas,et al.  The New Jersey Data Reduction Report , 1997 .

[334]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[335]  Abraham Silberschatz,et al.  Database System Concepts , 1980 .

[336]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[337]  Michael Stonebraker,et al.  Monitoring Streams - A New Class of Data Management Applications , 2002, VLDB.

[338]  Avi Pfeffer,et al.  SPOOK: A system for probabilistic object-oriented knowledge representation , 1999, UAI.

[339]  Tom M. Mitchell,et al.  Generalization as Search , 2002 .

[340]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[341]  William S. Cleveland,et al.  Visualizing Data , 1993 .

[342]  Jude W. Shavlik,et al.  in Advances in Neural Information Processing , 1996 .

[343]  Dimitrios Gunopulos,et al.  On-Line Discovery of Dense Areas in Spatio-temporal Databases , 2003, SSTD.

[344]  Kyuseok Shim,et al.  WALRUS: A Similarity Retrieval Algorithm for Image Databases , 2004, IEEE Trans. Knowl. Data Eng..

[345]  Sung-Hyon Myaeng,et al.  A practical hypertext catergorization method using links and incrementally available class information , 2000, SIGIR '00.

[346]  Robert V. Hogg,et al.  Introduction to Mathematical Statistics. , 1966 .

[347]  Heikki Mannila,et al.  The power of sampling in knowledge discovery , 1994, PODS '94.

[348]  Cheng Yang,et al.  Efficient discovery of error-tolerant frequent itemsets in high dimensions , 2001, KDD '01.

[349]  Michael J. Pazzani,et al.  Learning Collaborative Information Filters , 1998, ICML.

[350]  Jiong Yang,et al.  STING: A Statistical Information Grid Approach to Spatial Data Mining , 1997, VLDB.

[351]  Douglas B. Terry,et al.  Using collaborative filtering to weave an information tapestry , 1992, CACM.

[352]  Pedro M. Domingos,et al.  Beyond Independence: Conditions for the Optimality of the Simple Bayesian Classifier , 1996, ICML.

[353]  Douglas B. Terry,et al.  Continuous queries over append-only databases , 1992, SIGMOD '92.

[354]  W. Loh,et al.  Tree-Structured Classification via Generalized Discriminant Analysis. , 1988 .

[355]  Isabelle Guyon,et al.  Discovering Informative Patterns and Data Cleaning , 1996, Advances in Knowledge Discovery and Data Mining.

[356]  Xifeng Yan,et al.  CloSpan: Mining Closed Sequential Patterns in Large Datasets , 2003, SDM.

[357]  Christos Faloutsos,et al.  Efficient retrieval of similar time sequences under time warping , 1998, Proceedings 14th International Conference on Data Engineering.

[358]  Alex Berson,et al.  Data Warehousing, Data Mining, and OLAP , 1997 .

[359]  Bernard Widrow,et al.  Neural networks: applications in industry, business and science , 1994, CACM.

[360]  Oliver Günther,et al.  Multidimensional access methods , 1998, CSUR.

[361]  W. Loh,et al.  SPLIT SELECTION METHODS FOR CLASSIFICATION TREES , 1997 .

[362]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[363]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[364]  Usama M. Fayyad,et al.  The Attribute Selection Problem in Decision Tree Generation , 1992, AAAI.

[365]  Stephen I. Gallant,et al.  Neural network learning and expert systems , 1993 .

[366]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[367]  Jiong Yang,et al.  CLUSEQ: efficient and effective sequence clustering , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[368]  Patrick Valduriez,et al.  Join indices , 1987, TODS.

[369]  Ashish Gupta,et al.  Materialized views: techniques, implementations, and applications , 1999 .

[370]  Xintao Wu,et al.  Using Loglinear Models to Compress Datacube , 2000, Web-Age Information Management.

[371]  Bei Yu,et al.  A cross-collection mixture model for comparative text mining , 2004, KDD.

[372]  George Karypis,et al.  Automated Approaches for Classifying Structures , 2002, BIOKDD.

[373]  Bernhard Schölkopf,et al.  Shrinking the Tube: A New Support Vector Regression Algorithm , 1998, NIPS.

[374]  Paul M. Aoki Generalizing "search" in generalized search trees , 1998, Proceedings 14th International Conference on Data Engineering.

[375]  Sridhar Ramaswamy,et al.  Cyclic association rules , 1998, Proceedings 14th International Conference on Data Engineering.

[376]  Dennis Shasha,et al.  Declarative Data Cleaning: Language, Model, and Algorithms , 2001, VLDB.

[377]  Philip S. Yu,et al.  Finding generalized projected clusters in high dimensional spaces , 2000, SIGMOD 2000.

[378]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[379]  Jiawei Han,et al.  Data-Driven Discovery of Quantitative Rules in Relational Databases , 1993, IEEE Trans. Knowl. Data Eng..

[380]  Michel Manago,et al.  Induction of Decision Trees from Complex Structured Data , 1991, Knowledge Discovery in Databases.

[381]  Stavros Christodoulakis,et al.  Message files , 1982, TOIS.

[382]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[383]  Jiawei Han,et al.  An Efficient Two-Step Method for Classification of Spatial Data , 1998 .

[384]  Jiawei Han,et al.  Object-Based Selective Materialization for Efficient Implementation of Spatial Data Cubes , 2000, IEEE Trans. Knowl. Data Eng..

[385]  Paul S. Bradley,et al.  Scaling Clustering Algorithms to Large Databases , 1998, KDD.

[386]  Gregory F. Cooper,et al.  A Bayesian method for the induction of probabilistic networks from data , 1992, Machine Learning.

[387]  Michael J. A. Berry,et al.  Mastering Data Mining: The Art and Science of Customer Relationship Management , 1999 .

[388]  Jonathan Pevsner,et al.  Bioinformatics and functional genomics , 2003 .

[389]  Thorsten Joachims,et al.  A statistical learning learning model of text classification for support vector machines , 2001, SIGIR '01.

[390]  Jacques Cohen,et al.  Bioinformatics—an introduction for computer scientists , 2004, CSUR.

[391]  Sunita Sarawagi,et al.  Integrating association rule mining with relational database systems: alternatives and implications , 1998, SIGMOD '98.

[392]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[393]  Michael Stonebraker,et al.  Efficient organization of large multidimensional arrays , 1994, Proceedings of 1994 IEEE 10th International Conference on Data Engineering.

[394]  Christos Faloutsos,et al.  Prediction and indexing of moving objects with unknown motion patterns , 2004, SIGMOD '04.

[395]  Hans-Peter Kriegel,et al.  Density-Connected Sets and their Application for Trend Detection in Spatial Databases , 1997, KDD.

[396]  Shamkant B. Navathe,et al.  Mining for strong negative associations in a large database of customer transactions , 1998, Proceedings 14th International Conference on Data Engineering.

[397]  Laks V. S. Lakshmanan,et al.  Quotient Cube: How to Summarize the Semantics of a Data Cube , 2002, VLDB.

[398]  Hans-Peter Kriegel,et al.  Spatial Data Mining: A Database Approach , 1997, SSD.

[399]  Hans-Peter Kriegel,et al.  Visual classification: an interactive approach to decision tree construction , 1999, KDD '99.

[400]  Jiawei Han,et al.  IncSpan: incremental mining of sequential patterns in large database , 2004, KDD.

[401]  Jeffrey F. Naughton,et al.  An array-based algorithm for simultaneous multidimensional aggregates , 1997, SIGMOD '97.

[402]  Samuel Madden,et al.  Continuously adaptive continuous queries over streams , 2002, SIGMOD '02.

[403]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[404]  Thomas G. Dietterich,et al.  A Comparative Review of Selected Methods for Learning from Examples , 1983 .

[405]  Kun Liu,et al.  VEDAS: A Mobile and Distributed Data Stream Mining System for Real-Time Vehicle Monitoring , 2004, SDM.

[406]  Michael J. Carey,et al.  Reducing the Braking Distance of an SQL Query Engine , 1998, VLDB.

[407]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD 2000.

[408]  Laks V. S. Lakshmanan,et al.  Exploratory mining and pruning optimizations of constrained associations rules , 1998, SIGMOD '98.

[409]  Sudipto Guha,et al.  Streaming-data algorithms for high-quality clustering , 2002, Proceedings 18th International Conference on Data Engineering.

[410]  William H. Press,et al.  The Art of Scientific Computing Second Edition , 1998 .

[411]  Rakesh Agrawal,et al.  SPRINT: A Scalable Parallel Classifier for Data Mining , 1996, VLDB.

[412]  Kristian G. Olesen,et al.  Practical Issues in Modeling Large Diagnostic Systems with Multiply Sectioned Bayesian Networks , 2000, Int. J. Pattern Recognit. Artif. Intell..

[413]  Martin Ester,et al.  Frequent term-based text clustering , 2002, KDD.

[414]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD 2000.

[415]  Ryszard S. Michalski,et al.  A Theory and Methodology of Inductive Learning , 1983, Artificial Intelligence.

[416]  Tej Anand Opportunity explorer: Navigating large databases using knowledge discovery templates , 2004, Journal of Intelligent Information Systems.

[417]  Elena Baralis,et al.  Materialized Views Selection in a Multidimensional Database , 1997, VLDB.

[418]  Petra Perner,et al.  Data Mining on Multimedia Data , 2002, Lecture Notes in Computer Science.

[419]  Tariq Samad,et al.  Designing Application-Specific Neural Networks Using the Genetic Algorithm , 1989, NIPS.

[420]  Raymond J. Mooney,et al.  Content-boosted collaborative filtering for improved recommendations , 2002, AAAI/IAAI.

[421]  J. Snoeyink,et al.  Mining Spatial Motifs from Protein Structure Graphs , 2003 .

[422]  Jinyan Li,et al.  Efficient mining of emerging patterns: discovering trends and differences , 1999, KDD '99.

[423]  A. Voisard Spatial Query Languages , 2002 .

[424]  Pierre Baldi,et al.  Bioinformatics - the machine learning approach (2. ed.) , 2000 .

[425]  Dennis Shasha,et al.  Algorithmics and applications of tree and graph searching , 2002, PODS.

[426]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[427]  Alex Berson,et al.  Building Data Mining Applications for CRM , 1999 .

[428]  Johannes Gehrke,et al.  CACTUS—clustering categorical data using summaries , 1999, KDD '99.

[429]  Dimitrios Gunopulos,et al.  Efficient Mining of Spatiotemporal Patterns , 2001, SSTD.

[430]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[431]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[432]  Jiawei Han,et al.  Classifying large data sets using SVMs with hierarchical clusters , 2003, KDD '03.

[433]  Hannu Toivonen,et al.  Finding Frequent Substructures in Chemical Compounds , 1998, KDD.

[434]  Ziv Bar-Yossef,et al.  Template detection via data mining and its applications , 2002, WWW.

[435]  Richard M. Karp,et al.  A simple algorithm for finding frequent elements in streams and bags , 2003, TODS.

[436]  Oren Etzioni,et al.  Adaptive Web Sites: Conceptual Cluster Mining , 1999, IJCAI.

[437]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[438]  Philip S. Yu,et al.  A Framework for Projected Clustering of High Dimensional Data Streams , 2004, VLDB.

[439]  G. De Soete,et al.  Clustering and Classification , 2019, Data-Driven Science and Engineering.

[440]  Soumen Chakrabarti,et al.  Enhanced topic distillation using text, markup tags, and hyperlinks , 2001, SIGIR '01.

[441]  Douglas H. Fisher,et al.  Improving Inference through Conceptual Clustering , 1987, AAAI.

[442]  Mike James,et al.  Classification Algorithms , 1986, Encyclopedia of Machine Learning and Data Mining.

[443]  Jiawei Han,et al.  Mining Compressed Frequent-Pattern Sets , 2005, VLDB.

[444]  Salvatore J. Stolfo,et al.  Toward Multi-Strategy Parallel & Distributed Learning in Sequence Analysis , 1993, ISMB.

[445]  Raymond J. Mooney,et al.  A probabilistic framework for semi-supervised clustering , 2004, KDD.

[446]  Jiawei Han,et al.  Dynamic Generation and Refinement of Concept Hierarchies for Knowledge Discovery in Databases , 1994, KDD Workshop.

[447]  Mohammed J. Zaki Efficiently mining frequent trees in a forest , 2002, KDD.

[448]  John Scott Social Network Analysis , 1988 .

[449]  Philip S. Yu,et al.  Cross-relational clustering with user's guidance , 2005, KDD '05.

[450]  Samuel Kaski,et al.  Self organization of a massive document collection , 2000, IEEE Trans. Neural Networks Learn. Syst..

[451]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[452]  Philip S. Yu,et al.  A Framework for Clustering Evolving Data Streams , 2003, VLDB.

[453]  Wojciech Szpankowski,et al.  An efficient algorithm for detecting frequent subgraphs in biological networks , 2004, ISMB/ECCB.

[454]  Sushil Jajodia,et al.  Mining Temporal Relationships with Multiple Granularities in Time Sequences , 1998, IEEE Data Eng. Bull..

[455]  Shashi Shekhar,et al.  Spatial Databases: A Tour , 2003 .

[456]  Heikki Mannila,et al.  Discovery of Frequent Episodes in Event Sequences , 1997, Data Mining and Knowledge Discovery.

[457]  Chris Chatfield,et al.  The Analysis of Time Series: An Introduction , 1981 .

[458]  Sunita Sarawagi,et al.  Mining Surprising Patterns Using Temporal Description Length , 1998, VLDB.

[459]  Yehuda Lindell,et al.  A Statistical Theory for Quantitative Association Rules , 1999, KDD.

[460]  S. Lauritzen The EM algorithm for graphical association models with missing data , 1995 .

[461]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[462]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[463]  Elisa Bertino,et al.  State-of-the-art in privacy preserving data mining , 2004, SGMD.

[464]  David Zipser,et al.  Feature Discovery by Competive Learning , 1986, Cogn. Sci..

[465]  Alexandre V. Evfimievski,et al.  Privacy preserving mining of association rules , 2002, Inf. Syst..

[466]  Philip S. Yu,et al.  Substructure similarity search in graph databases , 2005, SIGMOD '05.

[467]  Sunita Sarawagi,et al.  Intelligent Rollups in Multidimensional OLAP Data , 2001, VLDB.

[468]  Jiawei Han,et al.  Resource and Knowledge Discovery in Global Information Systems: A Preliminary Design and Experiment , 1995, KDD.

[469]  Umeshwar Dayal,et al.  A data-warehouse/OLAP framework for scalable telecommunication tandem traffic analysis , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[470]  Paul E. Utgoff,et al.  Decision Tree Induction Based on Efficient Tree Restructuring , 1997, Machine Learning.

[471]  S. Grossberg,et al.  Pattern Recognition by Self-Organizing Neural Networks , 1991 .

[472]  E. Tufte,et al.  The visual display of quantitative information , 1984, The SAGE Encyclopedia of Research Design.

[473]  Ralf Hartmut Güting Dr.rer.nat An introduction to spatial database systems , 2005, The VLDB Journal.

[474]  Jian Pei,et al.  CLOSET+: searching for the best strategies for mining frequent closed itemsets , 2003, KDD '03.

[475]  Geoff Hulten,et al.  Mining time-changing data streams , 2001, KDD '01.

[476]  Giuseppe Psaila,et al.  Querying Shapes of Histories , 1995, VLDB.

[477]  David W. Aha,et al.  Simplifying decision trees: A survey , 1997, The Knowledge Engineering Review.

[478]  Wei-Ying Ma,et al.  Block-based web search , 2004, SIGIR '04.

[479]  Philip S. Yu,et al.  Outlier detection for high dimensional data , 2001, SIGMOD '01.

[480]  Jiawei Han,et al.  High-Dimensional OLAP: A Minimal Cubing Approach , 2004, VLDB.

[481]  Gregory Piatetsky-Shapiro,et al.  Discovery, Analysis, and Presentation of Strong Rules , 1991, Knowledge Discovery in Databases.

[482]  S. Muthukrishnan,et al.  Surfing Wavelets on Streams: One-Pass Summaries for Approximate Aggregate Queries , 2001, VLDB.

[483]  Jiawei Han,et al.  Mining coherent dense subgraphs across massive biological networks for functional discovery , 2005, ISMB.

[484]  Raymond T. Ng,et al.  Algorithms for Mining Distance-Based Outliers in Large Datasets , 1998, VLDB.

[485]  Belur V. Dasarathy,et al.  Nearest neighbor (NN) norms: NN pattern classification techniques , 1991 .

[486]  Joan Feigenbaum,et al.  Factorization in Experiment Generation , 1986, AAAI.

[487]  Jörg Rech,et al.  Knowledge Discovery in Databases , 2001, Künstliche Intell..

[488]  Jon M. Kleinberg,et al.  Inferring Web communities from link topology , 1998, HYPERTEXT '98.

[489]  Johannes Fürnkranz,et al.  Incremental Reduced Error Pruning , 1994, ICML.

[490]  李幼升,et al.  Ph , 1989 .

[491]  Kathryn B. Laskey,et al.  Network Fragments: Representing Knowledge for Constructing Probabilistic Models , 1997, UAI.

[492]  Elena Baralis,et al.  Designing Templates for Mining Association Rules , 2004, Journal of Intelligent Information Systems.

[493]  Donato Malerba,et al.  A Further Comparison of Simplification Methods for Decision-Tree Induction , 1995, AISTATS.

[494]  Michael S. Waterman,et al.  Introduction to Computational Biology: Maps, Sequences and Genomes , 1998 .

[495]  Bill Gates,et al.  Business @ the Speed of Thought: Succeeding in the Digital Economy , 2000 .

[496]  Dennis Shasha,et al.  High Performance Discovery In Time Series: Techniques And Case Studies (Monographs in Computer Science) , 2004 .

[497]  Tomasz Imielinski,et al.  MSQL: A Query Language for Database Mining , 1999, Data Mining and Knowledge Discovery.

[498]  Giuseppe Psaila,et al.  A New SQL-like Operator for Mining Association Rules , 1996, VLDB.

[499]  John Mingers,et al.  Neural Networks, Decision Tree Induction and Discriminant Analysis: an Empirical Comparison , 1994 .

[500]  Alan M. Frieze,et al.  A general model of web graphs , 2003, Random Struct. Algorithms.

[501]  João Meidanis,et al.  Introduction to computational molecular biology , 1997 .

[502]  Michael W. Berry,et al.  Survey of Text Mining: Clustering, Classification, and Retrieval , 2007 .

[503]  David J. Spiegelhalter,et al.  Machine Learning, Neural and Statistical Classification , 2009 .

[504]  Jiawei Han,et al.  MAIDS: mining alarming incidents from data streams , 2004, SIGMOD '04.

[505]  Ethem Alpaydin,et al.  Introduction to Machine Learning (Adaptive Computation and Machine Learning) , 2004 .

[506]  Jianyong Wang,et al.  Mining sequential patterns by pattern-growth: the PrefixSpan approach , 2004, IEEE Transactions on Knowledge and Data Engineering.

[507]  Pat Langley,et al.  Static Versus Dynamic Sampling for Data Mining , 1996, KDD.

[508]  Jiawei Han,et al.  Mining recurrent items in multimedia with progressive resolution refinement , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[509]  Johannes Gehrke,et al.  BOAT—optimistic decision tree construction , 1999, SIGMOD '99.

[510]  Jack Sklansky,et al.  On Automatic Feature Selection , 1988, Int. J. Pattern Recognit. Artif. Intell..

[511]  Kenneth C. Laudon,et al.  Markets and privacy , 1993, CACM.

[512]  Kenneth A. Ross,et al.  Fast Computation of Sparse Datacubes , 1997, VLDB.

[513]  Zhaohui Tang,et al.  Data Mining with SQL Server 2005 , 2005 .

[514]  Andrzej Skowron,et al.  The Discernibility Matrices and Functions in Information Systems , 1992, Intelligent Decision Support.

[515]  Daniel A. Keim,et al.  An Efficient Approach to Clustering in Large Multimedia Databases with Noise , 1998, KDD.

[516]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[517]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[518]  Stuart J. Russell,et al.  BLOG: Probabilistic Models with Unknown Objects , 2005, IJCAI.

[519]  Valdis E. Krebs,et al.  Mapping Networks of Terrorist Cells , 2001 .

[520]  V. Barnett,et al.  Applied Linear Statistical Models , 1975 .

[521]  Agnès Voisard,et al.  Spatial Databases: With Application to GIS , 2001 .

[522]  Christos Faloutsos,et al.  Graphs over time: densification laws, shrinking diameters and possible explanations , 2005, KDD '05.

[523]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[524]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[525]  Thomas C. Redman,et al.  Data Quality: The Field Guide , 2001 .

[526]  Sunita Sarawagi,et al.  Modeling multidimensional databases , 1997, Proceedings 13th International Conference on Data Engineering.

[527]  R. Fletcher Practical Methods of Optimization , 1988 .

[528]  Jeffrey D. Ullman,et al.  Implementing data cubes efficiently , 1996, SIGMOD '96.

[529]  Alberto O. Mendelzon,et al.  WebOQL: restructuring documents, databases, and webs , 1999 .

[530]  D. Krane,et al.  Fundamental Concepts of Bioinformatics , 2002 .

[531]  John A. Major,et al.  Selecting among rules induced from a hurricane database , 1993, Journal of Intelligent Information Systems.

[532]  Christos Faloutsos,et al.  Online data mining for co-evolving time sequences , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[533]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.

[534]  George Kollios,et al.  Mining, indexing, and querying historical spatiotemporal data , 2004, KDD.

[535]  Yelena Yesha,et al.  Data Mining: Next Generation Challenges and Future Directions , 2004 .

[536]  Jiawei Han,et al.  Efficient Polygon Amalgamation Methods for Spatial OLAP and Spatial Data Mining , 1999, SSD.

[537]  Goetz Graefe,et al.  Multi-table joins through bitmapped join indices , 1995, SGMD.

[538]  Anthony K. H. Tung,et al.  Carpenter: finding closed patterns in long biological datasets , 2003, KDD '03.

[539]  Aristides Gionis,et al.  Approximating a collection of frequent sets , 2004, KDD.

[540]  Philip S. Yu,et al.  Clustering through decision tree construction , 2000, CIKM '00.

[541]  Abraham Silberschatz,et al.  What Makes Patterns Interesting in Knowledge Discovery Systems , 1996, IEEE Trans. Knowl. Data Eng..

[542]  Roberto J. Bayardo,et al.  Efficiently mining long patterns from databases , 1998, SIGMOD '98.

[543]  Joseph Revelli,et al.  The Image Processing Handbook, 4th Edition , 2003, J. Electronic Imaging.

[544]  Ramakrishnan Srikant,et al.  Mining newsgroups using networks arising from social behavior , 2003, WWW '03.

[545]  Jennifer Neville,et al.  Learning relational probability trees , 2003, KDD '03.

[546]  Jaideep Srivastava,et al.  Web usage mining: discovery and applications of usage patterns from Web data , 2000, SKDD.

[547]  Joost N. Kok,et al.  A quickstart in frequent structure mining can make a difference , 2004, KDD.

[548]  Jaideep Srivastava,et al.  Selecting the right interestingness measure for association patterns , 2002, KDD.

[549]  Hongjun Lu,et al.  Condensed cube: an effective approach to reducing data cube size , 2002, Proceedings 18th International Conference on Data Engineering.

[550]  Witold Pedrycz,et al.  Data Mining Methods for Knowledge Discovery , 1998, IEEE Trans. Neural Networks.

[551]  Jesus Mena,et al.  Investigative Data Mining for Security and Criminal Detection , 2002 .

[552]  Jian Pei,et al.  Mining Multi-Dimensional Constrained Gradients in Data Cubes , 2001, VLDB.

[553]  Rakesh Agrawal,et al.  Privacy-preserving data mining , 2000, SIGMOD 2000.

[554]  Pat Langley,et al.  Elements of Machine Learning , 1995 .

[555]  Richard A. Johnson,et al.  Applied Multivariate Statistical Analysis , 1983 .

[556]  Christopher R. Westphal,et al.  Data Mining Solutions: Methods and Tools for Solving Real-World Problems , 1998 .

[557]  J. Devore,et al.  Statistics: The Exploration and Analysis of Data , 1986 .

[558]  Ralph Kimball,et al.  The Data Warehouse Lifecycle Toolkit: Expert Methods for Designing, Developing and Deploying Data Warehouses with CD Rom , 1998 .

[559]  Hans-Peter Kriegel,et al.  Knowledge Discovery in Large Spatial Databases: Focusing Techniques for Efficient Class Identification , 1995, SSD.

[560]  Jiawei Han,et al.  Towards on-line analytical mining in large databases , 1998, SGMD.

[561]  Christos Faloutsos,et al.  Advanced Database Systems , 1997, Lecture Notes in Computer Science.

[562]  I. Kononenko,et al.  Attribute Selection for Modeling , 1997 .

[563]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[564]  David J. DeWitt,et al.  NiagaraCQ: a scalable continuous query system for Internet databases , 2000, SIGMOD '00.

[565]  R. Mike Cameron-Jones,et al.  FOIL: A Midterm Report , 1993, ECML.

[566]  Jiawei Han,et al.  BIDE: efficient mining of frequent closed sequences , 2004, Proceedings. 20th International Conference on Data Engineering.

[567]  Andrew W. Moore,et al.  Tractable group detection on large link data sets , 2003, Third IEEE International Conference on Data Mining.

[568]  Jiawei Han,et al.  Generalization-Based Data Mining in Object-Oriented Databases Using an Object Cube Model , 1998, Data Knowl. Eng..

[569]  Jon M. Kleinberg,et al.  Mining the Web's Link Structure , 1999, Computer.

[570]  J. Ross Quinlan,et al.  Simplifying Decision Trees , 1987, Int. J. Man Mach. Stud..

[571]  Lotfi A. Zadeh,et al.  Commonsense Knowledge Representation Based on Fuzzy Logic , 1983, Computer.

[572]  Tomasz Imielinski,et al.  DataMine: Application Programming Interface and Query Language for Database Mining , 1996, KDD.

[573]  Chen Wang,et al.  Scalable mining of large disk-based graph databases , 2004, KDD.

[574]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[575]  Jon M. Kleinberg,et al.  A Microeconomic View of Data Mining , 1998, Data Mining and Knowledge Discovery.

[576]  John Riedl,et al.  GroupLens: an open architecture for collaborative filtering of netnews , 1994, CSCW '94.

[577]  Carlo Zaniolo,et al.  Metaqueries for Data Mining , 1996, Advances in Knowledge Discovery and Data Mining.

[578]  ZhaoHui Tang,et al.  Building data mining solutions with OLE DB for DM and XML for analysis , 2005, SGMD.

[579]  Prabhakar Raghavan,et al.  Information retrieval algorithms: a survey , 1997, SODA '97.

[580]  Mong-Li Lee,et al.  Image Mining: Trends and Developments , 2002, Journal of Intelligent Information Systems.

[581]  Joseph L. Hellerstein,et al.  Mining partially periodic event patterns with unknown periods , 2001, Proceedings 17th International Conference on Data Engineering.

[582]  Lise Getoor,et al.  Link-Based Classification , 2003, Encyclopedia of Machine Learning and Data Mining.

[583]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[584]  Ramakrishnan Srikant,et al.  The Quest Data Mining System , 1996, KDD.

[585]  Jan-Ming Ho,et al.  Discovering informative content blocks from Web documents , 2002, KDD.

[586]  Raghu Ramakrishnan,et al.  Bottom-up computation of sparse and Iceberg CUBE , 1999, SIGMOD '99.

[587]  Ralf Hartmut Güting,et al.  An introduction to spatial database systems , 1994, VLDB J..

[588]  David M. Pennock,et al.  Statistical relational learning for document mining , 2003, Third IEEE International Conference on Data Mining.

[589]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.

[590]  Alain Degenne,et al.  Introducing Social Networks , 1999 .

[591]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[592]  Heikki Mannila,et al.  A database perspective on knowledge discovery , 1996, CACM.

[593]  P. Bickel,et al.  Mathematical Statistics: Basic Ideas and Selected Topics , 1977 .

[594]  J. Ross Quinlan,et al.  An Empirical Comparison of Genetic and Decision-Tree Classifiers , 1988, ML.

[595]  Julian R. Ullmann,et al.  An Algorithm for Subgraph Isomorphism , 1976, J. ACM.

[596]  Thomas G. Dietterich,et al.  Readings in Machine Learning , 1991 .

[597]  Pat Langley,et al.  Models of Incremental Concept Formation , 1990, Artif. Intell..

[598]  V. S. Subrahmanian Principles of Multimedia Database Systems , 1998 .

[599]  Hong-Ye Gao,et al.  Wavelet analysis [for signal processing] , 1996 .

[600]  George Karypis,et al.  C HAMELEON : A Hierarchical Clustering Algorithm Using Dynamic Modeling , 1999 .

[601]  Anthony K. H. Tung,et al.  Spatial clustering in the presence of obstacles , 2001, Proceedings 17th International Conference on Data Engineering.

[602]  Ronald R. Yager,et al.  Fuzzy sets, neural networks, and soft computing , 1994 .

[603]  Jeffrey F. Naughton,et al.  Materialized View Selection for Multidimensional Datasets , 1998, VLDB.

[604]  Jeffrey F. Naughton,et al.  Letter from the Special Issue Editor , 1997, IEEE Data Eng. Bull..

[605]  Michael J. A. Berry,et al.  Data Mining Techniques: For Marketing, Sales, and Customer Relationship Management , 2004 .

[606]  Philip S. Yu,et al.  Clustering by pattern similarity in large data sets , 2002, SIGMOD '02.

[607]  Wynne Hsu,et al.  Using General Impressions to Analyze Discovered Classification Rules , 1997, KDD.

[608]  Jiawei Han,et al.  Summarizing itemset patterns: a profile-based approach , 2005, KDD '05.

[609]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.

[610]  Prabhakar Raghavan,et al.  A Linear Method for Deviation Detection in Large Databases , 1996, KDD.

[611]  Stephen J. Wright,et al.  Numerical Optimization , 2018, Fundamental Statistical Inference.

[612]  Theodore Johnson,et al.  Exploratory Data Mining and Data Cleaning , 2003 .

[613]  J. R. Quinlan Learning With Continuous Classes , 1992 .

[614]  Philip S. Yu,et al.  Graph indexing: a frequent structure-based approach , 2004, SIGMOD '04.

[615]  Son K. Dao,et al.  Dealing with Semantic Heterogeneity by Generalization-Based Data Mining Techniques , 2007 .

[616]  David A. Padua,et al.  Parallel mining of closed sequential patterns , 2005, KDD '05.

[617]  Paul E. Green,et al.  K-modes Clustering , 2001, J. Classif..

[618]  Christos Faloutsos,et al.  FastMap: a fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets , 1995, SIGMOD '95.

[619]  Ke Wang,et al.  Building Hierarchical Classifiers Using Class Proximity , 1999, VLDB.

[620]  Usama M. Fayyad,et al.  What Should Be Minimized in a Decision Tree? , 1990, AAAI.

[621]  W. H. Inmon,et al.  Building the data warehouse , 1992 .

[622]  Matthew Richardson,et al.  Mining the network value of customers , 2001, KDD '01.

[623]  Y.-S. Shih,et al.  Families of splitting criteria for classification trees , 1999, Stat. Comput..

[624]  Ada Wai-Chee Fu,et al.  Finding Structure and Characteristics of Web Documents for Classification , 2000, ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery.

[625]  Sudipto Guha,et al.  CURE: an efficient clustering algorithm for large databases , 1998, SIGMOD '98.

[626]  Rajeev Rastogi,et al.  Processing complex aggregate queries over data streams , 2002, SIGMOD '02.

[627]  Guruprasad Madhavan Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins (3rd Edition). Edited by Andreas D. Baxevanis and B. F. Francis Ouellette. Hardcover, October 2004, 540 pages. US $79.95, ISBN: 0-471-47878-4 , 2006, Annals of Biomedical Engineering.

[628]  Tom M. Mitchell,et al.  Version Spaces: A Candidate Elimination Approach to Rule Learning , 1977, IJCAI.

[629]  J. L. Hodges,et al.  Discriminatory Analysis - Nonparametric Discrimination: Consistency Properties , 1989 .

[630]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[631]  David S. Stoffer,et al.  Time series analysis and its applications , 2000 .

[632]  R. Bone Discovery , 1938, Nature.

[633]  Patrick E. O'Neil,et al.  Improved query performance with variant indexes , 1997, SIGMOD '97.

[634]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[635]  David Heckerman,et al.  Bayesian Networks for Knowledge Discovery , 1996, Advances in Knowledge Discovery and Data Mining.

[636]  Raymond J. Mooney,et al.  Symbolic and Neural Learning Algorithms: An Experimental Comparison , 1991, Machine Learning.

[637]  Jiawei Han,et al.  Generalization and decision tree induction: efficient classification in data mining , 1997, Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications.

[638]  Howard J. Hamilton,et al.  Knowledge discovery and measures of interest , 2001 .

[639]  Renée J. Miller,et al.  Association rules over interval data , 1997, SIGMOD '97.

[640]  J. Wootton Introduction to computational biology: Maps, sequences and genomes; Interdisciplinary statistics , 1997 .

[641]  Jennifer Widom,et al.  Database Systems: The Complete Book , 2001 .

[642]  Xuehua Shen,et al.  Context-sensitive information retrieval using implicit feedback , 2005, SIGIR '05.

[643]  Yasuhiko Morimoto,et al.  Data mining using two-dimensional optimized association rules: scheme, algorithms, and visualization , 1996, SIGMOD '96.

[644]  George Karypis,et al.  Frequent subgraph discovery , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[645]  Bradley P. Allen,et al.  Case-based reasoning: business applications , 1994, CACM.

[646]  Michael Ian Shamos,et al.  Computational geometry: an introduction , 1985 .

[647]  Padhraic Smyth,et al.  Image database exploration: progress and challenges , 1993 .

[648]  Mohammed J. Zaki,et al.  PlanMine: Sequence Mining for Plan Failures , 1998, KDD.

[649]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[650]  R. Nakano,et al.  Medical diagnostic expert system based on PDP model , 1988, IEEE 1988 International Conference on Neural Networks.

[651]  Albert-László Barabási,et al.  Linked - how everything is connected to everything else and what it means for business, science, and everyday life , 2003 .

[652]  Jian Pei,et al.  CMAR: accurate and efficient classification based on multiple class-association rules , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[653]  Jon Kleinberg,et al.  The link prediction problem for social networks , 2003, CIKM '03.

[654]  S. Gabel,et al.  Using Neural Networks , 2003 .

[655]  Jeffrey F. Naughton,et al.  On the Computation of Multidimensional Aggregates , 1996, VLDB.

[656]  Dennis Shasha,et al.  StatStream: Statistical Monitoring of Thousands of Data Streams in Real Time , 2002, VLDB.

[657]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory, Third Edition , 1989, Springer Series in Information Sciences.

[658]  Brian D. Ripley,et al.  Pattern Recognition and Neural Networks , 1996 .

[659]  Philip S. Yu,et al.  On demand classification of data streams , 2004, KDD.

[660]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[661]  Stuart L. Crawford Extensions to the CART Algorithm , 1989, Int. J. Man Mach. Stud..

[662]  Jaideep Srivastava,et al.  Web Mining — Concepts, Applications, and Research Directions , 2004 .

[663]  Wai Lam,et al.  Bayesian Network Refinement Via Machine Learning Approach , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[664]  Saso Dzeroski,et al.  Inductive Logic Programming: Techniques and Applications , 1993 .

[665]  George H. John Enhancements to the data mining process , 1997 .

[666]  Michael Sherman Probability and Statistics for Engineering and the Sciences (6th ed.) , 2006 .

[667]  R. Michalski,et al.  Learning from Observation: Conceptual Clustering , 1983 .

[668]  C. G. Hilborn,et al.  The Condensed Nearest Neighbor Rule , 1967 .

[669]  David J. Maguire,et al.  Geographical information systems : principles and applications , 1991 .

[670]  Rob Mattison,et al.  Data Warehousing and Data Mining for Telecommunications , 1997 .

[671]  Usama M. Fayyad,et al.  Automating the Analysis and Cataloging of Sky Surveys , 1996, Advances in Knowledge Discovery and Data Mining.

[672]  Jiawei Han,et al.  CloseGraph: mining closed frequent graph patterns , 2003, KDD '03.

[673]  Kotagiri Ramamohanarao,et al.  Making Use of the Most Expressive Jumping Emerging Patterns for Classification , 2001, Knowledge and Information Systems.

[674]  Huan Liu,et al.  Subspace clustering for high dimensional data: a review , 2004, SKDD.

[675]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[676]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[677]  Geoff Hulten,et al.  Mining high-speed data streams , 2000, KDD '00.

[678]  Wei-Yin Loh,et al.  A Comparison of Prediction Accuracy, Complexity, and Training Time of Thirty-Three Old and New Classification Algorithms , 2000, Machine Learning.

[679]  Le Gruenwald,et al.  A survey of data mining and knowledge discovery software tools , 1999, SKDD.

[680]  Dan Gusfield Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[681]  Sunita Sarawagi,et al.  i3: Intelligent, Interactive Investigaton of OLAP data cubes , 2000, SIGMOD Conference.

[682]  Philip S. Yu,et al.  Mining Frequent Patterns in Data Streams at Multiple Time Granularities , 2002 .

[683]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[684]  Salvatore J. Stolfo,et al.  Experiments on multistrategy learning by meta-learning , 1993, CIKM '93.

[685]  Raghu Ramakrishnan,et al.  Probabilistic Optimization of Top N Queries , 1999, VLDB.

[686]  Robert A. Jacobs,et al.  Increased rates of convergence through learning rate adaptation , 1987, Neural Networks.

[687]  Takashi Washio,et al.  An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data , 2000, PKDD.