Enhancing data analysis with noise removal
暂无分享,去创建一个
Hui Xiong | Vipin Kumar | Michael S. Steinbach | Gaurav Pandey | M. Steinbach | Vipin Kumar | G. Pandey | Hui Xiong
[1] Oliver Günther,et al. Multidimensional access methods , 1998, CSUR.
[2] Hans-Peter Kriegel,et al. LOF: identifying density-based local outliers , 2000, SIGMOD '00.
[3] Rynson W. H. Lau,et al. Knowledge and Data Engineering for e-Learning Special Issue of IEEE Transactions on Knowledge and Data Engineering , 2008 .
[4] Alexander Dekhtyar,et al. Information Retrieval , 2018, Lecture Notes in Computer Science.
[5] Leonid Portnoy,et al. Intrusion detection with unlabeled data using clustering , 2000 .
[6] Tian Zhang,et al. BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.
[7] Hans-Peter Kriegel,et al. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.
[8] Salvatore J. Stolfo,et al. Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem , 1998, Data Mining and Knowledge Discovery.
[9] Ron Kohavi,et al. Wrappers for Feature Subset Selection , 1997, Artif. Intell..
[10] Vipin Kumar,et al. Finding Clusters of Different Sizes, Shapes, and Densities in Noisy, High Dimensional Data , 2003, SDM.
[11] Charles Elkan,et al. An Efficient Domain-Independent Algorithm for Detecting Approximately Duplicate Database Records , 1997, DMKD.
[12] Raymond T. Ng,et al. Distance-based outliers: algorithms and applications , 2000, The VLDB Journal.
[13] Aidong Zhang,et al. WaveCluster: A Multi-Resolution Clustering Approach for Very Large Spatial Databases , 1998, VLDB.
[14] Stephen D. Bay,et al. Mining distance-based outliers in near linear time with randomization and a simple pruning rule , 2003, KDD '03.
[15] Ken Orr,et al. Data quality and systems theory , 1998, CACM.
[16] Chinatsu Aone,et al. Fast and effective text mining using linear-time document clustering , 1999, KDD '99.
[17] Tok Wang Ling,et al. IntelliClean: a knowledge-based intelligent data cleaner , 2000, KDD '00.
[18] P. Tan,et al. Mining Hyperclique Patterns with Confidence Pruning , 2003 .
[19] Sridhar Ramaswamy,et al. Efficient algorithms for mining outliers from large data sets , 2000, SIGMOD '00.
[20] P. Bork,et al. Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.
[21] Dennis Shasha,et al. Declarative Data Cleaning: Language, Model, and Algorithms , 2001, VLDB.
[22] Carla E. Brodley,et al. Improving automated land cover mapping by identifying and eliminating mislabeled observations from training data , 1996, IGARSS '96. 1996 International Geoscience and Remote Sensing Symposium.
[23] Carla E. Brodley,et al. Identifying Mislabeled Training Data , 1999, J. Artif. Intell. Res..
[24] Jaideep Srivastava,et al. Selecting the right objective measure for association analysis , 2004, Inf. Syst..
[25] Victoria J. Hodge,et al. A Survey of Outlier Detection Methodologies , 2004, Artificial Intelligence Review.
[26] Martin F. Porter,et al. An algorithm for suffix stripping , 1997, Program.
[27] Clara Pizzuti,et al. Fast Outlier Detection in High Dimensional Spaces , 2002, PKDD.
[28] Vipin Kumar,et al. WebACE: a Web agent for document categorization and exploration , 1998, AGENTS '98.
[29] Tomasz Imielinski,et al. Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.
[30] D. Botstein,et al. Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.
[31] Salvatore J. Stolfo,et al. The merge/purge problem for large databases , 1995, SIGMOD '95.
[32] D. Botstein,et al. Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.
[33] Anil K. Jain,et al. Algorithms for Clustering Data , 1988 .
[34] Vipin Kumar,et al. Introduction to Data Mining, (First Edition) , 2005 .
[35] Hans-Peter Kriegel,et al. Density-Based Clustering in Spatial Databases: The Algorithm GDBSCAN and Its Applications , 1998, Data Mining and Knowledge Discovery.
[36] Xiaoli Li,et al. Eliminating noisy information in Web pages for data mining , 2003, KDD '03.
[37] Yiming Yang,et al. Noise reduction in a statistical approach to text categorization , 1995, SIGIR '95.
[38] AgrawalRakesh,et al. Mining association rules between sets of items in large databases , 1993 .
[39] Thomas Redman,et al. The impact of poor data quality on the typical enterprise , 1998, CACM.
[40] C RedmanThomas. The impact of poor data quality on the typical enterprise , 1998 .
[41] Dennis Shasha,et al. AJAX: an extensible data cleaning tool , 2000, SIGMOD '00.
[42] Sudipto Guha,et al. CURE: an efficient clustering algorithm for large databases , 1998, SIGMOD '98.
[43] Hans-Peter Kriegel,et al. LOF: identifying density-based local outliers , 2000, SIGMOD 2000.
[44] Hui Xiong,et al. Mining strong affinity association patterns in data sets with skewed support distribution , 2003, Third IEEE International Conference on Data Mining.