Exploring the power of heterogeneous information sources
暂无分享,去创建一个
Jiawei Han | Jing Gao | Jiawei Han | Jing Gao
[1] Andrew McCallum,et al. Automating the Construction of Internet Portals with Machine Learning , 2000, Information Retrieval.
[2] Geoffrey E. Hinton,et al. Adaptive Mixtures of Local Experts , 1991, Neural Computation.
[3] Philip S. Yu,et al. Mining concept-drifting data streams using ensemble classifiers , 2003, KDD '03.
[4] Bianca Zadrozny,et al. Learning and evaluating classifiers under sample selection bias , 2004, ICML.
[5] Rich Caruana,et al. Ensemble selection from libraries of models , 2004, ICML.
[6] Geoff Hulten,et al. Mining time-changing data streams , 2001, KDD '01.
[7] Mark Crovella,et al. Mining anomalies using traffic feature distributions , 2005, SIGCOMM '05.
[8] Stephen D. Bay,et al. Detecting Group Differences: Mining Contrast Sets , 2001, Data Mining and Knowledge Discovery.
[9] Ben Taskar,et al. Multi-View Learning over Structured and Non-Identical Outputs , 2008, UAI.
[10] Rong Ge,et al. Joint cluster analysis of attribute data and relationship data , 2008, ACM Trans. Knowl. Discov. Data.
[11] Sameer Singh,et al. Novelty detection: a review - part 1: statistical approaches , 2003, Signal Process..
[12] Ian Davidson,et al. Discovering Contexts and Contextual Outliers Using Random Walks in Graphs , 2009, 2009 Ninth IEEE International Conference on Data Mining.
[13] Inderjit S. Dhillon,et al. Clustering with Bregman Divergences , 2005, J. Mach. Learn. Res..
[14] Philip S. Yu,et al. A probabilistic framework for relational clustering , 2007, KDD '07.
[15] Anil K. Jain,et al. Clustering ensembles: models of consensus and weak partitions , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[16] Vipin Kumar,et al. A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs , 1998, SIAM J. Sci. Comput..
[17] Prabhakar Raghavan,et al. A Linear Method for Deviation Detection in Large Databases , 1996, KDD.
[18] Andreas Thor,et al. Evaluation of entity resolution approaches on real-world match problems , 2010, Proc. VLDB Endow..
[19] Rahul Malik,et al. VideoMule: a consensus learning approach to multi-label classification from noisy user-generated videos , 2009, MM '09.
[20] Michael Jiang,et al. Monitoring multi-tier clustered systems with invariant metric relationships , 2008, SEAMS '08.
[21] Philip S. Yu,et al. Combining multiple clusterings by soft correspondence , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).
[22] Joann J. Ordille,et al. Data integration: the teenage years , 2006, VLDB.
[23] Kenji Yamanishi,et al. Network anomaly detection based on Eigen equation compression , 2009, KDD.
[24] David B. Dunson,et al. Bayesian Data Analysis , 2010 .
[25] Bogdan E. Popescu,et al. PREDICTIVE LEARNING VIA RULE ENSEMBLES , 2008, 0811.1679.
[26] Rajeev Motwani,et al. The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.
[27] M E J Newman,et al. Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.
[28] Eric Bauer,et al. An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants , 1999, Machine Learning.
[29] Sunita Sarawagi,et al. Domain Adaptation of Conditional Probability Models Via Feature Subsetting , 2007, PKDD.
[30] Bernhard Schölkopf,et al. Ranking on Data Manifolds , 2003, NIPS.
[31] J. Besag. Spatial Interaction and the Statistical Analysis of Lattice Systems , 1974 .
[32] Ian H. Witten,et al. Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .
[33] John Shawe-Taylor,et al. Two view learning: SVM-2K, Theory and Practice , 2005, NIPS.
[34] Yun Chi,et al. Combining link and content for community detection: a discriminative approach , 2009, KDD.
[35] Alexander Zien,et al. Transductive support vector machines for structured variables , 2007, ICML '07.
[36] Charles A. Micchelli,et al. A Spectral Regularization Framework for Multi-Task Structure Learning , 2007, NIPS.
[37] Hans-Peter Kriegel,et al. LOF: identifying density-based local outliers , 2000, SIGMOD 2000.
[38] Jinyan Li,et al. Efficient mining of emerging patterns: discovering trends and differences , 1999, KDD '99.
[39] Xiaowei Xu,et al. SCAN: a structural clustering algorithm for networks , 2007, KDD '07.
[40] Nasser M. Nasrabadi,et al. Pattern Recognition and Machine Learning , 2006, Technometrics.
[41] Joydeep Ghosh,et al. Cluster ensembles , 2011, Data Clustering: Algorithms and Applications.
[42] Avrim Blum,et al. The Bottleneck , 2021, Monopsony Capitalism.
[43] Wei Fan,et al. Systematic data selection to mine concept-drifting data streams , 2004, KDD.
[44] Hisashi Kashima,et al. Eigenspace-based anomaly detection in computer systems , 2004, KDD.
[45] Shashi Shekhar,et al. Detecting graph-based spatial outliers: algorithms and applications (a summary of results) , 2001, KDD '01.
[46] Philip S. Yu,et al. Truth Discovery with Multiple Conflicting Information Providers on the Web , 2008, IEEE Trans. Knowl. Data Eng..
[47] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.
[48] Gerhard Widmer,et al. Learning in the Presence of Concept Drift and Hidden Contexts , 1996, Machine Learning.
[49] Ian Davidson,et al. Flexible constrained spectral clustering , 2010, KDD.
[50] Alon Y. Halevy,et al. Semantic Integration Research in the Database Community : A Brief Survey , 2005 .
[51] Kagan Tumer,et al. Analysis of decision boundaries in linearly combined neural classifiers , 1996, Pattern Recognit..
[52] Nguyen Lu Dang Khoa,et al. Robust Outlier Detection Using Commute Time and Eigenspace Embedding , 2010, PAKDD.
[53] Ben Taskar,et al. Introduction to statistical relational learning , 2007 .
[54] Fabio Gagliardi Cozman,et al. Semi-Supervised Learning of Mixture Models , 2003, ICML.
[55] Jiawei Han,et al. Modeling Probabilistic Measurement Correlations for Problem Determination in Large-Scale Distributed Systems , 2009, 2009 29th IEEE International Conference on Distributed Computing Systems.
[56] Andrew McCallum,et al. Group and Topic Discovery from Relations and Their Attributes , 2005, NIPS.
[57] Giorgio Valentini,et al. Supervised and Unsupervised Ensemble Methods and their Applications , 2008 .
[58] Philip S. Yu,et al. Effective estimation of posterior probabilities: explaining the accuracy of randomized decision tree approaches , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).
[59] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.
[60] Kun Zhang,et al. Forecasting Skewed Biased Stochastic Ozone Days: Analyses and Solutions , 2006, Sixth International Conference on Data Mining (ICDM'06).
[61] Yoav Freund,et al. A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.
[62] Bernhard Schölkopf,et al. Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.
[63] Bhavani M. Thuraisingham,et al. Cloud-based malware detection for evolving data streams , 2011, ACM Trans. Manag. Inf. Syst..
[64] Raymond J. Mooney,et al. A probabilistic framework for semi-supervised clustering , 2004, KDD.
[65] Xiaojin Zhu,et al. Seeing stars when there aren’t many stars: Graph-based semi-supervised learning for sentiment categorization , 2006 .
[66] Jing Gao,et al. A Novel Framework for Incorporating Labeled Examples into Anomaly Detection , 2006, SDM.
[67] Jiawei Han,et al. Hierarchical aggregate classification with limited supervision for data reduction in wireless sensor networks , 2011, SenSys.
[68] Carla E. Brodley,et al. Solving cluster ensemble problems by bipartite graph partitioning , 2004, ICML.
[69] Arun Ross,et al. Information fusion in biometrics , 2003, Pattern Recognit. Lett..
[70] Raymond T. Ng,et al. Distance-based outliers: algorithms and applications , 2000, The VLDB Journal.
[71] Ji-Rong Wen,et al. Scalable community discovery on textual data with relations , 2008, CIKM '08.
[72] Eleazar Eskin,et al. Anomaly Detection over Noisy Data using Learned Probability Distributions , 2000, ICML.
[73] Naoki Abe,et al. Proximity-Based Anomaly Detection Using Sparse Structure Learning , 2009, SDM.
[74] Steffen Bickel,et al. Discriminative learning for differing training and test distributions , 2007, ICML '07.
[75] Taher H. Haveliwala. Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for Web Search , 2003, IEEE Trans. Knowl. Data Eng..
[76] Jennifer Widom,et al. Models and issues in data stream systems , 2002, PODS.
[77] Masashi Sugiyama,et al. Mixture Regression for Covariate Shift , 2006, NIPS.
[78] Jon Kleinberg,et al. Authoritative sources in a hyperlinked environment , 1999, SODA '98.
[79] Philip S. Yu,et al. Mining Extremely Skewed Trading Anomalies , 2004, EDBT.
[80] Deepak S. Turaga,et al. A Spectral Framework for Detecting Inconsistency across Multi-source Object Relationships , 2011, 2011 IEEE 11th International Conference on Data Mining.
[81] H. Shimodaira,et al. Improving predictive inference under covariate shift by weighting the log-likelihood function , 2000 .
[82] Xindong Wu,et al. Combining proactive and reactive predictions for data streams , 2005, KDD '05.
[83] Miroslaw Malek,et al. Using Hidden Semi-Markov Models for Effective Online Failure Prediction , 2007, 2007 26th IEEE International Symposium on Reliable Distributed Systems (SRDS 2007).
[84] Arindam Banerjee,et al. Bayesian cluster ensembles , 2009, Stat. Anal. Data Min..
[85] Graham Cormode,et al. Summarizing and Mining Skewed Data Streams , 2005, SDM.
[86] Samy Bengio,et al. Semi-supervised adapted HMMs for unusual event detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).
[87] D. L. Hall,et al. Mathematical Techniques in Multisensor Data Fusion , 1992 .
[88] Christopher J. C. Burges,et al. Spectral clustering and transductive learning with multiple views , 2007, ICML '07.
[89] Qiang Yang,et al. Boosting for transfer learning , 2007, ICML '07.
[90] Koby Crammer,et al. Learning from Multiple Sources , 2006, NIPS.
[91] Zhen Guo,et al. Tracking Probabilistic Correlation of Monitoring Data for Fault Detection in Complex Systems , 2006, International Conference on Dependable Systems and Networks (DSN'06).
[92] Jitendra Malik,et al. Normalized Cuts and Image Segmentation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..
[93] Jiawei Han,et al. On Appropriate Assumptions to Mine Data Streams: Analysis and Practice , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).
[94] Rayid Ghani,et al. Analyzing the effectiveness and applicability of co-training , 2000, CIKM '00.
[95] VARUN CHANDOLA,et al. Anomaly detection: A survey , 2009, CSUR.
[96] Leo Breiman,et al. Random Forests , 2001, Machine Learning.
[97] Mark Crovella,et al. Distributed Spatial Anomaly Detection , 2008, IEEE INFOCOM 2008 - The 27th Conference on Computer Communications.
[98] Zhi-Hua Zhou,et al. Isolation Forest , 2008, 2008 Eighth IEEE International Conference on Data Mining.
[99] Aristides Gionis,et al. Clustering aggregation , 2005, 21st International Conference on Data Engineering (ICDE'05).
[100] Haifeng Chen,et al. Discovering likely invariants of distributed transaction systems for autonomic system management , 2006, 2006 IEEE International Conference on Autonomic Computing.
[101] Chris H. Q. Ding,et al. Solving Consensus and Semi-supervised Clustering Problems Using Nonnegative Matrix Factorization , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).
[102] Yizhou Sun,et al. Heterogeneous source consensus learning via decision propagation and negotiation , 2009, KDD.
[103] Nitesh V. Chawla,et al. Editorial: special issue on learning from imbalanced data sets , 2004, SKDD.
[104] J. Besag. On the Statistical Analysis of Dirty Pictures , 1986 .
[105] Thorsten Joachims,et al. Transductive Learning via Spectral Graph Partitioning , 2003, ICML.
[106] Andrew W. Moore,et al. Locally Weighted Learning , 1997, Artificial Intelligence Review.
[107] Gustavo E. A. P. A. Batista,et al. A study of the behavior of several methods for balancing machine learning training data , 2004, SKDD.
[108] Qiang Yang,et al. Semi-Supervised Learning with Very Few Labeled Training Examples , 2007, AAAI.
[109] Sham M. Kakade,et al. Multi-view clustering via canonical correlation analysis , 2009, ICML '09.
[110] Koby Crammer,et al. Analysis of Representations for Domain Adaptation , 2006, NIPS.
[111] Rich Caruana,et al. Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.
[112] Carlo Batini,et al. Methodologies for data quality assessment and improvement , 2009, CSUR.
[113] Kathryn B. Laskey,et al. Nonparametric Bayesian Clustering Ensembles , 2010, ECML/PKDD.
[114] Thorsten Joachims,et al. Detecting Concept Drift with Support Vector Machines , 2000, ICML.
[115] Diane J. Cook,et al. Graph-based anomaly detection , 2003, KDD '03.
[116] Philip S. Yu,et al. A General Framework for Mining Concept-Drifting Data Streams with Skewed Distributions , 2007, SDM.
[117] R. Polikar,et al. Ensemble based systems in decision making , 2006, IEEE Circuits and Systems Magazine.
[118] Stephen M. Smith,et al. Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm , 2001, IEEE Transactions on Medical Imaging.
[119] John Shawe-Taylor,et al. Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.
[120] Marcus A. Maloof,et al. Using additive expert ensembles to cope with concept drift , 2005, ICML.
[121] Sanjay Chawla,et al. On local spatial outliers , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).
[122] Ivor W. Tsang,et al. Domain adaptation from multiple sources via auxiliary classifiers , 2009, ICML '09.
[123] Adrian E. Raftery,et al. Bayesian model averaging: a tutorial (with comments by M. Clyde, David Draper and E. I. George, and a rejoinder by the authors , 1999 .
[124] Maurizio Lenzerini,et al. Data integration: a theoretical perspective , 2002, PODS.
[125] William E. Winkler,et al. The State of Record Linkage and Current Research Problems , 1999 .
[126] Zoubin Ghahramani,et al. Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.
[127] Steffen Bickel,et al. Multi-view clustering , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).
[128] Sanjay Ranka,et al. Conditional Anomaly Detection , 2007, IEEE Transactions on Knowledge and Data Engineering.
[129] Latifur Khan,et al. Facing the reality of data stream classification: coping with scarcity of labeled data , 2012, Knowledge and Information Systems.
[130] Bernhard Schölkopf,et al. Learning with Local and Global Consistency , 2003, NIPS.
[131] Thomas G. Dietterich. Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.
[132] Yizhou Sun,et al. A Graph-Based Consensus Maximization Approach for Combining Multiple Supervised and Unsupervised Models , 2013, IEEE Transactions on Knowledge and Data Engineering.
[133] Anil K. Jain,et al. Data clustering: a review , 1999, CSUR.
[134] Philip S. Yu,et al. A General Model for Multiple View Unsupervised Learning , 2008, SDM.
[135] Lise Getoor,et al. Collective entity resolution in relational data , 2007, TKDD.
[136] Yizhou Sun,et al. Graph-based Consensus Maximization among Multiple Supervised and Unsupervised Models , 2009, NIPS.
[137] Bernhard Schölkopf,et al. Correcting Sample Selection Bias by Unlabeled Data , 2006, NIPS.
[138] Mikhail Belkin,et al. A Co-Regularization Approach to Semi-supervised Learning with Multiple Views , 2005 .
[139] Daniel Marcu,et al. Domain Adaptation for Statistical Classifiers , 2006, J. Artif. Intell. Res..
[140] Christos Faloutsos,et al. Spectral Analysis for Billion-Scale Graphs: Discoveries and Implementation , 2011, PAKDD.
[141] Ian Davidson,et al. On Sample Selection Bias and Its Efficient Correction via Model Averaging and Unlabeled Examples , 2007, SDM.
[142] S. Muthukrishnan,et al. Modeling skew in data streams , 2006, SIGMOD Conference.
[143] Wei Fan,et al. Heterogeneous cross domain ranking in latent space , 2009, CIKM.
[144] Shai Ben-David,et al. Detecting Change in Data Streams , 2004, VLDB.
[145] Christoph H. Lampert,et al. Learning Multi-View Neighborhood Preserving Projections , 2011, ICML.
[146] Qiang Yang,et al. A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.
[147] Jennifer Widom,et al. SimRank: a measure of structural-context similarity , 2002, KDD.
[148] Jiawei Han,et al. Knowledge transfer via multiple model local structure mapping , 2008, KDD.
[149] Jimeng Sun,et al. Neighborhood formation and anomaly detection in bipartite graphs , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).
[150] Ulrike von Luxburg,et al. A tutorial on spectral clustering , 2007, Stat. Comput..
[151] Jiawei Han,et al. Data Mining: Concepts and Techniques , 2000 .
[152] Deepak S. Turaga,et al. Consensus extraction from heterogeneous detectors to improve performance over network traffic anomaly detection , 2011, 2011 Proceedings IEEE INFOCOM.
[153] Chao Yang,et al. ARPACK users' guide - solution of large-scale eigenvalue problems with implicitly restarted Arnoldi methods , 1998, Software, environments, tools.
[154] Philip S. Yu,et al. Classifying Data Streams with Skewed Class Distributions and Concept Drifts , 2008, IEEE Internet Computing.
[155] Susan T. Dumais,et al. The Combination of Text Classifiers Using Reliability Indicators , 2016, Information Retrieval.
[156] Giovanni Seni,et al. Ensemble Methods in Data Mining: Improving Accuracy Through Combining Predictions , 2010, Ensemble Methods in Data Mining.
[157] Ruoming Jin,et al. MMIS07, 08: mining multiple information sources workshop report , 2008, SKDD.
[158] Eric A. Brewer,et al. Pinpoint: problem determination in large, dynamic Internet services , 2002, Proceedings International Conference on Dependable Systems and Networks.
[159] Zhenguo Li,et al. Constrained clustering via spectral regularization , 2009, CVPR.
[160] Gregory Z. Grudic,et al. Unsupervised Outlier Detection and Semi-Supervised Learning ; CU-CS-976-04 , 2004 .
[161] Hui Xiong,et al. Transfer learning from multiple source domains via consensus regularization , 2008, CIKM '08.
[162] Paramvir Bahl,et al. Towards highly reliable enterprise network services via inference of multi-level dependencies , 2007, SIGCOMM.
[163] Joydeep Ghosh,et al. Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..
[164] Divesh Srivastava,et al. Record linkage: similarity measures and algorithms , 2006, SIGMOD Conference.
[165] Bhavani M. Thuraisingham,et al. Classification and Novel Class Detection in Concept-Drifting Data Streams under Time Constraints , 2011, IEEE Transactions on Knowledge and Data Engineering.