Query by document via a decomposition-based two-level retrieval approach
暂无分享,去创建一个
Laurence T. Yang | Yaoxue Zhang | Yue-Zhi Zhou | Zhiwei Li | Lei Zhang | Rui Cai | Linkai Weng | Lei Zhang | Rui Cai | Zhiwei Li | L. Yang | Yuezhi Zhou | Yaoxue Zhang | Linkai Weng
[1] Geoffrey Zweig,et al. Syntactic Clustering of the Web , 1997, Comput. Networks.
[2] Yi Zhang,et al. Novelty and redundancy detection in adaptive filtering , 2002, SIGIR '02.
[3] Alexandr Andoni,et al. Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).
[4] Chris H. Q. Ding,et al. Nonnegative Matrix Factorization and Probabilistic Latent Semantic Indexing: Equivalence Chi-Square Statistic, and a Hybrid Method , 2006, AAAI.
[5] Andrei Z. Broder,et al. Efficient query evaluation using a two-level retrieval process , 2003, CIKM '03.
[6] Luiz André Barroso,et al. Web Search for a Planet: The Google Cluster Architecture , 2003, IEEE Micro.
[7] Thomas Hofmann,et al. Probabilistic latent semantic indexing , 1999, SIGIR '99.
[8] JUSTIN ZOBEL,et al. Inverted files for text search engines , 2006, CSUR.
[9] Matthew Hurst,et al. A Language Model Approach to Keyphrase Extraction , 2003, ACL 2003.
[10] Peter D. Turney. Learning Algorithms for Keyphrase Extraction , 2000, Information Retrieval.
[11] Katerina T. Frantzi,et al. Incorporating Context Information for the Extraction of Terms , 1997, ACL.
[12] Sergey Brin,et al. The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.
[13] Olga Vechtomova,et al. Approaches to High Accuracy Retrieval: Phrase-Based Search Experiments in the HARD Track , 2004, TREC.
[14] T. Landauer,et al. Indexing by Latent Semantic Analysis , 1990 .
[15] Ian H. Witten,et al. Thesaurus based automatic keyphrase indexing , 2006, Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '06).
[16] Daniel P. Lopresti,et al. Models and algorithms for duplicate document detection , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).
[17] John Riedl,et al. Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.
[18] W. Bruce Croft,et al. Cluster-based retrieval using language models , 2004, SIGIR '04.
[19] Nando de Freitas,et al. An Introduction to MCMC for Machine Learning , 2004, Machine Learning.
[20] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..
[21] Monika Henzinger,et al. Finding near-duplicate web pages: a large-scale evaluation of algorithms , 2006, SIGIR.
[22] Susumu Horiguchi,et al. Learning to classify short and sparse text & web with hidden topics from large-scale data collections , 2008, WWW.
[23] Padhraic Smyth,et al. Modeling General and Specific Aspects of Documents with a Probabilistic Topic Model , 2006, NIPS.
[24] T. Minka. Estimating a Dirichlet distribution , 2012 .
[25] Moses Charikar,et al. Similarity estimation techniques from rounding algorithms , 2002, STOC '02.
[26] John Wright,et al. Decomposing background topics from keywords by principal component pursuit , 2010, CIKM.
[27] Fan Deng,et al. Approximately detecting duplicates for streaming data using stable bloom filters , 2006, SIGMOD Conference.
[28] John D. Lafferty,et al. Model-based feedback in the language modeling approach to information retrieval , 2001, CIKM '01.
[29] Andrew W. Moore,et al. New Algorithms for Efficient High-Dimensional Nonparametric Classification , 2006, J. Mach. Learn. Res..
[30] Hector Garcia-Molina,et al. Finding replicated Web collections , 2000, SIGMOD '00.
[31] W. Bruce Croft,et al. LDA-based document models for ad-hoc retrieval , 2006, SIGIR.
[32] Mark Steyvers,et al. Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.
[33] Yi Zhang,et al. Exact Maximum Likelihood Estimation for Word Mixtures , 2002 .
[34] H. Sebastian Seung,et al. Algorithms for Non-negative Matrix Factorization , 2000, NIPS.
[35] Ophir Frieder,et al. Collection statistics for fast duplicate document detection , 2002, TOIS.
[36] Yin Yang,et al. Query by document , 2009, WSDM '09.