Efficient Inference, Search and Evaluation for Latent Variable Models of Text with Applications to Information Retrieval and Machine Translation
暂无分享,去创建一个
[1] Sophie Ahrens,et al. Recommender Systems , 2012 .
[2] Trevor Darrell,et al. Locality-Sensitive Hashing Using Stable Distributions , 2006 .
[3] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.
[4] Thomas Hofmann,et al. Probabilistic Latent Semantic Indexing , 1999, SIGIR Forum.
[5] John Tait,et al. Current Challenges in Patent Information Retrieval , 2011, The Information Retrieval Series.
[6] Jon Louis Bentley,et al. Multidimensional binary search trees used for associative searching , 1975, CACM.
[7] T. Minka. Estimating a Dirichlet distribution , 2012 .
[8] Michael J. Kurtz,et al. The NASA Astrophysics Data System: Overview , 2000, astro-ph/0002104.
[9] David Buttler,et al. Exploring Topic Coherence over Many Models and Many Topics , 2012, EMNLP.
[10] Chong Wang,et al. Reading Tea Leaves: How Humans Interpret Topic Models , 2009, NIPS.
[11] Srinivasan Parthasarathy,et al. Structure-based querying of proteins using wavelets , 2006, CIKM '06.
[12] John Langford,et al. Sparse Online Learning via Truncated Gradient , 2008, NIPS.
[13] C. R. Rao,et al. Diversity: its measurement, decomposition, apportionment and analysis , 1982 .
[14] Dan Roth,et al. An Unsupervised Learning Algorithm for Rank Aggregation , 2007, ECML.
[15] David A. Smith,et al. A Minimally Supervised Approach for Detecting and Ranking Document Translation Pairs , 2011, WMT@EMNLP.
[16] Robert Villa,et al. The effectiveness of query-specific hierarchic clustering in information retrieval , 2002, Inf. Process. Manag..
[17] Hinrich Schütze,et al. Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.
[18] John A. Swets,et al. Effectiveness of information retrieval methods , 1969 .
[19] Kevin P. Murphy,et al. Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.
[20] Yan Ke,et al. An efficient parts-based near-duplicate and sub-image retrieval system , 2004, MULTIMEDIA '04.
[21] Richard A. Harshman,et al. Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..
[22] Holger Schwenk,et al. On the Use of Comparable Corpora to Improve SMT performance , 2009, EACL.
[23] Wang Ling,et al. Microblogs as Parallel Corpora , 2013, ACL.
[24] Aren Jansen,et al. Efficient spoken term discovery using randomized algorithms , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.
[25] C. J. van Rijsbergen,et al. The use of hierarchic clustering in information retrieval , 1971, Inf. Storage Retr..
[26] Sudipto Guha,et al. Streaming and sublinear approximation of entropy and information distances , 2005, SODA '06.
[27] Eric P. Xing,et al. Symmetric Correspondence Topic Models for Multilingual Text Analysis , 2012, NIPS.
[28] Moses Charikar,et al. Similarity estimation techniques from rounding algorithms , 2002, STOC '02.
[29] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.
[30] Jimmy J. Lin,et al. No Free Lunch: Brute Force vs. Locality-Sensitive Hashing for Cross-lingual Pairwise Similarity , 2011, SIGIR '11.
[31] Pascale Fung,et al. Mining Very-Non-Parallel Corpora: Parallel Sentence and Lexicon Extraction via Bootstrapping and E , 2004, EMNLP.
[32] Edward A. Fox,et al. Combination of Multiple Searches , 1993, TREC.
[33] Radford M. Neal. Pattern Recognition and Machine Learning , 2007, Technometrics.
[34] Flemming Topsøe,et al. Some inequalities for information divergence and related measures of discrimination , 2000, IEEE Trans. Inf. Theory.
[35] Robert L. Mercer,et al. An Estimate of an Upper Bound for the Entropy of English , 1992, CL.
[36] Ruslan Salakhutdinov,et al. Evaluation methods for topic models , 2009, ICML '09.
[37] Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.
[38] Miles Osborne,et al. Streaming First Story Detection with application to Twitter , 2010, NAACL.
[39] Gabriella Kazai,et al. An analysis of human factors and label accuracy in crowdsourcing relevance judgments , 2013, Information Retrieval.
[40] Joon Ho Lee,et al. Combining multiple evidence from different properties of weighting schemes , 1995, SIGIR '95.
[41] Thomas L. Griffiths,et al. Integrating Topics and Syntax , 2004, NIPS.
[42] Michael I. Jordan,et al. An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.
[43] Lawrence K. Saul,et al. A Variational Approximation for Topic Modeling of Hierarchical Corpora , 2013, ICML.
[44] Thorsten Joachims,et al. Optimizing search engines using clickthrough data , 2002, KDD.
[45] Tao Li,et al. Product recommendation with temporal dynamics , 2012, Expert Syst. Appl..
[46] Thomas M. Cover,et al. Elements of Information Theory , 2005 .
[47] I. Csiszár. Why least squares and maximum entropy? An axiomatic approach to inference for linear inverse problems , 1991 .
[48] Christoph Tillmann,et al. A Simple Sentence-Level Extraction Algorithm for Comparable Data , 2009, NAACL.
[49] Jianhua Lin,et al. Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.
[50] Chris Callison-Burch,et al. Open Source Toolkit for Statistical Machine Translation: Factored Translation Models and Lattice Decoding , 2006 .
[51] Thomas L. Griffiths,et al. Online Inference of Topics with Latent Dirichlet Allocation , 2009, AISTATS.
[52] Cheng Yang,et al. Efficient acoustic index for music retrieval with various degrees of similarity , 2002, MULTIMEDIA '02.
[53] François Yvon,et al. Two Ways to Use a Noisy Parallel News Corpus for Improving Statistical Machine Translation , 2011, BUCC@ACL.
[54] Philipp Koehn,et al. Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.
[55] Hanna Wallach,et al. Structured Topic Models for Language , 2008 .
[56] Dragos Stefan Munteanu,et al. Improving Machine Translation Performance by Exploiting Non-Parallel Corpora , 2005, CL.
[57] James Allan,et al. Real-time Query Expansion in Relevance Models , 2006 .
[58] Francis R. Bach,et al. Online Learning for Latent Dirichlet Allocation , 2010, NIPS.
[59] Marti A. Hearst,et al. Reexamining the cluster hypothesis: scatter/gather on retrieval results , 1996, SIGIR '96.
[60] Son Bao Pham,et al. An Efficient Framework for Extracting Parallel Sentences from Non-Parallel Corpora , 2014, Fundam. Informaticae.
[61] Dong Zhou,et al. Latent Document Re-Ranking , 2009, EMNLP.
[62] W. Bruce Croft. Combining Approaches to Information Retrieval , 2002 .
[63] David Yarowsky,et al. Toward Statistical Machine Translation without Parallel Corpora , 2012, EACL 2012.
[64] Brendan T. O'Connor,et al. A Latent Variable Model for Geographic Lexical Variation , 2010, EMNLP.
[65] W. Bruce Croft,et al. Search Engines - Information Retrieval in Practice , 2009 .
[66] W. Bruce Croft,et al. LDA-based document models for ad-hoc retrieval , 2006, SIGIR.
[67] Daniel Jurafsky,et al. Studying the History of Ideas Using Topic Models , 2008, EMNLP.
[68] Matthew Lease,et al. Crowdsourcing for information retrieval , 2012, SIGF.
[69] Robert C. Moore. Fast and accurate sentence alignment of bilingual corpora , 2002, AMTA.
[70] Charles L. A. Clarke,et al. Efficient and effective spam filtering and re-ranking for large web datasets , 2010, Information Retrieval.
[71] James Allan,et al. A comparison of statistical significance tests for information retrieval evaluation , 2007, CIKM '07.
[72] Kristina Toutanova,et al. Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment , 2010, NAACL.
[73] Timothy Baldwin,et al. Automatic Evaluation of Topic Coherence , 2010, NAACL.
[74] Wai Lam,et al. An unsupervised topic segmentation model incorporating word order , 2013, SIGIR.
[75] Philip Resnik,et al. Holistic Sentiment Analysis Across Languages: Multilingual Supervised Latent Dirichlet Allocation , 2010, EMNLP.
[76] James Allan,et al. A Comparative Study of Utilizing Topic Models for Information Retrieval , 2009, ECIR.
[77] David M. Blei,et al. Multilingual Topic Models for Unaligned Text , 2009, UAI.
[78] Jakob Uszkoreit,et al. Large Scale Parallel Document Mining for Machine Translation , 2010, COLING.
[79] Xiangji Huang,et al. TREC-CHEM: large scale chemical information retrieval evaluation at TREC , 2009, SIGF.
[80] Mark Steyvers,et al. Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.
[81] David Buttler,et al. Latent topic feedback for information retrieval , 2011, KDD.
[82] I-En Liao,et al. A library recommender system based on a personal ontology model and collaborative filtering technique for English collections , 2010, Electron. Libr..
[83] Javed A. Aslam,et al. An analysis of crowd workers mistakes for specific and complex relevance assessment task , 2013, CIKM.
[84] Chris Quirk,et al. Generative Models of Noisy Translations with Applications to Parallel Fragment Extraction , 2007 .
[85] James Allan,et al. Fast query expansion using approximations of relevance models , 2010, CIKM.
[86] A. McCallum,et al. Topical N-Grams: Phrase and Topic Discovery, with an Application to Information Retrieval , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).
[87] Andrew McCallum,et al. Polylingual Topic Models , 2009, EMNLP.
[88] John D. Lafferty,et al. A correlated topic model of Science , 2007, 0708.3601.
[89] Ellen M. Vdorhees,et al. The cluster hypothesis revisited , 1985, SIGIR '85.
[90] David M. W. Powers,et al. Applications and Explanations of Zipf’s Law , 1998, CoNLL.
[91] Radford M. Neal. Slice Sampling , 2003, The Annals of Statistics.
[92] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.
[93] John C. Platt,et al. Translingual Document Representations from Discriminative Projections , 2010, EMNLP.
[94] Jessica Enright,et al. A Fast Method for Parallel Document Identification , 2007, HLT-NAACL.
[95] Sunil Arya,et al. ANN: library for approximate nearest neighbor searching , 1998 .
[96] Andrew McCallum,et al. Rethinking LDA: Why Priors Matter , 2009, NIPS.
[97] James Allan,et al. A New Measure of the Cluster Hypothesis , 2009, ICTIR.
[98] Yee Whye Teh,et al. On Smoothing and Inference for Topic Models , 2009, UAI.
[99] Kirk Pruhs,et al. KDDCS: a load-balanced in-network data-centric storage scheme for sensor networks , 2006, CIKM '06.
[100] Judea Pearl,et al. Reverend Bayes on Inference Engines: A Distributed Hierarchical Approach , 1982, AAAI.
[101] W. Bruce Croft,et al. Indri : A language-model based search engine for complex queries ( extended version ) , 2005 .
[102] Jia Zeng,et al. Residual Belief Propagation for Topic Modeling , 2012, ADMA.
[103] W. Bruce Croft,et al. Transforming patents into prior-art queries , 2009, SIGIR.
[104] Jimmy J. Lin,et al. Why Not Grab a Free Lunch? Mining Large Corpora for Parallel Sentences to Improve Translation Modeling , 2012, NAACL.
[105] Duen-Ren Liu,et al. Product recommendation approaches: Collaborative filtering via customer lifetime value and customer demands , 2008, Expert Syst. Appl..
[106] David M. Blei,et al. Probabilistic topic models , 2012, Commun. ACM.
[107] Alistair Moffat,et al. Improvements that don't add up: ad-hoc retrieval results since 1998 , 2009, CIKM.
[108] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..
[109] W. Bruce Croft,et al. Document clustering: An evaluation of some experiments with the cranfield 1400 collection , 1975, Inf. Process. Manag..
[110] Andrew McCallum,et al. Database of NIH grants using machine-learned categories and graphical clustering , 2011, Nature Methods.
[111] Mauro Cettolo,et al. Mining parallel fragments from comparable texts , 2010, IWSLT.
[112] Philipp Koehn,et al. Europarl: A Parallel Corpus for Statistical Machine Translation , 2005, MTSUMMIT.
[113] W. Bruce Croft. A model of cluster searching bases on classification , 1980, Inf. Syst..
[114] John D. Lafferty,et al. Dynamic topic models , 2006, ICML.
[115] C. V. Jawahar,et al. Video retrieval by mimicking poses , 2012, ICMR '12.
[116] Mark Stevenson,et al. Evaluating Topic Coherence Using Distributional Semantics , 2013, IWCS.
[117] David M. Blei,et al. Sparse stochastic inference for latent Dirichlet allocation , 2012, ICML.
[118] Andrew McCallum,et al. Topics over time: a non-Markov continuous-time model of topical trends , 2006, KDD '06.
[119] Jon Louis Bentley,et al. An Algorithm for Finding Best Matches in Logarithmic Expected Time , 1977, TOMS.
[120] Sunil Arya,et al. Approximate nearest neighbor queries in fixed dimensions , 1993, SODA '93.
[121] W. Bruce Croft,et al. Relevance-Based Language Models , 2001, SIGIR '01.
[122] Van Rijsbergen,et al. Automatic information structuring and retrieval. , 1972 .
[123] W. Bruce Croft,et al. Cluster-based retrieval using language models , 2004, SIGIR '04.
[124] Patrick Pantel,et al. Randomized Algorithms and NLP: Using Locality Sensitive Hash Functions for High Speed Noun Clustering , 2005, ACL.
[125] Piotr Indyk,et al. Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.
[126] Pablo Castells,et al. Probabilistic Score Normalization for Rank Aggregation , 2006, ECIR.