Efficient Inference, Search and Evaluation for Latent Variable Models of Text with Applications to Information Retrieval and Machine Translation

EFFICIENT INFERENCE, SEARCH AND EVALUATION FOR LATENT VARIABLE MODELS OF TEXT WITH APPLICATIONS TO INFORMATION RETRIEVAL AND MACHINE TRANSLATION

[1]  Sophie Ahrens,et al.  Recommender Systems , 2012 .

[2]  Trevor Darrell,et al.  Locality-Sensitive Hashing Using Stable Distributions , 2006 .

[3]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[4]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Indexing , 1999, SIGIR Forum.

[5]  John Tait,et al.  Current Challenges in Patent Information Retrieval , 2011, The Information Retrieval Series.

[6]  Jon Louis Bentley,et al.  Multidimensional binary search trees used for associative searching , 1975, CACM.

[7]  T. Minka Estimating a Dirichlet distribution , 2012 .

[8]  Michael J. Kurtz,et al.  The NASA Astrophysics Data System: Overview , 2000, astro-ph/0002104.

[9]  David Buttler,et al.  Exploring Topic Coherence over Many Models and Many Topics , 2012, EMNLP.

[10]  Chong Wang,et al.  Reading Tea Leaves: How Humans Interpret Topic Models , 2009, NIPS.

[11]  Srinivasan Parthasarathy,et al.  Structure-based querying of proteins using wavelets , 2006, CIKM '06.

[12]  John Langford,et al.  Sparse Online Learning via Truncated Gradient , 2008, NIPS.

[13]  C. R. Rao,et al.  Diversity: its measurement, decomposition, apportionment and analysis , 1982 .

[14]  Dan Roth,et al.  An Unsupervised Learning Algorithm for Rank Aggregation , 2007, ECML.

[15]  David A. Smith,et al.  A Minimally Supervised Approach for Detecting and Ranking Document Translation Pairs , 2011, WMT@EMNLP.

[16]  Robert Villa,et al.  The effectiveness of query-specific hierarchic clustering in information retrieval , 2002, Inf. Process. Manag..

[17]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[18]  John A. Swets,et al.  Effectiveness of information retrieval methods , 1969 .

[19]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[20]  Yan Ke,et al.  An efficient parts-based near-duplicate and sub-image retrieval system , 2004, MULTIMEDIA '04.

[21]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[22]  Holger Schwenk,et al.  On the Use of Comparable Corpora to Improve SMT performance , 2009, EACL.

[23]  Wang Ling,et al.  Microblogs as Parallel Corpora , 2013, ACL.

[24]  Aren Jansen,et al.  Efficient spoken term discovery using randomized algorithms , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.

[25]  C. J. van Rijsbergen,et al.  The use of hierarchic clustering in information retrieval , 1971, Inf. Storage Retr..

[26]  Sudipto Guha,et al.  Streaming and sublinear approximation of entropy and information distances , 2005, SODA '06.

[27]  Eric P. Xing,et al.  Symmetric Correspondence Topic Models for Multilingual Text Analysis , 2012, NIPS.

[28]  Moses Charikar,et al.  Similarity estimation techniques from rounding algorithms , 2002, STOC '02.

[29]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[30]  Jimmy J. Lin,et al.  No Free Lunch: Brute Force vs. Locality-Sensitive Hashing for Cross-lingual Pairwise Similarity , 2011, SIGIR '11.

[31]  Pascale Fung,et al.  Mining Very-Non-Parallel Corpora: Parallel Sentence and Lexicon Extraction via Bootstrapping and E , 2004, EMNLP.

[32]  Edward A. Fox,et al.  Combination of Multiple Searches , 1993, TREC.

[33]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[34]  Flemming Topsøe,et al.  Some inequalities for information divergence and related measures of discrimination , 2000, IEEE Trans. Inf. Theory.

[35]  Robert L. Mercer,et al.  An Estimate of an Upper Bound for the Entropy of English , 1992, CL.

[36]  Ruslan Salakhutdinov,et al.  Evaluation methods for topic models , 2009, ICML '09.

[37]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[38]  Miles Osborne,et al.  Streaming First Story Detection with application to Twitter , 2010, NAACL.

[39]  Gabriella Kazai,et al.  An analysis of human factors and label accuracy in crowdsourcing relevance judgments , 2013, Information Retrieval.

[40]  Joon Ho Lee,et al.  Combining multiple evidence from different properties of weighting schemes , 1995, SIGIR '95.

[41]  Thomas L. Griffiths,et al.  Integrating Topics and Syntax , 2004, NIPS.

[42]  Michael I. Jordan,et al.  An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.

[43]  Lawrence K. Saul,et al.  A Variational Approximation for Topic Modeling of Hierarchical Corpora , 2013, ICML.

[44]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[45]  Tao Li,et al.  Product recommendation with temporal dynamics , 2012, Expert Syst. Appl..

[46]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[47]  I. Csiszár Why least squares and maximum entropy? An axiomatic approach to inference for linear inverse problems , 1991 .

[48]  Christoph Tillmann,et al.  A Simple Sentence-Level Extraction Algorithm for Comparable Data , 2009, NAACL.

[49]  Jianhua Lin,et al.  Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[50]  Chris Callison-Burch,et al.  Open Source Toolkit for Statistical Machine Translation: Factored Translation Models and Lattice Decoding , 2006 .

[51]  Thomas L. Griffiths,et al.  Online Inference of Topics with Latent Dirichlet Allocation , 2009, AISTATS.

[52]  Cheng Yang,et al.  Efficient acoustic index for music retrieval with various degrees of similarity , 2002, MULTIMEDIA '02.

[53]  François Yvon,et al.  Two Ways to Use a Noisy Parallel News Corpus for Improving Statistical Machine Translation , 2011, BUCC@ACL.

[54]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[55]  Hanna Wallach,et al.  Structured Topic Models for Language , 2008 .

[56]  Dragos Stefan Munteanu,et al.  Improving Machine Translation Performance by Exploiting Non-Parallel Corpora , 2005, CL.

[57]  James Allan,et al.  Real-time Query Expansion in Relevance Models , 2006 .

[58]  Francis R. Bach,et al.  Online Learning for Latent Dirichlet Allocation , 2010, NIPS.

[59]  Marti A. Hearst,et al.  Reexamining the cluster hypothesis: scatter/gather on retrieval results , 1996, SIGIR '96.

[60]  Son Bao Pham,et al.  An Efficient Framework for Extracting Parallel Sentences from Non-Parallel Corpora , 2014, Fundam. Informaticae.

[61]  Dong Zhou,et al.  Latent Document Re-Ranking , 2009, EMNLP.

[62]  W. Bruce Croft Combining Approaches to Information Retrieval , 2002 .

[63]  David Yarowsky,et al.  Toward Statistical Machine Translation without Parallel Corpora , 2012, EACL 2012.

[64]  Brendan T. O'Connor,et al.  A Latent Variable Model for Geographic Lexical Variation , 2010, EMNLP.

[65]  W. Bruce Croft,et al.  Search Engines - Information Retrieval in Practice , 2009 .

[66]  W. Bruce Croft,et al.  LDA-based document models for ad-hoc retrieval , 2006, SIGIR.

[67]  Daniel Jurafsky,et al.  Studying the History of Ideas Using Topic Models , 2008, EMNLP.

[68]  Matthew Lease,et al.  Crowdsourcing for information retrieval , 2012, SIGF.

[69]  Robert C. Moore Fast and accurate sentence alignment of bilingual corpora , 2002, AMTA.

[70]  Charles L. A. Clarke,et al.  Efficient and effective spam filtering and re-ranking for large web datasets , 2010, Information Retrieval.

[71]  James Allan,et al.  A comparison of statistical significance tests for information retrieval evaluation , 2007, CIKM '07.

[72]  Kristina Toutanova,et al.  Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment , 2010, NAACL.

[73]  Timothy Baldwin,et al.  Automatic Evaluation of Topic Coherence , 2010, NAACL.

[74]  Wai Lam,et al.  An unsupervised topic segmentation model incorporating word order , 2013, SIGIR.

[75]  Philip Resnik,et al.  Holistic Sentiment Analysis Across Languages: Multilingual Supervised Latent Dirichlet Allocation , 2010, EMNLP.

[76]  James Allan,et al.  A Comparative Study of Utilizing Topic Models for Information Retrieval , 2009, ECIR.

[77]  David M. Blei,et al.  Multilingual Topic Models for Unaligned Text , 2009, UAI.

[78]  Jakob Uszkoreit,et al.  Large Scale Parallel Document Mining for Machine Translation , 2010, COLING.

[79]  Xiangji Huang,et al.  TREC-CHEM: large scale chemical information retrieval evaluation at TREC , 2009, SIGF.

[80]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[81]  David Buttler,et al.  Latent topic feedback for information retrieval , 2011, KDD.

[82]  I-En Liao,et al.  A library recommender system based on a personal ontology model and collaborative filtering technique for English collections , 2010, Electron. Libr..

[83]  Javed A. Aslam,et al.  An analysis of crowd workers mistakes for specific and complex relevance assessment task , 2013, CIKM.

[84]  Chris Quirk,et al.  Generative Models of Noisy Translations with Applications to Parallel Fragment Extraction , 2007 .

[85]  James Allan,et al.  Fast query expansion using approximations of relevance models , 2010, CIKM.

[86]  A. McCallum,et al.  Topical N-Grams: Phrase and Topic Discovery, with an Application to Information Retrieval , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[87]  Andrew McCallum,et al.  Polylingual Topic Models , 2009, EMNLP.

[88]  John D. Lafferty,et al.  A correlated topic model of Science , 2007, 0708.3601.

[89]  Ellen M. Vdorhees,et al.  The cluster hypothesis revisited , 1985, SIGIR '85.

[90]  David M. W. Powers,et al.  Applications and Explanations of Zipf’s Law , 1998, CoNLL.

[91]  Radford M. Neal Slice Sampling , 2003, The Annals of Statistics.

[92]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[93]  John C. Platt,et al.  Translingual Document Representations from Discriminative Projections , 2010, EMNLP.

[94]  Jessica Enright,et al.  A Fast Method for Parallel Document Identification , 2007, HLT-NAACL.

[95]  Sunil Arya,et al.  ANN: library for approximate nearest neighbor searching , 1998 .

[96]  Andrew McCallum,et al.  Rethinking LDA: Why Priors Matter , 2009, NIPS.

[97]  James Allan,et al.  A New Measure of the Cluster Hypothesis , 2009, ICTIR.

[98]  Yee Whye Teh,et al.  On Smoothing and Inference for Topic Models , 2009, UAI.

[99]  Kirk Pruhs,et al.  KDDCS: a load-balanced in-network data-centric storage scheme for sensor networks , 2006, CIKM '06.

[100]  Judea Pearl,et al.  Reverend Bayes on Inference Engines: A Distributed Hierarchical Approach , 1982, AAAI.

[101]  W. Bruce Croft,et al.  Indri : A language-model based search engine for complex queries ( extended version ) , 2005 .

[102]  Jia Zeng,et al.  Residual Belief Propagation for Topic Modeling , 2012, ADMA.

[103]  W. Bruce Croft,et al.  Transforming patents into prior-art queries , 2009, SIGIR.

[104]  Jimmy J. Lin,et al.  Why Not Grab a Free Lunch? Mining Large Corpora for Parallel Sentences to Improve Translation Modeling , 2012, NAACL.

[105]  Duen-Ren Liu,et al.  Product recommendation approaches: Collaborative filtering via customer lifetime value and customer demands , 2008, Expert Syst. Appl..

[106]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[107]  Alistair Moffat,et al.  Improvements that don't add up: ad-hoc retrieval results since 1998 , 2009, CIKM.

[108]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[109]  W. Bruce Croft,et al.  Document clustering: An evaluation of some experiments with the cranfield 1400 collection , 1975, Inf. Process. Manag..

[110]  Andrew McCallum,et al.  Database of NIH grants using machine-learned categories and graphical clustering , 2011, Nature Methods.

[111]  Mauro Cettolo,et al.  Mining parallel fragments from comparable texts , 2010, IWSLT.

[112]  Philipp Koehn,et al.  Europarl: A Parallel Corpus for Statistical Machine Translation , 2005, MTSUMMIT.

[113]  W. Bruce Croft A model of cluster searching bases on classification , 1980, Inf. Syst..

[114]  John D. Lafferty,et al.  Dynamic topic models , 2006, ICML.

[115]  C. V. Jawahar,et al.  Video retrieval by mimicking poses , 2012, ICMR '12.

[116]  Mark Stevenson,et al.  Evaluating Topic Coherence Using Distributional Semantics , 2013, IWCS.

[117]  David M. Blei,et al.  Sparse stochastic inference for latent Dirichlet allocation , 2012, ICML.

[118]  Andrew McCallum,et al.  Topics over time: a non-Markov continuous-time model of topical trends , 2006, KDD '06.

[119]  Jon Louis Bentley,et al.  An Algorithm for Finding Best Matches in Logarithmic Expected Time , 1977, TOMS.

[120]  Sunil Arya,et al.  Approximate nearest neighbor queries in fixed dimensions , 1993, SODA '93.

[121]  W. Bruce Croft,et al.  Relevance-Based Language Models , 2001, SIGIR '01.

[122]  Van Rijsbergen,et al.  Automatic information structuring and retrieval. , 1972 .

[123]  W. Bruce Croft,et al.  Cluster-based retrieval using language models , 2004, SIGIR '04.

[124]  Patrick Pantel,et al.  Randomized Algorithms and NLP: Using Locality Sensitive Hash Functions for High Speed Noun Clustering , 2005, ACL.

[125]  Piotr Indyk,et al.  Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.

[126]  Pablo Castells,et al.  Probabilistic Score Normalization for Rank Aggregation , 2006, ECIR.