Duplicate Detection in Programming Question Answering Communities
暂无分享,去创建一个
Quan Z. Sheng | Wei Emma Zhang | Ermyas Abebe | Jey Han Lau | Wenjie Ruan | W. Zhang | E. Abebe | Wenjie Ruan
[1] Stephen E. Robertson,et al. Okapi at TREC-3 , 1994, TREC.
[2] Stephen E. Robertson,et al. GatfordCentre for Interactive Systems ResearchDepartment of Information , 1996 .
[3] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[4] Tong Zhang,et al. Solving large scale linear prediction problems using stochastic gradient descent algorithms , 2004, ICML.
[5] Carlo Strapparava,et al. Corpus-based and Knowledge-based Measures of Text Semantic Similarity , 2006, AAAI.
[6] Nitin Madnani,et al. Re-examining Machine Translation Metrics for Paraphrase Identification , 2012, NAACL.
[7] Yoav Freund,et al. A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.
[8] Christopher M. Bishop,et al. Classification and regression , 1997 .
[9] Seetha Hari,et al. Learning From Imbalanced Data , 2019, Advances in Computer and Electrical Engineering.
[10] Timothy Baldwin,et al. An Empirical Evaluation of doc2vec with Practical Insights into Document Embedding Generation , 2016, Rep4NLP@ACL.
[11] Ming Zhou,et al. Answering Questions with Complex Semantic Constraints on Open Knowledge Bases , 2015, CIKM.
[12] C. J. van Rijsbergen,et al. Probabilistic models of information retrieval based on measuring the divergence from randomness , 2002, TOIS.
[13] Andrew Chou,et al. Semantic Parsing on Freebase from Question-Answer Pairs , 2013, EMNLP.
[14] Christoph Treude,et al. How do programmers ask and answer questions on the web?: NIER track , 2011, 2011 33rd International Conference on Software Engineering (ICSE).
[15] Jeffrey Pennington,et al. Dynamic Pooling and Unfolding Recursive Autoencoders for Paraphrase Detection , 2011, NIPS.
[16] Chanchal Kumar Roy,et al. Mining Duplicate Questions of Stack Overflow , 2016, 2016 IEEE/ACM 13th Working Conference on Mining Software Repositories (MSR).
[17] Aixin Sun,et al. Topic Modeling for Short Texts with Auxiliary Word Embeddings , 2016, SIGIR.
[18] Tin Kam Ho,et al. The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..
[19] Ashish Sureka,et al. Chaff from the wheat: characterization and modeling of deleted questions on stack overflow , 2014, WWW.
[20] Tat-Seng Chua,et al. Paraphrase Recognition via Dissimilarity Significance Classification , 2006, EMNLP.
[21] Idan Szpektor,et al. Learning from the past: answering new questions with past answers , 2012, WWW.
[22] Frederick Jelinek,et al. Interpolated estimation of Markov source parameters from sparse data , 1980 .
[23] Jonathan Berant,et al. Semantic Parsing via Paraphrasing , 2014, ACL.
[24] Wei-Yin Loh,et al. Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..
[25] Quan Z. Sheng,et al. Detecting Duplicate Posts in Programming QA Communities via Latent Semantics and Association Rules , 2017, WWW.
[26] Jimmy J. Lin,et al. Multi-Perspective Sentence Similarity Modeling with Convolutional Neural Networks , 2015, EMNLP.
[27] Jacob Eisenstein,et al. Discriminative Improvements to Distributional Sentence Similarity , 2013, EMNLP.
[28] G. Golub,et al. Updating formulae and a pairwise algorithm for computing sample variances , 1979 .
[29] Andreas Christmann,et al. Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.
[30] Strother H. Walker,et al. Estimation of the probability of an event as a function of several independent variables. , 1967, Biometrika.
[31] Yong Yu,et al. Analyzing and Predicting Not-Answered Questions in Community-based Question Answering Services , 2011, AAAI.
[32] David Lo,et al. Multi-Factor Duplicate Question Detection in Stack Overflow , 2015, Journal of Computer Science and Technology.
[33] Christian S. Jensen,et al. A generalized framework of exploring category information for question retrieval in community question answer archives , 2010, WWW '10.
[34] Koby Crammer,et al. Online Passive-Aggressive Algorithms , 2003, J. Mach. Learn. Res..
[35] Éric Gaussier,et al. Information-based models for ad hoc IR , 2010, SIGIR '10.
[36] Christian S. Jensen,et al. Approaches to Exploring Category Information for Question Retrieval in Community Question-Answer Archives , 2012, TOIS.
[37] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.
[38] Fang Liu,et al. Improving Question Retrieval in Community Question Answering Using World Knowledge , 2013, IJCAI.
[39] Hermann Ney,et al. The Alignment Template Approach to Statistical Machine Translation , 2004, CL.
[40] Kai Wang,et al. A syntactic tree matching approach to finding similar questions in community-based qa services , 2009, SIGIR.
[41] CHENGXIANG ZHAI,et al. A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.
[42] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..
[43] Michael Collins,et al. Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms , 2002, EMNLP.
[44] Quoc V. Le,et al. Distributed Representations of Sentences and Documents , 2014, ICML.
[45] N. Altman. An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression , 1992 .
[46] Christiane Fellbaum,et al. Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.