Generating E-Commerce Product Titles and Predicting their Quality

E-commerce platforms present products using titles that summarize product information. These titles cannot be created by hand, therefore an algorithmic solution is required. The task of automatically generating these titles given noisy user provided titles is one way to achieve the goal. The setting requires the generation process to be fast and the generated title to be both human-readable and concise. Furthermore, we need to understand if such generated titles are usable. As such, we propose approaches that (i) automatically generate product titles, (ii) predict their quality. Our approach scales to millions of products and both automatic and human evaluations performed on real-world data indicate our approaches are effective and applicable to existing e-commerce scenarios.

[1]  Colin Cherry,et al.  A Systematic Comparison of Smoothing Techniques for Sentence-Level BLEU , 2014, WMT@ACL.

[2]  Hermann Ney,et al.  Improved backing-off for M-gram language modeling , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[3]  Wenhu Chen,et al.  Guided Alignment Training for Topic-Aware Neural Machine Translation , 2016, AMTA.

[4]  Hirokazu Suzuki,et al.  Automatic Post-Editing based on SMT and its selective application by Sentence-Level Automatic Quality Evaluation , 2011, MTSUMMIT.

[5]  Hermann Ney,et al.  Computing Consensus Translation for Multiple Machine Translation Systems Using Enhanced Hypothesis Alignment , 2006, EACL.

[6]  Jade Goldstein-Stewart,et al.  The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries , 1998, SIGIR Forum.

[7]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[8]  Andy Way,et al.  Using TERp to Augment the System Combination for SMT , 2010, AMTA.

[9]  Alexander H. Waibel,et al.  Decoding Algorithm in Statistical Machine Translation , 1997, ACL.

[10]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[11]  Yoav Goldberg,et al.  A Primer on Neural Network Models for Natural Language Processing , 2015, J. Artif. Intell. Res..

[12]  Jürgen Schmidhuber,et al.  LSTM can Solve Hard Long Time Lag Problems , 1996, NIPS.

[13]  Richard M. Schwartz,et al.  Improved Word-Level System Combination for Machine Translation , 2007, ACL.

[14]  Philipp Koehn,et al.  Neural Machine Translation , 2017, ArXiv.

[15]  Ehud Reiter,et al.  A Structured Review of the Validity of BLEU , 2018, CL.

[16]  Matthew G. Snover,et al.  A Study of Translation Edit Rate with Targeted Human Annotation , 2006, AMTA.

[17]  Samuel Webb Williams,et al.  The Realities of Generating Natural Language from Databases , 1998 .

[18]  Francisco Casacuberta,et al.  Minimum Bayes-risk System Combination , 2011, ACL.

[19]  Yang Feng,et al.  Lattice-based System Combination for Statistical Machine Translation , 2009, EMNLP.

[20]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[21]  Nelson Morgan,et al.  The Elements of Automatic Summarization , 2011 .

[22]  Wei-Yun Ma,et al.  Hybrid System Combination for Machine Translation: An Integration of Phrase-level and Sentences-level Combination Approaches , 2014 .

[23]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[24]  Nicola Ueffing,et al.  Generating titles for millions of browse pages on an e-Commerce site , 2017, INLG.

[25]  Francisco Casacuberta,et al.  Improving the minimum Bayes’ risk combination of machine translation systems , 2013, IWSLT.

[26]  Loïc Barrault,et al.  Many , 2020, Definitions.

[27]  Alexander M. Rush,et al.  Abstractive Sentence Summarization with Attentive Recurrent Neural Networks , 2016, NAACL.

[28]  Spyridon Matsoukas,et al.  Trait-Based Hypothesis Selection For Machine Translation , 2012, HLT-NAACL.

[29]  Jonathan G. Fiscus,et al.  A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER) , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[30]  Kuldip K. Paliwal,et al.  Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[31]  Wei-Yun Ma,et al.  System Combination for Machine Translation through Paraphrasing , 2015, EMNLP.

[32]  Ralph Weischedel,et al.  A STUDY OF TRANSLATION ERROR RATE WITH TARGETED HUMAN ANNOTATION , 2005 .

[33]  Jason Weston,et al.  A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.