From query to question in one click: suggesting synthetic questions to searchers

In Web search, users may remain unsatisfied for several reasons: the search engine may not be effective enough or the query might not reflect their intent. Years of research focused on providing the best user experience for the data available to the search engine. However, little has been done to address the cases in which relevant content for the specific user need has not been posted on the Web yet. One obvious solution is to directly ask other users to generate the missing content using Community Question Answering services such as Yahoo! Answers or Baidu Zhidao. However, formulating a full-fledged question after having issued a query requires some effort. Some previous work proposed to automatically generate natural language questions from a given query, but not for scenarios in which a searcher is presented with a list of questions to choose from. We propose here to generate synthetic questions that can actually be clicked by the searcher so as to be directly posted as questions on a Community Question Answering service. This imposes new constraints, as questions will be actually shown to searchers, who will not appreciate an awkward style or redundancy. To this end, we introduce a learning-based approach that improves not only the relevance of the suggested questions to the original query, but also their grammatical correctness. In addition, since queries are often underspecified and ambiguous, we put a special emphasis on increasing the diversity of suggestions via a novel diversification mechanism. We conducted several experiments to evaluate our approach by comparing it to prior work. The experiments show that our algorithm improves question quality by 14% over prior work and that adding diversification reduced redundancy by 55%.

[1]  Recommendation Diversification Using Explanations , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[2]  Sean M. McNee,et al.  Improving recommendation lists through topic diversification , 2005, WWW '05.

[3]  Susan T. Dumais,et al.  Improving Web Search Ranking by Incorporating User Behavior Information , 2019, SIGIR Forum.

[4]  Koby Crammer,et al.  Online Passive-Aggressive Algorithms , 2003, J. Mach. Learn. Res..

[5]  Arthur C. Graesser,et al.  Question Generation from Concept Maps , 2012, Dialogue Discourse.

[6]  Ryan T. McDonald,et al.  Scalable Large-Margin Online Learning for Structured Classification , 2005 .

[7]  Tova Milo,et al.  Diversification and refinement in collaborative filtering recommender , 2011, CIKM '11.

[8]  Dan Klein,et al.  Faster and Smaller N-Gram Language Models , 2011, ACL.

[9]  Rashmi Prasad,et al.  Question Generation from Paragraphs at UPenn: QGSTEC System Description , 2010 .

[10]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[11]  Evaggelia Pitoura,et al.  Search result diversification , 2010, SGMD.

[12]  Eric Horvitz,et al.  Patterns of search: analyzing and modeling Web query refinement , 1999 .

[13]  Eugene Agichtein,et al.  When web search fails, searchers become askers: understanding the transition , 2012, SIGIR '12.

[14]  Koby Crammer,et al.  Online Large-Margin Training of Dependency Parsers , 2005, ACL.

[15]  Edward Y. Chang,et al.  K2Q: Generating Natural Language Questions from Keywords with User Refinements , 2011, IJCNLP.

[16]  Evgeniy Gabrilovich,et al.  Predicting web searcher satisfaction with existing community-based answers , 2011, SIGIR.

[17]  Chao Li,et al.  Automatically Generating Questions from Queries for Community-based Question Answering , 2011, IJCNLP.

[18]  Daniel Marcu,et al.  The Importance of Lexicalized Syntax Models for Natural Language Generation Tasks , 2002, INLG.

[19]  Stephanie Seneff,et al.  Automatic grammar correction for second-language learners , 2006, INTERSPEECH.

[20]  Paul Piwek,et al.  The First Question Generation Shared Task Evaluation Challenge , 2010, Dialogue Discourse.

[21]  Chin-Yew Lin,et al.  Automatic Question Generation from Queries , 2008 .

[22]  Sadid A. Hasan,et al.  Automation of Question Generation From Sentences , 2011 .

[23]  David McDonald,et al.  Proceedings of the Fifth International Natural Language Generation Conference , 2008, INLG 2008.

[24]  Manish Agarwal,et al.  Automatic Question Generation using Discourse Cues , 2011, BEA@ACL.

[25]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.