Neural Instant Search for Music and Podcast

Over recent years, podcasts have emerged as a novel medium for sharing and broadcasting information over the Internet. Audio streaming platforms originally designed for music content, such as Amazon Music, Pandora, and Spotify, have reported a rapid growth, with millions of users consuming podcasts every day. With podcasts emerging as a new medium for consuming information, the need to develop information access systems that enable efficient and effective discovery from a heterogeneous collection of music and podcasts is more important than ever. However, information access in such domains still remains understudied. In this work, we conduct a large-scale log analysis to study and compare podcast and music search behavior on Spotify, a major audio streaming platform. Our findings suggest that there exist fundamental differences in user behavior while searching for podcasts compared to music. Specifically, we identify the need to improve podcast search performance. We propose a simple yet effective transformer-based neural instant search model that retrieves items from a heterogeneous collection of music and podcast content. Our model takes advantage of multi-task learning to optimize for a ranking objective in addition to a query intent type identification objective. Our experiments on large-scale search logs show that the proposed model significantly outperforms strong baselines for both podcast and music queries.

[1]  Milad Shokouhi,et al.  Time-sensitive query auto-completion , 2012, SIGIR '12.

[2]  Mounia Lalmas,et al.  Aggregated Search , 2011, Advanced Topics in Information Retrieval.

[3]  Joemon M. Jose,et al.  Recent and robust query auto-completion , 2014, WWW.

[4]  Larry P. Heck,et al.  Learning deep structured semantic models for web search using clickthrough data , 2013, CIKM.

[5]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[6]  Bhaskar Mitra,et al.  Exploring Session Context using Distributed Representations of Queries and Reformulations , 2015, SIGIR.

[7]  Ziv Bar-Yossef,et al.  Context-sensitive query auto-completion , 2011, WWW.

[8]  Jiawei Han,et al.  adaQAC: Adaptive Query Auto-Completion via Implicit Negative Feedback , 2015, SIGIR.

[9]  Hongbo Deng,et al.  A two-dimensional click model for query auto-completion , 2014, SIGIR.

[10]  Giuseppe Ottaviano,et al.  Space-efficient data structures for Top-k completion , 2013, WWW '13.

[11]  Mohand Boughanem,et al.  Aggregated search: A new information retrieval paradigm , 2014, CSUR.

[12]  Craig MacDonald,et al.  Comparing Approaches for Query Autocompletion , 2015, SIGIR.

[13]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[14]  Honghui Chen,et al.  Query Auto-Completion Based on Word2vec Semantic Similarity , 2018 .

[15]  W. Bruce Croft,et al.  From Neural Re-Ranking to Neural Ranking: Learning a Sparse Representation for Inverted Indexing , 2018, CIKM.

[16]  M. de Rijke,et al.  Learning from homologous queries and semantically related terms for query auto completion , 2016, Inf. Process. Manag..

[17]  Jean Garcia-Gathright,et al.  Developing Evaluation Metrics for Instant Search Using Mixed Methods Methods , 2019, SIGIR.

[18]  M. de Rijke,et al.  Attention-based Hierarchical Neural Query Suggestion , 2018, SIGIR.

[19]  Dae Hoon Park,et al.  A Neural Language Model for Query Auto-Completion , 2017, SIGIR.

[20]  W. Bruce Croft,et al.  A Deep Relevance Matching Model for Ad-hoc Retrieval , 2016, CIKM.

[21]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[22]  W. Bruce Croft,et al.  A Deep Look into Neural Ranking Models for Information Retrieval , 2019, Inf. Process. Manag..

[23]  M. de Rijke,et al.  Personalized document re-ranking based on Bayesian probabilistic matrix factorization , 2014, SIGIR.

[24]  Tat-Seng Chua,et al.  Neural Collaborative Filtering , 2017, WWW.

[25]  Pu-Jen Cheng,et al.  Learning user reformulation behavior for query auto-completion , 2014, SIGIR.

[26]  Hongbo Deng,et al.  Analyzing User's Sequential Behavior in Query Auto-Completion via Markov Processes , 2015, SIGIR.

[27]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[28]  W. Bruce Croft,et al.  Guided Transformer: Leveraging Multiple External Sources for Representation Learning in Conversational Search , 2020, SIGIR.

[29]  Kyunghyun Cho,et al.  Passage Re-ranking with BERT , 2019, ArXiv.

[30]  W. Bruce Croft,et al.  Neural Ranking Models with Weak Supervision , 2017, SIGIR.

[31]  Hamed Zamani,et al.  Situational Context for Ranking in Personal Search , 2017, WWW.

[32]  Alistair Moffat,et al.  A Taxonomy of Query Auto Completion Modes , 2017, ADCS.

[33]  Joemon M. Jose,et al.  Evaluating aggregated search pages , 2012, SIGIR '12.

[34]  Bhaskar Mitra,et al.  Query Auto-Completion for Rare Prefixes , 2015, CIKM.

[35]  Milad Shokouhi,et al.  Learning to personalize query auto-completion , 2013, SIGIR.

[36]  Ganesh Venkataraman,et al.  Instant Search: A Hands-on Tutorial , 2016, SIGIR.

[37]  Jean Garcia-Gathright,et al.  Just Give Me What I Want: How People Use and Evaluate Music Search , 2019, CHI.

[38]  Zhiyong Lu,et al.  Personalized neural language models for real-world query auto completion , 2018, NAACL.

[39]  W. Bruce Croft,et al.  Performance Prediction for Non-Factoid Question Answering , 2019, ICTIR.

[40]  Yannis Tzitzikas,et al.  Advancing Search Query Autocompletion Services with More and Better Suggestions , 2010, ICWE.

[41]  Mari Ostendorf,et al.  Personalized Language Model for Query Auto-Completion , 2018, ACL.

[42]  M. de Rijke,et al.  A Survey of Query Auto Completion in Information Retrieval , 2016, Found. Trends Inf. Retr..