A Plan for Ancillary Copyright: Original Snippets

The snippets that web search engines generate for their result presentation are extracted from the retrieved web pages, reusing pieces of text that match a user’s query. Copyright owners of the retrieved web pages are typically not asked for usage rights. This long-time practice now faces increasing backlash from news publishers, legal action, and even new legislation in Germany and Spain: the so-called ancillary copyright for news publishers. This copyright law restricts the fair use of intellectual property of news publishers, allowing them to raise claims for monetary compensation when their text is reused, even within snippets. If passed at the EU level, ancillary copyright could severely impact future information system development. This paper promotes a “technological remedy”, namely, to synthesize true original snippets without text reuse.

[1]  Michalis Vazirgiannis,et al.  Automated snippet generation for online advertising , 2013, CIKM.

[2]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[3]  Bowen Zhou,et al.  Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond , 2016, CoNLL.

[4]  Ed H. Chi,et al.  The singularity is not near: slowing growth of Wikipedia , 2009, Int. Sym. Wikis.

[5]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[6]  Ziyang Liu,et al.  Query biased snippet generation in XML search , 2008, SIGMOD Conference.

[7]  Matthias Hagen,et al.  Clickbait Detection , 2016, ECIR.

[8]  Christopher D. Manning,et al.  Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[9]  Thorsten Joachims,et al.  Eye-tracking analysis of user behavior in WWW search , 2004, SIGIR '04.

[10]  David Maxwell,et al.  A Study of Snippet Length and Informativeness: Behaviour, Performance and User Experience , 2017, SIGIR.

[11]  Falk Scholer,et al.  Constructing query-biased summaries: a comparison of human and system generated snippets , 2010, IIiX.

[12]  Ryen W. White,et al.  Finding relevant documents using top ranking sentences: an evaluation of two alternative schemes , 2002, SIGIR '02.

[13]  Haofen Wang,et al.  Snippet Generation for Semantic Web Search Engines , 2008, ASWC.

[14]  Hugh E. Williams,et al.  Fast generation of result snippets in web search , 2007, SIGIR.

[15]  Tamás D. Gedeon,et al.  What Snippet Size is Needed in Mobile Web Search? , 2017, CHIIR.

[16]  Marti A. Hearst,et al.  Improving Search Results Quality by Customizing Summary Lengths , 2008, ACL.

[17]  Mark Sanderson,et al.  Advantages of query biased summaries in information retrieval , 1998, SIGIR '98.

[18]  Alexander M. Rush,et al.  Abstractive Sentence Summarization with Attentive Recurrent Neural Networks , 2016, NAACL.

[19]  Phyllis B. Baxendale,et al.  Machine-Made Index for Technical Literature - An Experiment , 1958, IBM J. Res. Dev..

[20]  Vishal Gupta,et al.  Recent automatic text summarization techniques: a survey , 2016, Artificial Intelligence Review.

[21]  Jason Weston,et al.  A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.

[22]  Pavel Braslavski,et al.  Search Snippet Evaluation at Yandex: Lessons Learned and Future Directions , 2011, CLEF.

[23]  Brent J. Hecht,et al.  The Substantial Interdependence of Wikipedia and Google: A Case Study on the Relationship Between Peer Production Communities and Information Technologies , 2017, ICWSM.

[24]  Edward Cutrell,et al.  What are you looking for?: an eye-tracking study of information usage in web search , 2007, CHI.

[25]  Ryen W. White,et al.  The Use of Implicit Evidence for Relevance Feedback in Web Retrieval , 2002, ECIR.