Heterogeneous Queries for Synoptic and Phrasal Search Notebook for PAN at CLEF 2014

This paper describes an architecture of the source retrieval system used at PAN 2014 lab on uncovering plagiarism, authorship, and social software misuse. The system is based on the systems used in last years at PAN 13 [6] and PAN 12 [5]. Majority of features were adapted with some improvements described in this paper. The source retrieval subsystem form an integral part of a modern system for plagiarism discovery.