论文信息 - Identifying Manipulated Offerings on Review Portals

Identifying Manipulated Offerings on Review Portals

Recent work has developed supervised methods for detecting deceptive opinion spam— fake reviews written to sound authentic and deliberately mislead readers. And whereas past work has focused on identifying individual fake reviews, this paper aims to identify offerings (e.g., hotels) that contain fake reviews. We introduce a semi-supervised manifold ranking algorithm for this task, which relies on a small set of labeled individual reviews for training. Then, in the absence of gold standard labels (at an offering level), we introduce a novel evaluation procedure that ranks artificial instances of real offerings, where each artificial offering contains a known number of injected deceptive reviews. Experiments on a novel dataset of hotel reviews show that the proposed method outperforms state-of-art learning baselines.

Claire Cardie | Jiwei Li | Myle Ott

[1] Juan Martínez-Romo,et al. Web spam identification through language model analysis , 2009, AIRWeb '09.

[2] Arjun Mukherjee,et al. Spotting fake reviewer groups in consumer reviews , 2012, WWW.

[3] Ee-Peng Lim,et al. Finding unusual review patterns using unexpected rules , 2010, CIKM.

[4] Bernhard Schölkopf,et al. Learning with Local and Global Consistency , 2003, NIPS.

[5] Ee-Peng Lim,et al. Detecting product review spammers using rating behaviors , 2010, CIKM.

[6] Yi Yang,et al. Learning to Identify Review Spam , 2011, IJCAI.

[7] Thorsten Joachims,et al. Making large-scale support vector machine learning practical , 1999 .

[8] Claire Cardie,et al. Estimating the prevalence of deception in online review communities , 2012, WWW.

[9] Xiaojun Wan,et al. Multi-document summarization using cluster-based link analysis , 2008, SIGIR '08.

[10] Thomas L. Griffiths,et al. The Author-Topic Model for Authors and Documents , 2004, UAI.

[11] Luca Becchetti,et al. A reference collection for web spam , 2006, SIGF.