A User Study on the Automated Assessment of Reviews

Reviews are text-based feedback provided by a reviewer to the author of a submission. Reviews play a crucial role in providing feedback to people who make assessment decisions (e.g. deciding a student’s grade, purchase decision of a product). It is therefore important to ensure that reviews are of a good quality. In our work we focus on the study of academic reviews. A review is considered to be of a good quality if it can help the author identify mistakes in their work, and help them learn possible ways of fixing them. Metareviewing is the process of evaluating reviews. An automated metareviewing process could provide quick and reliable feedback to reviewers on their assessment of authors’ submissions. Timely feedback on reviews could help reviewers correct their assessments and provide more useful and effective feedback to authors. In this paper we investigate the usefulness of metrics such as review relevance, content type, tone, quantity and plagiarism in determining the quality of reviews. We conducted a study on 24 participants, who used the automated assessment feature on Expertiza, a collaborative peer-reviewing system. The aim of the study is to identify reviewers’ perception of the usefulness of the automated assessment feature and its different metrics. Results suggest that participants find relevance to be the most important and quantity to be the least important in determining a review’s quality. Participants also found the system’s feedback from metrics such as content type and plagiarism to be most useful and informative.

[1]  N. Black,et al.  Development of the review quality instrument (RQI) for assessing peer reviews of manuscripts. , 1999, Journal of clinical epidemiology.

[2]  Virpi Roto,et al.  Understanding, scoping and defining user experience: a survey approach , 2009, CHI.

[3]  E.F. Gehringer,et al.  Work in Progress: Reusable Learning Objects Through Peer Review: The Expertiza Approach , 2006, Proceedings. Frontiers in Education. 36th Annual Conference.

[4]  Martin Ester,et al.  Review recommendation: personalized prediction of the quality of online reviews , 2011, CIKM '11.

[5]  Edward F. Gehringer,et al.  A word-order based graph representation for relevance identification , 2012, CIKM.

[6]  Daniel Marcu,et al.  Finding the WRITE Stuff: Automatic Identification of Discourse Structure in Student Essays , 2003, IEEE Intell. Syst..

[7]  P. Foltz,et al.  Content-based feedback 1 Supporting content-based feedback in online writing evaluation with LSA , 2000 .

[8]  Ravi Kumar,et al.  Matching Reviews to Objects using a Language Model , 2009, EMNLP.

[9]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[10]  Ee-Peng Lim,et al.  Detecting product review spammers using rating behaviors , 2010, CIKM.

[11]  Christian D. Schunn,et al.  Scaffolded writing and rewriting in the discipline: A web-based reciprocal peer review system , 2007, Comput. Educ..

[12]  Ivan Titov,et al.  Modeling online reviews with multi-grain topic models , 2008, WWW.

[13]  Klemens Böhm,et al.  Reviewing the reviewers: A study of author perception on peer reviews in computer science , 2010, 6th International Conference on Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom 2010).

[14]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[15]  A. DeRycker Reviewing Peer Reviews: A Rule-Based Approach , 2010 .

[16]  Elena Tognini-Bonelli,et al.  Corpus Linguistics at Work , 2002, Computational Linguistics.

[17]  Mike Kuniavsky Smart Things: Ubiquitous Computing User Experience Design: Ubiquitous Computing User Experience Design , 2010 .

[18]  Christian D. Schunn,et al.  The nature of feedback: how different types of peer feedback affect writing performance , 2009 .

[19]  Kwangsu Cho Machine Classification of Peer Comments in Physics , 2008, EDM.

[20]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[21]  Richong Zhang,et al.  Review recommendation with graphical model and EM algorithm , 2010, WWW '10.

[22]  Edward F. Gehringer Expertiza: Managing Feedback in Collaborative Learning , 2010 .

[23]  Roy Rada,et al.  Collaborative hypermedia in a classroom setting , 1994 .

[24]  Christian D. Schunn,et al.  Assessing Reviewer's Performance Based on Mining Problem Localization in Peer-Review Data , 2010, EDM.