SCiMet: Stable, sCalable and reliable Metric-based framework for quality assessment in collaborative content generation systems

Abstract In collaborative content generation (CCG), such as publishing scientific articles, a group of contributors collaboratively generates artifacts available through a venue. The main concern in such systems is the quality. A remarkable range of research considers quality metrics partially when dealing with the quality of artifacts, contributors, and venues. However, such approaches have several drawbacks. One of the most notable ones is that they are not comprehensive in terms of the metrics to evaluate all entities, including artifacts, contributors, and venues. Also, they are vulnerable to potential attacks. In this paper, we propose a novel iterative definition in which the quality of artifacts, collaborators, and venues are defined interconnectedly. In our framework, the quality of an artifact is defined based on the quality of its contributors, venue, references, and citations. The quality of a contributor is defined based on the quality of his artifacts, collaborators, and the venues. Quality of a venue is defined based on both quality of artifacts and contributors. We propose a data model, formulations, and an algorithm for the proposed approach. We also compare the robustness of our approach against malicious manipulations with two well-known related approaches. The comparison results show the superiority of our method over other related approaches.

[1]  Aleksandar Ignjatovic,et al.  An Iterative Method for Calculating Robust Rating Scores , 2015, IEEE Transactions on Parallel and Distributed Systems.

[2]  Boualem Benatallah,et al.  Quality Control in Crowdsourcing , 2018, ACM Comput. Surv..

[3]  J. E. Hirsch,et al.  An index to quantify an individual's scientific research output , 2005, Proc. Natl. Acad. Sci. USA.

[4]  Carl T. Bergstrom Eigenfactor Measuring the value and prestige of scholarly journals , 2007 .

[5]  H. Stanley,et al.  The science of science: from the perspective of complex systems , 2017 .

[6]  Ding-wei Huang Positive correlation between quality and quantity in academic journals , 2016, J. Informetrics.

[7]  Loet Leydesdorff,et al.  A review of theory and practice in scientometrics , 2015, Eur. J. Oper. Res..

[8]  Mohsen Kahani,et al.  A metric Suite for Systematic Quality Assessment of Linked Open Data , 2020, ArXiv.

[9]  Iraklis Varlamis,et al.  Detecting rising stars in dynamic collaborative networks , 2017, J. Informetrics.

[10]  Boualem Benatallah,et al.  Harnessing Implicit Teamwork Knowledge to Improve Quality in Crowdsourcing Processes , 2014, 2014 IEEE 7th International Conference on Service-Oriented Computing and Applications.

[11]  Aleksandar Ignjatovic,et al.  Rating through Voting: An Iterative Method for Robust Rating , 2012, ArXiv.

[12]  Johan Bollen,et al.  A Principal Component Analysis of 39 Scientific Impact Measures , 2009, PloS one.

[13]  Elisa Bertino,et al.  Quality Control in Crowdsourcing Systems: Issues and Directions , 2013, IEEE Internet Computing.

[14]  Lorenzo Vigentini,et al.  An Iterative Algorithm for Reputation Aggregation in Multi-dimensional and Multinomial Rating Systems , 2015, SEC.

[15]  Paul Van Dooren,et al.  Iterative Filtering in Reputation Systems , 2010, SIAM J. Matrix Anal. Appl..

[16]  Ludo Waltman,et al.  A review of the literature on citation impact indicators , 2015, J. Informetrics.

[17]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[18]  Albert-László Barabási,et al.  Collective credit allocation in science , 2014, Proceedings of the National Academy of Sciences.

[19]  Ludo Waltman,et al.  Constructing bibliometric networks: A comparison between full and fractional counting , 2016, J. Informetrics.

[20]  Suzanne K. Lewis,et al.  Comparison of several author indices for gauging academic productivity , 2019, Informatics in Medicine Unlocked.

[21]  Steffen Fritz,et al.  Assessing quality of volunteer crowdsourcing contributions: lessons from the Cropland Capture game , 2016, Int. J. Digit. Earth.

[22]  David Johnstone,et al.  Factors influencing the decision to crowdsource: A systematic literature review , 2015, Information Systems Frontiers.

[23]  Sandro Morasca,et al.  Property-Based Software Engineering Measurement , 1996, IEEE Trans. Software Eng..

[24]  Salil S. Kanhere,et al.  A Reputation Framework for Social Participatory Sensing Systems , 2014, Mob. Networks Appl..

[25]  Stefano Nasini,et al.  Research impact in co-authorship networks: a two-mode analysis , 2017, J. Informetrics.

[26]  S. Haustein,et al.  Characterizing Social Media Metrics of Scholarly Papers: The Effect of Document Properties and Collaboration Patterns , 2015, PloS one.

[27]  Carl T. Bergstrom,et al.  Author-level Eigenfactor metrics: Evaluating the influence of authors, institutions, and countries within the social science research network community , 2013, J. Assoc. Inf. Sci. Technol..

[28]  Yi-Cheng Zhang,et al.  Information filtering via Iterative Refinement , 2006, ArXiv.

[29]  Karim R. Lakhani,et al.  Citations Systematically Misrepresent the Quality and Impact of Research Articles: Survey and Experimental Evidence from Thousands of Citers , 2020, ArXiv.

[30]  Yan Yan,et al.  The impact of collaboration and knowledge networks on citations , 2017, J. Informetrics.

[31]  Rasmus A. X. Persson,et al.  Bibliometric author evaluation through linear regression on the coauthor network , 2015, J. Informetrics.

[32]  Yannis Manolopoulos,et al.  Gazing at the skyline for star scientists , 2016, J. Informetrics.

[33]  S. Schmid Five years post-DORA: promoting best practices for research assessment , 2017, Molecular biology of the cell.

[34]  Elisa Bertino,et al.  Social-collaborative determinants of content quality in online knowledge production systems: comparing Wikipedia and Stack Overflow , 2018, Social Network Analysis and Mining.

[35]  Jeffrey Braithwaite,et al.  Comprehensive Researcher Achievement Model (CRAM): a framework for measuring researcher achievement, impact and influence derived from a systematic literature review of metrics and models , 2019, BMJ Open.

[36]  Kristina Lerman,et al.  Dynamics of Content Quality in Collaborative Knowledge Production , 2017, ICWSM.

[37]  Ronald Rousseau,et al.  Positive correlation between journal production and journal impact factors , 2016, J. Informetrics.

[38]  Geert Poels,et al.  Distance-based software measurement: necessary and sufficient properties for software measures , 2000, Inf. Softw. Technol..

[39]  R. Cagan The San Francisco Declaration on Research Assessment , 2013, Disease Models & Mechanisms.

[40]  Jie Tang,et al.  ArnetMiner: extraction and mining of academic social networks , 2008, KDD.