Contributions to assess the reproducibility and the agreement of respondents in CATA tasks

Abstract In consideration of the widespread use of check-all-that-apply (CATA) questions in sensory and consumer research, the investigation of how the panel performs in this methodology is deemed necessary. Checking the reliability of this type of data is relevant, mostly in situations where, as a rapid method, it is obtained from consumers and in less controlled conditions such as online studies. While an excellent performance is expected and required from trained subjects, when working with consumers it is convenient to explore the data obtained, since this can also hint at how seriously they are taking the task, their levels of fatigue, boredom, etc. In this work, we have developed some complementary tools to existing ones to be able to evaluate statistically the reproducibility of both the respondents and the panel. The assessment of the reliability of the panel would not be complete without other methodologies dedicated to the reproducibility at the panel level of the products and of the terms, and to the assessment of the agreements between pairs of respondents, and between each respondent and the panel solution. For those criteria, methodologies based on multiple factor analysis on contingency tables and McNemar test are presented. To illustrate the methodology developed, the statistical complements were applied on data from two studies. The studies, one involving emotions and another one sensory characterisation data, were conducted in two different test–retest sessions.

[1]  Morten Meilgaard,et al.  Sensory Evaluation Techniques , 2020 .

[2]  Jian Bi,et al.  AGREEMENT AND RELIABILITY ASSESSMENTS FOR PERFORMANCE OF SENSORY DESCRIPTIVE PANEL , 2003 .

[3]  Gastón Ares,et al.  Check-all-that-apply (CATA) responses elicited by consumers: Within-assessor reproducibility and stability of sensory product characterizations , 2013 .

[4]  Mónica Bécue-Bertaut,et al.  Multiple Factor Analysis for Contingency Tables in the FactoMineR Package , 2013, R J..

[5]  Y. Escoufier LE TRAITEMENT DES VARIABLES VECTORIELLES , 1973 .

[6]  Frank Rossi,et al.  Assessing sensory panelist performance using repeatability and reproducibility measures , 2001 .

[7]  Jérôme Pagès,et al.  Collection and analysis of perceived product inter-distances using multiple factor analysis: Application to the study of 10 white wines from the Loire Valley , 2005 .

[8]  Herbert L. Meiselman,et al.  Development of a method to measure consumer emotions associated with foods , 2010 .

[9]  Sébastien Lê,et al.  FactoMineR: An R Package for Multivariate Analysis , 2008 .

[10]  Sébastien Lê,et al.  STATISTICAL ANALYSIS OF HIERARCHICAL SORTING DATA , 2011 .

[11]  Michael Meyners,et al.  Panel and panelist agreement for product comparisons in studies of Temporal Dominance of Sensations , 2011 .

[12]  V. Vovk,et al.  Combining P-Values Via Averaging , 2012, Biometrika.

[13]  Jérôme Pagès,et al.  A principal axes method for comparing contingency tables: MFACT , 2004, Comput. Stat. Data Anal..

[14]  Dominique Valentin,et al.  Comparison of conventional descriptive analysis and a citation frequency-based descriptive method for odor profiling: An application to Burgundy Pinot noir wines , 2010 .

[15]  Jérôme Pagès,et al.  Multiple factor analysis (AFMULT package) , 1994 .

[16]  Morton B. Brown 400: A Method for Combining Non-Independent, One-Sided Tests of Significance , 1975 .

[17]  Jérôme Pagès,et al.  Rating of products through scores and free-text assertions: Comparing and combining both , 2008 .

[18]  Martin Kermit,et al.  Assessing the performance of a sensory panel–panellist monitoring and tracking , 2005 .

[19]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[20]  Jérôme Pagès,et al.  Testing the significance of the RV coefficient , 2008, Comput. Stat. Data Anal..

[21]  Dominique Valentin,et al.  Aroma properties of young Spanish monovarietal white wines: a study using sorting task, list of terms and frequency of citation , 2008 .

[22]  John C. Castura,et al.  Existing and new approaches for the analysis of CATA data , 2013 .

[23]  J. Gower Generalized procrustes analysis , 1975 .

[24]  Tormod Næs,et al.  Performance indices in descriptive sensory analysis – A complimentary screening tool for assessor and panel performance , 2013 .

[25]  Michael J. Hautus,et al.  Comparison of performance in the A-Not A, 2-AFC, and same-different tests for the flavor discrimination of margarines: The effect of cognitive decision strategies , 2007 .

[26]  Sara R. Jaeger,et al.  The impact of evoked consumption contexts and appropriateness on emotion responses , 2014 .

[27]  Patrick J. F. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 2003 .

[28]  Robert Sabatier,et al.  The ACT (STATIS method) , 1994 .

[29]  Sébastien Lê,et al.  Exploratory Multivariate Analysis by Example Using R , 2010 .

[30]  Gastón Ares,et al.  Investigation of the number of consumers necessary to obtain stable sample and descriptor configurations from check-all-that-apply (CATA) questions , 2014 .