Meta-Heuristics in Short Scale Construction: Ant Colony Optimization and Genetic Algorithm

The advent of large-scale assessment, but also the more frequent use of longitudinal and multivariate approaches to measurement in psychological, educational, and sociological research, caused an increased demand for psychometrically sound short scales. Shortening scales economizes on valuable administration time, but might result in inadequate measures because reducing an item set could: a) change the internal structure of the measure, b) result in poorer reliability and measurement precision, c) deliver measures that cannot effectively discriminate between persons on the intended ability spectrum, and d) reduce test-criterion relations. Different approaches to abbreviate measures fare differently with respect to the above-mentioned problems. Therefore, we compare the quality and efficiency of three item selection strategies to derive short scales from an existing long version: a Stepwise COnfirmatory Factor Analytical approach (SCOFA) that maximizes factor loadings and two metaheuristics, specifically an Ant Colony Optimization (ACO) with a tailored user-defined optimization function and a Genetic Algorithm (GA) with an unspecific cost-reduction function. SCOFA compiled short versions were highly reliable, but had poor validity. In contrast, both metaheuristics outperformed SCOFA and produced efficient and psychometrically sound short versions (unidimensional, reliable, sensitive, and valid). We discuss under which circumstances ACO and GA produce equivalent results and provide recommendations for conditions in which it is advisable to use a metaheuristic with an unspecific out-of-the-box optimization function.

[1]  Katherine A McGonagle,et al.  Response Rates in National Panel Surveys , 2013, The Annals of the American Academy of Political and Social Science.

[2]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[3]  Klaas Sijtsma,et al.  On the Use, the Misuse, and the Very Limited Usefulness of Cronbach’s Alpha , 2008, Psychometrika.

[4]  P. Fayers Item Response Theory for Psychologists , 2004, Quality of Life Research.

[5]  Michael T. Kane,et al.  Inferences about variance components and reliability-generalizability coefficients in the absence of random sampling. , 2002 .

[6]  Luca Scrucca,et al.  GA: A Package for Genetic Algorithms in R , 2013 .

[7]  William Revelle,et al.  Cronbach’s α, Revelle’s β, and Mcdonald’s ωH: their relations with each other and two alternative conceptualizations of reliability , 2005 .

[8]  Klaas Sijtsma,et al.  Test Length and Decision Quality in Personnel Selection: When Is Short Too Short? , 2012 .

[9]  Oliver Wilhelm,et al.  Methods matter: Testing competing models for designing short-scale Big-Five assessments , 2015 .

[10]  Mitsutoshi Okazaki,et al.  Revised NEO Personality Inventory(NEO-PI-R)を用いたてんかん患者におけるパーソナリティ傾向に関する検討 , 2018 .

[11]  Jutta von Maurice,et al.  1 The National Educational Panel Study: need, main features, and research potential , 2011 .

[12]  D. Borsboom,et al.  The concept of validity. , 2004, Psychological review.

[13]  Akira Harada,et al.  Stepwise variable selection in factor analysis , 2000 .

[14]  Gideon J. Mellenbergh,et al.  Measurement precision in test score and item response models , 1996 .

[15]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[16]  Zvi Drezner,et al.  Tabu search model selection in multiple regression analysis , 1999 .

[17]  P. Bentler,et al.  Cutoff criteria for fit indexes in covariance structure analysis : Conventional criteria versus new alternatives , 1999 .

[18]  Walter L Leite,et al.  Item Selection for the Development of Short Forms of Scales Using an Ant Colony Optimization Algorithm , 2008, Multivariate behavioral research.

[19]  J. Deneubourg,et al.  Self-organized shortcuts in the Argentine ant , 1989, Naturwissenschaften.

[20]  R. P. McDonald,et al.  Test Theory: A Unified Treatment , 1999 .

[21]  G. A. Marcoulides,et al.  New developments and techniques in structural equation modeling , 2001 .

[22]  P. Costa,et al.  A contemplated revision of the NEO Five-Factor Inventory , 2004 .

[23]  Thomas Stützle,et al.  Ant Colony Optimization: Overview and Recent Advances , 2018, Handbook of Metaheuristics.

[24]  T. Yarkoni,et al.  Using a genetic algorithm to abbreviate the Psychopathic Personality Inventory-Revised (PPI-R). , 2015, Psychological assessment.

[25]  R. P. McDonald,et al.  Behavior Domains in Theory and in Practice , 2003 .

[26]  P. Costa,et al.  The revised NEO personality inventory (NEO-PI-R) , 2008 .

[27]  L. Cronbach,et al.  Construct validity in psychological tests. , 1955, Psychological bulletin.

[28]  H. Gulliksen The relation of item difficulty and inter-item correlation to test variance and reliability , 1945 .

[29]  Baljinder K. Sahdra,et al.  Using Genetic Algorithms in a Large Nationally Representative American Sample to Abbreviate the Multidimensional Experiential Avoidance Questionnaire , 2016, Front. Psychol..

[30]  André Beauducel,et al.  On the Performance of Maximum Likelihood Versus Means and Variance Adjusted Weighted Least Squares Estimation in CFA , 2006 .

[31]  Michael Eid,et al.  Compliance to a cell phone-based ecological momentary assessment study: the effect of time and personality characteristics. , 2012, Psychological assessment.

[32]  N. Schmitt Uses and abuses of coefficient alpha. , 1996 .

[33]  Denny Borsboom,et al.  Frontiers of Test Validity Theory: Measurement, Causation, and Meaning , 2013 .

[34]  R. Lissitz,et al.  Limitations of Coefficient Alpha as an Index of Test Unidimensionality1 , 1977 .

[35]  J. Deneubourg,et al.  Probabilistic behaviour in ants: A strategy of errors? , 1983 .

[36]  O. John,et al.  Measuring personality in one minute or less: A 10-item short version of the Big Five Inventory in English and German , 2007 .

[37]  Richard E. Lucas,et al.  Secondary data analysis: An introduction for psychologists. , 2011 .

[38]  Denny Borsboom,et al.  The attack of the psychometricians , 2006, Psychometrika.

[39]  Moritz Heene,et al.  Masking misfit in confirmatory factor analysis by increasing unique variances: a cautionary note on the usefulness of cutoff values of fit indices. , 2011, Psychological methods.

[40]  Christoph Kemper,et al.  Short Scales – Five Misunderstandings and Ways to Overcome Them , 2014 .

[41]  Martin Arendasy,et al.  Automatic Generation of Rasch-Calibrated Items: Figural Matrices Test GEOM and Endless-Loops Test EC , 2005 .

[42]  Kristopher J Preacher,et al.  On creating and using short forms of scales in secondary research. , 2011 .

[43]  Klaas Sijtsma,et al.  On the consistency of individual classification using short scales. , 2007, Psychological methods.

[44]  J. M. Cortina,et al.  What Is Coefficient Alpha? An Examination of Theory and Applications , 1993 .

[45]  Klaas Sijtsma,et al.  Correcting Fallacies in Validity, Reliability, and Classification , 2009 .

[46]  Frank S. Freeman,et al.  Interpretation of educational measurements , 1928 .

[47]  Martin Schultze,et al.  Following the Ants: Development of Short Scales for Proactive Personality and Supervisor Support by Ant Colony Optimization , 2017 .

[48]  Bruce Thompson,et al.  Psychometrics is Datametrics: the Test is not Reliable , 2000 .

[49]  S. Gosling,et al.  A very brief measure of the Big-Five personality domains , 2003 .

[50]  Denny Borsboom,et al.  The end of construct validity. , 2009 .

[51]  D. McCarthy,et al.  On the sins of short-form development. , 2000, Psychological assessment.

[52]  Eugenio Gonzalez,et al.  principles of multiple matrix booklet designs and parameter recovery in large-scale assessments , 2010 .

[53]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[54]  J. Loevinger,et al.  The attenuation paradox in test theory. , 1954, Psychological bulletin.

[55]  Jonathan M. Campbell,et al.  Peabody Picture Vocabulary Test , 2010 .

[56]  Richard D. Roberts,et al.  A psychometric analysis of the reading the mind in the eyes test: toward a brief form for research and applied settings , 2015, Front. Psychol..

[57]  O. John,et al.  Los Cinco Grandes across cultures and ethnic groups: multitrait multimethod analyses of the Big Five in Spanish and English. , 1998, Journal of personality and social psychology.

[58]  Tal Yarkoni The Abbreviation of Personality, or how to Measure 200 Personality Scales with 200 Items. , 2010, Journal of research in personality.

[59]  Hildegard Schaeper,et al.  The German National Educational Panel Study (NEPS) , 2013 .

[60]  Ulrich Schroeders,et al.  The influence of item sampling on sex differences in knowledge tests , 2016 .

[61]  Klaas Sijtsma,et al.  On the Shortcomings of Shortened Tests: A Literature Review , 2013 .

[62]  Yves Rosseel,et al.  lavaan: An R Package for Structural Equation Modeling , 2012 .

[63]  Sabine Weinert,et al.  The Assessment of Reading Competence (Including Sample Items For Grade 5 and 9) , 2012 .

[64]  J. Loevinger,et al.  PERSON AND POPULATION AS PSYCHOMETRIC CONCEPTS. , 1965, Psychological review.

[65]  R. Hambleton,et al.  Handbook of Modern Item Response Theory , 1997 .