Best-worst scaling: consistency of preferences with discrete choice experiments and stability over time

This paper addresses two research questions regarding best-worst scaling (BWS): (a) Are preferences estimated from BWS tasks consistent with those from discrete choice experiments (DCEs)? and (b)  Are population preferences estimated from BWS tasks stable over time? The context of this research was the estimation of a set of preference weights for a number of social care related quality of life indicators, which are used to describe dimensions of people's lives that may be expected to be influenced by social care. This research programme was funded by the UK Department of Health through their HTA programme, and the UK Office for National Statistics through the Treasury's ‘Invest to Save' programme. To address the first question, 300 interviews were undertaken with members of the general public in which they were asked to complete both DCEs and BWS tasks. This approach allowed efficient collection of the data and avoided possible concerns that a matched-sample approach might lead to biases in unobserved characteristics between the groups presented with each preference-elicitation method. The order of the choice tasks was randomised across individuals to avoid bias. The empirical analysis undertaken on the BWS and DCE data revealed that the two methods produced comparable results. Given that the data suggest these methods produce consistent results we can then consider the wider advantages and disadvantages of each method. These are discussed and the decision to take forward and further develop the best-worst scaling task for future phases of the research is explained. The second question, on the stability of preferences, can be addressed from data that we collected in two subsequent phases of data collection using best-worst scaling experiments. In each wave, 500 members of the general public were interviewed to establish their preference weights. These two waves of data collection were undertaken a year apart. Whilst the instrument was subjected to some minor improvements between the two years, this data offers some valuable insights in to the stability of population preferences that have been collected using a best-worst scaling instrument. This paper provides some useful empirical findings on these key questions, although it is acknowledged that the findings may in part be contingent on the context of the study. The paper provides useful insights for other researchers similarly interested in exploring alternative methods available for eliciting data capable of supporting the estimation of discrete choice models for valuation studies and also discusses the benefits that best-worst scaling tasks can provide for some applications.

[1]  H. Bleichrodt A new explanation for the difference between time trade-off utilities and standard gamble utilities. , 2002, Health economics.

[2]  Michel Bierlaire,et al.  BIOGEME: a free package for the estimation of discrete choice models , 2003 .

[3]  Chong W. Kim,et al.  How do the public value different outcomes of social care , 2010 .

[4]  R. Sander Overview: Outcomes of Social Care for Older People and Carers , 1999 .

[5]  P. Burge,et al.  Estimating the value of social care. , 2010, Journal of health economics.

[6]  J. Brazier,et al.  Outcomes of Social Care for Adults , 2012 .

[7]  T. Wykes,et al.  The development of a measure of social care outcome for older people. Funded/commissioned by: Department of Health , 2002 .

[8]  Andrew Daly,et al.  Calculating Errors for Measures Derived from Choice Modeling Estimates , 2012 .

[9]  T. Flynn Using Conjoint Analysis and Choice Experiments to Estimate QALY Values , 2012, PharmacoEconomics.

[10]  T. Peters,et al.  Best--worst scaling: What it can do for health care research and how to do it. , 2007, Journal of health economics.

[11]  Younger adults' understanding of questions for a service user experience survey: a report to the Information Centre for Health and Social Care , 2006 .

[12]  Joanna Coast,et al.  Using discrete choice experiments to understand preferences for quality of life. Variance-scale heterogeneity matters. , 2010, Social science & medicine.

[13]  J. Louviere,et al.  The Role of the Scale Parameter in the Estimation and Comparison of Multinomial Logit Models , 1993 .

[14]  J. Louviere,et al.  Probabilistic models of set-dependent and attribute-level best-worst choice , 2008 .

[15]  J. Bond,et al.  Developing attributes for a generic quality of life measure for older people: preferences or capabilities? , 2006, Social science & medicine.

[16]  Julie Ratcliffe,et al.  Measuring and valuing health benefits for economic evaluation in adolescence: an assessment of the practicality and validity of the child health utility 9D in the Australian adolescent population. , 2012, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[17]  J. Malley,et al.  Younger Adults' Understanding of Questions for a Service User Experience Survey. Funded/commissioned by: The Health and Social Care Information Centre , 2006 .

[18]  A. Petch,et al.  Outcomes Important to People With Intellectual Disabilities , 2008 .

[19]  Martin Knapp,et al.  Measuring and understanding social services outputs , 2005 .

[20]  T. Flynn Valuing citizen and patient preferences in health: recent developments in three types of best–worst scaling , 2010, Expert review of pharmacoeconomics & outcomes research.

[21]  J. Louviere,et al.  Determining the Appropriate Response to Evidence of Public Concern: The Case of Food Safety , 1992 .