Sequence analysis of call record data: exploring the role of different cost settings

Sequence analysis is widely used in life course research and more recently has been applied by survey methodologists to summarize complex call record data. However, summary variables derived in this way have proved ineffective for post-survey adjustments, owing to weak correlations with key survey variables. We reflect on the underlying optimal matching algorithm and test the sensitivity of the output to input parameters or ‘costs’, which must be specified by the analyst. The results illustrate the complex relationship between these costs and the output variables which summarize the call record data. Regardless of the choice of costs, there was a low correlation between the summary variables and the key survey variables, limiting the scope for bias reduction. The analysis is applied to call records from the Irish Longitudinal Study on Ageing, which is a nationally representative, face-to-face household survey

[1]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[2]  Gabriele B. Durrant,et al.  Evaluating the Measurement Error of Interviewer Observed Paradata , 2013 .

[3]  Frauke Kreuter,et al.  Analyzing contact sequences in call record data. Potential and limitations of sequence indicators for nonresponse adjustments in the European Social Survey , 2009 .

[4]  F. Kreuter,et al.  Paradata for Nonresponse Error Investigation , 2013 .

[5]  Gabriele B. Durrant,et al.  Analysing interviewer call record data by using a multilevel discrete-time event history modelling approach , 2013 .

[6]  Joop J. Hox,et al.  Multilevel modeling: When and why , 1998 .

[7]  Laurent Lesnard Socio-Temporal Patterns Setting Cost in Optimal Matching to Uncover Contemporaneous , 2010 .

[9]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[10]  Paul P. Biemer,et al.  Using level‐of‐effort paradata in non‐response adjustments with application to field surveys , 2013 .

[11]  B. West,et al.  The Quality of Paradata: A Literature Review , 2013 .

[12]  Pia Bagnaschi,et al.  [How much does it cost?]. , 2015, Krankenpflege. Soins infirmiers.

[13]  Andrew Abbott,et al.  Sequence Analysis and Optimal Matching Techniques for Social Science Data , 2004 .

[14]  David L. Vannette,et al.  Interviewer speech and the success of survey invitations , 2013 .

[15]  A. Abbott,et al.  Measuring Resemblance in Sequence Data: An Optimal Matching Analysis of Musicians' Careers , 1990, American Journal of Sociology.

[16]  Xavier Bry,et al.  Harpoon or Bait? A Comparison of Various Metrics in Fishing for Sequence Patterns , 2012 .

[17]  Peter Lynn,et al.  Separating refusal bias and non-contact bias: evidence from UK national surveys , 2002 .

[18]  Peter J. Smith,et al.  Sequence analysis as a tool for investigating call record data , 2014 .

[19]  Robert M. Groves,et al.  Nonresponse in Household Interview Surveys: Groves/Nonresponse , 1998 .

[20]  Ulrich Kohler,et al.  Sequence Analysis with Stata , 2006 .

[21]  Jelke Bethlehem,et al.  Handbook of Nonresponse in Household Surveys: Bethlehem/Handbook of Nonresponse , 2011 .

[22]  Lawrence L. Wu Some Comments on “Sequence Analysis and Optimal Matching Methods in Sociology: Review and Prospect” , 2000 .

[23]  Mick P. Couper,et al.  MEASURING SURVEY QUALITY IN A CASIC ENVIRONMENT , 2002 .

[24]  Annelies G. Blom,et al.  Nonresponse bias adjustments: what can process data contribute? , 2009 .

[25]  F. Billari,et al.  Classifying life course trajectories: a comparison of latent class and sequence analysis , 2012 .

[26]  F. Peracchi,et al.  Survey response and survey characteristics: microlevel evidence from the European Community Household Panel , 2005 .

[27]  G. Savva,et al.  Design and Methodology of The Irish Longitudinal Study on Ageing , 2013, Journal of the American Geriatrics Society.

[28]  Ivano Bison,et al.  OM Matters: The Interaction Effects between Indel and Substitution Costs , 2009 .

[29]  Silke Aisenbrey,et al.  New Life for Old Ideas: The "Second Wave" of Sequence Analysis Bringing the "Course" Back Into the Life Course , 2010 .

[30]  Gilbert Ritschard,et al.  Analyzing and Visualizing State Sequences in R with TraMineR , 2011 .

[31]  Gabriele B. Durrant,et al.  Sequence analysis as a graphical tool for investigating call record data , 2014 .

[32]  Li Jiang,et al.  LEVEL-OF-EFFORT PARADATA AND NONRESPONSE ADJUSTMENT MODELS FOR A NATIONAL FACE-TO-FACE SURVEY , 2014 .

[33]  Fiona Steele,et al.  Analysis of Multivariate Social Science Data, Second Edition , 2008 .

[34]  R. Little,et al.  Does Weighting for Nonresponse Increase the Variance of Survey Means? (Conference Paper) , 2004 .

[35]  Matissa N. Hollister,et al.  Is Optimal Matching Suboptimal? , 2009 .

[36]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[37]  Jelke Bethlehem,et al.  Handbook of Nonresponse in Household Surveys , 2011 .

[38]  Philipp Bucher,et al.  How Much Does It Cost? , 2009 .

[39]  R. Groves Nonresponse Rates and Nonresponse Bias in Household Surveys , 2006 .

[40]  M. Savage,et al.  Ascription into Achievement: Models of Career Systems at Lloyds Bank, 1890-1970 , 1996, American Journal of Sociology.

[41]  Joel Levine But What Have You Done for Us Lately? , 2000 .

[42]  Robert M. Groves,et al.  Using proxy measures and other correlates of survey outcomes to adjust for non‐response: examples from multiple surveys , 2010 .

[44]  J. Michael Brick,et al.  Unit Nonresponse and Weighting Adjustments: A Critical Review , 2013 .

[45]  Dominique Joye,et al.  Patterns of Contact Attempts in Surveys , 2014 .

[46]  A. Abbott,et al.  Sequence Analysis and Optimal Matching Methods in Sociology , 2000 .