A framework for analysing social sequences

Longitudinal categorical data arise in many diverse areas of the social sciences and methods for its analysis have taken two broad directions. Heuristically, one can attempt to model the state space (i.e., the categories) or the sequence space (i.e., the subjects), typically with event history models or optimal matching respectively. This study proposes a more general framework for inference from such data which acknowledges not only the analytic approach (split into stochastic models and algorithmic differencing) but also hypothesis, sequences, categorisation and representation. The individual sequence can be thought of as a map from time to the state space. The hypothesis relates to how these maps are similar and how they deviate from this structure. The analytical frameworks define what is assumed, what is uncertain, and how this is modelled. The categories of the state variable define what is considered pivotal as an event. Representations create explorative tools and describe structure, as well as communicating high dimensional inferences. It is the interaction between these ideas which is fundamental to making inferences, as well as their relationship to time, which is essential to the social science treatment of sequences. Thus, the analysis should not prefer one approach to analysis over another but appreciate the origin of the data and the theory under examination.

[1]  Henri Poincaré,et al.  The Value of Science , 1905 .

[2]  T. Kuhn,et al.  The Structure of Scientific Revolutions. , 1964 .

[3]  S. Altmann,et al.  Sociobiology of rhesus monkeys. II. Stochastics of social communication. , 1965, Journal of theoretical biology.

[4]  P. Meehl Theory-Testing in Psychology and Physics: A Methodological Paradox , 1967, Philosophy of Science.

[5]  D. J. Bartholomew,et al.  Stochastic Models for Social Processes. , 1968 .

[6]  Brian Everitt,et al.  Cluster analysis , 1974 .

[7]  David Sankoff,et al.  Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison , 1983 .

[8]  Joseph B. Kruskall,et al.  The Symmetric Time-Warping Problem : From Continuous to Discrete , 1983 .

[9]  Roger Bakeman,et al.  Observing Interaction: An Introduction to Sequential Analysis , 1986 .

[10]  Chris Chatfield,et al.  Problem Solving: A Statistician's Guide , 1988 .

[11]  A. Abbott,et al.  Measuring Resemblance in Sequence Data: An Optimal Matching Analysis of Musicians' Careers , 1990, American Journal of Sociology.

[12]  A. Raftery,et al.  Estimation and Modelling Repeated Patterns in High Order Markov Chains with the Mixture Transition Distribution Model , 1994 .

[13]  T. Taris,et al.  Measuring the Agreement between Sequences , 1995 .

[14]  A. Abbott Sequence analysis: new methods for old ideas , 1995 .

[15]  R. Bakeman,et al.  Log-linear approaches to lag-sequential analysis when consecutive codes may and cannot repeat. , 1995 .

[16]  David J. Hand,et al.  Statistics and the theory of measurement , 1996 .

[17]  T. Kuhn The structure of scientific revolutions, 3rd ed. , 1996 .

[18]  Roger Bakeman,et al.  Testing sequential association: Estimating exact p values using sampled permutations. , 1996 .

[19]  Edward R. Tufte Visual explanations: images and quantities, evidence and narrative , 1997 .

[20]  Andrew Abbott,et al.  2. Sequence Comparison Viaalignment and Gibbs Sampling: A Formal Analysis of the Emergence of the Modern Sociological Article , 1997 .

[21]  W C Wilson,et al.  Activity Pattern Analysis by Means of Sequence-Alignment Methods , 1998 .

[22]  Jennifer Smith,et al.  What Have You Done for Us Lately , 1999 .

[23]  Mary Blair-Loy,et al.  Career Patterns of Executive Women in Finance: An Optimal Matching Analysis1 , 1999, American Journal of Sociology.

[24]  Susan Leigh Star,et al.  Sorting Things Out: Classification and Its Consequences , 1999 .

[25]  Andrew Abbott,et al.  Reply to Levine and Wu , 2000 .

[26]  Joel Levine But What Have You Done for Us Lately? , 2000 .

[27]  Paul A. Gore,et al.  11 – Cluster Analysis , 2000 .

[28]  Lawrence L. Wu Some Comments on “Sequence Analysis and Optimal Matching Methods in Sociology: Review and Prospect” , 2000 .

[29]  Andrew Abbott,et al.  Time Matters: On Theory and Method , 2001 .

[30]  Martin Gren,et al.  Time-geography matters , 2001 .

[31]  A. Raftery,et al.  Bayesian Multidimensional Scaling and Choice of Dimension , 2001 .

[32]  Nigel Thrift,et al.  Timespace: Geographies of Temporality , 2001 .

[33]  Y. Ben-Shlomo,et al.  A life course approach to chronic disease epidemiology: conceptual models, empirical challenges and interdisciplinary perspectives. , 2002, International journal of epidemiology.

[34]  K. Prandy Ideal types, stereotypes and classes. , 2002, The British journal of sociology.

[35]  J. Singer,et al.  Applied Longitudinal Data Analysis , 2003 .

[36]  Lawrence L. Wu Event History Models for Life Course Analysis , 2003 .

[37]  Peter Green,et al.  Highly Structured Stochastic Systems , 2003 .

[38]  Cees H. Elzinga,et al.  Sequence Similarity , 2003 .

[39]  William A. V. Clark,et al.  Housing Careers in the United States, 1968-93: Modelling the Sequencing of Housing States , 2003 .

[40]  J. Mortimer,et al.  Handbook of the life course , 2003 .

[41]  Wolff-Michael Roth,et al.  Emergence of Graphing Practices in Scientific Research , 2004 .

[42]  Geert Wets,et al.  Mining Navigation Patterns Using a Sequence Alignment Method , 2004, Knowl. Inf. Syst..

[43]  P. Green,et al.  Bayesian Markov chain Monte Carlo sequence analysis reveals varying neutral substitution patterns in mammalian evolution. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[44]  J. Mcardle Five Steps in Latent Curve Modeling with Longitudinal Life-span Data , 2005 .

[45]  A. Mira Highly Structured Stochastic Systems. Peter J. Green, Nils Lid Hjort, and Sylvia Richardson , 2005 .

[46]  Raffaella Piccarreta,et al.  Analyzing Demographic Life Courses through Sequence Analysis , 2005 .

[47]  Jean-Marie Le Goff,et al.  Incitations for Interdisciplinarity in Life Course Research , 2005 .

[48]  V. Marshall Agency, Events, and Structure at the End of the Life Course☆ , 2005 .

[49]  R. Levy Why look at life courses in an interdisciplinary perspective , 2005 .

[50]  R. Levy,et al.  Towards an interdisciplinary perspective on the life course , 2005 .

[51]  Ulrich Kohler,et al.  Sequence Analysis with Stata , 2006 .

[52]  Clarke Wilson,et al.  Reliability of Sequence-Alignment Analysis of Social Processes: Monte Carlo Tests of Clustalg Software , 2006 .

[53]  D. Stark,et al.  Social Times of Network Spaces: Network Sequences and Foreign Investment in Hungary1 , 2006, American Journal of Sociology.

[54]  Laurent Lesnard,et al.  Optimal Matching and Social Sciences , 2006 .

[55]  Lefteris Angelis,et al.  Validation and interpretation of Web users' sessions clusters , 2007, Inf. Process. Manag..

[56]  Paul Higgs,et al.  Optimal Matching Analysis Using Ideal Types to Describe the Lifecourse: An Illustration of How Histories of Work, Partnerships and Housing Relate to Quality of Life in Early Old Age , 2007 .

[57]  Raffaella Piccarreta,et al.  Clustering work and family trajectories by using a divisive algorithm , 2007 .

[58]  Aart C. Liefbroer,et al.  De-standardization of Family-Life Trajectories of Young Adults: A Cross-National Comparison Using Sequence Analysis , 2007 .

[59]  Gary Pollock,et al.  Holistic trajectories: a study of combined employment, housing and family careers by using multiple‐sequence analysis , 2007 .

[60]  A. Raftery,et al.  Model-Based Clustering With Dissimilarities: A Bayesian Approach , 2007 .

[61]  Noam Shoval,et al.  Sequence Alignment as a Method for Human Activity Analysis in Space and Time , 2007 .

[62]  Christian Brzinsky-Fay,et al.  Lost in transition: labour market entry sequences of school leavers in Europe , 2007 .

[63]  S. Iske Educational Research Online: E-Learning Sequences Analyzed By Means Of Optimal-Matching , 2008 .

[64]  N. Robette,et al.  Comparing Qualitative Harmonic Analysis and Optimal Matching: An Exploratory Study of Occupational Trajectories , 2008 .

[65]  Glenn Firebaugh,et al.  Seven Rules for Social Research , 2008 .

[66]  Peter Martin,et al.  Beyond Transitions: Applying Optimal Matching Analysis to Life Course Research , 2008 .

[67]  Philipp Bucher,et al.  How Much Does It Cost? , 2009 .

[68]  Matissa N. Hollister,et al.  Is Optimal Matching Suboptimal? , 2009 .

[69]  Silke Aisenbrey,et al.  New Life for Old Ideas: The "Second Wave" of Sequence Analysis Bringing the "Course" Back Into the Life Course , 2010 .

[70]  Christian Brzinsky-Fay,et al.  New Developments in Sequence Analysis , 2010 .

[71]  Brendan Halpin,et al.  Optimal Matching Analysis and Life-Course Data: The Importance of Duration , 2010 .

[72]  Anna Reid,et al.  On Becoming a Statistician—A Qualitative View , 2010 .

[73]  Raffaella Piccarreta,et al.  Exploring sequences: a graphical tool based on multi‐dimensional scaling , 2010 .