Clustering work and family trajectories by using a divisive algorithm

Summary. We present an approach to the construction of clusters of life course trajectories and use it to obtain ideal types of trajectories that can be interpreted and analysed meaningfully. We represent life courses as sequences on a monthly timescale and apply optimal matching analysis to compute dissimilarities between individuals. We introduce a new divisive clustering algorithm which has features that are in common with both Ward's agglomerative algorithm and classification and regression trees. We analyse British Household Panel Survey data on the employment and family trajectories of women. Our method produces clusters of sequences for which it is straightforward to determine who belongs to each cluster, making it easier to interpret the relative importance of life course factors in distinguishing subgroups of the population. Moreover our method gives guidance on selecting the number of clusters.

[1]  A. Abbott,et al.  Measuring Resemblance in Sequence Data: An Optimal Matching Analysis of Musicians' Careers , 1990, American Journal of Sociology.

[2]  J. Morgan,et al.  Problems in the Analysis of Survey Data, and a Proposal , 1963 .

[3]  Stefani Scherer Stepping-Stones or Traps? , 2004 .

[4]  T. Caliński,et al.  A dendrite method for cluster analysis , 1974 .

[5]  A. Fielding,et al.  Binary Segmentation in Survey Analysis with Particular Reference to AID , 1977 .

[6]  T. Falbo,et al.  Female Labour Market Behaviour and Fertility: A Rational-Choice Approach , 1992 .

[7]  J. Jong-Gierveld,et al.  Female Labour Market Behaviour and Fertility , 1991 .

[8]  G. N. Lance,et al.  Computer programs for monothetic classification ("Association analysis") , 1965, Comput. J..

[9]  W. T. Williams,et al.  Dissimilarity Analysis: a new Technique of Hierarchical Sub-division , 1964, Nature.

[10]  John C. Henretta Shirley Dex (ed.), Life and Work History Analyses: Qjzalitative and Quantitative Developments , Routledge, London, 244 pp., pbk. £12.99, ISBN 0 415 05338 2. , 1992, Ageing and Society.

[11]  A. Scott,et al.  297. Note: On the Edwards and Cavalli-Sforza Method of Cluster Analysis , 1971 .

[12]  J. Ermisch,et al.  Early motherhood and later partnerships , 2005 .

[13]  A W EDWARDS,et al.  A METHOD FOR CLUSTER ANALYSIS. , 1965, Biometrics.

[14]  J. Hobcraft,et al.  Childhood Poverty, Early Motherhood and Adult Social Exclusion , 1999, The British journal of sociology.

[15]  Lawrence L. Wu Some Comments on “Sequence Analysis and Optimal Matching Methods in Sociology: Review and Prospect” , 2000 .

[16]  L. Hubert Monotone invariant clustering procedures , 1973 .

[17]  J. Mirowsky Age at First Birth, Health, and Mortality∗ , 2005, Journal of health and social behavior.

[18]  Richard D. Wiggins,et al.  Transitions from school to work in a changing social context , 2001 .

[19]  M. Savage,et al.  Ascription into Achievement: Models of Career Systems at Lloyds Bank, 1890-1970 , 1996, American Journal of Sociology.

[20]  Andrew Abbott,et al.  Reply to Levine and Wu , 2000 .

[21]  A. Chevalier,et al.  The long-run labour market consequences of teenage motherhood in Britain , 2000 .

[22]  S. Stansfeld,et al.  Partnership history and mental health over time , 2003, Journal of epidemiology and community health.

[23]  Miguel A. Malo,et al.  Employment status mobility from a life-cycle perspective , 2003 .

[24]  T. Chan,et al.  Optimal Matching Analysis: A Methodological Note on Studying Career Mobility , 1995 .

[25]  Joseph B. Kruskal,et al.  Time Warps, String Edits, and Macromolecules , 1999 .

[26]  A. Abbott Sequence analysis: new methods for old ideas , 1995 .

[27]  Catherine Hakim,et al.  A New Approach to Explaining Fertility Patterns: Preference Theory , 2003 .

[28]  Raffaella Piccarreta,et al.  Sequence Analysis of BHPS Life Course Data , 2005 .

[29]  A. Abbott,et al.  Sequence Analysis and Optimal Matching Methods in Sociology , 2000 .

[30]  Joel Levine But What Have You Done for Us Lately? , 2000 .

[31]  Stefani Scherer,et al.  Early Career Patterns - A Comparison of Great Britain and West Germany , 2001 .

[32]  Brendan Halpin,et al.  Class careers as sequences : An optimal matching analysis of work-life histories , 1998 .

[33]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[34]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[35]  J. Ermisch,et al.  Early childbearing and housing choices , 2004 .

[36]  G. N. Lance,et al.  Note on a New Information-Statistic Classificatory Program , 1968, Comput. J..

[37]  W. T. Williams,et al.  Multivariate Methods in Plant Ecology: I. Association-Analysis in Plant Communities , 1959 .

[38]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[39]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[40]  C. Hakim,et al.  Lifestyle Preferences as Determinants of Women's Differentiated Labor Market Careers , 2002 .

[41]  Michael Anyadike-Danes,et al.  Predicting successful and unsuccessful transitions from school to work by using sequence methods , 2002 .