Local spatial biclustering and prediction of urban juvenile delinquency and recidivism

Using a novel database, ProDES, developed by the Crime and Justice Research Center at Temple University, this article investigates the relationship between spatial characteristics and juvenile delinquency and recidivism—the proportion of delinquents who commit crimes following completion of a court‐ordered program—in Philadelphia, PA. ProDES was originally a case‐based sample, where the cases were adjudicated in family court, 1994–2004. For our analysis, we focused attention on studying 6768 juvenile males from the data set. To address the difficult issue of nonstationarity in the data, we considered various two‐way clustering algorithms to group the juveniles into ‘types’ by way of the many variables that described the juveniles. Following different modeling scenarios, we applied the plaid biclustering algorithm in which a sequence of subsets (‘layers’) of both juveniles and variables are extracted from the data one layer at a time, but where overlapping layers are allowed. This type of ‘biclustering’ is a new way of studying juvenile‐offense data. We show that the juveniles within each layer can be viewed as spatially clustered. The layers were determined as descriptive tools to aid in identifying subsets of the data that could be useful in policy making. Statistical relationships of the variables and juveniles within each layer are then studied using neural network models. Results indicate that the methods of this paper are more successful in predicting juvenile recidivism in urban environments when different crimes are modeled as separate data sets rather than being pooled together as a single data set. © 2011 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 2011

[1]  B. B. Brown,et al.  Adolescents' Relationships with Peers , 2013 .

[2]  Zoran Obradovic,et al.  The Effect of Neighborhood Characteristics and Spatial Spillover on Urban Juvenile Delinquency and Recidivism , 2011 .

[3]  José Luís Oliveira,et al.  Improving the performance of the iterative signature algorithm for the identification of relevant patterns , 2011, Stat. Anal. Data Min..

[4]  A. Getis The Analysis of Spatial Association by Use of Distance Statistics , 2010 .

[5]  J. Ord,et al.  Local Spatial Autocorrelation Statistics: Distributional Issues and an Application , 2010 .

[6]  T. Seeman,et al.  Neighborhood Effects on Health , 2010 .

[7]  A. Nobel,et al.  Finding large average submatrices in high dimensional data , 2009, 0905.1682.

[8]  John H. Maindonald,et al.  Modern Multivariate Statistical Techniques: Regression, Classification and Manifold Learning , 2009 .

[9]  P. Greenwood Prevention and Intervention Programs for Juvenile Offenders , 2008, The Future of children.

[10]  S. Kaski,et al.  Bayesian biclustering with the plaid model , 2008, 2008 IEEE Workshop on Machine Learning for Signal Processing.

[11]  Tonglin Zhang,et al.  Limiting distribution of the G statistics , 2008 .

[12]  Alan Julian Izenman,et al.  Modern Multivariate Statistical Techniques: Regression, Classification, and Manifold Learning , 2008 .

[13]  Friedrich Leisch,et al.  A toolbox for bicluster analysis in R , 2008 .

[14]  Alan Julian Izenman,et al.  Modern Multivariate Statistical Techniques , 2008 .

[15]  A. Izenman,et al.  Predicting Recidivism: Analyzing the Effects of Individual, Program and Neighborhoods with Cross-Classified Hierarchical Generalized Linear Modeling , 2007 .

[16]  J. Wright,et al.  Gender Differences in the Predictors of Juvenile Delinquency , 2007 .

[17]  Wojtek J. Krzanowski,et al.  Biclustering models for structured microarray data , 2005, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[18]  Wojtek J. Krzanowski,et al.  Improved biclustering of microarray data demonstrated through systematic performance tests , 2005, Comput. Stat. Data Anal..

[19]  Turner Hl Biclustering microarray data : some extensions of the plaid model. , 2005 .

[20]  L. Steinberg,et al.  Reentry of Young Offenders from the Justice System , 2004, Youth violence and juvenile justice.

[21]  M. Goodchild,et al.  Spatially integrated social science , 2004 .

[22]  Luc Anselin,et al.  SPATIAL ANALYSES OF HOMICIDE WITH AREAL DATA , 2004 .

[23]  Inderjit S. Dhillon,et al.  Minimum Sum-Squared Residue Co-Clustering of Gene Expression Data , 2004, SDM.

[24]  Bart De Moor,et al.  Biclustering microarray data by Gibbs sampling , 2003, ECCB.

[25]  Joseph T. Chang,et al.  Spectral biclustering of microarray data: coclustering genes and conditions. , 2003, Genome research.

[26]  Philip S. Yu,et al.  Enhanced biclustering on expression data , 2003, Third IEEE Symposium on Bioinformatics and Bioengineering, 2003. Proceedings..

[27]  G. Ambler Bayesian Two-way Clustering for Gene Expression Data , 2003 .

[28]  Peter A. Flach,et al.  Feature Selection with Labelled and Unlabelled Data , 2002 .

[29]  Roded Sharan,et al.  Discovering statistically significant biclusters in gene expression data , 2002, ISMB.

[30]  L. C. Gordon,et al.  Community differences in the association between parenting practices and child conduct problems , 2002 .

[31]  Richard M. Karp,et al.  Discovering local structure in gene expression data: the order-preserving submatrix problem , 2002, RECOMB '02.

[32]  B. Rankin,et al.  Social Contexts and Urban Adolescent Outcomes: The Interrelated Effects of Neighborhoods, Families, and Peers on African-American Youth , 2002 .

[33]  Inderjit S. Dhillon,et al.  Co-clustering documents and words using bipartite spectral graph partitioning , 2001, KDD '01.

[34]  Ingrid Gould Ellen,et al.  Neighborhood Effects on Health: Exploring the Links and Assessing the Evidence , 2001 .

[35]  Bruce H. Rankin,et al.  Neighborhood Poverty and the Social Isolation of Inner-City African American Families , 2000 .

[36]  George M. Church,et al.  Biclustering of Expression Data , 2000, ISMB.

[37]  M. Charlton,et al.  Quantitative geography : perspectives on spatial data analysis by , 2001 .

[38]  L. Kowaleski-Jones,et al.  Staying Out of Trouble: Community Resources and Problem Behavior Among High‐Risk Adolescents , 2000 .

[39]  L. Lazzeroni Plaid models for gene expression data , 2000 .

[40]  Stephanie J. Funk,et al.  Risk Assessment for Juveniles on Probation , 1999 .

[41]  P. Mazerolle Gender, general strain, and delinquency: An empirical examination , 1998 .

[42]  G. Breakwell Transitions through adolescence: Interpersonal domains and context - Graber,JA, BrooksGunn,J, Petersen,AC , 1997 .

[43]  S. Raudenbush,et al.  Neighborhoods and violent crime: a multilevel study of collective efficacy. , 1997, Science.

[44]  J. Mccord Violence and childhood in the inner city , 1997 .

[45]  D. Elliott,et al.  The Effects of Neighborhood Disadvantage on Adolescent Development , 1996 .

[46]  J. Beaman,et al.  Parents and peer group as mediators of the effect of community structure on adolescent problem behavior , 1996, American journal of community psychology.

[47]  Greg J. Duncan,et al.  Do Neighborhoods Influence Child and Adolescent Development? , 1993, American Journal of Sociology.

[48]  Kelvyn Jones,et al.  Specifying and estimating multilevel models for geographical research , 1991 .

[49]  Samuel R. Staley The Truly Disadvantaged: The Inner City, the Underclass, and Public Policy , 1989 .

[50]  M. Pohlmann The Truly Disadvantaged: The Inner City, the Underclass, and the Public Policy.William Julius Wilson , 1989 .

[51]  L. Anselin,et al.  Spatial Econometrics: Methods and Models , 1988 .

[52]  E. Ziegel COMPSTAT: Proceedings in Computational Statistics , 1988 .

[53]  W. Wilson,et al.  The Truly Disadvantaged: The Inner City, The Underclass, and Public Policy. , 1988 .

[54]  H. D. McKay,et al.  Juvenile Delinquency and Urban Areas , 1943 .