Multilevel Cross‐Dependent Binary Longitudinal Data

We provide insights into new methodology for the analysis of multilevel binary data observed longitudinally, when the repeated longitudinal measurements are correlated. The proposed model is logistic functional regression conditioned on three latent processes describing the within- and between-variability, and describing the cross-dependence of the repeated longitudinal measurements. We estimate the model components without employing mixed-effects modeling but assuming an approximation to the logistic link function. The primary objectives of this article are to highlight the challenges in the estimation of the model components, to compare two approximations to the logistic regression function, linear and exponential, and to discuss their advantages and limitations. The linear approximation is computationally efficient whereas the exponential approximation applies for rare events functional data. Our methods are inspired by and applied to a scientific experiment on spectral backscatter from long range infrared light detection and ranging (LIDAR) data. The models are general and relevant to many new binary functional data sets, with or without dependence between repeated functional measurements.

[1]  Russell E. Warren,et al.  Detection and classification of atmospheric aerosols using multi-wavelength CO2 lidar , 2007, SPIE Defense + Commercial Sensing.

[2]  Colin O. Wu,et al.  Nonparametric Mixed Effects Models for Unequally Sampled Noisy Curves , 2001, Biometrics.

[3]  R. Carroll,et al.  Nonparametric estimation of correlation functions in longitudinal and spatial data, with application to colon carcinogenesis experiments , 2007, 0710.3638.

[4]  Avishai Ben-David,et al.  Simultaneous estimation of aerosol cloud concentration and spectral backscatter from multiple-wavelength lidar data. , 2008, Applied optics.

[5]  Russell E. Warren,et al.  Detection and classification of atmospheric aerosols using multi-wavelength LWIR LIDAR , 2009, Defense + Commercial Sensing.

[6]  P. Hall,et al.  Deconvolution When Classifying Noisy Data Involving Transformations , 2012, Journal of the American Statistical Association.

[7]  H. Müller,et al.  Functional Data Analysis for Sparse Longitudinal Data , 2005 .

[8]  Marina Vannucci,et al.  Wavelet-Based Nonparametric Modeling of Hierarchical Functions in Colon Carcinogenesis , 2003 .

[9]  G. Satten,et al.  Inference on haplotype effects in case-control studies using unphased genotype data. , 2003, American journal of human genetics.

[10]  H. Müller,et al.  Modelling sparse generalized longitudinal observations with latent Gaussian processes , 2008 .

[11]  Jack A. Taylor,et al.  Non-hierarchical logistic models and case-only designs for assessing susceptibility in population-based case-control studies. , 1994, Statistics in medicine.

[12]  Brian S. Caffo,et al.  Multilevel functional principal component analysis , 2009 .

[13]  Ana-Maria Staicu,et al.  Fast methods for spatially correlated multilevel functional data. , 2010, Biostatistics.

[14]  Robert E. Weiss,et al.  An Analysis of Paediatric Cd4 Counts for Acquired Immune Deficiency Syndrome Using Flexible Random Curves , 1996 .

[15]  B. Mallick,et al.  Bayesian Hierarchical Spatially Correlated Functional Data Analysis with Application to Colon Carcinogenesis , 2008, Biometrics.

[16]  L C Kwee,et al.  Simple methods for assessing haplotype‐environment interactions in case‐only and case‐control studies , 2007, Genetic epidemiology.

[17]  J. O. Ramsay,et al.  Functional Data Analysis (Springer Series in Statistics) , 1997 .