PERTURBED FACTOR ANALYSIS: ACCOUNTING FOR GROUP DIFFERENCES IN EXPOSURE PROFILES.

In this article, we investigate group differences in phthalate exposure profiles using NHANES data. Phthalates are a family of industrial chemicals used in plastics and as solvents. There is increasing evidence of adverse health effects of exposure to phthalates on reproduction and neuro-development, and concern about racial disparities in exposure. We would like to identify a single set of low-dimensional factors summarizing exposure to different chemicals, while allowing differences across groups. Improving on current multi-group additive factor models, we propose a class of Perturbed Factor Analysis (PFA) models that assume a common factor structure after perturbing the data via multiplication by a group-specific matrix. Bayesian inference algorithms are defined using a matrix normal hierarchical model for the perturbation matrices. The resulting model is just as flexible as current approaches in allowing arbitrarily large differences across groups but has substantial advantages that we illustrate in simulation studies. Applying PFA to NHANES data, we learn common factors summarizing exposures to phthalates, while showing clear differences across groups.

[1]  N. Kamimura,et al.  Phthalates impact human health: Epidemiological evidences and plausible mechanism of action. , 2017, Journal of hazardous materials.

[2]  Zhe Gan,et al.  Variational Autoencoder for Deep Learning of Images, Labels and Captions , 2016, NIPS.

[3]  Jana Schaich Borg,et al.  Bayesian time-aligned factor analysis of paired multivariate time series , 2019, J. Mach. Learn. Res..

[4]  Daniele Durante,et al.  A note on the multiplicative gamma process , 2016, 1610.03408.

[5]  Lorenzo Trippa,et al.  Bayesian multistudy factor analysis for high-throughput biological data , 2018, The Annals of Applied Statistics.

[6]  H. Lopes,et al.  Sparse Bayesian Factor Analysis When the Number of Factors Is Unknown , 2018, Bayesian Analysis.

[7]  Kyla W. Taylor,et al.  Associations between Personal Care Product Use Patterns and Breast Cancer Risk among White and Black Women in the Sister Study , 2018, Environmental health perspectives.

[8]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[9]  L. Schwartz On Bayes procedures , 1965 .

[10]  Yan Zhao,et al.  Age and Sex-Specific Relationships between Phthalate Exposures and Obesity in Chinese Children at Puberty , 2014, PloS one.

[11]  Neil D. Lawrence,et al.  Gaussian Process Latent Variable Models for Visualisation of High Dimensional Data , 2003, NIPS.

[12]  J. Meza,et al.  A principal factor analysis to characterize agricultural exposures among Nebraska veterans , 2016, Journal of Exposure Science and Environmental Epidemiology.

[13]  Michael A. West,et al.  BAYESIAN MODEL ASSESSMENT IN FACTOR ANALYSIS , 2004 .

[14]  J. Meeker,et al.  Racial and ethnic variations in phthalate metabolite concentration changes across full-term pregnancies , 2017, Journal of Exposure Science and Environmental Epidemiology.

[15]  Sungkyu Jung,et al.  Incorporating covariates into integrated factor analysis of multi‐view data , 2017, Biometrics.

[16]  J. S. Marron,et al.  Angle-based joint and individual variation explained , 2017, J. Multivar. Anal..

[17]  Jan Hannig,et al.  Non-iterative Joint and Individual Variation Explained , 2015, 1512.04060.

[18]  Eric F Lock,et al.  JOINT AND INDIVIDUAL VARIATION EXPLAINED (JIVE) FOR INTEGRATED ANALYSIS OF MULTIPLE DATA TYPES. , 2011, The annals of applied statistics.

[19]  D. Dunson,et al.  Sparse Bayesian infinite factor models. , 2011, Biometrika.

[20]  Christian Aßmann,et al.  Bayesian analysis of static and dynamic factor models: An ex-post approach towards the rotation problem , 2016 .

[21]  Debdeep Pati,et al.  Bayesian Semiparametric Multivariate Density Deconvolution , 2014, Journal of the American Statistical Association.

[22]  F. Perera,et al.  Prenatal Exposure to Phthalates and Childhood Body Size in an Urban Cohort , 2015, Environmental health perspectives.

[23]  E. George,et al.  Fast Bayesian Factor Analysis via Automatic Rotations to Sparsity , 2016 .

[24]  Damien McParland,et al.  CLUSTERING SOUTH AFRICAN HOUSEHOLDS BASED ON THEIR ASSET STATUS USING LATENT VARIABLE MODELS. , 2014, The annals of applied statistics.

[25]  David Dunson,et al.  Bayesian Factorizations of Big Sparse Tensors , 2013, Journal of the American Statistical Association.

[26]  Mi Jung Park,et al.  Phthalate exposure and childhood obesity , 2014, Annals of pediatric endocrinology & metabolism.

[27]  Lorenzo Trippa,et al.  Multi‐study factor analysis , 2016, Biometrics.

[28]  J. Brock,et al.  Racial disparity in maternal phthalates exposure; Association with racial disparity in fetal growth and birth outcomes. , 2019, Environment international.

[29]  Joaquin Quiñonero Candela,et al.  Local distance preservation in the GP-LVM through back constraints , 2006, ICML.

[30]  Yongseok Park,et al.  Meta‐analytic principal component analysis in integrative omics application , 2018, Bioinform..

[31]  Sharon X. Lee,et al.  Mixtures of Factor Analyzers with Fundamental Skew Symmetric Distributions , 2018, 1802.02467.