Multidataset independent subspace analysis extends independent vector analysis

Despite its multivariate nature, independent component analysis (ICA) is generally limited to univariate latents in the sense that each latent component is a scalar process. Independent subspace analysis (ISA), or multidimensional ICA (MICA), is a generalization of ICA which identifies latent independent vector components instead. While ISA/MICA considers multidimensional latent components within a single dataset, our work specifically considers the case of multiple datasets. Independent vector analysis (IVA) is a related technique that also considers multiple datasets explicitly but with a fixed and constrained model. Here, we first show that 1) ISA/MICA naturally extends to the case of multiple datasets (which we call MISA), and that 2) IVA is a special case of this extension. Then we develop an algorithm for MISA and demonstrate its performance on both IVA- and MISA-type problems. The benefit of these extensions is that the vector sources (or subspaces) capture higher order statistical dependence across datasets while retaining independence between subspaces. This is a promising model that can explore complex latent relations across multiple datasets and help identify novel biological traits for intricate mental illnesses such as schizophrenia.

[1]  Aapo Hyvärinen,et al.  Natural Image Statistics - A Probabilistic Approach to Early Computational Vision , 2009, Computational Imaging and Vision.

[2]  Te-Won Lee,et al.  Independent vector analysis (IVA): Multivariate approach for fMRI group study , 2008, NeuroImage.

[3]  Aapo Hyvärinen,et al.  FastISA: A fast fixed-point algorithm for independent subspace analysis , 2006, ESANN.

[4]  Quoc V. Le,et al.  ICA with Reconstruction Cost for Efficient Overcomplete Feature Learning , 2011, NIPS.

[5]  V. Calhoun,et al.  A Selective Review of Multimodal Fusion Methods in Schizophrenia , 2012, Front. Hum. Neurosci..

[6]  Vince D. Calhoun,et al.  ICA order selection based on consistency: Application to genotype data , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[7]  Vince D. Calhoun,et al.  A statistically motivated framework for simulation of stochastic data fusion models applied to multimodal neuroimaging , 2014, NeuroImage.

[8]  Jean-François Cardoso,et al.  Multidimensional independent component analysis , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).