A review of multivariate analyses in imaging genetics

Recent advances in neuroimaging technology and molecular genetics provide the unique opportunity to investigate genetic influence on the variation of brain attributes. Since the year 2000, when the initial publication on brain imaging and genetics was released, imaging genetics has been a rapidly growing research approach with increasing publications every year. Several reviews have been offered to the research community focusing on various study designs. In addition to study design, analytic tools and their proper implementation are also critical to the success of a study. In this review, we survey recent publications using data from neuroimaging and genetics, focusing on methods capturing multivariate effects accommodating the large number of variables from both imaging data and genetic data. We group the analyses of genetic or genomic data into either a priori driven or data driven approach, including gene-set enrichment analysis, multifactor dimensionality reduction, principal component analysis, independent component analysis (ICA), and clustering. For the analyses of imaging data, ICA and extensions of ICA are the most widely used multivariate methods. Given detailed reviews of multivariate analyses of imaging data available elsewhere, we provide a brief summary here that includes a recently proposed method known as independent vector analysis. Finally, we review methods focused on bridging the imaging and genetic data by establishing multivariate and multiple genotype-phenotype-associations, including sparse partial least squares, sparse canonical correlation analysis, sparse reduced rank regression and parallel ICA. These methods are designed to extract latent variables from both genetic and imaging data, which become new genotypes and phenotypes, and the links between the new genotype-phenotype pairs are maximized using different cost functions. The relationship between these methods along with their assumptions, advantages, and limitations are discussed.

[1]  Ting Hu,et al.  Characterizing genetic interactions in human disease association studies using statistical epistasis networks , 2011, BMC Bioinformatics.

[2]  Jason H. Moore,et al.  BIOINFORMATICS REVIEW , 2005 .

[3]  Moo K. Chung,et al.  Spatially augmented LPboosting for AD classification with evaluations on the ADNI dataset , 2009, NeuroImage.

[4]  Thomas E. Nichols,et al.  The ENIGMA Consortium: large-scale collaborative analyses of neuroimaging and genetic data , 2014, Brain Imaging and Behavior.

[5]  C. F. Beckmann,et al.  Tensorial extensions of independent component analysis for multisubject FMRI analysis , 2005, NeuroImage.

[6]  Tülay Adali,et al.  Estimating the number of independent components for functional magnetic resonance imaging data , 2007, Human brain mapping.

[7]  Erika Cule,et al.  Significance testing in ridge regression for genetic data , 2011, BMC Bioinformatics.

[8]  Stacey Winham,et al.  Applications of multifactor dimensionality reduction to genome-wide data using the R package 'MDR'. , 2013, Methods in molecular biology.

[9]  Mark S. Cohen,et al.  Patterns of brain activation in people at risk for Alzheimer's disease. , 2000, The New England journal of medicine.

[10]  T. Adali,et al.  Unmixing fMRI with independent component analysis , 2006, IEEE Engineering in Medicine and Biology Magazine.

[11]  Florence Thibaut,et al.  Why schizophrenia genetics needs epigenetics: a review. , 2012, Psychiatria Danubina.

[12]  Shannon L. Risacher,et al.  A large scale multivariate parallel ICA method reveals novel imaging–genetic relationships for Alzheimer's disease in the ADNI cohort , 2012, NeuroImage.

[13]  V. Calhoun,et al.  Combining fMRI and SNP data to investigate connections between brain function and genetics using parallel ICA , 2009, Human brain mapping.

[14]  Scott M. Williams,et al.  New strategies for identifying gene-gene interactions in hypertension , 2002, Annals of medicine.

[15]  Jingyu Liu,et al.  Sparse canonical correlation analysis applied to fMRI and genetic data fusion , 2010, 2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[16]  Godfrey D Pearlson,et al.  Source density‐driven independent component analysis approach for fMRI data , 2005, Human brain mapping.

[17]  D. Hardoon,et al.  Correlation-based multivariate analysis of genetic influence on brain volume , 2009, Neuroscience Letters.

[18]  Vince D. Calhoun,et al.  Rare Copy Number Deletions Predict Individual Variation in Intelligence , 2011, PloS one.

[19]  Kenny Q. Ye,et al.  Gene Size Matters , 2012, PloS one.

[20]  Vince D. Calhoun,et al.  Human Neuroscience , 2022 .

[21]  Vince D. Calhoun,et al.  A review of multivariate methods for multimodal fusion of brain imaging data , 2012, Journal of Neuroscience Methods.

[22]  Shannon L. Risacher,et al.  Identifying quantitative trait loci via group-sparse multitask regression and feature selection: an imaging genetics study of the ADNI cohort , 2012, Bioinform..

[23]  V. Calhoun,et al.  An ICA with reference approach in identification of genetic variation and associated brain networks , 2012, Front. Hum. Neurosci..

[24]  Aapo Hyvärinen,et al.  A Fast Fixed-Point Algorithm for Independent Component Analysis of Complex Valued Signals , 2000, Int. J. Neural Syst..

[25]  Andrew J. Saykin,et al.  Voxelwise genome-wide association study (vGWAS) , 2010, NeuroImage.

[26]  Rebecca Campbell,et al.  A Comparison of Three Methods , 2003 .

[27]  Aapo Hyvärinen,et al.  Independent component analysis of fMRI group studies by self-organizing clustering , 2005, NeuroImage.

[28]  Doo Hyun Chung,et al.  Comparison of Invariant NKT Cells with Conventional T Cells by Using Gene Set Enrichment Analysis (GSEA) , 2011, Immune network.

[29]  F. Agakov,et al.  Abundant pleiotropy in human complex diseases and traits. , 2011, American journal of human genetics.

[30]  Vince D. Calhoun,et al.  Integrated Analysis of Gene Expression and Copy Number Data on Gene Shaving Using Independent Component Analysis , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[31]  S V Faraone,et al.  Genetics of Alzheimer's disease. , 1996, Journal of the Formosan Medical Association = Taiwan yi zhi.

[32]  Vince D. Calhoun,et al.  Rare Copy Number Deletions Predict Individual Variation in Human Brain Metabolite Concentrations in Individuals with Alcohol Use Disorders , 2011, Biological Psychiatry.

[33]  Scott M. Williams,et al.  A Simple and Computationally Efficient Approach to Multifactor Dimensionality Reduction Analysis of Gene-Gene Interactions for Quantitative Traits , 2013, PloS one.

[34]  Christian R. Marshall,et al.  Copy number variations and risk for schizophrenia in 22q11.2 deletion syndrome , 2008, Human molecular genetics.

[35]  Thomas E. Nichols,et al.  False positives in neuroimaging genetics using voxel-based morphometry data , 2011, NeuroImage.

[36]  A. Meyer-Lindenberg,et al.  Intermediate phenotypes and genetic mechanisms of psychiatric disorders , 2006, Nature Reviews Neuroscience.

[37]  Taesung Park,et al.  A novel method to identify high order gene-gene interactions in genome-wide association studies: Gene-based MDR , 2012, BMC Bioinformatics.

[38]  Michael I. Jordan,et al.  Kernel independent component analysis , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[39]  Annarita D'Addabbo,et al.  Comparative study of gene set enrichment methods , 2009, BMC Bioinformatics.

[40]  D. Weinberger,et al.  Imaging Genetics: Perspectives from Studies of Genetically Driven Variation in Serotonin Function and Corticolimbic Affective Processing , 2006, Biological Psychiatry.

[41]  J. H. Moore,et al.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. , 2001, American journal of human genetics.

[42]  Jessica A. Turner,et al.  Multifaceted genomic risk for brain function in schizophrenia , 2012, NeuroImage.

[43]  Jiang Gui,et al.  A computationally efficient hypothesis testing method for epistasis analysis using multifactor dimensionality reduction , 2009, Genetic epidemiology.

[44]  L. K. Hansen,et al.  Independent component analysis of functional MRI: what is signal and what is noise? , 2003, Current Opinion in Neurobiology.

[45]  Daniel R. Weinberger,et al.  Imaging genetics—days of future past , 2010, NeuroImage.

[46]  Antonio Moreno,et al.  Significant correlation between a set of genetic polymorphisms and a functional brain network revealed by feature selection and sparse Partial Least Squares , 2012, NeuroImage.

[47]  Roman Filipovych,et al.  Semi-supervised pattern classification of medical images: Application to mild cognitive impairment (MCI) , 2011, NeuroImage.

[48]  Vince D. Calhoun,et al.  PARALLEL INDEPENDENT COMPONENT ANALYSIS FOR MULTIMODAL ANALYSIS: APPLICATION TO FMRI AND EEG DATA , 2007, 2007 4th IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[49]  Giovanni Montana,et al.  Random forests on distance matrices for imaging genetics studies , 2013, Statistical applications in genetics and molecular biology.

[50]  Jessica A. Turner,et al.  Guided exploration of genomic risk for gray matter abnormalities in schizophrenia using parallel independent component analysis with reference , 2013, NeuroImage.

[51]  Russ B. Altman,et al.  Independent component analysis: Mining microarray data for fundamental human gene expression modules , 2010, J. Biomed. Informatics.

[52]  David P. Kreil,et al.  Independent component analysis of microarray data in the study of endometrial cancer , 2004, Oncogene.

[53]  Laurence Faivre,et al.  Recurrent rearrangements in synaptic and neurodevelopmental genes and shared biologic pathways in schizophrenia, autism, and mental retardation. , 2009, Archives of general psychiatry.

[54]  Patrik D'haeseleer,et al.  How does gene expression clustering work? , 2005, Nature Biotechnology.

[55]  Mark T. W. Ebbert,et al.  Genetics of Alzheimer's Disease , 2013, BioMed research international.

[56]  V. Calhoun,et al.  Methylation patterns in whole blood correlate with symptoms in schizophrenia patients. , 2014, Schizophrenia bulletin.

[57]  P. Visscher,et al.  Common polygenic variation contributes to risk of schizophrenia and bipolar disorder , 2009, Nature.

[58]  H. Akaike A new look at the statistical model identification , 1974 .

[59]  Vincent J Schmithorst,et al.  Comparison of three methods for generating group statistical inferences from independent component analysis of functional magnetic resonance imaging data , 2004, Journal of magnetic resonance imaging : JMRI.

[60]  J. Cardoso Infomax and maximum likelihood for blind source separation , 1997, IEEE Signal Processing Letters.

[61]  S. Lawrie,et al.  The influence of polygenic risk for bipolar disorder on neural activation assessed using fMRI , 2012, Translational Psychiatry.

[62]  D. Chakrabarti,et al.  A fast fixed - point algorithm for independent component analysis , 1997 .

[63]  Andreas Papassotiropoulos,et al.  Genetics of human episodic memory: dealing with complexity , 2011, Trends in Cognitive Sciences.

[64]  I. Gottesman,et al.  The endophenotype concept in psychiatry: etymology and strategic intentions. , 2003, The American journal of psychiatry.

[65]  M. Daly,et al.  PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes , 2003, Nature Genetics.

[66]  Vince D. Calhoun,et al.  ICA order selection based on consistency: Application to genotype data , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[67]  Vince D. Calhoun,et al.  Parallel independent component analysis using an optimized neurovascular coupling for concurrent EEG-fMRI sources , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[68]  R. Straub,et al.  Effect of COMT Val108/158 Met genotype on frontal lobe function and risk for schizophrenia , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[69]  M. L. Calle,et al.  Model‐Based Multifactor Dimensionality Reduction for detecting epistasis in case–control data in the presence of noise , 2011, Annals of human genetics.

[70]  Paul M. Thompson,et al.  Identification of gene pathways implicated in Alzheimer's disease using longitudinal imaging phenotypes with sparse regression☆ , 2012, NeuroImage.

[71]  Andrew J. Saykin,et al.  Hippocampal Atrophy as a Quantitative Trait in a Genome-Wide Association Study Identifying Novel Susceptibility Genes for Alzheimer's Disease , 2009, PloS one.

[72]  E. Oja,et al.  Independent Component Analysis , 2013 .

[73]  Trevor J. Hastie,et al.  Genome-wide association analysis by lasso penalized logistic regression , 2009, Bioinform..

[74]  F. Meinecke,et al.  Analysis of Multimodal Neuroimaging Data , 2011, IEEE Reviews in Biomedical Engineering.

[75]  Kent Hutchison,et al.  Identification of Genetic and Epigenetic Marks Involved in Population Structure , 2010, PloS one.

[76]  Ting Hu,et al.  Epistasis, complexity, and multifactor dimensionality reduction. , 2013, Methods in molecular biology.

[77]  Anders D. Børglum,et al.  Genome-wide association study identifies five new schizophrenia loci , 2011, Nature Genetics.

[78]  Daniel R Weinberger,et al.  Neuroimaging-genetic paradigms: a new approach to investigate the pathophysiology and treatment of cognitive deficits in schizophrenia. , 2006, Harvard review of psychiatry.

[79]  JiangDaxin,et al.  Cluster Analysis for Gene Expression Data , 2004 .

[80]  Vince D. Calhoun,et al.  2011 Ieee International Workshop on Machine Learning for Signal Processing Iva for Multi-subject Fmri Analysis: a Comparative Study Using a New Simulation Toolbox , 2022 .

[81]  Jingyu Liu,et al.  A multimodality ICA study - integrating genomic single nucleotide polymorphisms with functional neuroimaging data , 2008, 2008 IEEE International Conference on Bioinformatics and Biomeidcine Workshops.

[82]  De-Shuang Huang,et al.  Independent component analysis-based penalized discriminant method for tumor classification using gene expression data , 2006, Bioinform..

[83]  Vince D. Calhoun,et al.  Canonical Correlation Analysis for Data Fusion and Group Inferences , 2010, IEEE Signal Processing Magazine.

[84]  Te-Won Lee,et al.  Independent vector analysis (IVA): Multivariate approach for fMRI group study , 2008, NeuroImage.

[85]  M. Daly,et al.  Identification of risk loci with shared effects on five major psychiatric disorders: a genome-wide analysis , 2013, The Lancet.

[86]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[87]  Scott R Sponheim,et al.  Cumulative genetic risk and prefrontal activity in patients with schizophrenia. , 2013, Schizophrenia bulletin.

[88]  Johnny S. H. Kwan,et al.  GATES: a rapid and powerful gene-based association test using extended Simes procedure. , 2011, American journal of human genetics.

[89]  S. Cordwell,et al.  A proteome analysis of the anterior cingulate cortex gray matter in schizophrenia , 2006, Molecular Psychiatry.

[90]  H. Gunshin,et al.  A review of independent component analysis application to microarray gene expression data. , 2008, BioTechniques.

[91]  Jong-Hwan Lee,et al.  Independent vector analysis (IVA) for group fMRI processing of subcortical area , 2008, Int. J. Imaging Syst. Technol..

[92]  V. Calhoun,et al.  Association of genetic copy number variations at 11 q14.2 with brain regional volume differences in an alcohol use disorder population. , 2012, Alcohol.

[93]  V. Calhoun,et al.  Multisubject Independent Component Analysis of fMRI: A Decade of Intrinsic Networks, Default Mode, and Neurodiagnostic Discovery , 2012, IEEE Reviews in Biomedical Engineering.

[94]  Terrence J. Sejnowski,et al.  An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.

[95]  Andreas Meyer-Lindenberg,et al.  The future of fMRI and genetics research , 2012, NeuroImage.

[96]  Andreas Meyer-Lindenberg,et al.  Imaging genetics of schizophrenia , 2010, Dialogues in clinical neuroscience.

[97]  T. Adali,et al.  Ieee Workshop on Machine Learning for Signal Processing Semi-blind Ica of Fmri: a Method for Utilizing Hypothesis-derived Time Courses in a Spatial Ica Analysis , 2022 .

[98]  Martin Styner,et al.  Projection Regression Models for Multivariate Imaging Phenotype , 2012, Genetic epidemiology.

[99]  T Jombart,et al.  Genetic markers in the playground of multivariate analysis , 2009, Heredity.

[100]  J. Mazziotta,et al.  Cerebral metabolic and cognitive decline in persons at genetic risk for Alzheimer's disease. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[101]  V. Calhoun,et al.  Semiblind spatial ICA of fMRI using spatial constraints , 2009, Human brain mapping.

[102]  Aidong Zhang,et al.  Cluster analysis for gene expression data: a survey , 2004, IEEE Transactions on Knowledge and Data Engineering.

[103]  Daniel R Weinberger,et al.  Intermediate phenotypes in psychiatric disorders. , 2011, Current opinion in genetics & development.

[104]  John Shawe-Taylor,et al.  Sparse canonical correlation analysis , 2009, Machine Learning.

[105]  S. Mccarroll,et al.  Copy-number variation and association studies of human disease , 2007, Nature Genetics.

[106]  Mayte Suárez-Fariñas,et al.  Evaluation of the Psoriasis Transcriptome across Different Studies by Gene Set Enrichment Analysis (GSEA) , 2010, PloS one.

[107]  Paul M. Thompson,et al.  Imaging genetics via sparse canonical correlation analysis , 2013, 2013 IEEE 10th International Symposium on Biomedical Imaging.

[108]  Vince D. Calhoun,et al.  A Parallel Independent Component Analysis Approach to Investigate Genomic Influence on Brain Function , 2008, IEEE Signal Processing Letters.

[109]  P. Visscher,et al.  A versatile gene-based test for genome-wide association studies. , 2010, American journal of human genetics.

[110]  Kai Wang,et al.  A principal components regression approach to multilocus genetic association studies , 2008, Genetic epidemiology.

[111]  Kurt Hornik,et al.  A quantitative comparison of functional MRI cluster analysis , 2004, Artif. Intell. Medicine.

[112]  J. Pekar,et al.  A method for making group inferences from functional MRI data using independent component analysis , 2001, Human brain mapping.

[113]  Vince D. Calhoun,et al.  A review of group ICA for fMRI data and ICA for joint inference of imaging, genetic, and ERP data , 2009, NeuroImage.

[114]  Vince D. Calhoun,et al.  Genetic Associations of Brain Structural Networks in Schizophrenia: A Preliminary Study , 2010, Biological Psychiatry.

[115]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[116]  Jun Zhu,et al.  A generalized combinatorial approach for detecting gene-by-gene and gene-by-environment interactions with application to nicotine dependence. , 2007, American journal of human genetics.

[117]  Vince D. Calhoun,et al.  A Pilot Study on Collective Effects of 22q13.31 Deletions on Gray Matter Concentration in Schizophrenia , 2012, PloS one.

[118]  Tülay Adali,et al.  Comparison of multi‐subject ICA methods for analysis of fMRI data , 2010, Human brain mapping.

[119]  Paul M. Thompson,et al.  Sparse reduced-rank regression detects genetic associations with voxel-wise longitudinal phenotypes in Alzheimer's disease , 2012, NeuroImage.

[120]  David Skibinski,et al.  Genetic markers , 1993, Nature.

[121]  Zhaoxia Yu,et al.  SNP-based pathway enrichment analysis for genome-wide association studies , 2011, BMC Bioinformatics.

[122]  Vince D. Calhoun,et al.  A projection pursuit algorithm to classify individuals using fMRI data: Application to schizophrenia , 2008, NeuroImage.

[123]  Li Shen,et al.  Genetic pathway‐based hierarchical clustering analysis of older adults with cognitive complaints and amnestic mild cognitive impairment using clinical and neuroimaging phenotypes , 2010, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[124]  Stephen M. Smith,et al.  Probabilistic independent component analysis for functional magnetic resonance imaging , 2004, IEEE Transactions on Medical Imaging.

[125]  Marit Holden,et al.  GSEA-SNP: applying gene set enrichment analysis to SNP data from genome-wide association studies , 2008, Bioinform..

[126]  Andreas Meyer-Lindenberg,et al.  False positives in imaging genetics , 2008, NeuroImage.

[127]  Gary Donohoe,et al.  Brain vs behavior: an effect size comparison of neuroimaging and cognitive studies of genetic risk for schizophrenia. , 2013, Schizophrenia bulletin.

[128]  Hao He,et al.  Three-way (N-way) fusion of brain imaging data based on mCCA+jICA and its application to discriminating schizophrenia , 2013, NeuroImage.

[129]  Thomas E. Nichols,et al.  Discovering genetic associations with high-dimensional neuroimaging phenotypes: A sparse reduced-rank regression approach , 2010, NeuroImage.

[130]  Alessandro Serretti,et al.  Genetics of Alzheimer's disease. A rapidly evolving field. , 2007, Journal of Alzheimer's disease : JAD.

[131]  김태수,et al.  Independent vector analysis = 독립 벡터 분석 , 2007 .

[132]  Shannon L. Risacher,et al.  Identifying disease sensitive and quantitative trait-relevant biomarkers from multidimensional heterogeneous imaging genetics data via sparse multimodal multitask learning , 2012, Bioinform..

[133]  박현욱,et al.  Independent component analysis를 이용한 fMRI 신호 분석 , 1999 .

[134]  Aapo Hyvärinen,et al.  A Fast Fixed-Point Algorithm for Independent Component Analysis , 1997, Neural Computation.

[135]  Te-Won Lee,et al.  Independent Vector Analysis: Definition and Algorithms , 2006, 2006 Fortieth Asilomar Conference on Signals, Systems and Computers.

[136]  E R Martin,et al.  Identification of significant association and gene-gene interaction of GABA receptor subunit genes in autism. , 2005, American journal of human genetics.

[137]  Vince D. Calhoun,et al.  A pilot multivariate parallel ICA study to investigate differential linkage between neural networks and genetic profiles in schizophrenia , 2010, NeuroImage.

[138]  Jean-Franois Cardoso High-Order Contrasts for Independent Component Analysis , 1999, Neural Computation.

[139]  Vince D. Calhoun,et al.  Genetic determinants of target and novelty-related event-related potentials in the auditory oddball response , 2009, NeuroImage.

[140]  Jiang Gui,et al.  A Robust Multifactor Dimensionality Reduction Method for Detecting Gene–Gene Interactions with Application to the Genetic Analysis of Bladder Cancer Susceptibility , 2011, Annals of human genetics.

[141]  Douglas W. Jones,et al.  Genotype Influences In Vivo Dopamine Transporter Availability in Human Striatum , 2000, Neuropsychopharmacology.

[142]  P. Thompson,et al.  Neuroimaging endophenotypes: Strategies for finding genes influencing brain structure and function , 2007, Human brain mapping.

[143]  A. Timperman,et al.  Proteome analysis. , 2004, Methods in molecular biology.

[144]  Vince D. Calhoun,et al.  Parallel ICA identifies sub-components of resting state networks that covary with behavioral indices , 2012, Front. Hum. Neurosci..