Mathematically universal and biologically consistent astrocytoma genotype encodes for transformation and predicts survival phenotype

DNA alterations have been observed in astrocytoma for decades. A copy-number genotype predictive of a survival phenotype was only discovered by using the generalized singular value decomposition (GSVD) formulated as a comparative spectral decomposition. Here, we use the GSVD to compare whole-genome sequencing (WGS) profiles of patient-matched astrocytoma and normal DNA. First, the GSVD uncovers a genome-wide pattern of copy-number alterations, which is bounded by patterns recently uncovered by the GSVDs of microarray-profiled patient-matched glioblastoma (GBM) and, separately, lower-grade astrocytoma and normal genomes. Like the microarray patterns, the WGS pattern is correlated with an approximately one-year median survival time. By filling in gaps in the microarray patterns, the WGS pattern reveals that this biologically consistent genotype encodes for transformation via the Notch together with the Ras and Shh pathways. Second, like the GSVDs of the microarray profiles, the GSVD of the WGS profiles separates the tumor-exclusive pattern from normal copy-number variations and experimental inconsistencies. These include the WGS technology-specific effects of guanine-cytosine content variations across the genomes that are correlated with experimental batches. Third, by identifying the biologically consistent phenotype among the WGS-profiled tumors, the GBM pattern proves to be a technology-independent predictor of survival and response to chemotherapy and radiation, statistically better than the patient's age and tumor's grade, the best other indicators, and MGMT promoter methylation and IDH1 mutation. We conclude that by using the complex structure of the data, comparative spectral decompositions underlie a mathematically universal description of the genotype-phenotype relations in cancer that other methods miss.

[1]  Alan Edelman,et al.  The Geometry of Algorithms with Orthogonality Constraints , 1998, SIAM J. Matrix Anal. Appl..

[2]  Xun Chen,et al.  Joint Blind Source Separation for Neurophysiological Data Analysis: Multiset and multimodal methods , 2016, IEEE Signal Processing Magazine.

[3]  F. Speleman,et al.  Constitutional translocation t(1;17)(p36.31-p36.13;q11.2-q12.1) in a neuroblastoma patient. Establishment of somatic cell hybrids and identification of PND/A12M2 on chromosome 1 and NF1/SCYA7 on chromosome 17 as breakpoint flanking single copy markers. , 1995, Oncogene.

[4]  Hongkai Ji,et al.  Hedgehog pathway-regulated gene networks in cerebellum development and tumorigenesis , 2010, Proceedings of the National Academy of Sciences.

[5]  Q. Su,et al.  Expression of Notch-1 and its ligands, Delta-like-1 and Jagged-1, is critical for glioma cell survival and proliferation. , 2005, Cancer research.

[6]  Geert Verbeke,et al.  Chromosome instability is common in human cleavage-stage embryos , 2009, Nature Medicine.

[7]  Jill S Barnholtz-Sloan,et al.  CBTRUS Statistical Report: Primary brain and other central nervous system tumors diagnosed in the United States in 2010–2014 , 2017, Neuro-oncology.

[8]  Gerhard G. Thallinger,et al.  Integrative omics analysis. A study based on Plasmodium falciparum mRNA and protein data , 2014, BMC Systems Biology.

[9]  Haesun Park,et al.  Generalizing discriminant analysis using the generalized singular value decomposition , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Yongcui Wang,et al.  Matrix factorization reveals aging-specific co-expression gene modules in the fat and muscle tissues in nonhuman primates , 2016, Scientific Reports.

[11]  O. Alter,et al.  A Higher-Order Generalized Singular Value Decomposition for Comparison of Global mRNA Expression from Multiple Organisms , 2011, PloS one.

[12]  P. Meltzer,et al.  Twelve amplified and expressed genes localized in a single domain in glioma , 1996, Human Genetics.

[13]  C. Loan Generalizing the Singular Value Decomposition , 1976 .

[14]  S. Grossman,et al.  Published glioblastoma clinical trials from 1980 to 2013: Lessons from the past and for the future. , 2016 .

[15]  Vince D. Calhoun,et al.  Multimodal Data Fusion Using Source Separation: Two Effective Models Based on ICA and IVA and Their Properties , 2015, Proceedings of the IEEE.

[16]  Sharon J. Diskin,et al.  Copy number variation at 1q21.1 associated with neuroblastoma , 2009, Nature.

[17]  J. Briscoe,et al.  Notch Activity Modulates the Responsiveness of Neural Progenitors to Sonic Hedgehog Signaling , 2015, Developmental cell.

[18]  Wim Van Paesschen,et al.  Canonical Correlation Analysis Applied to Remove Muscle Artifacts From the Electroencephalogram , 2006, IEEE Transactions on Biomedical Engineering.

[19]  Alexander Eckehart Urban,et al.  Comprehensive performance comparison of high-resolution array platforms for genome-wide Copy Number Variation (CNV) analysis in humans , 2017, BMC Genomics.

[20]  Desmond J. Higham,et al.  Exploring metabolic pathway disruption in the subchronic phencyclidine model of schizophrenia with the Generalized Singular Value Decomposition , 2011, BMC Systems Biology.

[21]  Enrico Petretto,et al.  Multi-tissue Analysis of Co-expression Networks by Higher-Order Generalized Singular Value Decomposition Identifies Functionally Coherent Transcriptional Modules , 2014, PLoS genetics.

[22]  Robert A. Weinberg,et al.  Creation of human tumour cells with defined genetic elements , 1999, Nature.

[23]  Naoto Endo,et al.  Disruption of a long-range cis-acting regulator for Shh causes preaxial polydactyly , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[24]  J. Lupski,et al.  Genomic rearrangements and sporadic disease , 2007, Nature Genetics.

[25]  M. Wigler,et al.  Circular binary segmentation for the analysis of array-based DNA copy number data. , 2004, Biostatistics.

[26]  I. Conboy,et al.  Imbalance between pSmad3 and Notch induces CDK inhibitors in old muscle stem cells , 2008, Nature.

[27]  D. Botstein,et al.  Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organisms , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[28]  V. Seshan,et al.  FACETS: allele-specific copy number and clonal heterogeneity analysis tool for high-throughput DNA sequencing , 2016, Nucleic acids research.

[29]  Shmuel Friedland,et al.  A New Approach to Generalized Singular Value Decomposition , 2005, SIAM J. Matrix Anal. Appl..

[30]  Orly Alter,et al.  GSVD Comparison of Patient-Matched Normal and Tumor aCGH Profiles Reveals Global Copy-Number Alterations Predicting Glioblastoma Multiforme Survival , 2012, PloS one.

[31]  Steven J. M. Jones,et al.  Comprehensive, Integrative Genomic Analysis of Diffuse Lower-Grade Gliomas. , 2015, The New England journal of medicine.

[32]  Th. Boveri,et al.  Concerning the Origin of Malignant Tumours , 2008 .

[33]  Th. Boveri Concerning the Origin of Malignant Tumours by Theodor Boveri. Translated and annotated by Henry Harris , 2008, Journal of Cell Science.

[34]  M. Saunders,et al.  Towards a Generalized Singular Value Decomposition , 1981 .

[35]  David Haussler,et al.  Human-Specific NOTCH2NL Genes Affect Notch Signaling and Cortical Neurogenesis , 2018, Cell.

[36]  S. Hochreiter,et al.  cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate , 2012, Nucleic acids research.

[37]  M. Netsky,et al.  The longevity of patients with glioblastoma multiforme. , 1950, Journal of neurosurgery.

[38]  Mauricio O. Carneiro,et al.  The advantages of SMRT sequencing , 2013, Genome Biology.

[39]  John A. Berger,et al.  Jointly analyzing gene expression and copy number data in breast cancer using data reduction models , 2006, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[40]  S. Artavanis-Tsakonas,et al.  Modulation of notch signaling elicits signature tumors and inhibits hras1-induced oncogenesis in the mouse mammary epithelium. , 2004, The American journal of pathology.

[41]  D. Conrad,et al.  Global variation in copy number in the human genome , 2006, Nature.

[42]  Charles R. Johnson,et al.  Matrix Analysis, 2nd Ed , 2012 .

[43]  Ryan Mills,et al.  Comprehensive assessment of array-based platforms and calling algorithms for detection of copy number variants , 2011, Nature Biotechnology.

[44]  W. Hahn,et al.  Activation of Notch-1 signaling maintains the neoplastic phenotype in human Ras-transformed cells , 2002, Nature Medicine.

[45]  O. Alter,et al.  Platform-Independent Genome-Wide Pattern of DNA Copy-Number Alterations Predicting Astrocytoma Survival and Response to Treatment Revealed by the GSVD Formulated as a Comparative Spectral Decomposition , 2016, PloS one.

[46]  Joshua M. Korn,et al.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2008, Nature.

[47]  T. Graves,et al.  Finished sequence and assembly of the DUF1220-rich 1q21 region using a haploid human genome , 2014, BMC Genomics.

[48]  Gerald J Wyckoff,et al.  Human Lineage–Specific Amplification, Selection, and Neuronal Expression of DUF1220 Domains , 2006, Science.

[49]  David Haussler,et al.  The UCSC Genome Browser database: 2014 update , 2013, Nucleic Acids Res..

[50]  D. Lauffenburger,et al.  TNF-insulin crosstalk at the transcription factor GATA6 is revealed by a model that links signaling and transcriptomic data tensors , 2016, Science Signaling.

[51]  Andrea J. Liu,et al.  DNA Damage Follows Repair Factor Depletion and Portends Genome Variation in Cancer Cells after Pore Migration , 2017, Current Biology.

[52]  M. Delorenzi,et al.  MGMT methylation analysis of glioblastoma on the Infinium methylation BeadChip identifies two distinct CpG regions associated with gene silencing and outcome, yielding a prediction model for comparisons across datasets, tumor grades, and CIMP-status , 2012, Acta Neuropathologica.

[53]  Pierre-Antoine Absil,et al.  Elucidating the Altered Transcriptional Programs in Breast Cancer using Independent Component Analysis , 2007, PLoS Comput. Biol..

[54]  Orly Alter,et al.  Tensor GSVD of Patient- and Platform-Matched Tumor and Normal DNA Copy-Number Profiles Uncovers Chromosome Arm-Wide Patterns of Tumor-Exclusive Platform-Consistent Alterations Encoding for Cell Transformation and Predicting Ovarian Cancer Survival , 2015, PloS one.

[55]  U. Fischer,et al.  Genome-Wide Gene Amplification during Differentiation of Neural Progenitor Cells In Vitro , 2012, PloS one.

[56]  M. Scott,et al.  Patching the gaps in Hedgehog signalling , 2007, Nature Cell Biology.

[57]  Franklin T. Luk,et al.  Canonical correlations and generalized SVD: Applications and new algorithms , 1989 .

[58]  C. Sommer,et al.  Clinically distinct subgroups of glioblastoma multiforme studied by comparative genomic hybridization. , 1996, Laboratory investigation; a journal of technical methods and pathology.

[59]  Bert Vogelstein,et al.  Uncoupling of S phase and mitosis induced by anticancer agents in cells lacking p21 , 1996, Nature.

[60]  H. C. Corben,et al.  Classical Mechanics (2nd ed.) , 1961 .

[61]  Andreas W. Schreiber,et al.  Combining transcriptional datasets using the generalized singular value decomposition , 2008, BMC Bioinformatics.

[62]  D. Haussler,et al.  The Somatic Genomic Landscape of Glioblastoma , 2013, Cell.