Heterogeneous Data Fusion to Type Brain Tumor Biopsies

Current research in biomedical informatics involves analysis of multiple heterogeneous data sets. This includes patient demographics, clinical and pathology data, treatment history, patient outcomes as well as gene expression, DNA sequences and other information sources such as gene ontology. Analysis of these data sets could lead to better disease diagnosis, prognosis, treatment and drug discovery. In this paper, we use machine learning algorithms to create a novel framework to perform the heterogeneous data fusion on both metabolic and molecular datasets, including state-of-the-art high-resolution magic angle spinning (HRMAS) proton (1H) Magnetic Resonance Spectroscopy and gene transcriptome profiling, to intact brain tumor biopsies and to identify different profiles of brain tumors. Our experimental results show our novel framework outperforms any analysis using individual dataset.

[1]  A. Balmain,et al.  How many mutations are required for tumorigenesis? implications from human cancer data , 1993 .

[2]  Igor Kononenko,et al.  Estimating Attributes: Analysis and Extensions of RELIEF , 1994, ECML.

[3]  Huan Liu,et al.  Chi2: feature selection and discretization of numeric attributes , 1995, Proceedings of 7th IEEE International Conference on Tools with Artificial Intelligence.

[4]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[5]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[6]  F. Podo Tumour phospholipid metabolism , 1999, NMR in biomedicine.

[7]  Leo L. Cheng,et al.  Quantification of microheterogeneity in glioblastoma multiforme with ex vivo high-resolution magic-angle spinning (HRMAS) proton magnetic resonance spectroscopy. , 2000, Neuro-oncology.

[8]  Robert C. Thompson,et al.  The “chip” as a specific genetic tool , 2000, Biological Psychiatry.

[9]  D. Morvan,et al.  Melanoma tumors acquire a new phospholipid metabolism phenotype under cystemustine as revealed by high-resolution magic angle spinning proton nuclear magnetic resonance spectroscopy of intact tumor samples. , 2002, Cancer research.

[10]  T. Golub,et al.  Gene expression-based classification of malignant gliomas correlates better with survival than histological classification. , 2003, Cancer research.

[11]  Chris H. Q. Ding,et al.  Minimum redundancy feature selection from microarray gene expression data , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[12]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[13]  S. Atlas,et al.  Magnetic resonance image–guided proteomics of human glioblastoma multiforme , 2003, Journal of magnetic resonance imaging : JMRI.

[14]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[15]  Tao Li,et al.  A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression , 2004, Bioinform..

[16]  L. Astrakas,et al.  Noninvasive Magnetic Resonance Spectroscopic Imaging Biomarkers to Predict the Clinical Grade of Pediatric Brain Tumors , 2004, Clinical Cancer Research.

[17]  C. Domeniconi,et al.  An Evaluation of Gene Selection Methods for Multi-class Microarray Data Classification , 2004 .

[18]  M. West,et al.  Gene expression profiling and genetic markers in glioblastoma survival. , 2005, Cancer research.

[19]  Vangelis Metsis,et al.  Spam Filtering with Naive Bayes - Which Naive Bayes? , 2006, CEAS.

[20]  S. Horvath,et al.  Relationship between Survival and Edema in Malignant Gliomas: Role of Vascular Endothelial Growth Factor and Neuronal Pentraxin 2 , 2007, Clinical Cancer Research.

[21]  L. Astrakas,et al.  Combination of high-resolution magic angle spinning proton magnetic resonance spectroscopy and microscale genomics to type brain tumor biopsies. , 2007, International journal of molecular medicine.

[22]  K. Aldape,et al.  Identification of noninvasive imaging surrogates for brain tumor gene-expression modules , 2008, Proceedings of the National Academy of Sciences.