Computational Intelligence Methods for Bioinformatics and Biostatistics: 15th International Meeting, CIBB 2018, Caparica, Portugal, September 6–8, 2018, Revised Selected Papers

Commonly, in gene expression microarray measurements multiple missing expression values are generated, and the proper handling of missing values is a critical task. To address the issue, in this paper a novel methodology, based on compressive sensing mechanism, is proposed in order to analyze gene expression data on the basis of topological characteristics of gene expression time series. The approach conceives, when data are recovered, their processing through a non-linear PCA for dimensional reduction and a Hierarchical Clustering Algorithm for agglomeration and visualization. Experiments have been performed on the yeast Saccharomyces cerevisiae dataset by considering different percentages of information loss. The approach highlights robust performance when high percentage of loss of information occurs and when few sampling data are available.

[1]  R. Irizarry,et al.  Missing data and technical variability in single‐cell RNA‐sequencing experiments , 2018, Biostatistics.

[2]  Antje Chang,et al.  BRENDA , the enzyme database : updates and major new developments , 2003 .

[3]  Nir Yosef,et al.  Bayesian Inference for a Generative Model of Transcriptome Profiles from Single-cell RNA Sequencing , 2018, bioRxiv.

[4]  Giancarlo Mauri,et al.  LASSIE: simulating large-scale models of biochemical systems on GPUs , 2017, BMC Bioinformatics.

[5]  Antonino Fiannaca,et al.  Variable Ranking Feature Selection for the Identification of Nucleosome Related Sequences , 2018, ADBIS.

[6]  Lei Shi,et al.  SABIO-RK—database for biochemical reaction kinetics , 2011, Nucleic Acids Res..

[7]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[8]  Ernst Wit,et al.  Extended differential geometric LARS for high-dimensional GLMs with general dispersion parameter , 2017, Statistics and Computing.

[9]  Antonino Fiannaca,et al.  miRTissue: a web application for the analysis of miRNA-target interactions in human tissues , 2018, BMC Bioinformatics.

[10]  David M. Blei,et al.  Variational Inference: A Review for Statisticians , 2016, ArXiv.

[11]  Yoshua Bengio,et al.  Gradient Flow in Recurrent Nets: the Difficulty of Learning Long-Term Dependencies , 2001 .

[12]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[13]  Mattia Antonino Di Gangi,et al.  Deep Learning Architectures for DNA Sequence Classification , 2016, WILF.

[14]  Aleksandra A. Kolodziejczyk,et al.  The technology and biology of single-cell RNA sequencing. , 2015, Molecular cell.

[15]  Z. Shao,et al.  Hsa-miR-206 represses the proliferation and invasion of breast cancer cells by targeting Cx43. , 2015, European review for medical and pharmacological sciences.

[16]  Chadrashekar Kagepura Thammaiah,et al.  Role of let-7 family microRNA in breast cancer , 2016, Non-coding RNA research.

[17]  Giancarlo Mauri,et al.  Gillespie’s Stochastic Simulation Algorithm on MIC coprocessors , 2016, The Journal of Supercomputing.

[18]  Alex A. Pollen,et al.  Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex , 2014, Nature Biotechnology.

[19]  Salvatore Gaglio,et al.  A Deep Learning Network for Exploiting Positional Information in Nucleosome Related Sequences , 2017, IWBBIO.

[20]  Ernst Wit,et al.  Differential geometric least angle regression: a differential geometric approach to sparse generalized linear models , 2013 .

[21]  Antonino Fiannaca,et al.  MiRNATIP: a SOM-based miRNA-target interactions predictor , 2016, BMC Bioinformatics.

[22]  John Eric Wilson Isozymes of mammalian hexokinase: structure, subcellular localization and metabolic function , 2003, Journal of Experimental Biology.

[23]  Sanghyuk Lee,et al.  miRGator v3.0: a microRNA portal for deep sequencing, expression profiling and mRNA targeting , 2012, Nucleic Acids Res..

[24]  D. Broomhead,et al.  A model of yeast glycolysis based on a consistent kinetic characterisation of all its enzymes , 2013, FEBS letters.

[25]  Antonino Fiannaca,et al.  A Deep Learning Approach to DNA Sequence Classification , 2015, CIBB.

[26]  C. Burge,et al.  Most mammalian mRNAs are conserved targets of microRNAs. , 2008, Genome research.

[27]  Riccardo Rizzo,et al.  Deep learning architectures for prediction of nucleosome positioning from sequences data , 2018, BMC Bioinformatics.

[28]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[29]  Kevin P. Claffey,et al.  Differential protein expression profiles in estrogen receptor-positive and -negative breast cancer tissues using label-free quantitative proteomics. , 2010, Genes & cancer.

[30]  S. Linnarsson,et al.  Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq , 2015, Science.

[31]  Emma Pierson,et al.  ZIFA: Dimensionality reduction for zero-inflated single-cell gene expression analysis , 2015, Genome Biology.

[32]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[33]  G. von Heijne,et al.  Tissue-based map of the human proteome , 2015, Science.

[34]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[35]  Anton J. Enright,et al.  Human MicroRNA Targets , 2004, PLoS biology.

[36]  Marco Beccuti,et al.  Overcoming the lack of kinetic information in biochemical reactions networks , 2017, PERV.

[37]  Masami Yokota Hirai,et al.  Identification of a Metabolic Reaction Network from Time-Series Data of Metabolite Concentrations , 2013, PloS one.

[38]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[39]  Ernst Wit,et al.  dglars: An R Package to Estimate Sparse Generalized Linear Models , 2014 .

[40]  E. Allgower,et al.  Introduction to Numerical Continuation Methods , 1987 .

[41]  H. Pazira High-dimensional variable selection for GLMs and survival models , 2017 .

[42]  Antonino Fiannaca,et al.  A Deep Learning Model for Epigenomic Studies , 2016, 2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS).

[43]  Valerie Speirs,et al.  Down-Regulation of miR-92 in Breast Epithelial Cells and in Normal but Not Tumour Fibroblasts Contributes to Breast Carcinogenesis , 2015, PloS one.

[44]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[45]  Antonino Fiannaca,et al.  Deep learning models for bacteria taxonomic classification of metagenomic data , 2018, BMC Bioinformatics.