DNetPRO: A network approach for low-dimensional signatures from high-throughput data

The objective of many high-throughput “omics” studies is to obtain a relatively low-dimensional set of observables - signature - for sample classification purposes (diagnosis, prognosis, stratification). We propose DNetPRO, Discriminant Analysis with Network PROcessing, a supervised signature identification method based on a bottom-up combinatorial approach that exploits the discriminant power of all variable pairs. The algorithm is easily scalable allowing efficient computing even for high number of observables (104 − 105). We show applications on real high-throughput genomic datasets in which our method outperforms existing results, or compares to them but with a smaller number of selected variables. Moreover the linearity of DNetPRO allows a clearer interpretation of the obtained signatures in comparison to non linear classification models

[1]  Geoffrey S Ginsburg,et al.  Personalized medicine: progress and promise. , 2011, Annual review of genomics and human genetics.

[2]  J. Beckmann,et al.  Reconciling evidence-based medicine and precision medicine in the era of big data: challenges and opportunities , 2016, Genome Medicine.

[3]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[4]  Adam A. Margolin,et al.  Assessing the clinical utility of cancer genomic and proteomic data across tumor types , 2014, Nature Biotechnology.

[5]  Mirko Francesconi,et al.  Overcoming resistance to conventional drugs in Ewing sarcoma and identification of molecular predictors of outcome. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[6]  Atul J Butte,et al.  Robust meta-analysis of gene expression using the elastic net , 2015, Nucleic acids research.

[7]  Tiejun Tong,et al.  Gene Selection Using Iterative Feature Elimination Random Forests for Survival Outcomes , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[8]  E. Liebscher A Universal Selection Method in Linear Regression Models , 2012 .

[9]  T. M. Johnson Perspective on Precision Medicine in Oncology , 2017, Pharmacotherapy.

[10]  R. R. Hocking The analysis and selection of variables in linear regression , 1976 .

[11]  Aixia Guo,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2014 .