Multicriteria Gene Screening for Microarray Experiments

Over the past decade there has been an explosion in the amount of genomic data available to biomedical researchers due to advances in biotechnology. For example, using gene microarrays, it is now possible to probe a person’s gene expression profile over the more than 20,000 genes in the human genome. Signals extracted from gene microarray experiments can be linked to genetic factors underlying disease, development, and aging in a population. This has greatly accelerated the pace of gene discovery. However, the massive scale and experimental variability of genomic data makes extraction of biologically significant genetic information very challenging. One of the most important problems is to select a list of genes which are both biologically and statistically significant based on the outcomes of gene microarray experiments. We will describe a novel multicriteria method that we have developed for this gene selection problem that allows tight control of both minimum observable differential change (biological significance) and familywise error rate (statistical significance) and also provides a set of simultaneous confidence intervals for the differences.

[1]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[2]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[3]  Terry Speed,et al.  Normalization of cDNA microarray data. , 2003, Methods.

[4]  R. K. Ursem Multi-objective Optimization using Evolutionary Algorithms , 2009 .

[5]  Alfred O. Hero,et al.  Clustering gene expression signals from retinal microarray data , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  M. Eisen,et al.  Gene expression informatics —it's all in your mine , 1999, Nature Genetics.

[7]  C. Li,et al.  Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Christian Drosten,et al.  Characterization of a Novel Coronavirus Associated with Severe Acute Respiratory Syndrome , 2003, Science.

[9]  Trevor Hastie,et al.  Gene Shaving: a new class of clustering methods for expression arrays , 2000 .

[10]  Cheng Li,et al.  Model-based analysis of oligonucleotide arrays: model validation, design issues and standard error application , 2001, Genome Biology.

[11]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Ash A. Alizadeh,et al.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling , 2000, Nature.

[13]  Alfred O. Hero,et al.  Pareto-optimal methods for gene analysis , 2002 .

[14]  Obi L. Griffith,et al.  The Genome Sequence of the SARS-Associated Coronavirus , 2003, Science.

[15]  Alfred O. Hero,et al.  Pareto analysis for gene filtering in microarray experiments , 2002, 2002 11th European Signal Processing Conference.

[16]  A.O. Hero,et al.  Mathematical morphology applied to spot segmentation and quantification of gene microarray images , 2002, Conference Record of the Thirty-Sixth Asilomar Conference on Signals, Systems and Computers, 2002..

[17]  B. Commoner Is DNA the “secret of life”? , 1965, Clinical pharmacology and therapeutics.

[18]  J. Booth,et al.  Resampling-Based Multiple Testing. , 1994 .

[19]  William J. Wilson,et al.  Multivariate Statistical Methods , 2005, Technometrics.

[20]  Thomas E. Nichols,et al.  Thresholding of Statistical Maps in Functional Neuroimaging Using the False Discovery Rate , 2002, NeuroImage.

[21]  Philip H. Ramsey Nonparametric Statistical Methods , 1974, Technometrics.

[22]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[23]  Thorsten Dickhaus,et al.  Simultaneous Statistical Inference , 2014, Springer Berlin Heidelberg.

[24]  F. Collins,et al.  The Human Genome Project: Lessons from Large-Scale Biology , 2003, Science.

[25]  A. Hero,et al.  Posterior Pareto Front Analysis for Gene Filtering , .