Experiment Specific Expression Patterns

The differential analysis of genes between microarrays from several experimental conditions or treatments routinely estimates which genes change significantly between groups. As genes are never regulated individually observed behavior may be a consequence of changes in other genes. Existing approaches like co-expression analysis aim to resolve such patterns from a wide range of experiments. The knowledge of such a background set of experiments can be used to compute expected gene behavior based on known links. It is particularly interesting to detect previously unseen specific effects in other experiments. Here, a new method to spot genes deviating from expected behavior (PAttern DEviation SCOring - Padesco ) is devised. It uses linear regression models learned from a background set to arrive at gene specific prediction accuracy distributions. For a given experiment it is then decided whether each gene is predicted better or worse than expected. This provides a novel way to estimate the experiment specificityof each gene. We propose a validation procedure to estimate the detection of such specific candidates and show that these can be identified with an average accuracy of about 85 percent.

[1]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[2]  Homin K. Lee,et al.  Coexpression analysis of human genes across many microarray data sets. , 2004, Genome research.

[3]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[4]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[5]  H Huland,et al.  Local intratumor immunotherapy of prostate cancer with interleukin-2 reduces tumor growth. , 1999, Anticancer research.

[6]  Trevor Hastie,et al.  Imputing Missing Data for Gene Expression Arrays , 2001 .

[7]  Joseph D. Schwartzman,et al.  Mice Lacking the Chemokine Receptor CCR1 Show Increased Susceptibility to Toxoplasma gondii Infection1 , 2001, The Journal of Immunology.

[8]  K. Kinzler,et al.  Cancer genes and the pathways they control , 2004, Nature Medicine.

[9]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[10]  J. Welsh,et al.  Analysis of gene expression identifies candidate markers and pharmacological targets in prostate cancer. , 2001, Cancer research.

[11]  Bernhard Schölkopf,et al.  Kernel Methods in Computational Biology , 2005 .

[12]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[13]  Haifeng Li,et al.  Integrative missing value estimation for microarray data , 2006, BMC Bioinformatics.

[14]  Ao Li,et al.  Missing value estimation for DNA microarray gene expression data by Support Vector Regression imputation and orthogonal coding scheme , 2006, BMC Bioinformatics.

[15]  Sanghyun Park,et al.  Extraction of Informative Genes from Integrated Microarray Data , 2008, ISMIS.

[16]  Gene H. Golub,et al.  Missing value estimation for DNA microarray gene expression data: local least squares imputation , 2005, Bioinform..

[17]  Brad T. Sherman,et al.  DAVID: Database for Annotation, Visualization, and Integrated Discovery , 2003, Genome Biology.

[18]  Srinivasan Parthasarathy,et al.  Construction of a reference gene association network from multiple profiling data: application to data analysis , 2007, Bioinform..

[19]  E. Keller,et al.  Interleukin-6 and prostate cancer progression. , 2001, Cytokine & growth factor reviews.

[20]  D. Koller,et al.  A module map showing conditional activity of expression modules in cancer , 2004, Nature Genetics.

[21]  Sarah J. Roberts-Thomson,et al.  Calcium and cancer: targeting Ca2+ transport , 2007, Nature Reviews Cancer.

[22]  I. Fabregat Dysregulation of apoptosis in hepatocellular carcinoma cells. , 2009, World journal of gastroenterology.

[23]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[24]  W. Isaacs,et al.  Interleukin‐2 transfected prostate cancer cells generate a local antitumor effect in vivo , 1994, The Prostate.

[25]  Carlos Prieto,et al.  Algorithm to find gene expression profiles of deregulation and identify families of disease-altered genes , 2006, Bioinform..

[26]  E. Lander,et al.  Gene expression correlates of clinical prostate cancer behavior. , 2002, Cancer cell.

[27]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[28]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[29]  F. Wright,et al.  Gene expression profiling of isogenic cells with different TP53 gene dosage reveals numerous genes that are affected by TP53 dosage and identifies CSPG2 as a direct target of p53 , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Jan J. Battermann,et al.  Local therapy of cancer with free IL-2 , 2008, Cancer Immunology, Immunotherapy.

[31]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[32]  Rainer Spang,et al.  Finding disease specific alterations in the co-expression of genes , 2004, ISMB/ECCB.

[33]  Jianping Li,et al.  A multiple kernel support vector machine scheme for feature selection and rule extraction from gene expression data of cancer tissue , 2007, Artif. Intell. Medicine.

[34]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[35]  H. Llewelyn Roderick,et al.  Ca2+ signalling checkpoints in cancer: remodelling Ca2+ for cancer cell proliferation and survival , 2008, Nature Reviews Cancer.

[36]  Damien Chaussabel,et al.  Unique gene expression profiles of human macrophages and dendritic cells to phylogenetically distinct parasites. , 2003, Blood.

[37]  T. Barrette,et al.  Meta-analysis of microarrays: interstudy validation of gene expression profiles reveals pathway dysregulation in prostate cancer. , 2002, Cancer research.

[38]  Alexander J. Smola,et al.  Learning with Kernels: support vector machines, regularization, optimization, and beyond , 2001, Adaptive computation and machine learning series.

[39]  Elizabeth Garrett-Mayer,et al.  Cross-study validation and combined analysis of gene expression microarray data. , 2007, Biostatistics.

[40]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[41]  K. Struhl,et al.  A transcriptional signature and common gene networks link cancer with lipid metabolism and diverse human diseases. , 2010, Cancer cell.