Global gene expression analysis by combinatorial optimization

Generally, there is a trade-off between methods of gene expression analysis that are precise but labor-intensive, e.g. RT-PCR, and methods that scale up to global coverage but are not quite as quantitative, e.g. microarrays. In the present paper, we show how how a known method of gene expression profiling (K. Kato, Nucleic Acids Res. 23, 3685-3690 (1995)), which relies on a fairly small number of steps, can be turned into a global gene expression measurement by advanced data post-processing, with potentially little loss of accuracy. Post-processing here entails solving an ancillary combinatorial optimization problem. Validation is performed on in silico experiments generated from the FANTOM data base of full-length mouse cDNA. We present two variants of the method. One uses state-of-the-art commercial software for solving problems of this kind, the other a code developed by us specifically for this purpose, released in the public domain under GPL license.

[1]  K. Livak,et al.  DNA fingerprinting by sampled sequencing. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[2]  N. Samani,et al.  A simple and efficient method for the isolation of differentially expressed genes. , 1998, Journal of molecular biology.

[3]  Bruce Budowle,et al.  Evaluation of a multicapillary electrophoresis instrument for mitochondrial DNA typing. , 2003, Journal of forensic sciences.

[4]  S. P. Fodor,et al.  Light-directed, spatially addressable parallel chemical synthesis. , 1991, Science.

[5]  K. Kato,et al.  Description of the entire mRNA population by a 3' end cDNA fragment generated by class IIS restriction enzymes. , 1995, Nucleic acids research.

[6]  K. Kinzler,et al.  Serial Analysis of Gene Expression , 1995, Science.

[7]  E. Southern,et al.  Parallel synthesis and analysis of large numbers of related chemical compounds: applications to oligonucleotides. , 1994, Journal of biotechnology.

[8]  J. Rowley,et al.  Identifying novel transcripts and novel genes in the human genome by using novel SAGE tags , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[9]  T. Ohnishi,et al.  Expression profiling by iAFLP: A PCR-based method for genome-wide gene expression profiling. , 1999, Genome research.

[10]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..

[11]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[12]  Jakub Orzechowski Westholm,et al.  Local search methods in gene expression analysis , 2002 .

[13]  Laurence A. Wolsey,et al.  Integer and Combinatorial Optimization , 1988 .

[14]  K. Livak,et al.  Real time quantitative PCR. , 1996, Genome research.

[15]  A. Pardee,et al.  Differential display of eukaryotic messenger RNA by means of the polymerase chain reaction. , 1992, Science.

[16]  William H. Press,et al.  Numerical recipes , 1990 .

[17]  E. Southern,et al.  Analyzing and comparing nucleic acid sequences by hybridization to arrays of oligonucleotides: evaluation using experimental models. , 1992, Genomics.

[18]  M. Erlander,et al.  TOGA: an automated parsing technology for analyzing expression of nearly all genes. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[19]  J. W. Simpson,et al.  Gene expression analysis by transcript profiling coupled to a gene database query , 1999, Nature Biotechnology.

[20]  D R Sibson,et al.  Increasing the average abundance of low-abundance cDNAs by ordered subdivision of cDNA populations. , 1997, Methods in molecular biology.

[21]  Dr. Zbigniew Michalewicz,et al.  How to Solve It: Modern Heuristics , 2004 .

[22]  S. Weissman,et al.  Analysis of differential gene expression by display of 3' end restriction fragments of cDNAs. , 1996, Proceedings of the National Academy of Sciences of the United States of America.