Cluster analysis and data visualization of large-scale gene expression data.

The discovery of any new gene requires an analysis of the expression context for that gene. Now that the cDNA and genomic sequencing projects are progressing at such a rapid rate, high throughput gene expression screening approaches are beginning to appear to take advantage of that data. We present a strategy for the analysis for large-scale quantitative gene expression measurement data from time course experiments. Our approach takes advantage of cluster analysis and graphical visualization methods to reveal correlated patterns of gene expression from time series data. The coherence of these patterns suggests an order that conforms to a notion of shared pathways and control processes that can be experimentally verified.

[1]  A. Lindenmayer Mathematical models for cellular interactions in development. II. Simple and branching filaments with two-sided inputs. , 1968, Journal of theoretical biology.

[2]  L Young,et al.  The Jackson Laboratory , 1998, Molecular medicine.

[3]  Philippe Froguel,et al.  Susceptibility to insulin dependent diabetes mellitus maps to a 4.1 kb segment of DNA spanning the insulin gene and associated VNTR , 1993, Nature Genetics.

[4]  Roland Somogyi,et al.  Modeling the complexity of genetic networks: Understanding multigenic and pleiotropic regulation , 1996, Complex..

[5]  A. M. Turing,et al.  The chemical basis of morphogenesis , 1952, Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences.

[6]  J. Barker,et al.  Developmental kinetics of GAD family mRNAs parallel neurogenesis in the rat spinal cord , 1995, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[7]  Stuart A. Kauffman,et al.  The origins of order , 1993 .

[8]  R. Somogyi,et al.  The gene expression matrix: towards the extraction of genetic network architectures , 1997 .

[9]  John Bowman Thomas,et al.  An introduction to statistical communication theory , 1969 .

[10]  Alan S. Lapedes,et al.  Covariation of Mutations in the V3 Loop of HIV-1: An Information Theoretic Analysis , 1995 .

[11]  S Fuhrman,et al.  Reveal, a general reverse engineering algorithm for inference of genetic network architectures. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.