Primer on medical genomics. Part III: Microarray experiments and data analysis.

Genomics has been defined as the comprehensive study of whole sets of genes, gene products, and their interactions as opposed to the study of single genes or proteins. Microarray technology is one of many novel tools that are allowing global and high-throughput analysis of genes and gene products. In addition to an introduction on underlying principles, the current review focuses on the use of both complementary DNA and oligodeoxynucleotide microarrays in gene expression analysis. Genome-wide experiments generate a massive amount of data points that require systematic methods of analysis to extract biologically useful information. Accordingly, the current educational communication discusses different methods of data analysis, including supervised and unsupervised clustering algorithms. Illustrative clinical examples show clinical applications, including (1) identification of candidate genes or pathological pathways (ie, elucidation of pathogenesis); (2) identification of "new" molecular classes of diseases that may be relevant in disease reclassification, prognostication, and treatment selection (ie, class discovery); and (3) use of expression profiles of known disease classes to predict diagnosis and classification of unknown samples (ie, class prediction). The current review should serve as an introduction to the subject for clinician investigators, physicians and medical scientists in training, practicing clinicians, and other students of medicine.

[1]  John F. Heidelberg,et al.  Comparative genomic analysis of Vibrio cholerae: Genes that correlate with cholera endemic and pandemic disease , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[2]  G. Sherlock Analysis of large-scale gene expression data. , 2000, Current opinion in immunology.

[3]  G. Getz,et al.  Coupled two-way clustering analysis of gene microarray data. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[4]  R. Huang,et al.  Detection of multiple proteins in an antibody-based protein microarray system. , 2001, Journal of immunological methods.

[5]  N. Sampas,et al.  Molecular classification of cutaneous malignant melanoma by gene expression profiling , 2000, Nature.

[6]  Sayan Mukherjee,et al.  Molecular classification of multiple tumor types , 2001, ISMB.

[7]  R. Strausberg,et al.  High-throughput development and characterization of a genomewide collection of gene-based single nucleotide polymorphism markers by chip-based matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[8]  D J Lockhart,et al.  Genome-wide detection of allelic imbalance using human SNPs and high-density DNA arrays. , 2000, Genome research.

[9]  M. Bittner,et al.  Expression profiling using cDNA microarrays , 1999, Nature Genetics.

[10]  A. Brazma,et al.  Gene expression data analysis. , 2001, FEBS letters.

[11]  A. Schulze,et al.  Navigating gene expression using microarrays — a technology review , 2001, Nature Cell Biology.

[12]  A. Sinha,et al.  Gene expression profile analysis by DNA microarrays: promise and pitfalls. , 2001, JAMA.

[13]  E. Southern,et al.  Molecular interactions on microarrays , 1999, Nature Genetics.

[14]  Yudong D. He,et al.  Functional Discovery via a Compendium of Expression Profiles , 2000, Cell.

[15]  J. van Helden,et al.  Interactive visualization and exploration of relationships between biological objects. , 2000, Trends in biotechnology.

[16]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[17]  M Schena,et al.  Microarrays: biotechnology's discovery platform for functional genomics. , 1998, Trends in biotechnology.

[18]  J. Dopazo,et al.  Methods and approaches in the analysis of gene expression data. , 2001, Journal of immunological methods.

[19]  E. Dougherty,et al.  Gene-expression profiles in hereditary breast cancer. , 2001, The New England journal of medicine.

[20]  B. Haab,et al.  Advances in protein microarray technology for protein expression and interaction profiling. , 2001, Current opinion in drug discovery & development.

[21]  S. Ekström,et al.  Protein chips based on recombinant antibody fragments: a highly sensitive approach as detected by mass spectrometry. , 2001, BioTechniques.

[22]  J. Lemonnier,et al.  Increased Expression of Protein Kinase Cα, Interleukin‐1α, and RhoA Guanosine 5′‐Triphosphatase in Osteoblasts Expressing the Ser252Trp Fibroblast Growth Factor 2 Apert Mutation: Identification by Analysis of Complementary DNA Microarray , 2001 .

[23]  W. Yung,et al.  Reactivation of insulin-like growth factor binding protein 2 expression in glioblastoma multiforme: a revelation by parallel gene expression profiling. , 1999, Cancer research.

[24]  E. Winzeler,et al.  Genomics, gene expression and DNA arrays , 2000, Nature.

[25]  S. P. Fodor,et al.  High density synthetic oligonucleotide arrays , 1999, Nature Genetics.

[26]  K R Hess,et al.  Microarrays: handling the deluge of data and extracting reliable information. , 2001, Trends in biotechnology.

[27]  T. Gaasterland,et al.  Making the most of microarray data , 2000, Nature Genetics.

[28]  Partha S. Vasisht Computational Analysis of Microarray Data , 2003 .

[29]  G. Church,et al.  Systematic determination of genetic network architecture , 1999, Nature Genetics.

[30]  Jason E. Stewart,et al.  Minimum information about a microarray experiment (MIAME)—toward standards for microarray data , 2001, Nature Genetics.

[31]  Jeffrey T. Chang,et al.  Basic microarray analysis: grouping and feature reduction. , 2001, Trends in biotechnology.

[32]  S. Fields,et al.  Proteomics. Proteomics in genomeland. , 2001, Science.

[33]  Ash A. Alizadeh,et al.  Towards a novel classification of human malignancies based on gene expression patterns , 2001, The Journal of pathology.

[34]  Todd,et al.  Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning , 2002, Nature Medicine.

[35]  A. Sajantila,et al.  Y-chromosomal SNPs in Finno-Ugric-speaking populations analyzed by minisequencing on microarrays. , 2001, Genome research.

[36]  T. Bumol,et al.  Genetic information, genomic technologies, and the future of drug discovery. , 2001, JAMA.

[37]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[38]  J. Sudbø,et al.  Gene-expression profiles in hereditary breast cancer. , 2001, The New England journal of medicine.

[39]  J. Lemonnier,et al.  Increased expression of protein kinase Calpha, interleukin-1alpha, and RhoA guanosine 5'-triphosphatase in osteoblasts expressing the Ser252Trp fibroblast growth factor 2 receptor Apert mutation: identification by analysis of complementary DNA microarray. , 2001, Journal of bone and mineral research : the official journal of the American Society for Bone and Mineral Research.

[40]  S. Fields Proteomics in Genomeland , 2001, Science.

[41]  David E. Misek,et al.  Distinctive molecular profiles of high-grade and low-grade gliomas based on oligonucleotide microarray analysis. , 2001, Cancer research.

[42]  George M. Whitesides,et al.  The art of Building Small. , 2001 .

[43]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Ash A. Alizadeh,et al.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling , 2000, Nature.

[45]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[46]  Jan Komorowski,et al.  Predicting Gene Function from Gene Expressions and Ontologies , 2000, Pacific Symposium on Biocomputing.

[47]  G. A. Whitmore,et al.  Importance of replication in microarray gene expression studies: statistical methods and evidence from repetitive cDNA hybridizations. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[48]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[49]  Uwe Claussen,et al.  Mass spectrometry meets chip technology: A new proteomic tool in cancer research? , 2001, Electrophoresis.

[50]  Andrea Califano,et al.  Analysis of Gene Expression Microarrays for Phenotype Classification , 2000, ISMB.

[51]  I N Sarkar,et al.  Systematic analysis of DNA microarray data: ordering and interpreting patterns of gene expression. , 2001, Genome research.

[52]  J. Mesirov,et al.  Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[53]  M. Gerstein,et al.  Global Analysis of Protein Activities Using Proteome Chips , 2001, Science.

[54]  F. Barany,et al.  Mutation Detection in K‐ras, BRCA1, BRCA2, and p53 Using PCR/LDR and a Universal DNA Microarray , 2000, Annals of the New York Academy of Sciences.

[55]  Jie Liu,et al.  Genetic events associated with arsenic‐induced malignant transformation: applications of cDNA microarray technology † , 2001, Molecular carcinogenesis.