Of genomes and proteomes.

The era of complete genome sequences has arrived and with it vast amounts of data which must be annotated, cross referenced, and placed within the regulatory networks which define the physiology of an organism. One eucaryotic and three procaryotic genomes have been completed and the data made available and another 50 sequences are expected to be completed by the end of the decade. One of the first steps in the new post genome era will be to decipher the functions of the huge numbers of new open reading frames. Various approaches to investigate what unknown genes do and how genes interact together within an organism are being undertaken, including (1) the simultaneous measurement of the expression levels of all genes in a cell and (2) the mapping and quantitation of all proteins expressed within a cell. The idea of systematically mapping and identifying the total protein complement of the genome (the "proteome") arose over 20 years ago when the separation of proteins from total cell extracts by two dimensional (2D) gel electrophoresis was developed. This review will focus on the use of 2D gel electrophoresis as the basis for constructing proteome maps and on the rapid advances in mass spectrometry which will allow the large-scale, automated identification of proteins which is necessary for the creation of such databases.

[1]  M. Karas,et al.  Influence of the wavelength in high-irradiance ultraviolet laser desorption mass spectrometry of organic molecules , 1985 .

[2]  Thierry Rabilloud,et al.  Sample application by in‐gel rehydration improves the resolution of two‐dimensional electrophoresis with immobilized pH gradients in the first dimension , 1994, Electrophoresis.

[3]  Joachim Klose,et al.  Two‐dimensional electrophoresis of proteins: An updated protocol and implications for a functional analysis of the genome , 1995, Electrophoresis.

[4]  P. Højrup,et al.  Rapid identification of proteins by peptide-mass fingerprinting , 1993, Current Biology.

[5]  T. Hunkapiller,et al.  Peptide mass maps: a highly informative approach to protein identification. , 1993, Analytical biochemistry.

[6]  P. Højrup,et al.  Use of mass spectrometric molecular weight information to identify proteins in sequence databases. , 1993, Biological mass spectrometry.

[7]  E Carafoli,et al.  Protein identification in DNA databases by peptide mass fingerprinting , 1994, Protein science : a publication of the Protein Society.

[8]  J R Scherrer,et al.  The MELANIE project: From a biopsy to automatic protein map interpretation by computer , 1991, Electrophoresis.

[9]  M. Wilm,et al.  Error-tolerant identification of peptides in sequence databases by peptide sequence tags. , 1994, Analytical chemistry.

[10]  B. Barrell,et al.  Life with 6000 Genes , 1996, Science.

[11]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[12]  J. Fenn,et al.  Electrospray interface for liquid chromatographs and mass spectrometers. , 1985, Analytical chemistry.

[13]  J R Yates,et al.  Protein sequencing by tandem mass spectrometry. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[14]  R. Fleischmann,et al.  Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. , 1995, Science.

[15]  C. Watanabe,et al.  Identifying proteins from two-dimensional gels by molecular mass searching of peptide fragments in protein sequence databases. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[16]  N G Anderson,et al.  Design and implementation of a prototype Human Protein Index. , 1982, Clinical chemistry.

[17]  L. Hood,et al.  Internal amino acid sequence analysis of proteins separated by one- or two-dimensional gel electrophoresis after in situ protease digestion on nitrocellulose. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[18]  P. O’Farrell High resolution two-dimensional electrophoresis of proteins. , 1975, The Journal of biological chemistry.

[19]  R Kaufmann,et al.  Mass spectrometric sequencing of linear peptides by product-ion analysis in a reflectron time-of-flight mass spectrometer using matrix-assisted laser desorption ionization. , 1993, Rapid communications in mass spectrometry : RCM.

[20]  E Gianazza,et al.  Isoelectric focusing in immobilized pH gradients: principle, methodology and some applications. , 1982, Journal of biochemical and biophysical methods.

[21]  R. Aebersold,et al.  Protein identification by solid phase microextraction—capillary zone electrophoresis—microelectrospray—tandem mass spectrometry , 1996, Nature Biotechnology.

[22]  L. Hood,et al.  Electroblotting onto activated glass. High efficiency preparation of proteins from analytical sodium dodecyl sulfate-polyacrylamide gels for direct sequence analysis. , 1986, The Journal of biological chemistry.

[23]  G. Gonnet,et al.  Protein identification by mass profile fingerprinting. , 1993, Biochemical and biophysical research communications.

[24]  F. Neidhardt,et al.  Patterns of protein synthesis in E. coli: a catalog of the amount of 140 individual proteins at different growth rates , 1978, Cell.

[25]  Ploeg,et al.  Identification of sulfate starvation-regulated genes in Escherichia coli: a gene cluster involved in the utilization of taurine as a sulfur source , 1996, Journal of bacteriology.

[26]  R. Fleischmann,et al.  The Minimal Gene Complement of Mycoplasma genitalium , 1995, Science.

[27]  M. Mann,et al.  Improved mass accuracy in matrix-assisted laser desorption/ionization time-of-flight mass spectrometry of peptides , 1994, Journal of the American Society for Mass Spectrometry.

[28]  N G Anderson,et al.  Twenty years of two‐dimensional electrophoresis: Past, present and future , 1996, Electrophoresis.

[29]  D. Inzé,et al.  Alterations in the phenotype of plant cells studied by NH(2)-terminal amino acid-sequence analysis of proteins electroblotted from two-dimensional gel-separated total extracts. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Joël Vandekerckhove,et al.  Comprehensive two‐dimensional gel protein databases offer a global approach to the analysis of human cells: The transformed amnion cells (AMA) master database and its link to genome DNA sequence data , 1990, Electrophoresis.

[31]  M. Quadroni,et al.  Analysis of global responses by protein and peptide fingerprinting of proteins isolated by two-dimensional gel electrophoresis. Application to the sulfate-starvation response of Escherichia coli. , 1996, European journal of biochemistry.