In the 2017 first issue of this Journal – Genomes, Proteomes and Bioinformatics – a special database article entitled ‘‘GSA: Genome Sequence Archive” [1] is published. This article provides a brief introduction to the platform developed by the authors from the BIG Data Center (BIGD) of Beijing Institute of Genomics (BIG), Chinese Academy of Sciences (CAS). The aim of the GSA project is to collect, integrate, and archive raw sequence data submitted by domestic and international users. It is one of the major activities being carried on by a team of around 50 young bioinformaticians at BIGD. In addition to the GSA system, they are also working on several bioinformatics service-orientated projects as described in one of their recent publications [2]. The past half century has witnessed great advances in molecular biology. The deciphering of the genetic code and the establishment of the central dogma following the discovery of the DNA double helix formed a solid theoretical basis for the field of life sciences. On the other hand, the influential works by Frederick Sanger and others to determine the peptide, tRNA, and DNA sequences, as well as the fundamental endeavor by John Kendrew and Max Perutz to solve the three-dimensional structure of proteins, marked the beginning of the accumulation of molecular biological data. Protein sequence databases
[1]
Haruki Nakamura,et al.
The Protein Data Bank at 40: reflecting on the past to prepare for the future.
,
2012,
Structure.
[2]
International Human Genome Sequencing Consortium.
Initial sequencing and analysis of the human genome
,
2001,
Nature.
[3]
SH Song,et al.
The BIG Data Center: from deposition to integration to translation
,
2016,
Nucleic Acids Res..
[4]
Walter Gilbert,et al.
Towards a paradigm shift in biology
,
1991,
Nature.
[5]
Nikos Kyrpides,et al.
Genomes OnLine Database (GOLD) v.6: data updates and feature enhancements
,
2016,
Nucleic Acids Res..
[6]
Christopher P Austin,et al.
Prepublication data sharing
,
2009,
Nature.
[7]
Jun Yu,et al.
Bioinformatics in China: A Personal Perspective
,
2008,
PLoS Comput. Biol..
[8]
Qian Zhang,et al.
GSA: Genome Sequence Archive*
,
2017,
Genom. Proteom. Bioinform..
[9]
D. Mccormick.
Sequence the Human Genome
,
1986,
Bio/Technology.