PaGenBase: A Pattern Gene Database for the Global and Dynamic Understanding of Gene Function

Pattern genes are a group of genes that have a modularized expression behavior under serial physiological conditions. The identification of pattern genes will provide a path toward a global and dynamic understanding of gene functions and their roles in particular biological processes or events, such as development and pathogenesis. In this study, we present PaGenBase, a novel repository for the collection of tissue- and time-specific pattern genes, including specific genes, selective genes, housekeeping genes and repressed genes. The PaGenBase database is now freely accessible at http://bioinf.xmu.edu.cn/PaGenBase/. In the current version (PaGenBase 1.0), the database contains 906,599 pattern genes derived from the literature or from data mining of more than 1,145,277 gene expression profiles in 1,062 distinct samples collected from 11 model organisms. Four statistical parameters were used to quantitatively evaluate the pattern genes. Moreover, three methods (quick search, advanced search and browse) were designed for rapid and customized data retrieval. The potential applications of PaGenBase are also briefly described. In summary, PaGenBase will serve as a resource for the global and dynamic understanding of gene function and will facilitate high-level investigations in a variety of fields, including the study of development, pathogenesis and novel drug discovery.

[1]  J. Warrington,et al.  Comparison of human adult and fetal expression and identification of 535 housekeeping/maintenance genes. , 2000, Physiological genomics.

[2]  W. Willis,et al.  Haploinsufficiency of protamine-1 or -2 causes infertility in mice , 2001, Nature Genetics.

[3]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[4]  L. Domenjoud,et al.  Chromosomal localization of the human protamine genes, PRM1 and PRM2, to 16p13.3 by in situ hybridization , 1990, Human Genetics.

[5]  S. Batalov,et al.  A gene atlas of the mouse and human protein-encoding transcriptomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[6]  N. Dhanasekaran,et al.  Proliferation-specific genes activated by Galpha(12): a role for PDGFRalpha and JAK3 in Galpha(12)-mediated cell proliferation. , 2004, Cell biochemistry and biophysics.

[7]  M. Griswold,et al.  The Murine Testicular Transcriptome: Characterizing Gene Expression in the Testis During the Progression of Spermatogenesis1 , 2004, Biology of reproduction.

[8]  R. Barber,et al.  GAPDH as a housekeeping gene: analysis of GAPDH mRNA expression in a panel of 72 human tissues. , 2005, Physiological genomics.

[9]  K Dheda,et al.  Real-time RT-PCR normalisation; strategies and considerations , 2005, Genes and Immunity.

[10]  Yizheng Li,et al.  Detecting and profiling tissue-selective genes. , 2006, Physiological genomics.

[11]  Stuart Aitken,et al.  Mining housekeeping genes with a Naive Bayes classifier , 2006, BMC Genomics.

[12]  Y. Nishida,et al.  Housekeeping and tissue-specific genes in mouse tissues , 2007, BMC Genomics.

[13]  Tao Tao,et al.  GEPS: the Gene Expression Pattern Scanner , 2006, Nucleic Acids Res..

[14]  S. S. Koh,et al.  Identification of novel universal housekeeping genes by statistical analysis of microarray data. , 2007, Journal of biochemistry and molecular biology.

[15]  Jiang Qian,et al.  TiGER: A database for tissue-specific gene expression and regulation , 2008, BMC Bioinformatics.

[16]  W. Kamps,et al.  Evidence Based Selection of Housekeeping Genes , 2007, PloS one.

[17]  Joaquín Dopazo,et al.  GEPAS, a web-based tool for microarray data analysis and interpretation , 2008, Nucleic Acids Res..

[18]  T. Nikolskaya,et al.  A comprehensive functional analysis of tissue specificity of human gene expression , 2008, BMC Biology.

[19]  C. Lindskog,et al.  A Web-based Tool for in Silico Biomarker Discovery Based on Tissue-specific Protein Profiles in Normal and Cancer Tissues*S , 2008, Molecular & Cellular Proteomics.

[20]  Francisco S. Roque,et al.  A large-scale analysis of tissue-specific pathology and gene expression of human disease genes and complexes , 2008, Proceedings of the National Academy of Sciences.

[21]  Jon W. Huss,et al.  BioGPS: an extensible and customizable portal for querying and organizing gene annotation resources , 2009, Genome Biology.

[22]  Karl-Heinz Glatting,et al.  TissueDistributionDBs: a repository of organism-specific tissue-distribution profiles , 2010 .

[23]  T. Friedmann,et al.  Deficiency of the housekeeping gene hypoxanthine-guanine phosphoribosyltransferase (HPRT) dysregulates neurogenesis. , 2010, Molecular therapy : the journal of the American Society of Gene Therapy.

[24]  Chi Zhang,et al.  TiSGeD: a database for tissue-specific genes , 2010, Bioinform..

[25]  E. Lundberg,et al.  Towards a knowledge-based Human Protein Atlas , 2010, Nature Biotechnology.

[26]  Monte Westerfield,et al.  ZFIN: enhancements and updates to the zebrafish model organism database , 2010, Nucleic Acids Res..

[27]  Dennis B. Troup,et al.  NCBI GEO: archive for functional genomics data sets—10 years on , 2010, Nucleic Acids Res..

[28]  Shunmin He,et al.  Predicting Housekeeping Genes Based on Fourier Analysis , 2011, PloS one.

[29]  Adipocyte differentiation-specific gene transcriptional response to C18 unsaturated fatty acids plus insulin , 2012, Pflügers Archiv - European Journal of Physiology.

[30]  C. Pilarsky,et al.  Integrated Proteomic Profiling of Cell Line Conditioned Media and Pancreatic Juice for the Identification of Pancreatic Cancer Biomarkers , 2011, Molecular & Cellular Proteomics.

[31]  M. Long,et al.  Accelerated Recruitment of New Brain Development Genes into the Human Genome , 2011, PLoS biology.

[32]  Ibrahim Emam,et al.  ArrayExpress update—an archive of microarray and high-throughput sequencing-based functional genomics experiments , 2010, Nucleic Acids Res..

[33]  K. Steger,et al.  Prognostic markers for competent human spermatozoa: fertilizing capacity and contribution to the embryo. , 2011, International journal of andrology.

[34]  Lieven Thorrez,et al.  Tissue-specific disallowance of housekeeping genes: the other face of cell differentiation. , 2011, Genome research.

[35]  Sethuraman Panchanathan,et al.  FlyExpress: visual mining of spatiotemporal patterns for genes and publications in Drosophila embryogenesis , 2011, Bioinform..

[36]  Hao Wang,et al.  PaGeFinder: quantitative identification of spatiotemporal pattern genes , 2012, Bioinform..

[37]  D. Balciunas,et al.  The lineage-specific gene ponzr1 is essential for zebrafish pronephric and pharyngeal arch development , 2012, Development.

[38]  Ugur Sahin,et al.  RNA-Seq Atlas - a reference database for gene expression profiling in normal tissue by next-generation sequencing , 2012, Bioinform..

[39]  Gautier Koscielny,et al.  Ensembl 2012 , 2011, Nucleic Acids Res..

[40]  M. Purugganan,et al.  Genome-Wide Patterns of Arabidopsis Gene Expression in Nature , 2012, PLoS genetics.