The Gene Expression Omnibus Database

The Gene Expression Omnibus (GEO) database is an international public repository that archives and freely distributes high-throughput gene expression and other functional genomics data sets. Created in 2000 as a worldwide resource for gene expression studies, GEO has evolved with rapidly changing technologies and now accepts high-throughput data for many other data applications, including those that examine genome methylation, chromatin structure, and genome-protein interactions. GEO supports community-derived reporting standards that specify provision of several critical study elements including raw data, processed data, and descriptive metadata. The database not only provides access to data for tens of thousands of studies, but also offers various Web-based tools and strategies that enable users to locate data relevant to their specific interests, as well as to visualize and analyze the data. This chapter includes detailed descriptions of methods to query and download GEO data and use the analysis and visualization tools. The GEO homepage is at http://www.ncbi.nlm.nih.gov/geo/.

[1]  Stephen M. Mount,et al.  The genome sequence of Drosophila melanogaster. , 2000, Science.

[2]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[3]  Ping Li,et al.  A radiosensitivity gene signature in predicting glioma prognostic via EMT pathway , 2014, Oncotarget.

[4]  Yuqian Cui,et al.  Downregulation of HNF1 homeobox B is associated with drug resistance in ovarian cancer. , 2014, Oncology reports.

[5]  Jason E. Stewart,et al.  Minimum information about a microarray experiment (MIAME)—toward standards for microarray data , 2001, Nature Genetics.

[6]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[7]  J. Berg Genome sequence of the nematode C. elegans: a platform for investigating biology. , 1998, Science.

[8]  Dennis B. Troup,et al.  NCBI GEO: mining millions of expression profiles—database and tools , 2004, Nucleic Acids Res..

[9]  B. Barrell,et al.  Life with 6000 Genes , 1996, Science.

[10]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[11]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[12]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[13]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[14]  Andreas D Baxevanis,et al.  Searching NCBI Databases Using Entrez , 2004, Current protocols in bioinformatics.

[15]  Eduardo Tejera,et al.  Co-expression network analysis and genetic algorithms for gene prioritization in preeclampsia , 2013, BMC Medical Genomics.

[16]  Ji Huang,et al.  [Serial analysis of gene expression]. , 2002, Yi chuan = Hereditas.

[17]  Andrew Smith Genome sequence of the nematode C-elegans: A platform for investigating biology , 1998 .

[18]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[19]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[20]  Ying Chen,et al.  ExpTreeDB: Web-based query and visualization of manually annotated gene expression profiling experiments of human and mouse from GEO , 2014, Bioinform..

[21]  Sean R. Davis,et al.  GEOquery: a bridge between the Gene Expression Omnibus (GEO) and BioConductor , 2007, Bioinform..