ENCODE whole-genome data in the UCSC genome browser (2011 update)

The ENCODE project is an international consortium with a goal of cataloguing all the functional elements in the human genome. The ENCODE Data Coordination Center (DCC) at the University of California, Santa Cruz serves as the central repository for ENCODE data. In this role, the DCC offers a collection of high-throughput, genome-wide data generated with technologies such as ChIP-Seq, RNA-Seq, DNA digestion and others. This data helps illuminate transcription factor-binding sites, histone marks, chromatin accessibility, DNA methylation, RNA expression, RNA binding and other cell-state indicators. It includes sequences with quality scores, alignments, signals calculated from the alignments, and in most cases, element or peak calls calculated from the signal data. Each data set is available for visualization and download via the UCSC Genome Browser (http://genome.ucsc.edu/). ENCODE data can also be retrieved using a metadata system that captures the experimental parameters of each assay. The ENCODE web portal at UCSC (http://encodeproject.org/) provides information about the ENCODE data and links for access.

[1]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[2]  M. King,et al.  Evolution at two levels in humans and chimpanzees. , 1975, Science.

[3]  M. Gerstein,et al.  Unlocking the secrets of the genome , 2009, Nature.

[4]  Christopher Gignoux,et al.  The 1000 Genomes Project: new opportunities for research and social challenges , 2010, Genome Medicine.

[5]  Timothy R Hughes,et al.  Conserved expression without conserved regulatory sequence: the more things change, the more they stay the same. , 2010, Trends in genetics : TIG.

[6]  John C. Castle,et al.  Evolutionarily conserved and diverged alternative splicing events show different expression and functional profiles , 2005, Nucleic acids research.

[7]  A. D. Hershey,et al.  INDEPENDENT FUNCTIONS OF VIRAL PROTEIN AND NUCLEIC ACID IN GROWTH OF BACTERIOPHAGE , 1952, The Journal of general physiology.

[8]  Morgan C. Giddings,et al.  Incorporating sequence information into the scoring function: a hidden Markov model for improved peptide identification , 2008, Bioinform..

[9]  J. Harrow,et al.  GENCODE: producing a reference annotation for ENCODE , 2006, Genome Biology.

[10]  S. Tenenbaum,et al.  Advances in RIP-chip analysis : RNA-binding protein immunoprecipitation-microarray profiling. , 2008, Methods in molecular biology.

[11]  Jason A Greenbaum,et al.  Construction of a genome-scale structural map at single-nucleotide resolution. , 2007, Genome research.

[12]  William Stafford Noble,et al.  Global mapping of protein-DNA interactions in vivo by digital genomic footprinting , 2009, Nature Methods.

[13]  Ting Wang,et al.  The UCSC Genome Browser Database: update 2009 , 2008, Nucleic Acids Res..

[14]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[15]  Galt P. Barber,et al.  BigWig and BigBed: enabling browsing of large distributed datasets , 2010, Bioinform..