论文信息 - TIARA genome database: update 2013 - 字舞流文

TIARA genome database: update 2013

The Total Integrated Archive of short-Read and Array (TIARA; http://tiara.gmi.ac.kr) database stores and integrates human genome data generated from multiple technologies including next-generation sequencing and high-resolution comparative genomic hybridization array. The TIARA genome browser is a powerful tool for the analysis of personal genomic information by exploring genomic variants such as SNPs, indels and structural variants simultaneously. As of September 2012, the TIARA database provides raw data and variant information for 13 sequenced whole genomes, 16 sequenced transcriptomes and 33 high resolution array assays. Sequencing reads are available at a depth of ∼30× for whole genomes and 50× for transcriptomes. Information on genomic variants includes a total of ∼9.56 million SNPs, 23 025 of which are non-synonymous SNPs, and ∼1.19 million indels. In this update, by adding high coverage sequencing of additional human individuals, the TIARA genome database now provides an extensive record of rare variants in humans. Following TIARA’s fundamentally integrative approach, new transcriptome sequencing data are matched with whole-genome sequencing data in the genome browser. Users can here observe, for example, the expression levels of human genes with allele-specific quantification. Improvements to the TIARA genome browser include the intuitive display of new complex and large-scale data sets.

Jong-Il Kim | Dongwan Hong | Sung-Soo Park | Thomas Bleazard | Jongkeun Lee | Hyunchul Jung | Young Seok Ju | Sujung Kim | Saet-Byeol Yu | Jeong-Sun Seo

[1] Joshua M. Korn,et al. Integrated detection and population-genetic analysis of SNPs and copy number variation , 2008, Nature Genetics.

[2] Ryan E. Mills,et al. Discovery of common Asian copy number variants using integrated high-resolution array CGH and massively parallel DNA sequencing , 2010, Nature Genetics.

[3] Adam A. Margolin,et al. The Cancer Cell Line Encyclopedia enables predictive modeling of anticancer drug sensitivity , 2012, Nature.

[4] B. Williams,et al. Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[5] Seungbok Lee,et al. A transforming KIF5B and RET gene fusion in lung adenocarcinoma revealed from whole-genome and transcriptome sequencing. , 2012, Genome research.

[6] Yonatan Aumann,et al. Efficient Calculation of Interval Scores for DNA Copy Number Data Analysis , 2005, RECOMB.

[7] Ewa Deelman,et al. New tools and methods for direct programmatic access to the dbSNP relational database , 2010, Nucleic Acids Res..

[8] L. Feuk,et al. Detection of large-scale variation in the human genome , 2004, Nature Genetics.

[9] Jin Soo Lee,et al. FX: an RNA-Seq analysis tool on the cloud , 2012, Bioinform..

[10] Hideaki Sugawara,et al. The Sequence Read Archive , 2010, Nucleic Acids Res..

[11] Gautier Koscielny,et al. Ensembl 2012 , 2011, Nucleic Acids Res..

[12] S. Gabriel,et al. Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1. , 2010, Cancer cell.

[13] Thomas D. Wu,et al. A highly annotated whole-genome sequence of a Korean individual , 2009, Nature.

[14] H. P. Kang,et al. Extensive genomic and transcriptional diversity identified through massively parallel DNA and RNA sequencing of eighteen Korean individuals , 2011, Nature Genetics.

[15] Rasko Leinonen,et al. The sequence read archive: explosive growth of sequencing data , 2011, Nucleic Acids Res..

[16] Serban Nacu,et al. Fast and SNP-tolerant detection of complex variants and splicing in short reads , 2010, Bioinform..

[17] Mingming Jia,et al. COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer , 2010, Nucleic Acids Res..

[18] David Haussler,et al. The UCSC Known Genes , 2006, Bioinform..

[19] Mary Goldman,et al. The UCSC Genome Browser database: extensions and updates 2011 , 2011, Nucleic Acids Res..

[20] Gary D Bader,et al. International network of cancer genome projects , 2010, Nature.

[21] Joshua M. Korn,et al. Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2008, Nature.

[22] Dongwan Hong,et al. Reference-unbiased copy number variant analysis using CGH microarrays , 2010, Nucleic acids research.

[23] K. Sirotkin,et al. The NCBI dbGaP database of genotypes and phenotypes , 2007, Nature Genetics.

[24] Benjamin J. Raphael,et al. Integrated Genomic Analyses of Ovarian Carcinoma , 2011, Nature.

[25] Jong-Il Kim,et al. TIARA: a database for accurate analysis of multiple personal genomes based on cross-technology , 2010, Nucleic Acids Res..