TOMATOMICS: A Web Database for Integrated Omics Information in Tomato

Solanum lycopersicum (tomato) is an important agronomic crop and a major model fruit-producing plant. To facilitate basic and applied research, comprehensive experimental resources and omics information on tomato are available following their development. Mutant lines and cDNA clones from a dwarf cultivar, Micro-Tom, are two of these genetic resources. Large-scale sequencing data for ESTs and full-length cDNAs from Micro-Tom continue to be gathered. In conjunction with information on the reference genome sequence of another cultivar, Heinz 1706, the Micro-Tom experimental resources have facilitated comprehensive functional analyses. To enhance the efficiency of acquiring omics information for tomato biology, we have integrated the information on the Micro-Tom experimental resources and the Heinz 1706 genome sequence. We have also inferred gene structure by comparison of sequences between the genome of Heinz 1706 and the transcriptome, which are comprised of Micro-Tom full-length cDNAs and Heinz 1706 RNA-seq data stored in the KaFTom and Sequence Read Archive databases. In order to provide large-scale omics information with streamlined connectivity we have developed and maintain a web database TOMATOMICS (http://bioinf.mind.meiji.ac.jp/tomatomics/). In TOMATOMICS, access to the information on the cDNA clone resources, full-length mRNA sequences, gene structures, expression profiles and functional annotations of genes is available through search functions and the genome browser, which has an intuitive graphical interface.

[1]  C. Mungall,et al.  Gene Ontology Consortium : going forward The Gene Ontology , 2015 .

[2]  Claire O'Donovan,et al.  Expert curation in UniProtKB: a case study on dealing with conflicting and erroneous data , 2014, Database J. Biol. Databases Curation.

[3]  Nozomu Sakurai,et al.  MiBASE : A database of a miniature tomato cultivar Micro-Tom , 2006 .

[4]  María Martín,et al.  UniProt: A hub for protein information , 2015 .

[5]  Rasko Leinonen,et al.  The sequence read archive: explosive growth of sequencing data , 2011, Nucleic Acids Res..

[6]  J. Turner,et al.  Isolation and partial purification of an enzyme catalyzing the formation of O-xylosylzeatin in Phaseolus vulgaris embryos. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[7]  A novel inhibitor of cytokinin degradation (INCYDE) influences the biochemical parameters and photosynthetic apparatus in NaCl-stressed tomato plants , 2014, Planta.

[8]  Emily M. Strait,et al.  The arabidopsis information resource: Making and mining the “gold standard” annotated reference plant genome , 2015, Genesis.

[9]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[10]  Tanya Z. Berardini,et al.  The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools , 2011, Nucleic Acids Res..

[11]  Sarah Melamed,et al.  A new model system for tomato genetics , 1997 .

[12]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[13]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[14]  H. Klee,et al.  Role of an esterase in flavor volatile variation within the tomato clade , 2012, Proceedings of the National Academy of Sciences.

[15]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[16]  Brent K. Harbaugh,et al.  Micro-Tom. A miniature dwarf tomato , 1989 .

[17]  Rolf Apweiler,et al.  InterProScan: protein domains identifier , 2005, Nucleic Acids Res..

[18]  G. Shaw,et al.  Zeatin Glycosylation Enzymes in Phaseolus: Isolation of O-Glucosyltransferase from P. lunatus and Comparison to O-Xylosyltransferase from P. vulgaris. , 1989, Plant physiology.

[19]  Dominik K. Grosskinsky,et al.  Hormonal and metabolic regulation of tomato fruit sink activity and yield under salinity , 2014, Journal of experimental botany.

[20]  Q. Qian,et al.  Cytokinin Oxidase Regulates Rice Grain Production , 2005, Science.

[21]  Y. Yamazaki,et al.  TOMATOMA Update: Phenotypic and Metabolite Information in the Micro-Tom Mutant Resource. , 2016, Plant & cell physiology.

[22]  David M. A. Martin,et al.  Genome sequence and analysis of the tuber crop potato , 2011, Nature.

[23]  Heng Li,et al.  A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data , 2011, Bioinform..

[24]  Hitoshi Sakakibara,et al.  Metabolism and long-distance translocation of cytokinins. , 2010, Journal of integrative plant biology.

[25]  S. Lutts,et al.  Root-synthesized cytokinins improve shoot growth and fruit yield in salinized tomato (Solanum lycopersicum L.) plants , 2010, Journal of experimental botany.

[26]  Cole Trapnell,et al.  TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions , 2013, Genome Biology.

[27]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.

[28]  Daniel W. A. Buchan,et al.  The tomato genome sequence provides insights into fleshy fruit evolution , 2012, Nature.

[29]  Thomas D. Wu,et al.  GMAP: a genomic mapping and alignment program for mRNA and EST sequence , 2005, Bioinform..

[30]  Suzanna E Lewis,et al.  JBrowse: a dynamic web platform for genome visualization and analysis , 2016, Genome Biology.

[31]  Yasukazu Nakamura,et al.  Genome-wide analysis of intraspecific DNA polymorphism in 'Micro-Tom', a model cultivar of tomato (Solanum lycopersicum). , 2014, Plant & cell physiology.

[32]  D. Schwartz,et al.  Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data , 2013, Rice.

[33]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[34]  Mauricio O. Carneiro,et al.  From FastQ Data to High‐Confidence Variant Calls: The Genome Analysis Toolkit Best Practices Pipeline , 2013, Current protocols in bioinformatics.

[35]  Nozomu Sakurai,et al.  Large-scale analysis of full-length cDNAs from the tomato (Solanum lycopersicum) cultivar Micro-Tom, a reference system for the Solanaceae genomics , 2010, BMC Genomics.

[36]  Colin N. Dewey,et al.  De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis , 2013, Nature Protocols.

[37]  Matthew Fraser,et al.  InterProScan 5: genome-scale protein function classification , 2014, Bioinform..

[38]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[39]  Lukas A. Mueller,et al.  The Sol Genomics Network (SGN)—from genotype to phenotype to breeding , 2014, Nucleic Acids Res..

[40]  Wen J. Li,et al.  Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation , 2015, Nucleic Acids Res..

[41]  Cole Trapnell,et al.  Role of Rodent Secondary Motor Cortex in Value-based Action Selection Nih Public Access Author Manuscript , 2006 .

[42]  Robert D. Finn,et al.  The Pfam protein families database: towards a more sustainable future , 2015, Nucleic Acids Res..

[43]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[44]  T. Kakimoto Identification of plant cytokinin biosynthetic enzymes as dimethylallyl diphosphate:ATP/ADP isopentenyltransferases. , 2001, Plant & cell physiology.

[45]  Yoshihiro Kawahara,et al.  Rice Annotation Project Database (RAP-DB): An Integrative and Interactive Database for Rice Genomics , 2013, Plant & cell physiology.

[46]  Pablo Cingolani,et al.  © 2012 Landes Bioscience. Do not distribute. , 2022 .

[47]  Hajime Ohyanagi,et al.  Plant Omics Data Center: An Integrated Web Repository for Interspecies Gene Expression Networks with NLP-Based Curation , 2014, Plant & cell physiology.

[48]  Akiyasu C. Yoshizawa,et al.  KAAS: an automatic genome annotation and pathway reconstruction server , 2007, Environmental health perspectives.

[49]  J. Kyozuka,et al.  Direct control of shoot meristem activity by a cytokinin-activating enzyme , 2007, Nature.

[50]  Minoru Kanehisa,et al.  KEGG Bioinformatics Resource for Plant Genomics and Metabolomics. , 2016, Methods in molecular biology.

[51]  J. Samuels Biodiversity of Food Species of the Solanaceae Family: A Preliminary Taxonomic Inventory of Subfamily Solanoideae , 2015 .

[52]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[53]  I. A. Abreu,et al.  XBAT35, a novel Arabidopsis RING E3 ligase exhibiting dual targeting of its splice isoforms, is involved in ethylene-mediated regulation of apical hook curvature. , 2012, Molecular plant.

[54]  H. Sakakibara,et al.  Identification of Genes Encoding Adenylate Isopentenyltransferase, a Cytokinin Biosynthesis Enzyme, inArabidopsis thaliana * , 2001, The Journal of Biological Chemistry.

[55]  Mark H. Wright,et al.  The SOL Genomics Network. A Comparative Resource for Solanaceae Biology and Beyond1 , 2005, Plant Physiology.

[56]  M. Boguski,et al.  dbEST — database for “expressed sequence tags” , 1993, Nature Genetics.