ChloroSSRdb: a repository of perfect and imperfect chloroplastic simple sequence repeats (cpSSRs) of green plants

Simple sequence repeats (SSRs) are regions in DNA sequence that contain repeating motifs of length 1–6 nucleotides. These repeats are ubiquitously present and are found in both coding and non-coding regions of genome. A total of 534 complete chloroplast genome sequences (as on 18 September 2014) of Viridiplantae are available at NCBI organelle genome resource. It provides opportunity to mine these genomes for the detection of SSRs and store them in the form of a database. In an attempt to properly manage and retrieve chloroplastic SSRs, we designed ChloroSSRdb which is a relational database developed using SQL server 2008 and accessed through ASP.NET. It provides information of all the three types (perfect, imperfect and compound) of SSRs. At present, ChloroSSRdb contains 124 430 mined SSRs, with majority lying in non-coding region. Out of these, PCR primers were designed for 118 249 SSRs. Tetranucleotide repeats (47 079) were found to be the most frequent repeat type, whereas hexanucleotide repeats (6414) being the least abundant. Additionally, in each species statistical analyses were performed to calculate relative frequency, correlation coefficient and chi-square statistics of perfect and imperfect SSRs. In accordance with the growing interest in SSR studies, ChloroSSRdb will prove to be a useful resource in developing genetic markers, phylogenetic analysis, genetic mapping, etc. Moreover, it will serve as a ready reference for mined SSRs in available chloroplast genomes of green plants. Database URL: www.compubio.in/chlorossrdb/

[1]  Kai F. Müller,et al.  The evolution of the plastid chromosome in land plants: gene content, gene order, gene function , 2011, Plant Molecular Biology.

[2]  Abraham E. Tucker,et al.  Simple sequence repeat variation in the Daphnia pulex genome , 2010, BMC Genomics.

[3]  D. Botstein,et al.  Construction of a genetic linkage map in man using restriction fragment length polymorphisms. , 1980, American journal of human genetics.

[4]  J. Palmer,et al.  Chloroplast DNA systematics: a review of methods and data analysis , 1994 .

[5]  M. Emes,et al.  NONPHOTOSYNTHETIC METABOLISM IN PLASTIDS. , 2003, Annual review of plant physiology and plant molecular biology.

[6]  Hampapathalu A. Nagarajaram,et al.  MICdb: database of prokaryotic microsatellites , 2003, Nucleic Acids Res..

[7]  Hampapathalu A. Nagarajaram,et al.  Genome analysis IMEx : Imperfect Microsatellite Extractor , 2007 .

[8]  W. Powell,et al.  Chloroplast microsatellites: new tools for studies in plant ecology and evolution. , 2001, Trends in ecology & evolution.

[9]  Asheesh Shanker,et al.  MitoSatPlant: mitochondrial microsatellites database of viridiplantae. , 2014, Mitochondrion.

[10]  John M. Hancock The contribution of slippage-like processes to genome evolution , 1995, Journal of Molecular Evolution.

[11]  Y. Kashi,et al.  Simple sequence repeats as advantageous mutators in evolution. , 2006, Trends in genetics : TIG.

[12]  Thomas Tuschl,et al.  The growing catalog of small RNAs and their association with distinct Argonaute/Piwi family members , 2008, Development.

[13]  J. Jurka,et al.  Microsatellites in different eukaryotic genomes: survey and analysis. , 2000, Genome research.

[14]  A. Shanker,et al.  Bioinformatically mined simple sequence repeats in UniGene of Citrus sinensis , 2007 .

[15]  Atul Grover,et al.  EuMicroSatdb: A database for microsatellites in the sequenced genomes of eukaryotes , 2007, BMC Genomics.

[16]  H. Chandler Database , 1985 .

[17]  S. Miyagishima,et al.  Chloroplast DNA Replication Is Regulated by the Redox State Independently of Chloroplast Division in Chlamydomonas reinhardtii1[C][OA] , 2013, Plant Physiology.

[18]  Gaurav Sablok,et al.  ChloroMitoSSRDB: Open Source Repository of Perfect and Imperfect Repeats in Organelle Genomes for Evolutionary Genomics , 2012, DNA research : an international journal for rapid publication of reports on genes and genomes.

[19]  B. Faircloth,et al.  Primer3—new capabilities and interfaces , 2012, Nucleic acids research.

[20]  A. Shanker Combined data from chloroplast and mitochondrial genome sequences showed paraphyly of bryophytes , 2013 .

[21]  H. Daniell,et al.  Phylogenomic evidence of bryophytes’ monophyly using complete and incomplete data sets from chloroplast proteomes , 2011, Journal of Plant Biochemistry and Biotechnology.

[22]  D. Tautz,et al.  Simple sequences. , 1994, Current opinion in genetics & development.

[23]  Margaret Staton,et al.  CMD: a Cotton Microsatellite Database resource for Gossypium genomics , 2006, BMC Genomics.

[24]  W. Powell,et al.  How much effort is required to isolate nuclear microsatellites from plants? , 2003, Molecular ecology.

[25]  Dinesh Kumar,et al.  PIPEMicroDB: microsatellite database and primer generation tool for pigeonpea genome , 2013, Database J. Biol. Databases Curation.

[26]  David S. Goodsell,et al.  Mitochondrion , 2010, Biochemistry and molecular biology education : a bimonthly publication of the International Union of Biochemistry and Molecular Biology.

[27]  Paraphyly of bryophytes inferred using chloroplast sequences , 2013 .

[28]  W. Powell,et al.  Chloroplast microsatellites: new tools for studies in plant ecology and systematics , 2001 .

[29]  A. Shanker Mining of simple sequence repeats in chloroplast genome of a parasitic liverwort: Aneura mirabilis , 2013 .

[30]  A. Shanker Computationally mined microsatellites in chloroplast genome of Pellia endiviifolia , 2014 .

[31]  Ashutosh Kumar Singh,et al.  In silico mining in expressed sequences of Neurospora crassa for identification and abundance of microsatellites. , 2007, Microbiological research.