CandidaDB: a genome database for Candida albicans pathogenomics

CandidaDB is a database dedicated to the genome of the most prevalent systemic fungal pathogen of humans, Candida albicans. CandidaDB is based on an annotation of the Stanford Genome Technology Center C.albicans genome sequence data by the European Galar Fungail Consortium. CandidaDB Release 2.0 (June 2004) contains information pertaining to Assembly 19 of the genome of C.albicans strain SC5314. The current release contains 6244 annotated entries corresponding to 130 tRNA genes and 5917 protein-coding genes. For these, it provides tentative functional assignments along with numerous pre-run analyses that can assist the researcher in the evaluation of gene function for the purpose of specific or large-scale analysis. CandidaDB is based on GenoList, a generic relational data schema and a World Wide Web interface that has been adapted to the handling of eukaryotic genomes. The interface allows users to browse easily through genome data and retrieve information. CandidaDB also provides more elaborate tools, such as pattern searching, that are tightly connected to the overall browsing system. As the C.albicans genome is diploid and still incompletely assembled, CandidaDB provides tools to browse the genome by individual supercontigs and to examine information about allelic sequences obtained from complementary contigs. CandidaDB is accessible at http://genolist.pasteur.fr/CandidaDB.

[1]  Stephen G Oliver,et al.  Proteomic response to amino acid starvation inCandida albicans and Saccharomyces cerevisiae , 2004, Proteomics.

[2]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[3]  A. Krogh,et al.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. , 2001, Journal of molecular biology.

[4]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[5]  M. Borodovsky,et al.  GeneMark.hmm: new solutions for gene finding. , 1998, Nucleic acids research.

[6]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[7]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[8]  Christophe d'Enfert,et al.  Stage‐specific gene expression of Candida albicans in human blood , 2003, Molecular microbiology.

[9]  Matthew Berriman,et al.  GeneDB: a resource for prokaryotic and eukaryotic organisms , 2004, Nucleic Acids Res..

[10]  S. Brunak,et al.  SHORT COMMUNICATION Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites , 1997 .

[11]  Philippe Glaser,et al.  CAAT-Box, contigs-Assembly and Annotation Tool-Box for genome sequencing projects , 2004, Bioinform..

[12]  R. Wenzel Nosocomial candidemia: risk factors and attributable mortality. , 1995, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[13]  Kim Rutherford,et al.  Artemis: sequence visualization and annotation , 2000, Bioinform..

[14]  Ronald N. Jones,et al.  Bloodstream Infections Due to Candida Species: SENTRY Antimicrobial Surveillance Program in North America and Latin America, 1997-1998 , 2000, Antimicrobial Agents and Chemotherapy.

[15]  Michael Gribskov,et al.  Combining evidence using p-values: application to sequence homology searches , 1998, Bioinform..

[16]  S. Eddy,et al.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. , 1997, Nucleic acids research.

[17]  Dmitrij Frishman,et al.  The PEDANT genome database , 2003, Nucleic Acids Res..

[18]  B. Hube,et al.  Secreted lipases of Candida albicans: cloning, characterisation and expression analysis of a new gene family with at least ten members , 2000, Archives of Microbiology.

[19]  B. Hube,et al.  Multiplicity of genes encoding secreted aspartic proteinases in Candida species , 1994, Molecular microbiology.

[20]  D. Lipman,et al.  Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[21]  B. Dujon,et al.  Genomic Exploration of the Hemiascomycetous Yeasts: 3. Methods and strategies used for sequence analysis and annotation , 2000, FEBS letters.

[22]  George Newport,et al.  The diploid genome sequence of Candida albicans. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Alix T. Coste,et al.  Comparison of Gene Expression Profiles of Candida albicans Azole-Resistant Clinical Isolates and Laboratory Strains Exposed to Drugs Inducing Multidrug Transporters , 2004, Antimicrobial Agents and Chemotherapy.

[24]  Concha Gil,et al.  Analysis of the Candida albicans proteome. II. Protein information technology on the Net (update 2002). , 2003, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[25]  Kara Dolinski,et al.  Saccharomyces Genome Database (SGD) provides tools to identify and analyze sequences from Saccharomyces cerevisiae and related sequences from other organisms , 2004, Nucleic Acids Res..

[26]  K. Barker,et al.  Genome-Wide Expression Profile Analysis Reveals Coordinately Regulated Genes Associated with Stepwise Acquisition of Azole Resistance in Candida albicans Clinical Isolates , 2003, Antimicrobial Agents and Chemotherapy.

[27]  Antoine Danchin,et al.  SubtiList: the reference database for the Bacillus subtilis genome , 2002, Nucleic Acids Res..

[28]  Judith Berman,et al.  Candida albicans: A molecular revolution built on lessons from budding yeast , 2002, Nature Reviews Genetics.

[29]  Robert D. Finn,et al.  The Pfam protein families database , 2004, Nucleic Acids Res..

[30]  C. d’Enfert,et al.  Candida albicans Biofilms: a Developmental State Associated With Specific and Stable Gene Expression Patterns , 2004, Eukaryotic Cell.