NLSdb: database of nuclear localization signals

NLSdb is a database of nuclear localization signals (NLSs) and of nuclear proteins. NLSs are short stretches of residues mediating transport of nuclear proteins into the nucleus. The database contains 114 experimentally determined NLSs that were obtained through an extensive literature search. Using 'in silico mutagenesis' this set was extended to 308 experimental and potential NLSs. This final set matched over 43% of all known nuclear proteins and matches no currently known non-nuclear protein. NLSdb contains over 6000 predicted nuclear proteins and their targeting signals from the PDB and SWISS-PROT/TrEMBL databases. The database also contains over 12 500 predicted nuclear proteins from six entirely sequenced eukaryotic proteomes (Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana and Saccharomyces cerevisiae). NLS motifs often co-localize with DNA-binding regions. This observation was used to also annotate over 1500 DNA-binding proteins. NLSdb can be accessed via the web site: http://cubic.bioc.columbia.edu/db/NLSdb/.

[1]  P. Argos,et al.  SRS: information retrieval system for molecular biology data banks. , 1996, Methods in enzymology.

[2]  B. Rost,et al.  Finding nuclear localization signals , 2000, EMBO reports.

[3]  M. Hall,et al.  The T-DNA-linked VirD2 protein contains two distinct functional nuclear localization signals. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Y. Lefebvre,et al.  Nuclear localization signals overlap DNA- or RNA-binding domains in nucleic acid-binding proteins. , 1995, Nucleic acids research.

[5]  C. Xiao,et al.  Nuclear targeting signal recognition: a key control point in nuclear transport? , 2000, BioEssays : news and reviews in molecular, cellular and developmental biology.

[6]  Burkhard Rost,et al.  PEP: Predictions for Entire Proteomes , 2003, Nucleic Acids Res..

[7]  I. Mattaj,et al.  Nucleocytoplasmic transport: the soluble phase. , 1998, Annual review of biochemistry.

[8]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 , 2000, Nucleic Acids Res..

[9]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL , 1997, Nucleic Acids Res..