Removing near-neighbour redundancy from large protein sequence collections
暂无分享,去创建一个
[1] Miguel A. Andrade-Navarro,et al. Automatic Annotation for Biological Sequences by Etraction of Keywords from MEDLINE Abstracts: Development of a Prototype System , 1997, ISMB.
[2] Chris Sander,et al. GeneQuiz: A Workbench for Sequence Analysis , 1994, ISMB.
[3] B. Barrell,et al. Life with 6000 Genes , 1996, Science.
[4] Graziano Pesole,et al. CLEANUP: a fast computer program for removing redundancies from nucleotide sequence databases , 1996, Comput. Appl. Biosci..
[5] G J Williams,et al. The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.
[6] Isidore Rigoutsos,et al. FLASH: a fast look-up algorithm for string homology , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.
[7] John C. Wootton,et al. Non-globular Domains in Protein Sequences: Automated Segmentation Using Complexity Measures , 1994, Comput. Chem..
[8] Thomas L. Madden,et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.
[9] Chris Sander,et al. Frame: detection of genomic sequencing errors , 1998, Bioinform..
[10] Rolf Apweiler,et al. The SWISS-PROT protein sequence data bank and its new supplement TREMBL , 1996, Nucleic Acids Res..
[11] M S Waterman,et al. Identification of common molecular subsequences. , 1981, Journal of molecular biology.
[12] Larry Wall,et al. Programming Perl , 1991 .
[13] U. Hobohm,et al. Selection of representative protein data sets , 1992, Protein science : a publication of the Protein Society.
[14] D. Lipman,et al. Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.
[15] J. Wootton,et al. Construction of validated, non-redundant composite protein sequence databases. , 1990, Protein engineering.