kClust: fast and sensitive clustering of large protein sequence databases
暂无分享,去创建一个
[1] B. Rost,et al. Consensus sequences improve PSI-BLAST through mimicking profile–profile alignments , 2007, Nucleic acids research.
[2] Gang Liu,et al. Automatic clustering of orthologs and inparalogs shared by multiple proteomes , 2006, ISMB.
[3] Tao Jiang,et al. SEED: efficient clustering of next-generation sequences , 2011, Bioinform..
[4] Robert C. Edgar,et al. BIOINFORMATICS APPLICATIONS NOTE , 2001 .
[5] P. Bork,et al. A human gut microbial gene catalogue established by metagenomic sequencing , 2010, Nature.
[6] Adam Godzik,et al. Tolerating some redundancy significantly speeds up clustering of large protein databases , 2002, Bioinform..
[7] A. Godzik,et al. Sequence clustering strategies improve remote homology recognitions while reducing search times. , 2002, Protein engineering.
[8] Tim J. P. Hubbard,et al. SCOP: a structural classification of proteins database , 1998, Nucleic Acids Res..
[9] E. Birney,et al. Pfam: the protein families database , 2013, Nucleic Acids Res..
[10] Martin Vingron,et al. The SYSTERS protein sequence cluster set , 2000, Nucleic Acids Res..
[11] Stefan Götz,et al. SIMAP—a comprehensive database of pre-calculated protein sequence similarities, domains, annotations and clusters , 2009, Nucleic Acids Res..
[12] Anton J. Enright,et al. An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.
[13] Peter B. McGarvey,et al. UniRef: comprehensive and non-redundant UniProt reference clusters , 2007, Bioinform..
[14] M. Gerstein,et al. Annotation transfer for genomics: measuring functional divergence in multi-domain proteins. , 2001, Genome research.
[15] B. Haas,et al. A Catalog of Reference Genomes from the Human Microbiome , 2010, Science.
[16] W. Pearson. Effective protein sequence comparison. , 1996, Methods in enzymology.
[17] Christian E. V. Storm,et al. Automatic clustering of orthologs and in-paralogs from pairwise species comparisons. , 2001, Journal of molecular biology.
[18] Ting-Wen Chen,et al. DODO: an efficient orthologous genes assignment tool based on domain architectures. Domain based ortholog detection , 2010, BMC Bioinformatics.
[19] N Linial,et al. ProtoMap: Automatic classification of protein sequences, a hierarchy of protein families, and local maps of the protein space , 1999, Proteins.
[20] A. Biegert,et al. HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment , 2011, Nature Methods.
[21] A. Halpern,et al. The Sorcerer II Global Ocean Sampling Expedition: Northwest Atlantic through Eastern Tropical Pacific , 2007, PLoS biology.
[22] Poethig Rs,et al. Life with 25,000 genes. , 2001 .
[23] Vincent Miele,et al. Ultra-fast sequence clustering from similarity networks with SiLiX , 2011, BMC Bioinformatics.
[24] E. Myers,et al. Basic local alignment search tool. , 1990, Journal of molecular biology.
[25] Cathy H. Wu,et al. UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..
[26] L. Holm,et al. The Pfam protein families database , 2005, Nucleic Acids Res..
[27] Zhengwei Zhu,et al. CD-HIT: accelerated for clustering the next-generation sequencing data , 2012, Bioinform..
[28] Bin Ma,et al. PatternHunter: faster and more sensitive homology search , 2002, Bioinform..
[29] Liisa Holm,et al. RSDB: representative protein sequence databases have high information content , 2000, Bioinform..
[30] Nathan Linial,et al. ProtoNet 6.0: organizing 10 million protein sequences in a compact hierarchical family tree , 2011, Nucleic Acids Res..
[31] U. Hobohm,et al. Selection of representative protein data sets , 1992, Protein science : a publication of the Protein Society.
[32] Darren A. Natale,et al. The COG database: an updated version includes eukaryotes , 2003, BMC Bioinformatics.
[33] Adam Godzik,et al. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..
[34] Michael J. E. Sternberg,et al. Sequencing delivers diminishing returns for homology detection: implications for mapping the protein universe , 2010, Bioinform..
[35] Damian Szklarczyk,et al. eggNOG v3.0: orthologous groups covering 1133 organisms at 41 different taxonomic ranges , 2011, Nucleic Acids Res..
[36] D. Lipman,et al. Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.
[37] C. Stoeckert,et al. OrthoMCL: identification of ortholog groups for eukaryotic genomes. , 2003, Genome research.
[38] Feng Liu,et al. BioLMiner and the BioCreative II.5 challenge , 2010, BMC Bioinformatics.