Improving MinHash via the containment index with applications to metagenomic analysis
暂无分享,去创建一个
[1] Brian D. Ondov,et al. Mash: fast genome and metagenome distance estimation using MinHash , 2015, Genome Biology.
[2] Evan Bolton,et al. Database resources of the National Center for Biotechnology Information , 2017, Nucleic Acids Res..
[3] Andrei Z. Broder,et al. On the resemblance and containment of documents , 1997, Proceedings. Compression and Complexity of SEQUENCES 1997 (Cat. No.97TB100171).
[4] Carl Kingsford,et al. Fast Search of Thousands of Short-Read Sequencing Experiments , 2015, Nature Biotechnology.
[5] T. Thomas,et al. GemSIM: general, error-model based simulator of next-generation sequencing data , 2012, BMC Genomics.
[6] S. Tringe,et al. Tackling soil diversity with the assembly of large, complex metagenomes , 2014, Proceedings of the National Academy of Sciences.
[7] Björn Andersson,et al. Classification of DNA sequences using Bloom filters , 2010, Bioinform..
[8] Alan M. Frieze,et al. Min-Wise Independent Permutations , 2000, J. Comput. Syst. Sci..
[9] Sven Kosub,et al. A note on the triangle inequality for the Jaccard distance , 2016, Pattern Recognit. Lett..
[10] Edith Cohen,et al. Finding interesting associations without support pruning , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).
[11] Justin Chu,et al. BioBloom tools: fast, accurate and memory-efficient host species sequence screening using bloom filters , 2014, Bioinform..
[12] Luiz Irber,et al. sourmash: a library for MinHash sketching of DNA , 2016, J. Open Source Softw..
[13] Richard M. Karp,et al. Faster and More Accurate Sequence Alignment with SNAP , 2011, ArXiv.
[14] P. Flajolet,et al. HyperLogLog: the analysis of a near-optimal cardinality estimation algorithm , 2007 .
[15] Abhinandan Das,et al. Google news personalization: scalable online collaborative filtering , 2007, WWW '07.
[16] William Pugh,et al. Skip Lists: A Probabilistic Alternative to Balanced Trees , 1989, WADS.
[17] Burton H. Bloom,et al. Space/time trade-offs in hash coding with allowable errors , 1970, CACM.
[18] Páll Melsted,et al. Efficient counting of k-mers in DNA sequences using a bloom filter , 2011, BMC Bioinformatics.
[19] Rayan Chikhi,et al. Space-efficient and exact de Bruijn graph representation based on a Bloom filter , 2012, Algorithms for Molecular Biology.
[20] Arend Hintze,et al. Scaling metagenome sequence assembly with probabilistic de Bruijn graphs , 2011, Proceedings of the National Academy of Sciences.
[21] Michael A. Bender,et al. Don't Thrash: How to Cache Your Hash on Flash , 2011, Proc. VLDB Endow..
[22] Xiaolong Wu,et al. BLESS: Bloom filter-based error correction solution for high-throughput sequencing reads , 2014, Bioinform..
[23] Bin Fan,et al. Cuckoo Filter: Practically Better Than Bloom , 2014, CoNEXT.
[24] Harald Niederreiter,et al. Probability and computing: randomized algorithms and probabilistic analysis , 2006, Math. Comput..